this post was submitted on 04 Feb 2024

209 points (92.0% liked)

Games

17090 readers

283 users here now

Video game news oriented community. No NanoUFO is not a bot :)

Posts.

News oriented content (general reviews, previews or retrospectives allowed).
Broad discussion posts (preferably not only about a specific game).
No humor/memes etc..
No affiliate links
No advertising.
No clickbait, editorialized, sensational titles. State the game in question in the title. No all caps.
No self promotion.
No duplicate posts, newer post will be deleted unless there is more discussion in one of the posts.
No politics.

Comments.

No personal attacks.
Obey instance rules.
No low effort comments(one or two words, emoji etc..)
Please use spoiler tags for spoilers.

My goal is just to have a community where people can go and see what new game news is out for the day and comment on it.

Other communities:

Beehaw.org gaming

Lemmy.ml gaming

lemmy.ca pcgaming

founded 2 years ago

MODERATORS

nanoUFO@sh.itjust.works

209

Palworld server costs near $500K per month as network engineer is ordered to 'never let the service go down no matter what' (www.pcgamer.com)

submitted 11 months ago by nanoUFO@sh.itjust.works to c/games@sh.itjust.works

46 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] saint@group.lt 56 points 11 months ago (4 children)

well this is probably PR as there is no such system nor it can be made that can have 100% uptime. not talking about the fact that network engineers rarely work with servers :)

[–] PoTayToes@sh.itjust.works 36 points 11 months ago (2 children)

Not 100% but 99.9%... IIRC Guild Wars 2 servers had like 1 actual outage in 11 years. They have pretty amazing structure.

[–] drislands@lemmy.world 19 points 11 months ago (1 children)

Fun fact, uptime goals are measured in nines -- for example, 99.9% is three nines of uptime. If that one outage lasted an entire day, and they were never down at any other time, that would indeed be three nines of uptime.

[–] fibojoly@sh.itjust.works 6 points 11 months ago (2 children)

Yeah, my net admin colleagues explained that one to me a while back because the bosses were making similar uninformed demands ("this needs to never go down!" "Sure, here is how much that costs"). It was very enlightening :)

[–] sugar_in_your_tea@sh.itjust.works 2 points 11 months ago (1 children)

And by "never," a lot of bosses really mean, "you're working overtime if it goes down." It always comes down to cost.

[–] fibojoly@sh.itjust.works 4 points 11 months ago

Those colleagues are pretty much all quitting now, after their over time calculation has been modified because it was costing the company too much, while for other teams like mine it stayed the same. It's hilarious seeing the complete panic from the higher ups realizing how badly they fucked up. And I don't think they actually really realize...

[–] Kushia@lemmy.ml 2 points 11 months ago

Once I got a serious response to that from a manager saying that he could go on eBay, buy his own servers and do it himself. My response was to quit.

[–] saint@group.lt -2 points 11 months ago

a lot of things are possible if you are lucky enough ;)

[–] p03locke@lemmy.dbzer0.com 13 points 11 months ago (2 children)

well this is probably PR as there is no such system nor it can be made that can have 100% uptime.

Five-nines is entirely possible with enough resources and competent outage-minded engineers.

[–] send_me_your_ink@lemmynsfw.com 6 points 11 months ago (1 children)

Hell. Five nines is doable with eks, a single engineer and thinking through your changes before pushing them to prod. Ask me how I know...

[–] p03locke@lemmy.dbzer0.com 6 points 11 months ago (1 children)

Operations like this don't have a single engineer. The more complex the project, the higher the risk of complications and outages. It's not a matter of "oh, just think harder about your changes".

Ask me how I know...

[–] starman2112@sh.itjust.works 3 points 11 months ago

If you've got a rant, I'm all ears

[–] saint@group.lt -3 points 11 months ago (1 children)

Agree, but five nines are not 100% ;) Anyway - this discussion reminds me of Technical Report 85.7 - Jim Gray, which might be of the interest to some of you.

[–] masterspace@lemmy.ca 4 points 11 months ago

Distinguishing between 5 nines and 100% is just semantics in any discussion outside of contractual ones.

[–] Evotech@lemmy.world 5 points 11 months ago

If you just threaten your employees enough they river go down /s

[–] zeusbottom@sh.itjust.works 2 points 11 months ago (1 children)

This is a software development business, which is a positively bananas trade no matter what’s getting written. And the smaller the business, the more hats network guys wear. We work with everything from the server app down to the coffee machine fueling the devs. And 100% uptime isn’t the most crazy demand I’ve heard. I’m sure Chujo is busier than a one-armed paper hanger with jock itch.

At least he’s got money to throw at his hosting company. Scaling up would have been much slower in the old days.

[–] Meloku 3 points 11 months ago (2 children)

I'm not versed in videogame network infrastructures, but wouldn't be enough just having a load balancer and a couple of instances to ensure "100% uptime"? At least before all instances and the load balancer itself decide to join a suicidal pact, but more instances mean less chance of a critical event happening, no?

[–] jpeps@lemmy.world 3 points 11 months ago

At a press level, sure, and the same for the average user. Legally speaking these numbers do have significance, though. Amazon Web Services (at least at one time) offer a guarantee of 99.99% uptime for their infrastructure. That 0.001% covers things like once a year outages that make the news. A 10000th of a year is actually a tangible amount of time and not even Amazon is confident enough to ignore it.

[–] zeusbottom@sh.itjust.works 1 points 11 months ago

Depends on the cloud provider. AWS, as an example, have up to three “availability zones” within a single data center. If the customer needs HA, they are encouraged to run their applications in separate availability zones. It means different subnets within the VPC, redundant LBs spread across those zones, and more.

There is also probably DNS-based global load balancing across different data centers.

That’s just the hosting infrastructure. I’m sure Chujo works on the office LAN as well. He might wear the infosec hat also, which means he’s up to his eyeballs in firewall policy.

I don’t envy my brethren in software development orgs. Been there, done that, got that t-shirt long ago.