Some quick facts:
- We're running around 400 physical servers
- A quarter of those make up our growing OpenStack cluster, to which we are migrating the bulk of the other 75%
- 130 million unique visitors hit our web platform every month, leading to daily peak traffic hitting 1 Gbps
- This excludes all the cacheable traffic, which mainly lives on several CDNs through which we serve over 3PB a month
- We are using Puppet to manage all this, together with a lot of in-house tooling
The job
As Linux System Engineer at Spil Games, you will be part of the Hosting team, responsible for enhancing and maintaining our hosting platform.
You will need to identify any changes that are needed to ensure stability, reliability and growth of our platform and our sites. You will be part of the team that ensures 24/7 availability while avoiding and actively removing technical debt and SPOFs. The duties of this role include:
- Responding to and resolving incidents and alerts
- Maximizing the performance and availability of our services
- Executing growth and capacity planning
- Communicating with stakeholders to support their use of our hosting platform
- Working on interesting projects with the latest technologies and hardware
About Hosting
The Hosting team consists of 8-10 hard-working colleagues that work closely together with the Development departments as well as the Data teams. The core of our stack is our growing OpenStack cluster supporting over 125 projects plus over 30TB of object storage in the Swift cluster. Besides this, a large fleet of servers running MySQL, Hadoop and Memcache are running. All of this is managed with Puppet, Ansible, and a lot of dedication! It is defended by firewalls, ACLs, common sense, and Nerf guns.
We operate on values and principles rather than rules, policies, and procedures. This means that you'll have a lot of freedom in terms of doing what you think is best, but it also comes with great individual responsibility. We encourage people to be different and to think for themselves!
Some things we're big on: innovation, making mistakes (to learn from), elegant solutions to technical challenges, openness and transparency, open source, lifelong learning, Python, and League of Legends.
Things we're not fans of: bureaucracy, making the same mistakes again, slowness, repetition, hierarchy, Java, and Internet Explorer.