Senior Site Reliability Engineer

Crytek GmbH

1 month ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Tech stack

Java

Amazon Web Services (AWS)

C++

Cloud Computing

Profiling

Data Centers

Software Debugging

Disaster Recovery

Python

Network Security

Linux System Administration

Reliability Engineering

Ansible

Prometheus

Scripting (Bash/Python/Go/Ruby)

Computer Network Operations

Performance Testing

Grafana

Caching

Containerization

Deployment Automation

Bare Metal

Terraform

Job description

Crytek is looking for an experienced Senior Site Reliability Engineer to support Hunt: Showdown's NetOps department in our Frankfurt Studio. The person in this position will serve as the key liaison between development teams and the network operations team. They will drive operational excellence, lead infrastructure initiatives, and work closely with production and architecture to ensure systems are highly available, scalable, and efficient. This position includes both operational and strategic responsibilities. This role is based on-site at our headquarters in Frankfurt, Germany, where you'll collaborate with world-class developers and benefit from our attractive relocation package.

Responsibilities

Lead initiatives to improve reliability, scalability, and performance across our live game infrastructure.
Serve as subject matter expert and mentor to junior and mid-level engineers.
Daily operation and maintenance of hosted/cloud data-center environments.
Installation, configuration, and patching of system and game software.
Define, monitor, and improve SLIs/SLOs to maintain 99.9%+ uptime.
Own incident response and root cause analysis processes; create and maintain runbooks and playbooks.
Evaluate and implement new technologies, conducting POCs and driving them to production.
Maintain accurate, up-to-date documentation for systems, workflows, and processes.
Lead capacity planning, scaling strategies, and disaster recovery efforts.
Continuously optimize the reliability, observability, and cost efficiency of critical infrastructure.

Requirements

Do you have experience in Terraform?, * Previous experience as a Site Reliability Engineer, Platform Engineer or similar

Proven experience designing and operating large-scale, high-availability systems.
Strong Linux administration skills.
Experienced with containerization and orchestration technologies.
Experience in CI/CD pipelines, automated deployment, and infrastructure as code.
Solid understanding of network security principles.
Hands-on experience with both bare-metal and cloud (preferably AWS).
Proficient in automation tools such as Ansible and Terraform.
Skilled with observability tools like Open Telemetry, Prometheus, Mimir, and Grafana.
Deep understanding of scalability, profiling, debugging, and performance testing.
Strong grasp of web stack fundamentals (REST, HTTP, CDN, caching).
Experience setting up monitoring, metrics, and proactive alerting for production systems (Go, Java, C++).
Proficient scripting in Shell and Python.
Excellent communication and documentation skills in English.
Willing to relocate to Frankfurt.

Pluses

Experience with Zero Trust Networks, WireGuard, Nomad, MaaS, Foreman.
Knowledge of capacity forecasting and cost optimization for large-scale systems.

About the company

Company Apartment To help you get settled, we provide you with a fully furnished company apartment during your first three months in Frankfurt. Public Transport Pass Discover Frankfurt by bus, tram and metro - free of charge. Gym Card A healthy body is a healthy mind. We offer a membership at the premium gym chain Fitness First in Germany. Work out, join group fitness classes, or relax in the wellness facilities. State-of-the-art Office We've recently moved into a brand-new, modern office located in the heart of Frankfurt. Our new workspace is designed to inspire creativity and collaboration, with open areas, quiet zones, and top-tier facilities - all just steps away from public transport, restaurants, bars and cultural hotspots. International Environment We truly embody diversity at Crytek. With employees from over 42 different countries, we define ourselves by our cultural diversity. German Classes Understanding the local culture will make your stay abroad more enjoyable, and Crytek supports that by offering German language courses for you and your family. Events Join us on our exciting company events, including new starter breakfasts, summer and winter parties, our annual trip to Gamescom in Cologne, and many more! Vacation Days