Site Reliability Engineer

gridscale GmbH
Köln, Germany
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Köln, Germany

Tech stack

Bash
Unix
Cloud Computing
Linux
Distributed Systems
Fault Tolerance
Identity and Access Management
Systems Analysis
Python
Key Management
OpenStack
Reliability Engineering
Ansible
Software Systems
Kubernetes
Infrastructure Automation Frameworks
Terraform
Go

Job description

As a Site Reliability Engineer, you'll be part of a team responsible for building the framework to package and deploy software solutions on top of OPCP. You'll work on the conception, automation, and operation of our platform and drive continuous improvements. You'll help new teams onboard with the framework and OPCP. We're looking for someone who is comfortable in a security-oriented environment with a high degree of automation (GitOps). You are used to maintaining an overview of ambiguous situations, analyzing systems, and making well-founded decisions based on this analysis, * Drive the ongoing development of our Kubernetes stack and implement GitOps workflows (e.g., with FluxCD)

  • Support other teams and help them onboard with OPCP
  • Actively participate in system analysis and derive improvements, even when initial requirements are unclear
  • Develop and maintain our Infrastructure as Code using tools like Ansible and Terraform.
  • Participate in an on-call rotation.

Requirements

Do you have experience in UNIX?, Do you have a Bachelor's degree?, * Deep knowledge of Linux/Unix system administration and internals

  • Hands-on experience with cloud platforms, especially OpenStack, including infrastructure provisioning and management
  • Proficiency with Infrastructure as Code tools such as Terraform and Ansible
  • Experience with containerization and orchestration technologies, especially Kubernetes
  • Skilled in building and maintaining CI/CD pipelines
  • Hands-on experience with modern observability stacks, including metrics, logs and traces
  • Proficiency in scripting and automation using Python, Bash, and Go
  • Understanding of security fundamentals including IAM, secrets management, hardening and compliance
  • Understanding of distributed systems concepts such as the CAP theorem, consensus and fault tolerance
  • Strong problem-solving skills, adaptability and strict documentation discipline
  • Ability to thrive in collaborative product-team environments with a strong ownership mentality and a blameless culture mindset
  • Strong communication skills and a passion for continuous improvement

Benefits & conditions

Pulled from the full job description

  • Gym membership
  • Flexible schedule, * Exciting work in a highly innovative and international environment with cutting-edge technologies
  • 32 vacation days, increasing with length of service
  • Flexible working hours, home office options, and a secure permanent position with market- and performance-based compensation
  • Employer-funded pension plan and an attractive insurance package
  • OVHcloud covers 50% of public transportation costs
  • Up to €400 annual financial contribution from OVHcloud towards sports activities (gym membership, sports classes, etc.)
  • Through Corporate Benefits, you receive attractive discounts at numerous shops and companies
  • We contribute to the leasing of your cargo bike
  • Regular company events and free cold and hot beverages

About the company

At gridscale we build and maintain cloud-infrastructure of the next generation for our customers. Our ambition is to provide the world's simplest and most reliable IaaS and PaaS solution. 

gridscales’ intuitive interface design empowers people without in-depth IT knowledge to manage their IT infrastructure while a Kubernetes environment facilitates the management of cloud-native workloads. 

Apply for this position