Site Reliability Engineer (all genders)
envelio GmbH
Köln, Germany
6 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English, GermanJob location
Remote
Köln, Germany
Tech stack
Agile Methodologies
Systems Engineering
Cloud Computing
Continuous Delivery
Continuous Integration
Distributed Systems
Monitoring of Systems
Python
PostgreSQL
Linux System Administration
Systems Development Life Cycle
RabbitMQ
Redis
Reliability Engineering
Ansible
Prometheus
TypeScript
Datadog
Saltstack
Grafana
Multi-Cloud
Gitlab
Cloudformation
Gitlab-ci
Kubernetes
Infrastructure Automation Frameworks
Terraform
Docker
Job description
Köln Home-Office CI/CD (Continuous Integration/Delivery) Cloud Computing Docker Gitlab Go Grafana
+13
How You Make an Impact
- You maintain Kubernetes clusters across multiple clouds and on-premise environments, ensuring they are reliable, secure, and cost-effective
- You develop and maintain infrastructure-as-code (Terraform, SaltStack) to manage 100+ customer instances with layered configuration
- You design and maintain observability (monitoring, alerting, SLOs) so that production issues surface early and are resolved quickly
- You own and evolve secrets management, certificate automation, and security tooling across the platform
- You reduce operational toil through automation, better tooling, and solid runbooks
- You participate in incident response, root cause analysis, and drive follow-ups so the same issues do not reoccur
- You collaborate with development squads and the Operations team to improve the overall reliability of the IGP, * Join us fully remote #LI-Remote or at our lovely office in Cologne in a hybrid working mode
- Option for remote work from abroad (up to three months per year from anywhere in the EU or the USA)
- State of the art technology and modern tech stack
- Excellent hardware equipment (16 inch MacBooks, 2 screens at your workplace)
- 30 holidays + 3 corporate holidays
- Support for your health through sports membership cooperations
- Flexible use of a monthly mobility budget (e.g. Jobrad, public transport)
- Time and resources for individual growth
- envelio pension plan
- Regular company and team events
Job-Infos Berufsfelder System Engineering / Admin Studienfächer Informatik Informationstechnik Wirtschaftsinformatik Abschluss Ausbildung Bachelor Master/Diplom IT-Gehälter Was kann ich verdienen? get in IT hat die Informationen von der Webseite des Unternehmens und ggf. sonstigen Quellen sorgfältig zusammengestellt. Diese Informationen wurden vom Unternehmen noch nicht autorisiert.Informationen für Unternehmen
Requirements
- You have proven experience running production workloads on Kubernetes in a cloud or hybrid environment
- You are comfortable with Linux administration, networking, and distributed systems
- You have hands-on experience with infrastructure-as-code tools such as Terraform or CloudFormation
- You have worked with configuration management tools like SaltStack, Ansible, or Chef
- You have experience with container and orchestration technology (Docker, Kubernetes, Helm) in production
- You understand monitoring and observability and have worked with tools like Datadog, Prometheus, or Grafana
- You communicate effectively in asynchronous, remote-first environments
- You are curious, enjoy learning, and are open to using AI tools in your daily work
- You are business-fluent in English (Level C1)
- You have experience as a software developer, ideally with languages like Python or Go
- Nice to have: German language skills
How we develop Software
- Agile working method with Kanban in cooperation of all squads
- Continuous integration / Continuous delivery
- Working in small batches with fast reviews
- Knowledge sharing sessions between developers
- "You Code it - You Own it" - Team responsibility for certain functional areas of the product
- Blameless post-mortems and culture of continuous improvement
Our Tech Stack
- Multi-cloud, hybrid on-prem setup with Kubernetes and Helm as the common denominator
- Application primarily written in Python and TypeScript
- Standard backing services like PostgreSQL, RabbitMQ, Redis
- Gitlab & Gitlab CI for managing the Software Delivery Lifecycle
- Terraform for Infrastructure as Code