Senior Devops Engineer

European Recruitment
Municipality of Vitoria-Gasteiz, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Remote
Municipality of Vitoria-Gasteiz, Spain

Tech stack

API
Amazon Web Services (AWS)
Continuous Integration
DevOps
Github
Prometheus
Software Deployment
Grafana
Infrastructure as Code (IaC)
Kubernetes
Kafka
Terraform
Microservices

Job description

A leading scheduling platform, used by over 30 million people, is hiring a Senior DevOps Engineer to work remotely in Spain. We are looking for a Senior DevOps Platform Engineer who is not just an operator, but a true builder, architect, and continuous learner. If you thrive on owning complex systems and enjoy enabling teams to ship features with speed and confidence, this is the role for you. You will be responsible for ensuring the platform remains stable, reliable, highly optimised, and a first-class experience for your fellow engineers. What You Will Be Doing Infrastructure as Code (IaC) & GitOps: Own and evolve the platform using a modern, fully GitOps-driven, Kubernetes-native approach. Our core stack includes Flux2, Crossplane, and a mix of first- and third-party Kubernetes operators. Kubernetes Architecture: Operate, scale, and secure complex microservices architectures on Kubernetes. You will be the go-to expert supporting product teams with application deployment best practices. Cluster Lifecycle Management: Provision, upgrade, and maintain the health of our Kubernetes clusters using tools like kOps and Cluster API. AWS Cloud Infrastructure: Manage and provision robust cloud resources on AWS leveraging Crossplane and Terraform. CI/CD Engineering: Design, build, and maintain fast, reliable CI pipelines using GitHub Actions to support safe, frequent, and automated releases. Observability & Telemetry: Implement, refine, and champion world-class monitoring, tracing, and profiling. Our toolkit includes Prometheus, Grafana, Jaeger, and Pyroscope. Incident Response: Lead incident management, conduct thorough root cause analysis, and drive long-term systemic improvements to enhance platform resilience. Platform Optimization: Continuously seek out and implement improvements focused on developer experience, cost efficiency, and security hardening.

Requirements

Kubernetes Mastery: At least 3 years of dedicated, hands-on experience operating Kubernetes in a production environment. You possess a deep understanding of its internals, networking, and operational best practices. Ownership Mentality: You view the platform as a crucial product for both developers and end-users, taking full responsibility for its reliability and performance. Pragmatic Engineering Mindset: A track record of building reliable, understandable systems that are easy for others to use and maintain. Nice to Have Kafka Expertise: Familiarity with operating and scaling Kafka in a microservices environment is a strong plus. We use the Strimzi Kafka Operator. Controller Development: Experience building or contributing to Kubernetes controllers (ideally in Go), including developing custom controllers from scratch.

About the company

Join a rapidly expanding SaaS company that is revolutionising its sector by developing an innovative suite of AI-powered products!

Apply for this position