Site Reliability Engineer

Vitoriagasteiz
Legutio, Spain
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
€ 65K

Job location

Remote
Legutio, Spain

Tech stack

Amazon Web Services (AWS)
Amazon Web Services (AWS)
Cloud Computing
Continuous Integration
Disaster Recovery
Github
Redis
Datadog
Cloud Platform System
Delivery Pipeline
Kubernetes
Kafka
Terraform

Requirements

About the Opportunity This is not a maintenance-focused role. My client is looking for someone who enjoys solving complex infrastructure challenges, improving reliability, and building internal platforms that help engineering teams move faster with confidence. - Spain · 100% Remote - Senior level (+5 years of experience) - ️ Platform Engineering team - Up to 65k (according to profile) What You'll Work On - Observability & Incident Response - Improve monitoring, alerting, and operational visibility across the platform to help teams detect and resolve issues faster. - Disaster Recovery & Reliability - Strengthen disaster recovery practices, platform resilience, and recovery processes across critical infrastructure. - CI/CD & Delivery Workflows - Improve deployment workflows, automation, and platform delivery practices to enable safer and faster releases. - Cloud Platform & Kubernetes - Work on Kubernetes infrastructure, scalability, networking, and reliability across a modern AWS-based platform. - FinOps & Infrastructure Efficiency - Help engineering teams improve infrastructure visibility, scalability, and cost efficiency across cloud environments. Tech Stack - Cloud & Infrastructure: AWS, Kubernetes (EKS), Terraform / OpenTofu - Observability: Datadog - CI/CD & GitOps: GitHub Actions, ArgoCD, Helm, Kustomize - Data & Platform: Kafka, Redis, OpenSearch, RDS, S3 What We're Looking For - Strong production experience with AWS - Hands-on Kubernetes experience in production environments - Experience with Terraform or OpenTofu - Strong understanding of observability and monitoring practices - Experience with CI/CD pipelines and modern deployment workflows - Solid SRE fundamentals around reliability, incidents, and recovery - Strong ownership mentality and problem-solving skills Additional Information - Permanent remote setup from Spain - Shared on-call rotation - International engineering environment - Fast-growing product company with strong technical challenges Open to engineers looking for high-impact infrastructure ownership inside a modern cloud-native environment.

Apply for this position