Site Reliability Engineer

Vitoriagasteiz

Legutio, Spain

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

€ 65K

Job location

Remote

Legutio, Spain

Tech stack

Amazon Web Services (AWS)

Cloud Computing

Continuous Integration

Disaster Recovery

Github

Redis

Datadog

Cloud Platform System

Delivery Pipeline

Kubernetes

Kafka

Terraform

Requirements

About the Opportunity This is not a maintenance-focused role. My client is looking for someone who enjoys solving complex infrastructure challenges, improving reliability, and building internal platforms that help engineering teams move faster with confidence. - Spain · 100% Remote - Senior level (+5 years of experience) - ️ Platform Engineering team - Up to 65k (according to profile) What You'll Work On - Observability & Incident Response - Improve monitoring, alerting, and operational visibility across the platform to help teams detect and resolve issues faster. - Disaster Recovery & Reliability - Strengthen disaster recovery practices, platform resilience, and recovery processes across critical infrastructure. - CI/CD & Delivery Workflows - Improve deployment workflows, automation, and platform delivery practices to enable safer and faster releases. - Cloud Platform & Kubernetes - Work on Kubernetes infrastructure, scalability, networking, and reliability across a modern AWS-based platform. - FinOps & Infrastructure Efficiency - Help engineering teams improve infrastructure visibility, scalability, and cost efficiency across cloud environments. Tech Stack - Cloud & Infrastructure: AWS, Kubernetes (EKS), Terraform / OpenTofu - Observability: Datadog - CI/CD & GitOps: GitHub Actions, ArgoCD, Helm, Kustomize - Data & Platform: Kafka, Redis, OpenSearch, RDS, S3 What We're Looking For - Strong production experience with AWS - Hands-on Kubernetes experience in production environments - Experience with Terraform or OpenTofu - Strong understanding of observability and monitoring practices - Experience with CI/CD pipelines and modern deployment workflows - Solid SRE fundamentals around reliability, incidents, and recovery - Strong ownership mentality and problem-solving skills Additional Information - Permanent remote setup from Spain - Shared on-call rotation - International engineering environment - Fast-growing product company with strong technical challenges Open to engineers looking for high-impact infrastructure ownership inside a modern cloud-native environment.

Role details

Job location

Tech stack

Requirements

Apply for this position

Good distractions

Moments

Videos View all