Site Reliability Engineer

Sólo para miembros registrados
Municipality of Madrid, Spain
11 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
€ 80K

Job location

Municipality of Madrid, Spain

Tech stack

Amazon Web Services (AWS)
Cloud Computing
Computer Programming
Software Design Patterns
DevOps
Distributed Systems
Fault Tolerance
Reliability Engineering
Cloud Platform System
Reliability of Systems
Kubernetes
Terraform

Job description

  • Own the availability and performance of the platform services.
  • Design and implement automation solutions for operational tasks.
  • Develop observability solutions to identify performance issues.
  • Lead incident response and drive continuous improvement.
  • Analyze system performance data for capacity planning.
  • Define and maintain Service Level Objectives.
  • Optimize cloud resource utilization to reduce costs.
  • Collaborate with engineering teams for system reliability.

Requirements

A global cybersecurity leader is seeking a Site Reliability Engineer in Madrid, Spain. This role demands ensuring the reliability and performance of a massive scale platform while improving system observability and automating operational tasks. The ideal candidate has experience in Site Reliability Engineering or DevOps, is proficient in programming (preferably Go), and possesses deep knowledge in cloud infrastructure (AWS or GCP). This position offers competitive salary, benefits, and a vibrant workplace culture., * Experience in Site Reliability Engineering, DevOps, or supporting large-scale distributed systems.

  • Strong programming skills in at least one language (preferably Go).
  • Hands-on experience with major cloud platform services (AWS or GCP).
  • Understanding of distributed system design patterns and fault tolerance.
  • Proficiency with Infrastructure as Code tools like Terraform.
  • Experience with monitoring and observability tools.
  • Proven incident management track record., Site Reliability Engineering Programming in Go Deep cloud expertise Distributed systems knowledge Infrastructure as Code with Terraform Container orchestration with Kubernetes Observability expertise CI/CD pipelines Data-driven approach Strong communication skills

Apply for this position