Lead Site Reliability Engineer

Holland & Barrett
Charing Cross, United Kingdom
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Charing Cross, United Kingdom

Tech stack

Amazon Web Services (AWS)
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Bash
Cloud Computing
Computer Programming
Distributed Systems
Amazon DynamoDB
Identity and Access Management
Python
Load Testing
Reliability Engineering
Prometheus
Datadog
Amazon Web Services (AWS)
Grafana
Amazon Web Services (AWS)
Cloudformation
Amazon Web Services (AWS)
Build Tools
Functional Programming
Terraform
Devsecops
Go

Job description

  • Architect and improve cloud-native systems with reliability as a first-class principle.
  • Shape SLIs/SLOs, error budgets, capacity planning, and performance strategies.
  • Continuously evolve availability, efficiency, and resilience across our platforms.

Technical Leadership That Raises the Bar

  • Mentor SREs, platform engineers, and developers across the organisation.
  • Champion automation, observability, DevSecOps, and modern operational practices.
  • Influence engineering culture and architectural direction.

Operational Excellence

  • Own and lead high-severity incident response with calm, clarity, and technical depth.
  • Run world-class post-incident reviews and drive meaningful, measurable improvements.
  • Strengthen monitoring, alerting, on-call practices, and reliability processes.
  • Support resilience validation through load testing, stress testing, and chaos engineering.

Automation, Tooling & Engineering Efficiency

  • Build tools and automation that remove toil and accelerate teams.
  • Develop CI/CD pipelines and Infrastructure-as-Code environments.
  • Drive consistency, repeatability, and self-service across engineering.

Cross-Team Collaboration

  • Partner with Security, Platform, and Engineering teams to align reliability with security and resilience goals.
  • Lead teams toward better design, operational readiness, and measurable service health.
  • Contribute to documentation, runbooks, and operational processes that scale.

Requirements

  • 5-8+ years in SRE, Platform, Cloud Infrastructure, or operational engineering roles.
  • Hands-on experience architecting and improving large-scale, distributed systems.
  • Strong coding proficiency in Python, Go, Bash, or similar automation-focused languages.
  • Expertise with observability stacks: Datadog, Prometheus, Grafana, OpenTelemetry.
  • Deep AWS experience across EC2, EKS, Lambda, VPC, DynamoDB, S3, CloudFront, RDS, IAM, KMS, and more.
  • Proficiency with Terraform, CloudFormation, or AWS CDK.
  • Incident response leadership and root-cause analysis expertise.
  • Excellent documentation and communication skills.
  • Strong analytical and troubleshooting abilities.

Bonus

  • Experience mentoring or leading engineers within SRE or platform teams.
  • Experience with load testing, stress testing, and chaos engineering.
  • A passion for uplifting engineering culture through tooling, automation, and reliability-first thinking.

Benefits & conditions

  • A modern engineering culture built on autonomy, experimentation, and learning.
  • The chance to create real impact across critical customer and internal platforms.
  • A collaborative team that values innovation, continuous improvement, and technical excellence.

If you're ready to lead reliability for platforms with massive real-world impact, we'd love to meet you., * Health Cash Plan

  • Life Assurance
  • Bonus Scheme - Based on company & personal performance
  • Virtual GP
  • Private Medical care
  • FREE at-home blood test kit
  • Holiday Purchase option
  • Pension Contribution scheme
  • Access to 'Wellhub' with gyms, studios and wellbeing apps

Discounts & Savings

  • 25% Colleague Discount with FREE Standard Delivery
  • Exclusive Discounts from a wide range of partners
  • £/€50 Annual Product Allowance to spend in store

About the company

Holland and Barrett is an equal opportunity employer. We welcome diverse perspectives and are committed to creating an inclusive environment for all colleagues. We understand that when our colleagues are listened to, respected and valued for who they are, we build an organisation with belonging at its heart - making health and wellness a way of life for everyone.

Apply for this position