Site Reliability Engineer
CBS Butler Limited
West Bletchley, United Kingdom
2 days ago
Role details
Contract type
Temporary contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Compensation
£ 101KJob location
West Bletchley, United Kingdom
Tech stack
.NET
Amazon Web Services (AWS)
Azure
Bash
Cloud Computing
Continuous Integration
Linux
Distributed Systems
Event-Driven Programming
Github
Python
PCI Data Security Standards
Powershell
Reliability Engineering
Prometheus
Message Oriented Middleware
Scripting (Bash/Python/Go/Ruby)
Istio
Grafana
Concourse
Git Flow
Kubernetes
Rancher
Cloudwatch
Api Gateway
Terraform
Go
Job description
- Operate and enhance our Kubernetes platform across AWS, Azure, and on prem.
- Lead incident response, problem management, and root cause analysis.
- Deliver cluster lifecycle work: upgrades, patching, node pools, CNI/CSI, ingress, and Rancher operations.
- Own observability, dashboards, alerting, and SLOs/SLIs.
- Implement GitOps (Fleet) and reduce toil through automation and strong governance.
- Apply secure API gateway and WAF patterns.
- Work with distributed system patterns, including event brokers and asynchronous messaging.
- Maintain security posture: CVE remediation, GRC controls, scanning pipelines.
Requirements
- Deep knowledge of Kubernetes, Rancher, GitOps, Linux, and cloud networking.
- Understanding of API gateway and WAF patterns.
- Experience with distributed systems and event driven architectures.
- Strong automation/scripting (Python, Go, Bash, PowerShell, .NET).
- IaC:
o Terraform for foundational/bootstrap cluster provisioning.
o Crossplane as an orchestration layer (leveraging Terraform providers).
- Ability to work securely within PCI DSS / GDPR patterns.
- CI/CD: Concourse, GitHub Actions, Azure DevOps.
- Observability: Grafana, Prometheus, Jaeger/Tempo, CloudWatch, Loki, OpenTelemetry.
Nice to Have:
- AWS operational experience.
- Service mesh (Istio/Kuma).
- Hybrid cloud experience (AWS + Azure + on prem).
- Payments or regulated industry background.
About the company
Join a leading global IT consultancy and digital transformation organisation at the forefront of cloud, automation and secure platform engineering. We're looking for a Kubernetes-first engineer who wants to own and evolve a modern, enterprise-scale platform spanning AWS, Azure and on-prem. This is a hands-on role with real influence over reliability, security and architecture.