Site Reliability Engineer (SRE)
Role details
Job location
Tech stack
Job description
We are looking for an experienced Site Reliability Engineer (DV, NPPV3 cleared) to support highly secure, business-critical systems in West Midlands, working across Linux environments, cloud and on-prem infrastructure, container platforms (Docker/Kubernetes), CI/CD pipelines and enterprise monitoring tools., * Maintain and improve the reliability, availability, scalability and performance of critical production systems.
- Design, implement and optimise monitoring, alerting and observability solutions.
- Lead incident management, root cause analysis and post-incident reviews in line with SRE best practices.
- Build and enhance CI/CD pipelines to support safe, repeatable deployments.
- Automate operational tasks to reduce toil and improve system stability.
- Work closely with DevOps, Platform, Security and Development teams in a highly regulated environment.
- Support capacity planning, performance tuning, disaster recovery and business continuity planning.
- Produce and maintain clear technical documentation and operational runbooks.
Requirements
- Proven experience working as a Site Reliability Engineer, DevOps Engineer or similar role.
- Active DV clearance and NPPV3 clearance (both essential).
- Strong Linux system administration experience in enterprise environments.
- Experience with cloud and platform technologies such as AWS, Azure or private cloud platforms.
- Hands-on experience with containerisation and orchestration (Docker, Kubernetes, OpenShift).
- Strong knowledge of monitoring and observability tools such as Prometheus, Grafana, ELK/OpenSearch, Splunk or Datadog.
- Experience with infrastructure as code tools including Terraform, Ansible, Puppet or Chef.
- Scripting and automation skills using Bash, Python, PowerShell or Go.
- Experience supporting CI/CD tooling such as Jenkins, GitLab CI, GitHub Actions or Azure DevOps.
- Strong understanding of networking, security principles and access controls in secure environments.
- Ability to troubleshoot complex, distributed systems under pressure.
If you are a Site Reliability Engineer (DV, NPPV3 cleared) looking for a high-impact contract in a secure Birmingham-based environment, we would love to hear from you. Apply today - professional references required.