Site Reliability Engineer

Charles Simon Associates Ltd
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
£ 95K

Job location

Remote

Tech stack

Application Performance Management
Azure
Bash
Cloud Computing Security
Distributed Systems
Python
Load Testing
Log Analysis
Powershell
Reliability Engineering
Web Applications
Datadog
Pulumi
Scripting (Bash/Python/Go/Ruby)
Grafana
Cloudformation
Kubernetes
Azure
Puppet
Terraform
Microservices

Job description

Site Reliability Engineer - (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) - Permanent - Remote, * Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance.

  • Building and maintaining monitoring / observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics).
  • Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments.
  • Automating with PowerShell, Python, or Bash to drive efficiency.
  • Supporting Kubernetes and AKS environments in production.
  • Leading incident response, postmortems, and continuous improvement processes.
  • Driving cost optimisation, capacity planning, and load testing.
  • Championing best practices in cloud security and resilience.

Requirements

  • Proven Site Reliability Engineering background.

  • Strong Terraform skills with live environment deployment.

  • Kubernetes / AKS expertise.

  • Scripting in PowerShell, Python or Bash.

  • Monitoring experience (Datadog preferred, Azure or Grafana considered).

  • Background in web applications and distributed systems. Desirable Skills :

  • Knowledge of Microservices Architecture.

  • Familiarity with Kanban.

  • Experience with Puppet or Chef If you're passionate about Site Reliability Engineering and want to work in an environment where "that will do" is never good enough, this role is for you. Site Reliability Engineer - (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) - Permanent - Remote

Apply for this position