Senior DevOps Engineer

Digital Waffle
Nottingham, United Kingdom
13 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
£ 75K

Job location

Remote
Nottingham, United Kingdom

Tech stack

Microsoft Active Directory
Application Performance Management
Azure
Bash
Cloud Computing
Computer Programming
Continuous Integration
DevOps
DNS
Python
Log Analysis
Uptime
Windows Server
Powershell
Reliability Engineering
Ansible
Datadog
Load Balancing
Grafana
Kubernetes
Puppet
Terraform
ELK
Jenkins

Job description

You'll play a key role in ensuring system stability, scalability, and performance across cloud and on-prem environments. Working closely with software and platform teams, you'll design, implement, and manage solutions that improve service reliability and accelerate development delivery.

Key Skills:

  • Azure
  • Terraform
  • Kubernetes
  • Azure DevOps

What You'll Be Doing

  • Managing and improving production environments to ensure uptime, resilience, and scalability.
  • Using automation and infrastructure-as-code to streamline deployments and eliminate manual work.
  • Monitoring and tuning system performance across multiple services and environments.
  • Supporting development teams with deployment pipelines, CI/CD processes, and platform tools.
  • Troubleshooting complex application and infrastructure challenges.
  • Championing observability, incident response, and continuous improvement within SRE practices.

Requirements

  • Strong experience with Microsoft Azure and cloud-native technologies.
  • Deep knowledge of Terraform, Kubernetes, and App Services.
  • Experience building CI/CD pipelines with Azure DevOps.
  • Experience in a Site Reliability, DevOps, or Platform Engineering role.
  • Solid scripting or programming ability (PowerShell, Bash, Python, or similar).
  • Familiarity with monitoring and observability tools such as Datadog, Azure Application Insights, or Log Analytics.
  • Excellent collaboration and communication skills with the ability to work cross-functionally.
  • A proactive mindset with a genuine passion for automation and operational excellence.

Nice to Have

  • Knowledge of Windows Server environments and network fundamentals (DNS, load balancing, Active Directory).
  • Understanding of SLOs, SLIs, and modern incident management frameworks.
  • Familiarity with infrastructure tools such as Ansible, Puppet, Chef, Jenkins, Grafana, or ELK Stack.
  • Awareness of security and compliance best practices in cloud operations.

Apply for this position