Senior DevOps Engineer
Digital Waffle
Nottingham, United Kingdom
13 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
£ 75KJob location
Remote
Nottingham, United Kingdom
Tech stack
Microsoft Active Directory
Application Performance Management
Azure
Bash
Cloud Computing
Computer Programming
Continuous Integration
DevOps
DNS
Python
Log Analysis
Uptime
Windows Server
Powershell
Reliability Engineering
Ansible
Datadog
Load Balancing
Grafana
Kubernetes
Puppet
Terraform
ELK
Jenkins
Job description
You'll play a key role in ensuring system stability, scalability, and performance across cloud and on-prem environments. Working closely with software and platform teams, you'll design, implement, and manage solutions that improve service reliability and accelerate development delivery.
Key Skills:
- Azure
- Terraform
- Kubernetes
- Azure DevOps
What You'll Be Doing
- Managing and improving production environments to ensure uptime, resilience, and scalability.
- Using automation and infrastructure-as-code to streamline deployments and eliminate manual work.
- Monitoring and tuning system performance across multiple services and environments.
- Supporting development teams with deployment pipelines, CI/CD processes, and platform tools.
- Troubleshooting complex application and infrastructure challenges.
- Championing observability, incident response, and continuous improvement within SRE practices.
Requirements
- Strong experience with Microsoft Azure and cloud-native technologies.
- Deep knowledge of Terraform, Kubernetes, and App Services.
- Experience building CI/CD pipelines with Azure DevOps.
- Experience in a Site Reliability, DevOps, or Platform Engineering role.
- Solid scripting or programming ability (PowerShell, Bash, Python, or similar).
- Familiarity with monitoring and observability tools such as Datadog, Azure Application Insights, or Log Analytics.
- Excellent collaboration and communication skills with the ability to work cross-functionally.
- A proactive mindset with a genuine passion for automation and operational excellence.
Nice to Have
- Knowledge of Windows Server environments and network fundamentals (DNS, load balancing, Active Directory).
- Understanding of SLOs, SLIs, and modern incident management frameworks.
- Familiarity with infrastructure tools such as Ansible, Puppet, Chef, Jenkins, Grafana, or ELK Stack.
- Awareness of security and compliance best practices in cloud operations.