DevOps Engineer
Role details
Job location
Tech stack
Job description
We are seeking an experienced DevOps Engineer to join a high-performing project team in Central London. In this role, you will take technical ownership of cloud and on-premises infrastructure, providing leadership and mentorship to junior engineers while driving the architecture and evolution of a critical customer platform.
You will act as a senior technical authority, collaborating directly with customers and stakeholders to shape infrastructure strategy, ensure operational excellence, and deliver robust, scalable solutions. This role includes participation in an out-of-hours support rota for the production environment.
Key Responsibilities Lead the design, build, and continuous improvement of infrastructure solutions, setting technical direction for the team. Own cloud and on-premises environments end-to-end, ensuring security, resilience, and performance. Mentor and support junior DevOps engineers, conducting code and infrastructure reviews. Drive platform automation initiatives to reduce manual toil and improve deployment pipelines. Act as an escalation point for complex operational incidents, leading root cause analysis and remediation. Collaborate with customers and internal stakeholders to define infrastructure requirements and present technical solutions. Participate in the scheduled out-of-hours production support rota. Travel to data centres and offices as required (approximately 3-4 times per month).
Essential Skills and Experience AWS and Microsoft Azure: Deep hands-on expertise across both platforms, including architecture, cost management, and security best practices. On-premises/VMware: Proven experience designing and operating hybrid cloud environments. Containers and Orchestration: Expert-level knowledge of Docker and Kubernetes, including production-grade cluster management. CI/CD: Extensive experience architecting and maintaining pipelines with Jenkins or Azure DevOps. Version Control: Advanced use of GitLab, including branching strategies and pipeline integration. Programming/Scripting: Strong proficiency in Python, Go, and/or Bash; ability to write maintainable, production-quality code. GNU/Linux: Deep system-level understanding, including performance tuning and hardening. Windows Server: Proficiency in Windows Server 2016+ including ADFS, ADCS, and ADDS. Networking: Strong understanding of IP networking, physical network topology, and network security.Desirable Skills and Experience
Experience with monitoring and observability tooling such as Prometheus, Kibana, Nagios, or Splunk. Familiarity with backup and recovery platforms such as CommVault. Experience working in security-cleared or regulated environments (SC/DV). Relevant certifications (eg AWS Solutions Architect, CKA, Azure DevOps Expert). Strong communication and stakeholder management skills, with the ability to present complex technical concepts clearly. Experience contributing to or leading incident management and post-incident review processes.
What We Offer A collaborative, inclusive, and supportive working culture where your expertise is valued. Hybrid working arrangements with genuine flexibility. Meaningful opportunities for technical leadership and career progression. Access to training, certifications, and professional development in a wide range of technologies. The opportunity to work on high-impact projects with national significance
Requirements
AWS and Microsoft Azure: Deep hands-on expertise across both platforms, including architecture, cost management, and security best practices. On-premises/VMware: Proven experience designing and operating hybrid cloud environments. Containers and Orchestration: Expert-level knowledge of Docker and Kubernetes, including production-grade cluster management. CI/CD: Extensive experience architecting and maintaining pipelines with Jenkins or Azure DevOps. Version Control: Advanced use of GitLab, including branching strategies and pipeline integration. Programming/Scripting: Strong proficiency in Python, Go, and/or Bash; ability to write maintainable, production-quality code. GNU/Linux: Deep system-level understanding, including performance tuning and hardening. Windows Server: Proficiency in Windows Server 2016+ including ADFS, ADCS, and ADDS. Networking: Strong understanding of IP networking, physical network topology, and network security.Desirable Skills and Experience
Experience with monitoring and observability tooling such as Prometheus, Kibana, Nagios, or Splunk. Familiarity with backup and recovery platforms such as CommVault. Experience working in security-cleared or regulated environments (SC/DV). Relevant certifications (eg AWS Solutions Architect, CKA, Azure DevOps Expert). Strong communication and stakeholder management skills, with the ability to present complex technical concepts clearly. Experience contributing to or leading incident management and post-incident review processes.
Benefits & conditions
A collaborative, inclusive, and supportive working culture where your expertise is valued. Hybrid working arrangements with genuine flexibility. Meaningful opportunities for technical leadership and career progression. Access to training, certifications, and professional development in a wide range of technologies. The opportunity to work on high-impact projects with national significance