Cloud Operations Engineer
Role details
Job location
Tech stack
Job description
We are hiring multiple Cloud Operations Engineers and Lead Engineers to join a highly secure, mission-critical cloud operations team.
The role is open to a broad range of backgrounds, including Computer Science graduates, Linux-focused infrastructure engineers, Kubernetes/platform engineers, and individuals from live service or service desk environments with strong incident management experience.
This is a hands-on operational engineering role focused on maintaining stability, availability, and performance of a complex, secure cloud platform operating at scale., * Provide frontline operational support for secure cloud infrastructure and platform users
- Troubleshoot and resolve critical incidents across live production systems
- Lead or support incident response, escalation, and coordination during shifts
- Operate within a 24/7 rota supporting high-priority workloads and services
- Follow, maintain, and improve operational runbooks and incident procedures
- Identify opportunities to reduce operational toil and improve service reliability
- Support mentoring and knowledge sharing for junior engineers (senior roles)
- Engage with internal stakeholders and third parties during critical incidents
Technical Environment
- Linux (strong hands-on experience required)
- Kubernetes (deployment, troubleshooting, and platform support)
- Infrastructure as Code (Terraform or similar tools)
- Cloud-native networking and system troubleshooting
- Observability and monitoring tools
- APIs and integration services
- Secure, restricted, air-gapped cloud environments
Requirements
- Strong experience working with Linux-based systems in production environments
- Background in live service support, infrastructure operations, or platform engineering
- Experience troubleshooting system, application, or network-level issues
- Exposure to Kubernetes and/or containerised environments
- Understanding of infrastructure, networking, and operational support principles
- Ability to operate in high-pressure, incident-driven environments
- Willingness to learn and operate within highly secure cloud architectures
Desirable Experience
- Kubernetes administration or advanced troubleshooting experience
- Infrastructure as Code experience (Terraform or similar)
- Exposure to observability and monitoring platforms
- Experience working in 24/7 operational environments
- Prior experience coordinating shifts or leading small technical teams
deep expertise in secure cloud operations, Kubernetes platforms, and large-scale infrastructure engineering.
Benefits & conditions
Package: Competitive salary depending on experience plus shift allowance