Cloud Operations Engineer

Anson McCade
Cheltenham, United Kingdom
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Junior
Compensation
£ 50K

Job location

Cheltenham, United Kingdom

Tech stack

Cloud Computing
Cloud Computing Security
Cloud Engineering
Linux
Monitoring of Systems
Linux System Administration
Kubernetes
Information Technology
Terraform

Job description

We are hiring multiple Cloud Operations Engineers and Lead Engineers to join a highly secure, mission-critical cloud operations team.

The role is open to a broad range of backgrounds, including Computer Science graduates, Linux-focused infrastructure engineers, Kubernetes/platform engineers, and individuals from live service or service desk environments with strong incident management experience.

This is a hands-on operational engineering role focused on maintaining stability, availability, and performance of a complex, secure cloud platform operating at scale., * Provide frontline operational support for secure cloud infrastructure and platform users

  • Troubleshoot and resolve critical incidents across live production systems
  • Lead or support incident response, escalation, and coordination during shifts
  • Operate within a 24/7 rota supporting high-priority workloads and services
  • Follow, maintain, and improve operational runbooks and incident procedures
  • Identify opportunities to reduce operational toil and improve service reliability
  • Support mentoring and knowledge sharing for junior engineers (senior roles)
  • Engage with internal stakeholders and third parties during critical incidents

Technical Environment

  • Linux (strong hands-on experience required)
  • Kubernetes (deployment, troubleshooting, and platform support)
  • Infrastructure as Code (Terraform or similar tools)
  • Cloud-native networking and system troubleshooting
  • Observability and monitoring tools
  • APIs and integration services
  • Secure, restricted, air-gapped cloud environments

Requirements

  • Strong experience working with Linux-based systems in production environments
  • Background in live service support, infrastructure operations, or platform engineering
  • Experience troubleshooting system, application, or network-level issues
  • Exposure to Kubernetes and/or containerised environments
  • Understanding of infrastructure, networking, and operational support principles
  • Ability to operate in high-pressure, incident-driven environments
  • Willingness to learn and operate within highly secure cloud architectures

Desirable Experience

  • Kubernetes administration or advanced troubleshooting experience
  • Infrastructure as Code experience (Terraform or similar)
  • Exposure to observability and monitoring platforms
  • Experience working in 24/7 operational environments
  • Prior experience coordinating shifts or leading small technical teams

deep expertise in secure cloud operations, Kubernetes platforms, and large-scale infrastructure engineering.

Benefits & conditions

Package: Competitive salary depending on experience plus shift allowance

Apply for this position