Principal Site Reliability Engineer x5 (SC Cleared) in Nationwide

Energy Jobline
Olathe, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 90K

Job location

Olathe, United States of America

Tech stack

Bash
Databases
Continuous Integration
Python
PostgreSQL
Openshift
Red Hat Enterprise Linux - RHEL
Reliability Engineering
Site Reliability Engineering Practices
Prometheus
Working Model 2D
Istio
Grafana
Multi-Cloud
Kubernetes

Job description

We are seeking experienced Principal Site Reliability Engineers (SRE) to join a high-performing engineering team delivering resilient, cloud- platforms for UK-based customers. These roles blend senior technical leadership with hands-on delivery, covering both project-based work and the ongoing reliability, scalability, and security of critical services.

You'll work closely with other senior engineers in small, collaborative teams, taking ownership of platform reliability, setting best practices, and mentoring others. The role supports critical infrastructure, requires participation in an on-call rota, and operates within a hybrid working model across UK offices, client sites, and home.

Key responsibilities

Design, build, and maintain highly available, scalable, and resilient platforms, prioritising standardisation, reuse, and automation

Champion GitOps-first approaches, minimising manual configuration ("ClickOps")

Lead and contribute to Site Reliability Engineering practices, including error budgets, SLOs, SLIs, and incident management

Work in agile delivery teams, aligning engineering outcomes to customer and service reliability goals

Operate within defined on-call rotas, supporting services underpinning critical infrastructure

Provide technical leadership and mentorship, developing the capability of engineers across teams

Promote and embed best practices in reliability, security, observability, and automation

Contribute to the evolution of cloud- and SRE standards, patterns, and platform strategiesSkills and experience

Requirements

Proven leadership experience in Site Reliability Engineering or senior platform engineering roles

Strong expertise in Kubernetes and OpenShift (CKA/CKS certifications beneficial)

Experience designing complex multi-cloud or hybrid architectures

Hands-on knowledge of service mesh technologies such as Istio

Experience with enterprise-grade databases, including EDB Postgres

Deep understanding of observability and monitoring stacks, such as Prometheus, Grafana, Loki, Tempo, and LogiStack

Strong Infrastructure as Code experience using tools such as Helm or Kustomize

Proficiency in scripting and automation, including Bash and Python

CI/CD and GitOps pipeline management using tools such as ArgoCD, FluxCD, or Tekton

Experience with Red Hat ACM/ACS and advanced container networking (e.g. Submariner)

A strong focus on reliability, automation, and operational excellence

About the company

Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide. We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.

Apply for this position