Senior Site Reliability Engineer

Responsibilitiesdriving
Glasgow, United Kingdom
10 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Glasgow, United Kingdom

Tech stack

Cloud Computing
Reliability Engineering
Software Engineering
Delivery Pipeline
Grafana
Reliability of Systems
Kubernetes

Job description

We're working with a mission-led technology business that is continuing to invest heavily into its platform and engineering function as part of a wider modernisation and cloud transformation journey. This is not a traditional ops role. They are looking for a Senior SRE who can influence engineering teams, drive reliability standards across the organisation, and help shape how modern software is built, deployed, observed, and supported at scale. The environment is heavily engineering-focused with a strong platform mindset, modern cloud infrastructure, and a business-wide push around observability, resilience, automation, and developer enablement. You'll operate in a consultative capacity across multiple engineering teams, helping improve reliability and production readiness while influencing best practice across the wider technology organisation. Key ResponsibilitiesDriving reliability best practices across engineering teams and platformsHelping define and mature SLOs, SLIs and SLAs across critical servicesWorking closely with software engineers to improve system resilience, observability, scalability, and performanceSupporting teams with incident management, root cause analysis, and production readinessBuilding and improving monitoring, alerting, and observability toolingDriving automation and reducing operational toil through engineering solutionsInfluencing architecture decisions from a reliability and scalability perspectiveEmbedding SRE principles into product and platform engineering teams Tech EnvironmentGCP / Kubernetes / TerraformPrometheus / Grafana / Observability toolingCI/CD pipelines and modern engineering practicesInfrastructure as CodeDistributed systems and cloud-native environments

Requirements

What They're Looking ForStrong experience working within modern SRE or Platform Engineering environmentsPrevious software engineering or development background would be highly beneficialExperience defining and implementing SLIs, SLOs and SLAs in production environmentsStrong understanding of observability, monitoring, alerting, and reliability engineering principlesComfortable influencing engineering teams and operating in a consultative / enablement-style roleExperience working within cloud-native environments and modern infrastructure platforms This is a genuinely strong opportunity for someone who enjoys solving complex reliability challenges while influencing how engineering teams build and operate software at scale.If interested, drop me a message directly or apply below.

Apply for this position