SRE DevOps Engineer
Role details
Job location
Tech stack
Job description
Exciting opportunity for SRE DevOps Engineer to join our Client Servicing and Engagement platform. As SRE DevOps Engineer you'll be responsible for ensuring our products run reliably, are scalable, and perform optimally in production environments.
You'll supervise and manage these aspects to ensure our products meet the expected Service Level Objectives (SLOs) while also being accountable for one or more areas of the cloud infrastructure resources alongside supervising the work of the SREs in that area. You'll focus on observability of your technical areas and prioritising the operational service improvements to best improve the SLOs.
What you'll be doing:
- Maintain, support & improve our cloud infrastructure in a specific technical area
- Investigate, fix and remove service issues with an engineering mindset
- Identify ways in which to improve observability continuously
- Identify toil relentlessly and design automated solutions to remove it
- Prioritise operational service improvements to meet or increase SLO
- Lead incident post-mortems
- Working with PO to balance run, change and improve stories in each sprint based on metrics e.g. outage rate increasing
- Support building a strong team by mentoring early career engineers to advance their technical skills, and by undertaking technical interviews to enable us to hire new engineers
Why join us?
We're transforming at pace. Investing billions in our people, data and tech to change the way we meet the needs of our 28 million customers. We're growing, and we'd love you to be part of the journey.
Requirements
We're looking for an individual with 5+ years' experience across a broad skillset, capable of applying technical leadership across:
- Kubernetes (Vital Requirement to understand) and Service Mesh (Istio is currently used)
- CI/CD Automation for Build & Release (Important to have experience in 1 or more tooling solutions)
- Automated Unit/Integration/Load/Performance Testing
- Observability, Logging, Monitoring & Alerting
- Experience programming in at least two (but not all!) of the following languages: Java, Python, Go, C++, JavaScript, TypeScript, PowerShell or Bash/Shell
- Cloud platform technologies across Azure and GCP, including Azure App Gateway, API Management, AKS, Cosmos DB, Azure SQL, Azure Firewall, and GCP services such as Cloud Load Balancing, Apigee, GKE, Firestore/Cloud SQL, and VPC Firewall Rules
- Proficient in monitoring tools e.g. Azure Monitor/Log Analytics/Dynatrace, Security Cloud & API, Encryption & Certificates
We know that great talent comes from many backgrounds. Whilst this job advert may reference specific years of experience, we recognise that skills are developed in many ways, so if you have relevant, transferable experience, we encourage you to apply.
Benefits & conditions
We were one of the first major organisations to set goals on diversity in senior roles, create a menopause health package, and a dedicated Working with Cancer Initiative.
We also offer a wide-ranging benefits package, which includes:
-
A generous pension contribution of up to 15%
-
An annual performance-related bonus
-
Share schemes including free shares
-
Benefits you can adapt to your lifestyle, such as discounted shopping
-
28 days' holiday, with bank holidays on top
-
A range of wellbeing initiatives and generous parental leave policies