Site Reliability Engineer

Universal Music Group

Charing Cross, United Kingdom

2 months ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

£ 40K

Job location

Remote

Charing Cross, United Kingdom

Tech stack

Java

Microsoft Windows

Amazon Web Services (AWS)

Azure

Linux

Distributed Systems

IT Management

Python

Ansible

Prometheus

Datadog

Cloud Platform System

Grafana

Reliability of Systems

Kubernetes

Cloudwatch

Terraform

Splunk

Dynatrace

Docker

ServiceNow

Job description

As a Site Reliability Engineer, you won't just be supporting systems; you'll be ensuring the services that connect artists and fans around the globe are always on., System Reliability & Performance:

Design, build, and maintain the availability, scalability, and performance of critical services.
Develop and maintain robust monitoring, alerting, and observability systems (e.g., using AWS CloudWatch, Dynatrace) to ensure rapid issue detection and resolution.
Monitor infrastructure capacity and performance, providing analysis and suggestions for service delivery improvement.

Automation & Efficiency:

Drive the automation of repetitive operational tasks, including infrastructure provisioning, deployments, and scaling.
Create and maintain scripts and custom code to support and enhance our operational toolset.
Support and optimize CI/CD pipelines to improve deployment speed and reliability.

Incident Management & Collaboration:

Participate in an on-call rotation to troubleshoot and mitigate production incidents.
Lead post-incident reviews and root cause analyses to implement lasting solutions.
Partner with engineering and IT stakeholders to embed SRE best practices (SLOs, error budgets) into the design and development lifecycle.

Requirements

Required Experience & Skills:

A strong background in systems administration (Linux/Windows) in a large-scale environment.
Proficiency in at least one programming language (e.g., Python, Go, Java).
Hands-on experience with a major cloud platform (AWS, GCP, or Azure), with a high preference for AWS.
Solid understanding of networking, containers (Docker, Kubernetes), and Infrastructure as Code (e.g., Terraform, Ansible).
Experience with modern monitoring and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk, Dynatrace).
Proven analytical and problem-solving abilities with experience in a high-pressure environment.
Excellent communication skills and the ability to foster a collaborative team environment.

Preferred Experience & Skills:

Bachelor's degree in an IT-related field.
Experience managing large-scale, distributed systems for a global organization.
Familiarity with IT governance standards like ITIL.
Direct experience with ServiceNow for IT service management.
Knowledge of chaos engineering, resilience testing, and advanced capacity planning., Job Description Service Engineer - Day Jobs in Orpington at Stannah - Join Our Team! We are looking for an experienced Service Engineer who has Stairlift experience to cover a route across North London As a Service Engineer at Stannah, you will play a vital role in keeping...

Benefits & conditions

Washroom Service Driver Here's what you get with phs. - A salary of £27,650+ OTE £28,650 - 40hr working week Monday- Friday - 23 days annual holiday + bank holidays - Flexible hours and development opportunities - Flexible start and finish times giving you a better work..., Washroom Service Driver Here's what you get with phs. A salary of £27,650+ OTE £28,650 40hr working week Monday- Friday 23 days annual holiday + bank holidays Flexible hours and development opportunities Flexible start and finish times giving you a better work life balance...., £40,000 - £50,000

Location: Hampshire, South East, UKProcess Reliability EngineerLocation: Hampshire (Hybrid - 2/3 days WFH, 2/3 days travelling to Hampshire sites or office-based)Employment Type: PermanentHours: 37 hours per weekSalary: £40-50k (dependent on experience)About the RoleOur..., Washroom Service Driver Here's what you get with phs. - A salary of £27,650+ OTE £28,650 - 40hr working week Monday- Friday - 23 days annual holiday + bank holidays - Flexible hours and development opportunities - Flexible start and finish times giving you a better work... © 2025, Jobsora.com

About the company

It's the passionate and dedicated team at Universal Music who help make us the world's leading music company. From A&R to finance, legal to digital, sales to marketing, Universal Music is the place to grow and develop your career within a truly commercial and innovative business that leads in everything it does., We are UMG, the Universal Music Group. We are the world's leading music company. In everything we do, we are committed to artistry, innovation and entrepreneurship. We own and operate a broad array of businesses engaged in recorded music, music publishing, merchandising, and audiovisual content in more than 60 countries. We identify and develop recording artists and songwriters, and we produce, distribute and promote the most critically acclaimed and commercially successful music to delight and entertain fans around the world. As a key member of our Global Technical Operations team, you will be responsible for the reliability, scalability, and performance of the critical systems that power a global enterprise. By blending a software engineering mindset with operational expertise, you will engineer solutions that improve system reliability, automate complex processes, and reduce manual toil. You will be an essential partner to our development, infrastructure, and security teams, driving a culture of resilience and continuous improvement across the organization., About Us GSR is crypto's capital markets partner, helping founders and institutions scale with confidence. With over a decade of specialized expertise, we deliver institutional-grade market making, OTC trading, and strategic venture capital to support growth at every..., Job Description Discover your future at Citi Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you'll have the opportunity to grow your career, give back to your..., We'reSky, Europe's biggest entertainment brand. Think top-quality shows. Breaking news. Innovative tech. Must-have products. Careers here mean the freedom and support you need to make an impact - pushing boundaries, creating solutions, hitting targets. As part of our..., WHAT YOU CAN EXPECT FROM MORGAN STANLEY: We are committed to maintaining the first-class service and high standard of excellence that have defined Morgan Stanley for over 89 years. Our values - putting clients first doing the right thing leading with exceptional ideas..., About Anyscale At Anyscale, we're on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We're commercializing Ray, a popular open-source project that's creating an ecosystem of libraries for scalable machine...