Site Reliability Engineer

GoDaddy, LLC

Manchester, United Kingdom

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Shift work

Languages

English

Experience level

Senior

Compensation

£ 120K

Job location

Remote

Manchester, United Kingdom

Tech stack

Java

.NET

Agile Methodologies

Amazon Web Services (AWS)

Azure

Cloud Computing

Continuous Integration

Software Debugging

DevOps

Disaster Recovery

DNS

Information Management

Python

Network Service

Scrum

Reliability Engineering

Ansible

Prometheus

Software Engineering

TCP/IP

Tcpdump

Datadog

Data Logging

Load Balancing

Grafana

Amazon Web Services (AWS)

Gitlab

Cloudformation

Kubernetes

Information Technology

Terraform

ELK

Jenkins

Microservices

Requirements

Iron Mountain is seeking a skilled and motivated DevOps Engineer to join our global information management team in UK (100% remote).In this role, you will be responsible for providing critical technical support for applications and hardware, managing our observability strategy, and ensuring the global delivery and performance of our services. You will collaborate closely with network services, software engineering, and development teams to maintain a highly reliable and scalable environment.What You'll Do (Responsibilities)In this role, you will:Design and Automate Infrastructure: Build and maintain cloud infrastructure on AWS, GCP, or Azure, utilizing tools like Terraform, Ansible, or CloudFormation to automate provisioning.Optimize CI/CD Pipelines: Develop and manage continuous integration and deployment pipelines using Jenkins, GitLab, or ArgoCD to streamline software delivery.Enhance Observability: Implement monitoring and logging systems (Prometheus, Grafana, Datadog) to define alerts, dashboards, and log-based metrics that improve application availability.Lead Incident Management: Respond to real-time outages, perform root cause analysis, and participate in on-call rotations to ensure rapid service restoration.Ensure Application Sustainment: Oversee the full lifecycle of critical applications, including upgrades, patching, and scaling services to meet global demand while maintaining strict SLOs.Drive Reliability and Security: Implement self-healing systems, disaster recovery strategies, and security best practices, including regular vulnerability patching and audits.What You'll Bring (Skills & Qualifications)Specific requirements:Must have UK passport, be UK based for more than five consecutive years and able to obtain Security ClearanceThe ideal candidate will have:Previous experience as DevOps Engineer with focus on SRE Engineer.Strong technical expertise in managed Kubernetes services and cloud networking concepts (VPC, DNS, Load Balancers, and TCP/IP).Proven ability in complex troubleshooting using debugging tools like tcpdump or strace and log aggregation tools like the ELK stack or Splunk.Software Development skills in Python, Java, or .Net, along with experience developing scalable microservices and REST APIs.A Bachelor's Degree in Computer Science, Engineering, or a related field.Preferred Certifications: Scrum Master, PMP, or Agile SAFe certification.What We Offer (Benefits):Location: England (100% remote).Competitive Compensation: Salary and benefits aligned with your professional experience.Work-Life Balance: Flexible work options and alternative arrangements to support your personal needs.Health & Wellness: Comprehensive health, wellness, and retirement plans.Growth Opportunities: Access to continuous learning and professional development to accelerate your career.Ready to join the team?Apply Today! Do not miss the chance to join a global leader in storage and information management services. Similar jobs

Benefits & conditions

Head of Site Reliability Engineering & Infrastructure Location: Manchester (Hybrid) Contract: Permanent, Full-time Salary: Up to £80,000 + Share Options Incentive Scheme Morson Edge have partnered with a Global Tech oragnsation in their search for a Head of Site..., Site Reliability Engineer (SRE) - Defence / National Security - £75k - Farnborough - Hybrid A permanent opportunity for an experienced Site Reliability Engineer who enjoys building secure, automated, and highly reliable platforms. This role sits within a defence and...

About the company

Job Title: Site Reliability Engineer (Bare Metal Infrastructure) Client: Most Elite FinTech Firm in London Salary: Up to £150k+ Bonus + Full Package Location: London (Hybrid) One of London's most elite fintech firms is hiring a Site Reliability Engineer to join a seriously...