Site Reliability Engineer

GoDaddy, LLC
Manchester, United Kingdom
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English
Experience level
Senior
Compensation
£ 120K

Job location

Remote
Manchester, United Kingdom

Tech stack

Java
.NET
Agile Methodologies
Amazon Web Services (AWS)
Azure
Cloud Computing
Continuous Integration
Software Debugging
DevOps
Disaster Recovery
DNS
Information Management
Python
Network Service
Scrum
Reliability Engineering
Ansible
Prometheus
Software Engineering
TCP/IP
Tcpdump
Datadog
Data Logging
Load Balancing
Grafana
Amazon Web Services (AWS)
Gitlab
Cloudformation
Kubernetes
Information Technology
Terraform
ELK
Jenkins
Microservices

Requirements

Iron Mountain is seeking a skilled and motivated DevOps Engineer to join our global information management team in UK (100% remote).In this role, you will be responsible for providing critical technical support for applications and hardware, managing our observability strategy, and ensuring the global delivery and performance of our services. You will collaborate closely with network services, software engineering, and development teams to maintain a highly reliable and scalable environment.What You'll Do (Responsibilities)In this role, you will:Design and Automate Infrastructure: Build and maintain cloud infrastructure on AWS, GCP, or Azure, utilizing tools like Terraform, Ansible, or CloudFormation to automate provisioning.Optimize CI/CD Pipelines: Develop and manage continuous integration and deployment pipelines using Jenkins, GitLab, or ArgoCD to streamline software delivery.Enhance Observability: Implement monitoring and logging systems (Prometheus, Grafana, Datadog) to define alerts, dashboards, and log-based metrics that improve application availability.Lead Incident Management: Respond to real-time outages, perform root cause analysis, and participate in on-call rotations to ensure rapid service restoration.Ensure Application Sustainment: Oversee the full lifecycle of critical applications, including upgrades, patching, and scaling services to meet global demand while maintaining strict SLOs.Drive Reliability and Security: Implement self-healing systems, disaster recovery strategies, and security best practices, including regular vulnerability patching and audits.What You'll Bring (Skills & Qualifications)Specific requirements:Must have UK passport, be UK based for more than five consecutive years and able to obtain Security ClearanceThe ideal candidate will have:Previous experience as DevOps Engineer with focus on SRE Engineer.Strong technical expertise in managed Kubernetes services and cloud networking concepts (VPC, DNS, Load Balancers, and TCP/IP).Proven ability in complex troubleshooting using debugging tools like tcpdump or strace and log aggregation tools like the ELK stack or Splunk.Software Development skills in Python, Java, or .Net, along with experience developing scalable microservices and REST APIs.A Bachelor's Degree in Computer Science, Engineering, or a related field.Preferred Certifications: Scrum Master, PMP, or Agile SAFe certification.What We Offer (Benefits):Location: England (100% remote).Competitive Compensation: Salary and benefits aligned with your professional experience.Work-Life Balance: Flexible work options and alternative arrangements to support your personal needs.Health & Wellness: Comprehensive health, wellness, and retirement plans.Growth Opportunities: Access to continuous learning and professional development to accelerate your career.Ready to join the team?Apply Today! Do not miss the chance to join a global leader in storage and information management services. Similar jobs

Benefits & conditions

Head of Site Reliability Engineering & Infrastructure Location: Manchester (Hybrid) Contract: Permanent, Full-time Salary: Up to £80,000 + Share Options Incentive Scheme Morson Edge have partnered with a Global Tech oragnsation in their search for a Head of Site..., Site Reliability Engineer (SRE) - Defence / National Security - £75k - Farnborough - Hybrid A permanent opportunity for an experienced Site Reliability Engineer who enjoys building secure, automated, and highly reliable platforms. This role sits within a defence and...

About the company

Job Title: Site Reliability Engineer (Bare Metal Infrastructure) Client: Most Elite FinTech Firm in London Salary: Up to £150k+ Bonus + Full Package Location: London (Hybrid) One of London's most elite fintech firms is hiring a Site Reliability Engineer to join a seriously...

Apply for this position