Cloud Engineering Specialist - SRE

Qt Group
Manchester, United Kingdom
10 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Manchester, United Kingdom

Tech stack

Agile Methodologies
Application Performance Management
Cloud Computing
Cloud Computing Security
Cloud Engineering
Cloud Storage
Computer Networks
Continuous Delivery
Continuous Integration
Disaster Recovery
Prometheus
Systems Integration
Virtualization Technology
Grafana
Operational Systems
Cloud Migration
Terraform

Job description

At Enterprise cloud our purpose is to provide the best cloud to our customers. As we continue redefining into a modern, innovative and purposeful organisation, we are investing heavily in automation and engineering excellence across our platforms. We are looking for an experienced SRE to join us. In this role you will help strengthen observability, reliability and operational excellence across our on prem cloud estate. You will work closely with product owners and engineering., * Partner with Product Owners and engineering leads to embed reliability into roadmaps, backlogs, and delivery decisions.

  • Apply SRE principles (SLIs, SLOs, error budgets) to maintain service reliability, performance, and scalability.
  • Enhance observability across metrics, logs, traces, and events to ensure services are observable by design.
  • Manage infrastructure as code and CI/CD environments, delivering improvements and supporting operational changes.
  • Lead incident response and root cause analysis, driving effective resolution, post incident reviews, and long term prevention.
  • Work with cross functional engineering teams to remove technical barriers, reduce toil, and improve service operability.
  • Provide hands on engineering support, validating technical decisions and promoting best practices.
  • Foster a culture of curiosity, experimentation, and first principles thinking to strengthen engineering excellence., Cloud Deployment Cloud Strategy IT Service Delivery Cloud Security Cloud Architecture/Design Computer Networking Cloud Migration Virtualisation Operating Systems Agile Methodologies Cloud Operations Continuous Integration/Continuous Deployment Automation & Orchestration Cloud Storage Decision Making Growth Mindset Inclusive Leadership

Requirements

  • Deep understanding of SRE concepts SLIs, SLOs, SLAs and error budgets
  • Proven ability to design and implement reliable environments
  • Hands-on experience with monitoring tools, application insights, integrations with tools such as Prometheus and Grafana
  • Infrastructure as Code skills e.g. Terraform
  • Advanced knowledge of vmware technology
  • Experience with CI/CD, automation and monitoring tools
  • Experience with disaster recovery planning and chaos engineering practices
  • Experience implementing identity governance and security frameworks

Benefits & conditions

Looking in: Leading inclusively and Safely I inspire and build trust through self-awareness, honesty and integrity. Owning outcomes I take the right decisions that benefit the broader organisation.

Looking out: Delivering for the customer I execute brilliantly on clear priorities that add value to our customers and the wider business. Commercially savvy I demonstrate strong commercial focus, bringing an external perspective to decision-making.

Looking to the future: Growth mindset I experiment and identify opportunities for growth for both myself and the organisation. Building for the future I build diverse future-ready teams where all individuals can be at their best.

Apply for this position