Cloud Engineering Specialist - SRE
Role details
Job location
Tech stack
Job description
At Enterprise cloud our purpose is to provide the best cloud to our customers. As we continue redefining into a modern, innovative and purposeful organisation, we are investing heavily in automation and engineering excellence across our platforms. We are looking for an experienced SRE to join us. In this role you will help strengthen observability, reliability and operational excellence across our on prem cloud estate. You will work closely with product owners and engineering., * Partner with Product Owners and engineering leads to embed reliability into roadmaps, backlogs, and delivery decisions.
- Apply SRE principles (SLIs, SLOs, error budgets) to maintain service reliability, performance, and scalability.
- Enhance observability across metrics, logs, traces, and events to ensure services are observable by design.
- Manage infrastructure as code and CI/CD environments, delivering improvements and supporting operational changes.
- Lead incident response and root cause analysis, driving effective resolution, post incident reviews, and long term prevention.
- Work with cross functional engineering teams to remove technical barriers, reduce toil, and improve service operability.
- Provide hands on engineering support, validating technical decisions and promoting best practices.
- Foster a culture of curiosity, experimentation, and first principles thinking to strengthen engineering excellence., Cloud Deployment Cloud Strategy IT Service Delivery Cloud Security Cloud Architecture/Design Computer Networking Cloud Migration Virtualisation Operating Systems Agile Methodologies Cloud Operations Continuous Integration/Continuous Deployment Automation & Orchestration Cloud Storage Decision Making Growth Mindset Inclusive Leadership
Requirements
- Deep understanding of SRE concepts SLIs, SLOs, SLAs and error budgets
- Proven ability to design and implement reliable environments
- Hands-on experience with monitoring tools, application insights, integrations with tools such as Prometheus and Grafana
- Infrastructure as Code skills e.g. Terraform
- Advanced knowledge of vmware technology
- Experience with CI/CD, automation and monitoring tools
- Experience with disaster recovery planning and chaos engineering practices
- Experience implementing identity governance and security frameworks
Benefits & conditions
Looking in: Leading inclusively and Safely I inspire and build trust through self-awareness, honesty and integrity. Owning outcomes I take the right decisions that benefit the broader organisation.
Looking out: Delivering for the customer I execute brilliantly on clear priorities that add value to our customers and the wider business. Commercially savvy I demonstrate strong commercial focus, bringing an external perspective to decision-making.
Looking to the future: Growth mindset I experiment and identify opportunities for growth for both myself and the organisation. Building for the future I build diverse future-ready teams where all individuals can be at their best.