Lead Site Reliability Engineer | Copperleaf

IFS

Staines-upon-Thames, United Kingdom

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Staines-upon-Thames, United Kingdom

Tech stack

Amazon Web Services (AWS)

Application Performance Management

Azure

Bash

Software as a Service

Cloud Computing

Cloud Engineering

DevOps

Distributed Systems

Infrastructure as a Service (IaaS)

Python

Log Analysis

SQL Azure

Powershell

Systems Development Life Cycle

Reliability Engineering

SQL Databases

Data Logging

Scripting (Bash/Python/Go/Ruby)

Cloud Monitoring

Kubernetes

Bicep

Terraform

Microservices

Job description

Our Cloud Operations Team, a crucial component of our Software as a Service (SaaS) offering, also delivers Infrastructure as a Service (IaaS) to IFS Copperleaf. Built on the foundation of Site Reliability Engineering, we are expanding. Our commitment is to the reliability and uptime of our services, and we consistently aim to automate processes and minimize manual labor. We are currently seeking a mid senior level cloud engineer to contribute to these services and assist in enhancing the operational aspects of each service.

As a Lead Site Reliability Engineer (SRE) specializing in Azure, you will play a pivotal role in architecting, operating, and optimizing our cloud infrastructure. You will lead initiatives to ensure the reliability, scalability, and security of our Azure-based SaaS offerings. You'll mentor junior engineers, drive automation, and partner with development teams to deliver robust, high-availability solutions.

Key Responsibilities

Lead the design, implementation, and continuous improvement of Azure-based infrastructure for high-availability, mission-critical SaaS services.
Architect and automate deployment pipelines using Azure DevOps, ARM/Bicep, Terraform, and related tools.
Own and enhance monitoring, alerting, and incident response for Azure resources (App Services, AKS, SQL, Storage, Networking, etc.).
Drive root cause analysis and resolution of complex production incidents, collaborating across teams.
Define and enforce SLOs, SLIs, and SLAs for Azure-hosted SaaS services.
Champion security best practices, including identity, access, secrets, and certificate management in Azure.
Mentor and coach junior SREs and CloudOps engineers.
Partner with development teams to embed reliability and operational excellence into the SDLC.
Evaluate and implement new Azure features and services to improve reliability, performance, and cost efficiency.
Document architecture, runbooks, and operational procedures for Azure environments.

Requirements

Do you have experience in Terraform?, * 5+ years' experience in SRE, Cloud Operations, or DevOps roles, with at least 3 years focused on Microsoft Azure.

Deep expertise in Azure services (App Services, AKS, Azure SQL, Storage, Networking, Security Center, Monitor, etc.).
Strong automation and scripting skills (PowerShell, Python, Bash, or similar).
Proven experience with Infrastructure as Code (Terraform, ARM/Bicep).
Advanced troubleshooting of distributed systems, networking, and application performance in Azure.
Solid understanding of microservices, container orchestration (Kubernetes/AKS), and CI/CD pipelines.
Experience with monitoring, logging, and observability tools (Azure Monitor, Log Analytics, Application Insights).
Strong grasp of security protocols, certificate and secret management, and compliance in Azure.
Demonstrated ability to lead incident response and post-mortem analysis.
Excellent communication skills and a passion for mentoring others., * Azure certifications (e.g., Azure Solutions Architect, Azure DevOps Engineer).
Experience with hybrid or multi-cloud environments, including AWS.
Familiarity with cost management and optimization in Azure.
Experience supporting large-scale SaaS platforms.

Benefits & conditions

We embrace flexibility and hybrid work opportunities to support diverse needs and lifestyles, while also valuing inclusive workplace experiences. By fostering a sense of community, we drive innovation, strengthen connections, and nurture belonging. Our commitment ensures you can work in a way that suits you best, while also engaging with colleagues to share ideas and build meaningful relationships.

About the company

IFS is a billion-dollar revenue company with 7000+ employees on all continents. Our leading AI technology is the backbone of our award-winning enterprise software solutions, enabling our customers to be their best when it really matters-at the Moment of Service . Our commitment to internal AI adoption has allowed us to stay at the forefront of technological advancements, ensuring our colleagues can unlock their creativity and productivity, and our solutions are always cutting-edge. Copperleaf is the world's leading AI powered Asset Investment Planning (AIP) solution, enabling organizations to make better decisions - faster, smarter and with more confidence. Together Copperleaf and IFS offer the first end-to-end asset lifecycle management solution. Underpinned by Industrial AI, the combining of Copperleaf and IFS will allow our asset intensive customers to deliver on their moment of service through strategic allocation and execution of CAPEX and OPEX; balancing expenditure, business objectives, risk and optimal asset performance. At IFS, we're flexible, we're innovative, and we're focused not only on how we can engage with our customers but on how we can make a real change and have a worldwide impact. We help solve some of society's greatest challenges, fostering a better future through our agility, collaboration, and trust. We celebrate diversity and understand our responsibility to reflect the diverse world we work in. We are committed to promoting an inclusive workforce that fully represents the many different cultures, backgrounds, and viewpoints of our customers, our partners, and our communities. As a truly international company serving people from around the globe, we realize that our success is tantamount to the respect we have for those different points of view. By joining our team, you will have the opportunity to be part of a global, diverse environment; you will be joining a winning team with a commitment to sustainability; and a company where we get things done so that you can make a positive impact on the world. We're looking for innovative and original thinkers to work in an environment where you can #MakeYourMoment so that we can help others make theirs. With the power of our AI-driven solutions, we empower our team to change the status quo and make a real difference. If you want to change the status quo, we'll help you make your moment. Join Team Purple. Join IFS., Copperleaf IFS 's software helps some of the world's largest energy firms make better strategic decisions.