Senior SRE Cloud Enablement Engineer
Role details
Job location
Tech stack
Job description
We are seeking a Senior Site Reliability & Cloud Enablement Engineer to join the SRE Team. In this role, you will design, build, scale and operate highly available distributed systems and cloud applications that power intelligent buildings and spaces worldwide redefining how people live, work, learn and play.
Our team's scope is broad, with a strong focus on enabling product reliability and delivery velocity by building the platforms, standards, runbooks and guardrails that engineering teams rely on to scale safely. We emphasize data-driven reliability practices, strong cross-team collaboration and operational excellence. In addition to reliability engineering, we focus on cloud enablement by empowering teams to adopt cloud capabilities safely and efficiently at scale.
We are a highly distributed team where remote work is the norm. Our culture values trust, respect, ownership, asynchronous communication, and customer impact.
How you will contribute:
You will lead through influence, applying your unique combination of cloud infrastructure expertise and software engineering skills to solve complex reliability, scalability and operability challenges across our products and services. You will enable teams by designing reusable cloud patterns, automation and guardrails to accelerate product delivery.
Key Tasks & Responsibilities (Essential Functions)
- Monitor and improve service availability, performance and operability, ensuring systems meet defined SLAs and SLOs.
- Define and evolve reliability metrics (SLIs and SLOs) and ensure they are observable and actionable.
- Serve as an escalation point for critical incidents. Lead incident response and blameless postmortems.
- Drive proactive reliability improvements through problem identification, root cause analysis, remediation and systemic fixes.
- Partner with development teams to design, deploy and manage cloud infrastructure using modern best practices.
- Establish and evolve cloud standards, guardrails, and best practices that balance autonomy, security, reliability and cost efficiency.
- Collaborate with product, platform, QA, and security team throughout the software lifecycle to deliver reliable, scalable, and compliant systems.
- Advocate for changes that improve system reliability and engineering velocity.
Requirements
- 5+ years of experience in software engineering and SRE / DevOps roles supporting production systems.
- Strong hands-on experience with C#, Python, Java, or Go particularly for tooling and automation.
- Deep experience in at least one major cloud environment - Azure strongly preferred.
- Hands-On experience with Kubernetes and containerized workloads (AKS, Helm, Kustomize)
- Experience with Infrastructure as Code using Bicep or Terraform in complex cloud environments.
- Strong background in observability platforms and monitoring strategies (Datadog, Prometheus etc.)
- Working knowledge of GitOps and experience building and maintaining CI/CD Pipelines (Jenkins, Azure DevOps)
- Systems-oriented mindset with a focus on availability, resilience and enabling teams to operate effectively in the cloud.
- Proven ability to collaborate across teams to diagnose issues, identify root causes, and drive resolutions.
- Experience leading or participating in incident response and post-incident reviews.
- Solid understanding of change & release management practices.
- Ability to learn quickly, adapt to new technologies, and operate with minimal supervision.
- Strong written and verbal communication skills, with the ability to influence senior stakeholders and guide engineering teams.
- Curiosity, adaptability and demonstrated habit of leaving systems better than you found them
- Ability to travel for business purposes as needed.
Preferred Skills and Experience
- Experience working with IoT platforms or distributed device networks.
- Experience with disaster recovery, security, and compliance.
- Experience defining SLAs, SLOs, and SLIs in partnership with product teams.
- Familiarity with FinOps principles and cloud cost optimization strategies.
Benefits & conditions
The range for this position is $120,800 to $200,800. Placement within this range may vary, depending on the applicant's experience and geographic location. Acuity offers generous benefits including health care, dental coverage, vision plans, 401K benefits, and commissions/incentive compensation depending on role. For a list of our benefits, click here.