Site Reliability Engineer Manager ( Healthcare Domain)
Role details
Job location
Tech stack
Requirements
10+ years in engineering, operations, or SRE roles
5+ years leading SRE, platform, or reliability-focused teams
Proven experience implementing SRE practices at scale (SLIs, SLOs, error budgets)
Strong background in cloud environments (AWS, Azure, Google Cloud Platform)
Hands-on experience with observability tools (Splunk, AppDynamics, Prometheus, etc.)
Experience in incident management and production operations at scale
Ability to operate effectively in high-pressure and complex enterprise environments
Preferred Qualifications
Experience driving organizational transformation (not just technical implementation)
Strong understanding of CI/CD, DevOps, and automation practices
Experience working in regulated or large enterprise environments
Familiarity with AIOps or advanced automation strategies, Increased adoption of SLOs and reliability standards
Reduction in high-severity incidents over time
Improved MTTR and operational efficiency
Increased adoption of standardized observability practices
Reduction in reactive, ticket-driven work across teams
Clear alignment between SRE, PSE, and application teams