Site Reliability Engineer Manager ( Healthcare Domain)

Vaarida Technologies Llc
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Remote

Tech stack

Amazon Web Services (AWS)
Azure
Continuous Integration
DevOps
Site Reliability Engineering Practices
Prometheus
Google Cloud Platform
Cloud Platform System
Grafana
Mttr
Splunk
Appdynamics

Requirements

10+ years in engineering, operations, or SRE roles

5+ years leading SRE, platform, or reliability-focused teams

Proven experience implementing SRE practices at scale (SLIs, SLOs, error budgets)

Strong background in cloud environments (AWS, Azure, Google Cloud Platform)

Hands-on experience with observability tools (Splunk, AppDynamics, Prometheus, etc.)

Experience in incident management and production operations at scale

Ability to operate effectively in high-pressure and complex enterprise environments

Preferred Qualifications

Experience driving organizational transformation (not just technical implementation)

Strong understanding of CI/CD, DevOps, and automation practices

Experience working in regulated or large enterprise environments

Familiarity with AIOps or advanced automation strategies, Increased adoption of SLOs and reliability standards

Reduction in high-severity incidents over time

Improved MTTR and operational efficiency

Increased adoption of standardized observability practices

Reduction in reactive, ticket-driven work across teams

Clear alignment between SRE, PSE, and application teams

Apply for this position