Site Reliability Engineer Manager ( Healthcare Domain)

Vaarida Technologies Llc

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Tech stack

Amazon Web Services (AWS)

Azure

Continuous Integration

DevOps

Site Reliability Engineering Practices

Prometheus

Google Cloud Platform

Cloud Platform System

Grafana

Mttr

Splunk

Appdynamics

Requirements

10+ years in engineering, operations, or SRE roles

5+ years leading SRE, platform, or reliability-focused teams

Proven experience implementing SRE practices at scale (SLIs, SLOs, error budgets)

Strong background in cloud environments (AWS, Azure, Google Cloud Platform)

Hands-on experience with observability tools (Splunk, AppDynamics, Prometheus, etc.)

Experience in incident management and production operations at scale

Ability to operate effectively in high-pressure and complex enterprise environments

Preferred Qualifications

Experience driving organizational transformation (not just technical implementation)

Strong understanding of CI/CD, DevOps, and automation practices

Experience working in regulated or large enterprise environments

Familiarity with AIOps or advanced automation strategies, Increased adoption of SLOs and reliability standards

Reduction in high-severity incidents over time

Improved MTTR and operational efficiency

Increased adoption of standardized observability practices

Reduction in reactive, ticket-driven work across teams

Clear alignment between SRE, PSE, and application teams

Role details

Job location

Tech stack

Requirements

Apply for this position

Good distractions

Moments

Videos View all