Staff, Site Reliability Engineer (SRE)

Sprinter Health

San Francisco, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 255K

Job location

San Francisco, United States of America

Tech stack

Amazon Web Services (AWS)

Computing Platforms

Bash

Cloud Computing

Cloud Computing Security

Continuous Integration

Disaster Recovery

Distributed Systems

Amazon DynamoDB

Identity and Access Management

Python

Key Management

Node.js

Software Architecture

Reliability Engineering

Systems Architecture

TypeScript

Data Logging

Scripting (Bash/Python/Go/Ruby)

Cloud Platform System

Amazon Web Services (AWS)

Cloudformation

GraphQL

React Native

Functional Programming

Terraform

Marketplace

Serverless Computing

Job description

We're looking for a Staff Site Reliability Engineer who wants to build the reliability, infrastructure, and security foundations that power last-mile healthcare delivery at scale.

At Sprinter, you'll work on the operational backbone behind products that blend logistics, patient experience, safety, and medical operations. Our systems help determine whether patients get access to care, whether clinicians are routed efficiently, whether internal teams can operate effectively, and whether our platform can scale securely and reliably as the business grows.

This role is ideal for someone who wants broad ownership across reliability, cloud infrastructure, security, observability, automation, and platform design. You'll help raise the operational bar across engineering, reduce toil through infrastructure as code and scripting, strengthen our security posture, and guide architectural decisions that make our systems more resilient over time.

If you want to make meaningful technical decisions, work across engineering and operations, and help shape the foundation of how a high-growth healthcare company scales, this is that role., * Design, build, and improve the infrastructure that powers Sprinter's patient care, clinician operations, internal tooling, and partner-facing systems

Improve reliability across distributed systems, cloud infrastructure, CI/CD, observability, and incident response
Raise the security baseline across cloud infrastructure, access controls, secrets management, identity, and operational workflows
Build and maintain infrastructure as code using Terraform and related tooling
Automate manual infrastructure and operational processes through scripting, tooling, and platform improvements
Partner with engineering teams to improve system architecture, deployment practices, monitoring, logging, and alerting
Troubleshoot complex issues across infrastructure, application, data, and operational boundaries
Help define reliability, security, and infrastructure standards that allow Sprinter to scale without creating brittle systems
Support incident response practices, postmortems, operational readiness, and continuous improvement across engineering
Make pragmatic tradeoffs between reliability, security, speed, and simplicity in a fast-moving startup environment

Requirements

Do you have experience in TypeScript?, * Spent 8+ years in site reliability engineering, platform engineering, infrastructure engineering, security engineering, or related technical roles

Led high-impact infrastructure, reliability, platform, or security projects end to end with minimal oversight
Built and operated production systems in cloud environments, ideally AWS and/or GCP
Worked deeply with infrastructure as code, ideally Terraform
Improved observability, monitoring, logging, alerting, and incident response practices across engineering teams
Automated infrastructure, deployment, or operational workflows using scripting languages such as Python, Bash, or TypeScript
Improved cloud security, access management, secrets management, networking, or operational controls
Troubleshot production issues across application, infrastructure, networking, and deployment layers
Worked in environments where reliability, security, ambiguity, and speed all matter
Made technical decisions that balanced immediate business needs with long-term scalability, reliability, and maintainability, * You've built or scaled infrastructure in health tech, logistics, marketplace, fintech, or other operationally complex environments
You've worked in mid- or growth-stage startups where speed, ambiguity, and pragmatic decision-making were required
You have experience improving security posture in a practical, engineering-friendly way
You've helped establish reliability standards, incident response practices, or platform patterns across an engineering org
You're comfortable working directly with product engineers, data teams, operations, security stakeholders, and technical leadership
You have experience mentoring engineers and raising the operational bar across a broader engineering team
You've worked in regulated environments and understand the importance of privacy, security, and compliance best practices
You have people management experience or interest in growing into broader technical leadership over time, * Hiring Manager Interview: Experience and technical depth
Technical Interview: SRE fundamentals, observability, incident response, and disaster recovery
Soft Skills Interview: Collaboration style and compatibility with the teams this person will support
Reference Checks: Validation of performance and working style, * Terraform and infrastructure-as-code tooling
AWS
GCP
TypeScript
Python
Bash
CI/CD systems
Monitoring, logging, and observability platforms
Identity, access, and secrets management systems
Cloud networking and infrastructure tooling
Container and deployment systems
Serverless AWS, including AppSync, DynamoDB, Lambda, Amplify, CloudFormation, and Node
GraphQL
React Native and React Native for Web

Benefits & conditions

3.63.6 out of 5 stars San Francisco, CA Hybrid work $160,000 - $255,000 a year - Full-time, Pulled from the full job description

Food provided
Parental leave
Health insurance
401(k) matching
Paid time off
Vision insurance
Health savings account, We are a hybrid company based in the Bay Area with offices in both San Francisco and Menlo Park. For this requisition, we are open to remote candidates but will prioritize candidates who are local. We care about work-life balance and understand that there will be times where flexibility is needed., * Meaningful pre-IPO equity
Medical, dental, and vision plans 100% paid for you and your dependents
Flexible PTO + 10 paid holidays per year
401(k) with match
16-week parental leave policy for birthing parent, 8 weeks for all other parents
HSA + FSA contributions
Life insurance, plus short and long-term disability coverage
Free daily lunch in-office
Annual learning stipend
Relocation assistance

About the company

At Sprinter Health, our mission is reimagining how people access care by bringing it directly to their homes. Nearly 30% of patients in the U.S. skip preventive or chronic care simply because they can't get to a doctor's office. For many, the ER becomes their first touchpoint with the healthcare system, driving over $300B in avoidable costs every year. By using the same technologies that power leading marketplace and last-mile platforms, we deliver care where people are, especially those who need it most. So far, we've supported more than 2 million patients across 22 states, completed 130,000+ in-home visits, and maintained a 92 NPS. Our team of clinicians, technologists, and operators has raised over $125M from investors like a16z, General Catalyst, GV, and Accel and enjoys multi-year runway.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all