Staff, Site Reliability Engineer (SRE)

Sprinter Health
San Francisco, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 255K

Job location

San Francisco, United States of America

Tech stack

Amazon Web Services (AWS)
Computing Platforms
Bash
Cloud Computing
Cloud Computing Security
Continuous Integration
Disaster Recovery
Distributed Systems
Amazon DynamoDB
Identity and Access Management
Python
Key Management
Node.js
Software Architecture
Reliability Engineering
Systems Architecture
TypeScript
Data Logging
Scripting (Bash/Python/Go/Ruby)
Cloud Platform System
Amazon Web Services (AWS)
Cloudformation
GraphQL
React Native
Functional Programming
Terraform
Marketplace
Serverless Computing

Job description

We're looking for a Staff Site Reliability Engineer who wants to build the reliability, infrastructure, and security foundations that power last-mile healthcare delivery at scale.

At Sprinter, you'll work on the operational backbone behind products that blend logistics, patient experience, safety, and medical operations. Our systems help determine whether patients get access to care, whether clinicians are routed efficiently, whether internal teams can operate effectively, and whether our platform can scale securely and reliably as the business grows.

This role is ideal for someone who wants broad ownership across reliability, cloud infrastructure, security, observability, automation, and platform design. You'll help raise the operational bar across engineering, reduce toil through infrastructure as code and scripting, strengthen our security posture, and guide architectural decisions that make our systems more resilient over time.

If you want to make meaningful technical decisions, work across engineering and operations, and help shape the foundation of how a high-growth healthcare company scales, this is that role., * Design, build, and improve the infrastructure that powers Sprinter's patient care, clinician operations, internal tooling, and partner-facing systems

  • Improve reliability across distributed systems, cloud infrastructure, CI/CD, observability, and incident response
  • Raise the security baseline across cloud infrastructure, access controls, secrets management, identity, and operational workflows
  • Build and maintain infrastructure as code using Terraform and related tooling
  • Automate manual infrastructure and operational processes through scripting, tooling, and platform improvements
  • Partner with engineering teams to improve system architecture, deployment practices, monitoring, logging, and alerting
  • Troubleshoot complex issues across infrastructure, application, data, and operational boundaries
  • Help define reliability, security, and infrastructure standards that allow Sprinter to scale without creating brittle systems
  • Support incident response practices, postmortems, operational readiness, and continuous improvement across engineering
  • Make pragmatic tradeoffs between reliability, security, speed, and simplicity in a fast-moving startup environment

Requirements

Do you have experience in TypeScript?, * Spent 8+ years in site reliability engineering, platform engineering, infrastructure engineering, security engineering, or related technical roles

  • Led high-impact infrastructure, reliability, platform, or security projects end to end with minimal oversight
  • Built and operated production systems in cloud environments, ideally AWS and/or GCP
  • Worked deeply with infrastructure as code, ideally Terraform
  • Improved observability, monitoring, logging, alerting, and incident response practices across engineering teams
  • Automated infrastructure, deployment, or operational workflows using scripting languages such as Python, Bash, or TypeScript
  • Improved cloud security, access management, secrets management, networking, or operational controls
  • Troubleshot production issues across application, infrastructure, networking, and deployment layers
  • Worked in environments where reliability, security, ambiguity, and speed all matter
  • Made technical decisions that balanced immediate business needs with long-term scalability, reliability, and maintainability, * You've built or scaled infrastructure in health tech, logistics, marketplace, fintech, or other operationally complex environments
  • You've worked in mid- or growth-stage startups where speed, ambiguity, and pragmatic decision-making were required
  • You have experience improving security posture in a practical, engineering-friendly way
  • You've helped establish reliability standards, incident response practices, or platform patterns across an engineering org
  • You're comfortable working directly with product engineers, data teams, operations, security stakeholders, and technical leadership
  • You have experience mentoring engineers and raising the operational bar across a broader engineering team
  • You've worked in regulated environments and understand the importance of privacy, security, and compliance best practices
  • You have people management experience or interest in growing into broader technical leadership over time, * Hiring Manager Interview: Experience and technical depth
  • Technical Interview: SRE fundamentals, observability, incident response, and disaster recovery
  • Soft Skills Interview: Collaboration style and compatibility with the teams this person will support
  • Reference Checks: Validation of performance and working style, * Terraform and infrastructure-as-code tooling
  • AWS
  • GCP
  • TypeScript
  • Python
  • Bash
  • CI/CD systems
  • Monitoring, logging, and observability platforms
  • Identity, access, and secrets management systems
  • Cloud networking and infrastructure tooling
  • Container and deployment systems
  • Serverless AWS, including AppSync, DynamoDB, Lambda, Amplify, CloudFormation, and Node
  • GraphQL
  • React Native and React Native for Web

Benefits & conditions

3.63.6 out of 5 stars San Francisco, CA Hybrid work $160,000 - $255,000 a year - Full-time, Pulled from the full job description

  • Food provided
  • Parental leave
  • Health insurance
  • 401(k) matching
  • Paid time off
  • Vision insurance
  • Health savings account, We are a hybrid company based in the Bay Area with offices in both San Francisco and Menlo Park. For this requisition, we are open to remote candidates but will prioritize candidates who are local. We care about work-life balance and understand that there will be times where flexibility is needed., * Meaningful pre-IPO equity
  • Medical, dental, and vision plans 100% paid for you and your dependents
  • Flexible PTO + 10 paid holidays per year
  • 401(k) with match
  • 16-week parental leave policy for birthing parent, 8 weeks for all other parents
  • HSA + FSA contributions
  • Life insurance, plus short and long-term disability coverage
  • Free daily lunch in-office
  • Annual learning stipend
  • Relocation assistance

About the company

At Sprinter Health, our mission is reimagining how people access care by bringing it directly to their homes. Nearly 30% of patients in the U.S. skip preventive or chronic care simply because they can't get to a doctor's office. For many, the ER becomes their first touchpoint with the healthcare system, driving over $300B in avoidable costs every year. By using the same technologies that power leading marketplace and last-mile platforms, we deliver care where people are, especially those who need it most. So far, we've supported more than 2 million patients across 22 states, completed 130,000+ in-home visits, and maintained a 92 NPS. Our team of clinicians, technologists, and operators has raised over $125M from investors like a16z, General Catalyst, GV, and Accel and enjoys multi-year runway.

Apply for this position