Back

Senior Site Reliability Engineer

LYREBIRD LLC

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Tech stack

Amazon Web Services (AWS)

Amazon Web Services (AWS)

Cloud Computing

DevOps

Distributed Systems

Identity and Access Management

Reliability Engineering

Kubernetes

Amazon Web Services (AWS)

Functional Programming

Amazon Web Services (AWS)

Docker

Job description

We're hiring a Senior SRE to own the reliability, scalability, and performance of our production systems as we continue to grow.

At Lyrebird, you won't just respond to incidents. You'll design the systems and standards that prevent them. That means building infrastructure that scales cleanly, creating deployment patterns that reduce risk, and ensuring we can detect and resolve issues before they impact users.

This is a broad role that sits across platform engineering, DevOps, and security. You'll be responsible for ensuring our systems are resilient under load, observable in real time, and able to scale as usage increases., * Keep production systems online and restore them quickly when they fail

Lead and manage incidents, making high-quality decisions under pressure
Design and implement scalable infrastructure and deployment patterns
Build and improve CI/CD pipelines and release systems
Improve monitoring, telemetry, and observability across the stack
Own cloud infrastructure, security, and access controls
Work closely with engineers to ensure systems are built to scale from day one

Requirements

5-7 years experience in SRE, platform engineering, or DevOps roles
Strong AWS experience (ECS/Fargate, EC2, Lambda, SQS, IAM)
Experience running and scaling production systems
Strong understanding of distributed systems and scaling approaches
Hands-on experience with Docker and containerised environments
Experience with Kubernetes or ECS

How you work

You take ownership and follow things through
You're proactive and comfortable operating with ambiguity
You stay calm and make good decisions during incidents
You focus on solving problems end to end
You're willing to roll up your sleeves and get into the detail

About the company

Lyrebird Health builds AI-powered tools that reduce the administrative burden on clinicians and improve the quality and accessibility of healthcare.

Apply for this position