Senior SRE I
Waystar, Inc
Atlanta, United States of America
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Atlanta, United States of America
Tech stack
Java
Amazon Web Services (AWS)
Azure
Bash
Databases
Software Debugging
Linux
DevOps
Distributed Systems
Github
Python
Networking Basics
NoSQL
Reliability Engineering
Site Reliability Engineering Practices
Ansible
Prometheus
Ruby
Runbook
Software Deployment
Datadog
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Grafana
Reliability of Systems
Cloudformation
Containerization
Gitlab-ci
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Terraform
Splunk
Docker
Jenkins
Go
Job description
- Design, implement, and maintain automation for infrastructure provisioning, configuration management, and application deployments across various environments (on-premise and cloud).
- Proactively monitor system health, performance, and availability, utilizing a range of observability tools and defining key performance indicators (KPIs) and service level objectives (SLOs).
- Lead the investigation and resolution of complex production incidents, perform root cause analysis, and implement preventative measures to minimize future occurrences.
- Collaborate with development teams to ensure software is designed for reliability, scalability, and operational efficiency, participating in architectural reviews and providing expert guidance.
- Develop and maintain robust incident response procedures, runbooks, and disaster recovery plans.
- Contribute to the evolution of our SRE practices, tooling, and best standards, driving continuous improvement and knowledge sharing within the team.
- Participate in an on-call rotation to provide 24/7 support for critical production systems.
- Mentor junior SREs and contribute to the growth and development of the team.
- Evaluate and implement new technologies and solutions to enhance system reliability and operational efficiency.
Requirements
- Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
- 5+ years of experience in a Site Reliability Engineering, DevOps, or highly related infrastructure engineering role.
- Strong proficiency in at least one scripting/programming language (e.g., Python, Go, Java, Ruby, Bash).
- Extensive experience with cloud platforms (AWS, Azure, GCP) including services related to compute, networking, storage, and databases.
- Deep understanding of Linux operating systems and networking fundamentals.
- Proven experience with infrastructure as code tools (e.g., Terraform, CloudFormation, Ansible).
- Solid experience with CI/CD pipelines and related tools (e.g., Jenkins, GitLab CI, GitHub Actions).
- Demonstrable expertise in monitoring and alerting systems (e.g., Prometheus, Grafana, Datadog, Splunk).
- Strong problem-solving skills with a methodical approach to debugging complex distributed systems.
- Excellent communication and collaboration skills, with the ability to work effectively across cross-functional teams.
- Experience with containerization technologies (Docker, Kubernetes) is highly desirable.
- Familiarity with database technologies (relational and NoSQL) and their operational challenges.
Benefits & conditions
- Competitive total rewards (base salary + bonus, if applicable)
- Customizable benefits package (3 medical plans with Health Saving Account company match)
- We offer generous paid time off for our non-exempt team members, starting with 3 weeks + 13 paid holidays, including 2 personal floating holidays. We also offer flexible time off for our exempt team members + 13 paid holidays
- Paid parental leave (including maternity + paternity leave)
- Education assistance opportunities and free LinkedIn Learning access
- Free mental health and family planning programs, including adoption assistance and fertility support
- 401(K) program with company match
- Pet insurance
- Employee resource groups
About the company
Through a smart platform and better experience, Waystar helps providers simplify healthcare payments and yield powerful results throughout the complete revenue cycle.
Waystar's healthcare payments platform combines innovative, cloud-based technology, robust data, and unparalleled client support to streamline workflows and improve financials so providers can focus on what matters most: their patients and communities. Waystar is trusted by 1M+ providers, 1K+ hospitals and health systems, and is connected to over 5K commercial and Medicaid/Medicare payers. We are deeply committed to living out our organizational values: honesty; kindness; passion; curiosity; fanatical focus; best work, always; making it happen; and joyful, optimistic & fun.
Waystar products have won multiple Best in KLAS® or Category Leader awards since 2010 and earned multiple #1 rankings from Black Book surveys since 2012. The Waystar platform supports more than 500,000 providers, 1,000 health systems and hospitals, and 5,000 payers and health plans. For more information, visit waystar.com or follow @Waystar (https://twitter.com/Waystar) on Twitter.