Director, Site Reliability Engineering
Role details
Job location
Tech stack
Job description
The Director, Site Reliability Engineering (SRE) will lead reliability, performance, and observability initiatives for a portfolio of Vertafore products. This role owns SLIs/SLOs, incident response, automation, and CI/CD practices for assigned product families. Directors will manage multiple teams and collaborate with Product Development, Architecture, Cloud Operations, Information Security, and other SRE leaders to ensure operational excellence. This role is responsible for bridging the gap between development and operations by applying a software engineering mindset to system administration. You will own the lifecycle of services - from inception and design, through deployment, operation, and refinement., * Product Reliability Leadership
o Define and enforce SLIs/SLOs for a subset of Vertafore flagship products.
o Drive observability strategy across application and infrastructure layers.
- Release Engineering & Toil Reduction
o Oversee CI/CD pipelines for product deployments using tools like GitLab, Jenkins, Ansible, LaunchDarkly.
o Monitor and cap "Toil" (manual, repetitive operational work) at 50% using Automation and AI tools, ensuring the team spends the remaining time on project work that scales the system.
- Error Budget Management
o Manage "Error Budgets" to balance the velocity of feature releases with the stability of the platform, ensuring clear consequences when budgets are exhausted.
- Incident Management
o Define and participate in 24x7 on-call rotations for assigned products; ensure rapid resolution and blameless postmortems.
- Cross-Functional Collaboration
o Partner with Cloud Ops on capacity planning, OS patching (app tier), and load balancing (ALB, F5).
o Align reliability goals with product roadmaps and customer SLAs.
- Team Leadership
o Manage a group of Managers and Engineers, mentor teams on automation, observability, and reliability best practices.
Why Vertafore is the place for you: *Canada Only
- The opportunity to work in a space where modern technology meets a stable and vital industry
- Medical, vision & dental plans
- Life, AD&D
- Short Term and Long Term Disability
- Pension Plan & Employer Match
- Maternity, Paternity and Parental Leave
- Employee and Family Assistance Program (EFAP)
- Education Assistance
- Additional programs - Employee Referral and Internal Recognition
Requirements
Do you have experience in Team management?, Do you have a Bachelor's degree?, The selected candidate must be legally authorized to work in the United States., * Bachelor's degree in Computer Science, Information Systems, or related field.
-
15+ years in Software Engineering, SRE, DevOps, or reliability roles; 8+ years in leadership.
-
Proven ability to leverage software engineering principles and practices to solve reliability and operational challenges.
-
Expertise in CI/CD, observability, and incident response.
-
Strong AWS knowledge and experience with container orchestration.
-
Proven ability to lead reliability programs across multiple SaaS products.
-
Experience architecting applications or infrastructure for high-growth cloud platforms.
-
Experience in B2B SaaS environments involving large-scale distributed systems.
-
Proven leadership communicating and influencing at team, peer, and leadership levels.
-
Demonstrated experience driving operational excellence through metrics and KPIs.
-
(Preferred) Background supporting financial services, healthcare, or regulated industries.
Benefits & conditions
Pulled from the full job description
- Tuition reimbursement
- Pet insurance
- AD&D insurance
- Parental leave
- 401(k)
- Health insurance
- 401(k) matching
Full job description
$175,000 - $220,000 + VIP Bonus
Our fast-paced and collaborative environment inspires us to create, think, and challenge each other in ways that make our solutions and our teams better. Whether you're interested in engineering or development, marketing or sales, or something else - if this sounds like you, then we'd love to hear from you!
We are headquartered in Denver, Colorado, with offices in the US, Canada, and India.
$175,000 - $220,000 / year + Bonus, * The opportunity to work in a space where modern technology meets a stable and vital industry
- We have a Flexible First work environment! Our North America team members use our offices for collaboration, community and team-building, with members asked to sometimes come into an office and/or travel depending on job responsibilities. Other times, our teams work from home or a similar environment.
- Medical, vision & dental plans
- PPO & high-deductible options
- Health Savings Account & Flexible Spending Accounts Options:
- Health Care FSA
- Dental & Vision FSA
- Dependent Care FSA
- Commuter FSA
- Life, AD&D (Basic & Supplemental), and Disability
- 401(k) Retirement Savings Plain & Employer Match
- Supplemental Plans - Pet insurance, Hospital Indemnity, and Accident Insurance
- Parental Leave & Adoption Assistance
- Employee Assistance Program (EAP)
- Education & Legal Assistance
- Additional programs - Tuition Reimbursement, Employee Referral, Internal Recognition, and Wellness
- Commuter Benefits (Denver)