Director, DevOps & Cloud Infrastructure
RxBenefits Inc
Salt Lake City, United States of America
2 months ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Salt Lake City, United States of America
Tech stack
Artificial Intelligence
Amazon Web Services (AWS)
Azure
Backup Devices
Bash
Cloud Computing
Cloud Engineering
Continuous Integration
DevOps
Disaster Recovery
Github
Integrated Development Environments
Python
Octopus Deploy
Scrum
Reliability Engineering
Software Vulnerability Management
Datadog
Data Logging
Pulumi
Scripting (Bash/Python/Go/Ruby)
Grafana
Mttr
Cloudformation
Gitlab-ci
Kubernetes
Deployment Automation
Terraform
New Relic (SaaS)
Devsecops
Docker
Jenkins
Go
Job description
- Own uptime, availability, scalability, and performance of all production systems.
- Define and manage SLOs, SLAs, error budgets, and incident response practices.
- Lead post-incident reviews and drive systemic reliability improvements.
- Implement observability standards (logging, metrics, tracing).
Cloud & Infrastructure
- Own cloud infrastructure strategy (AWS, Azure, hybrid).
- Lead infrastructure-as-code (Terraform, CloudFormation, ARM, etc.).
- Working with GRC, Ensure disaster recovery, backup, and business continuity plans are tested and compliant.
- Monitor and optimize cloud spend through cost governance and FinOps practices.
DevOps, CI/CD & Engineering Enablement
- Own CI/CD pipelines, deployment automation, and release strategies.
- Enable safe, frequent releases (blue/green, canary, feature flags).
- Standardize DevOps tooling and platform capabilities across teams.
- Partner with Engineering to remove friction and increase delivery velocity.
Monitoring, Alerting, and Event Management
- Set plan and manage execution of dashboards, availability management and reporting.
- Align with Product Engineering teams to define NFRs related to definition, instrumentation and logging
Security & Compliance
- Embed security into DevOps practices (DevSecOps).
- Partner with Security, Legal, and Compliance on audits and certifications (SOC 2, HIPAA, HITRUST, PCI, etc.).
- Ensure secrets management, access controls, and vulnerability remediation.
Leadership & Strategy
- Build and lead DevOps, SRE, and Cloud Engineering teams.
- Define the DevOps operating model (centralized, embedded, hybrid).
- Establish KPIs for reliability, deployment frequency, MTTR, and cost efficiency.
- Partner closely with Engineering, Product, IT, Security, and Data teams.
- Contribute to long-term technology and architecture roadmap.
Requirements
- 10+ years of hands-on experience in DevOps, SRE, cloud engineering, and infrastructure.
- 5+ years as a Director in leadership/people management role (leading managers and/or large teams)
- Deep expertise in modern tools and practices:
- Cloud platforms (AWS, Azure).
- CI/CD (GitHub Actions, GitLab CI, Jenkins, ArgoCD).
- Containers & orchestration (Kubernetes, Docker, Helm).
- Infrastructure as Code (Terraform, Pulumi, Crossplane).
- Monitoring/Observability (DataDog, Sumo, Grafana, ELK, Datadog, New Relic).
- Scripting/automation (Python, Go, Bash).
- Strong understanding of Agile/Scrum/SAFe methodologies.
- Proven track record of building high-performance teams and driving cultural change.
- Excellent communication, strategic thinking, and cross-functional collaboration skills.
- Experience with large-scale, high-availability environments.
Preferred Skills/Experience:
- Relevant certifications (e.g., AWS Certified DevOps Engineer, Certified Kubernetes Administrator, Terraform Associate, SRE-related).
- Experience in regulated industries (healthcare) or with ML/AI infrastructure.
- Background in cost management and cloud financial operations (FinOps).