Senior DevOps Engineer
Role details
Job location
Tech stack
Job description
We are hiring a Senior DevOps Engineer to drive the reliability, scalability, and operational health of our realtime communications platform. You will be responsible for the day-to-day excellence of services supporting audio/video conferencing, recording, and live-streaming. This role requires a hands-on expert who can bridge the gap between complex infrastructure engineering and seamless product delivery within a global team environment., * Ensuring reliability and operations by owning the SLO/SLI framework for specific real-time services while tracking latency, availability, jitter, and packet loss metrics.
- Acting as an on-call responder during critical outages, guiding technical resolution, and participating in a global follow-the-sun rotation.
- Facilitating blameless postmortems and implementing technical action items to prevent recurring issues.
- Promoting preventive engineering by integrating automated guardrails into the development lifecycle to uphold architectural best practices before production deployment.
- Supervising the release management process for real-time services, executing high-frequency deployments with minimal risk through versioning and complex rollout orchestration across environments. Developing scalable infrastructure using Terraform and adhering to GitOps principles to ensure consistent configurations.
- Optimizing CI/CD pipelines with tools like GitHub Actions and Jenkins, employing canary releases and blue/green strategies for safe deployments.
- Collaborating with software engineering teams to provide operational insights during the design phase, advocating for reliability-focused practices.
- Working alongside global engineering hubs to offer technical context and maintain comprehensive documentation, including runbooks and KBAs.
Requirements
- Possess 6+ years of experience in DevOps, SRE, or infrastructure engineering, demonstrating success in supporting large-scale, latency-sensitive production systems.
- Gain expertise in supporting media-focused platforms such as video conferencing, streaming, gaming, or high-frequency trading environments.
- Display knowledge of real-time protocols, including WebRTC, RTP/RTCP, and SFU/MCU topologies, with practical application in production settings.
- Develop advanced skills in cloud platforms like AWS, GCP, or Azure, and manage Kubernetes environments using tools like Helm and ArgoCD.
- Use deployment orchestration tools and progressive delivery methods, including Canary, Blue/Green deployments, and Feature Flags, to enhance release processes.
- Understand networking principles, including DNS, Load Balancing, and CDN architecture, to ensure system reliability and performance.
- Write automation scripts using Python, Go, or Bash to create tools and improve incident response efficiency.
- Occasional weekend work may be required
- Ability to work across the globe or multiple time zones
Benefits & conditions
$98 900,00
Maximum: $228 700,00
In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.
Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.
We also have a location based compensation structure; there may be a different range for candidates in this and other locations
At Zoom, we offer a window of at least 5 days for you to apply because we believe in giving you every opportunity. Below is the potential closing date, just in case you want to mark it on your calendar. We look forward to receiving your application!
Anticipated Position Close Date, As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Click Learn for more information.