Senior Site Reliability Engineer
Role details
Job location
Tech stack
Job description
As a Senior Site Reliability Engineer, you can anticipate opportunities to work on our hybrid systems across the globe. You will be responsible for installing, configuring, and monitoring new systems within a network of global data undefined. Additionally, you will patch and maintain thousands of physical and cloud systems worldwide. To streamline operations, you will develop automation to reduce repetitive tasks and analyze and address performance bottlenecks. Furthermore, you will update and troubleshoot user access permissions, resolve network connectivity issues, and maintain system firewalls.
About the team
Zoom's SRE team is committed to delivering customer happiness, improving business efficiency, and promoting agility through innovation, data-driven insights, and automation. Our impact is reflected in smooth user experiences, optimized processes, and support for Zoom's expansion in the realm of communication and collaboration.
Responsibilities
Being a technical leader to the team and within the greater organization. Monitor and analyze system performance metrics to identify and address potential issues proactively and automatically. Ensure code quality through excellent documentation and thorough testing pipelines. Collaborate with other teams to troubleshoot system performance issues and promote SRE best practices.Optimize system deployment, configuration, performance, and uptime.
Requirements
- Demonstrate 5-10 years of hands-on experience in Site Reliability Engineering, DevOps, or Production Operations roles.
- Have a deep understanding of Linux fundamentals with a focus on Ubuntu systems. CPU/Memory/IO/Network troubleshooting and optimization.
- Have experience with bare metal infrastructure and datacenter operations, including proficiency in operating system deployment tools (Foreman, Cobbler, MAAS etc.)
- Have experience with core network services like DNS, NTP, DHCP, and HTTP.
- Demonstrate the ability to automate repetitive tasks to eliminate manual toil.
- Demonstrate skills with Ansible, Terraform, Kubernetes and Python. Unit and integration testing.
- Implement or improve existing CI Pipelines: Jenkins, GitLab CI, etc.
- Have experience with observability and monitoring tools. Tune alerts to reduce noise and improve response times. Develop metrics and dashboards.
- Clear written and verbal communication skills.
Benefits & conditions
$98 900,00
Maximum: $228 700,00
In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration; base salary, bonus and equity value.
Note: Starting pay will be based on a number of factors and commensurate with qualifications & experience.
We also have a location based compensation structure; there may be a different range for candidates in this and other locations
At Zoom, we offer a window of at least 5 days for you to apply because we believe in giving you every opportunity. Below is the potential closing date, just in case you want to mark it on your calendar. We look forward to receiving your application!, As part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways.