Senior DevOps Engineer
Role details
Job location
Tech stack
Job description
As a SRE Engineer at Sonos, you will be jointly responsible with the cloud application team for the reliability, observability and security of all cloud services we have in production for serving millions of customers using Sonos Voice Control and other AI-powered product experience. What You'll Do
- Support our Developer's experience via automated pipelines that includes timely feedback and a seamless path to release-level quality
- Maintain and administer Sonos Voice Control team's cloud infrastructure
- Monitor, debug and improve system performance and reliability
- Research, implement and democratize security best practices
- Troubleshoot and resolve issues in development, test, and production environments
- Monitor our cloud spending and implement proactive recommendations in order to reduce the cost of ownership of our infrastructure
- Document processes, configurations, and solutions for internal knowledge sharing
- Mentor and guide junior DevOps engineers
- Apply industry best practices to continuously improve our engineering processes, and cloud infrastructure components
- This role includes compensated on-call duty
Requirements
Do you have experience in Terraform?, * 7+ years of experience in a DevOps or SRE role
- Excellent English written and verbal communication skills.
- Great analytical skills, ability to lead technical discussion and adapt to audiences not proficient in DevOps and SRE.
- Great sense of service, reactivity to support requests from internal customers.
- Experience in scripting languages like Python, Bash, or Go for automation and tooling
- Cloud Platforms expert-level proficiency in at least one major cloud provider, including networking, security, compute, storage, and managed services.
- Deep knowledge of Kubernetes principles, configuration and administration.
- Infrastructure as Code: Deep experience with tools like Terraform, Terragrunt.
- CI/CD: Ability to design, implement, and maintain robust, secure, and scalable CI/CD pipelines.
- Monitoring, Logging, and Alerting: Expert-level experience with observability stacks (Prometheus/Grafana, Datadog). Ability to define and implement effective Service Level Objectives and Indicators.
- System and Application Reliability: Practical experience applying SRE principles like gamedays, chaos testing, Root Cause Analysis.
- Database Management: Familiarity with the operational aspects of running and scaling common databases.
- Security Best Practices: Knowledge of security principles in the cloud, including network security, identity and access management, and secret management.
Preferred Qualifications:
- Knowledge of Amazon Web Services (AWS) services beyond the basics
- Expertise with Kubernetes management toolings such ArgoCD, Karpenter,...
- Experience with MLOps principles and tools (Kserve, AI Gateway, Kubeflow, MLflow, …) for managing the machine learning lifecycle.
- Experience with monitoring tools like Datadog or Grafana
- Practical experience with MongoDB
- Practical experience with Github actions
Benefits & conditions
Research shows that candidates from underrepresented backgrounds often don't apply for roles if they don't meet all the criteria. If you don't have 100% of the skills listed, we strongly encourage you to apply if interested. Visa Sponsorship: Sonos is unable to sponsor or take over sponsorship of an employment visa for this role at this time. We ask that applicants be authorized to work for any French employer, both now and in the future. #LI-hybrid Your profile will be reviewed and you'll hear from us once we have an update. At Sonos we take the time to hire right and appreciate your patience.