DevOps Engineer
Role details
Job location
Tech stack
Job description
At some point in the future, every business will answer their phone with voice AI. We are making sure that when they build that AI agent, the experience of doing so doesn't suck.
We're growing incredibly fast and need someone who thrives in a dynamic, high-pressure environment to work in our engineering. This role is critical to the success of our company. We're looking for someone who is roll up their sleeves and dig in, and double down when things get tough.
We're looking for someone who is highly technical, driven, and fun to work with. You'll be working directly with our engineers, customers support team, and product team.
NOTE: This is an engineering position. You will be writing code, a lot of it and it will be a lot of work… if you're looking for a cushy corporate job where nothing get done, I recommend checking out some of our competitors. If you're looking to be a part of a winning team and to make a real difference read on. This role is in office 5 days a week in San Francisco.
Your day to day:
- Design, build, and operate highly reliable cloud infrastructure that powers real-time voice AI systems with extremely low latency and high availability.
- Own Kubernetes clusters end-to-end: provisioning, scaling, upgrades, networking, and debugging production incidents under real customer load.
- Build, maintain, and evolve infrastructure as code using tools like Terraform, Pulumi, or CloudFormation to ensure repeatable, auditable, and secure environments across staging and production.
- Create and operate CI/CD pipelines that enable fast, safe iteration across multiple microservices and teams.
- Design and maintain observability systems (metrics, logs, traces, alerting) to detect failures early and rapidly diagnose production issues.
- Partner with backend engineers to translate application requirements into scalable, secure infrastructure and clean deployment workflows.
- Harden systems through strong security practices including IAM, secrets management, network isolation, and least-privilege access controls.
- Optimize cloud performance and costs while maintaining reliability, developer velocity, and customer experience.
- Implement and operate GitOps-driven deployment workflows, using Git as the source of truth for infrastructure and application state, enabling safe, auditable, and automated rollouts.
- Lead incident response: investigate outages, coordinate fixes, write postmortems, and drive systemic reliability improvements.
- Continuously improve resilience through load testing, chaos testing, capacity planning, and proactive infrastructure upgrades., * Backend: Python, microservices, async programming
- Cloud & Infrastructure: AWS, GCP, Kubernetes, Redis, ArgoCD, GitOps
- Databases: Firebase, Supabase (PostgreSQL)
- Frontend: Next.js
- Observability & Monitoring: Datadog, logging, metrics, tracing
- Telephony & Voice AI: SIP, voice APIs, real-time call handling
- Other tools & practices: CI/CD, automated testing, resilient architecture
Requirements
Do you have experience in Python?, * 5+ years as a DevOps engineer
- Experience writing async web apps using fast api in Python
- Builder of APIs, Clouds, CI/CD pipelines
- Experience with IaC, AWS, Database Management at scale
- Understanding of good architecture, security practices
- Strong technical and communication skills
- Extensive experience with AWS & Kubernetes