Senior Devops / Platform Engineer
Role details
Job location
Tech stack
Job description
We build and operate a fully-automated Speech Analytics SaaS platform that runs on Kubernetes across AWS and GCP. Our infrastructure processes roughly 160,000 hours of audio per month with a 99%+ uptime SLA, serving enterprise customers who rely on mission-critical analytics. About the Platform
The platform is built on modern, cloud-native technologies: Kubernetes, the Argo ecosystem, MongoDB, ElasticSearch, and 100% Terraform-driven infrastructure. It auto-scales from dozens to more than 1,000 Kubernetes nodes based on demand. In addition to the core SaaS product, we deliver managed solutions (Autopilot and Copilot) and create AI-based services packaged as containerized, Terraform-ready modules for seamless integration into customer cloud environments (AWS, GCP, Azure)., * Design, build, and maintain multi-cloud infrastructure on AWS and GCP.
- Operate and optimize Kubernetes clusters (GKE, EKS) at scale (up to ~1 000 nodes).
- Lead infrastructure modernization and cloud-migration initiatives.
- Implement cost-optimization strategies across cloud providers.
- Manage Argo Workflows and ArgoCD for GitOps-based deployments.
- Build and maintain end-to-end Infrastructure as Code with Terraform (modular, reusable, multi-cloud).
- Develop internal automation tooling and scripts in Python, Bash, and Go.
- Implement zero-downtime deployment strategies.
- Deploy and manage production MongoDB, ElasticSearch, and other core services.
- Package and deploy workloads using Helm, Docker, and GitOps pipelines.
- Ensure 99%+ uptime through robust monitoring, incident response, and observability.
- Support delivery of AI containerized solutions ready for customer environments.
- Build comprehensive observability across all platform components.
- Implement security best practices and compliance requirements.
- Drive post-incident reviews and continuous improvement.
Requirements
Senior DevOps / Platform Engineer - a technical decision-maker who will help design, automate, and operate our cloud-native platform across AWS and GCP. You'll manage Kubernetes at scale, build highly automated CI/CD workflows, and collaborate with engineering teams to ensure reliable delivery of SaaS features and AI-driven products., * 5+ years of experience as a DevOps, SRE, or Platform Engineer in production environments.
- Strong hands-on Kubernetes experience (GKE and/or EKS) managing clusters at scale.
- Expert-level Terraform and Infrastructure as Code workflows.
- Multi-cloud experience with both AWS and GCP.
- Proven experience with CI/CD, GitOps, ArgoCD, and Argo Workflows.
- Solid Docker and Helm expertise for containerized deployments.
- Strong scripting/programming skills in Python and Bash.
- Experience running production-grade, scalable, and secure cloud systems.
- Comfortable with incident response and on-call responsibilities.
Nice to Have
- Programming for tooling development (Python, Bash, Go).
- Experience with observability stacks (Prometheus, Grafana, Elastic, OpenTelemetry).
- Hands-on AI/ML workloads in containerized environments.
- MongoDB and ElasticSearch operations at scale.
- Experience with cost-optimization strategies in cloud environments.
- Contributions to open-source DevOps/platform projects.
- AWS/GCP certifications.
Benefits & conditions
- Competitive salary.
- Fully remote work with flexible hours.
- 23 days of vacation plus Spanish public holidays.
- Growth & impact - real ownership of platform decisions.
- Direct collaboration with leadership on technical strategy.
- Continuous learning with modern cloud-native, DevOps, and AI tooling.
- Mentor and grow the team as we scale.
- Visible impact on products used by enterprise customers.
- Engineering-driven culture that values automation and best practices.
- Async-first communication and respect for work-life balance.
- Blameless post-mortems and learning from incidents.