Senior Site Reliability Engineer

Manychat
San Francisco, United States of America
8 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Barcelona, Spain

Tech stack

PHP
Agile Methodologies
Artificial Intelligence
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Ubuntu (Operating System)
Cloud Computing
Cloud Computing Security
Continuous Integration
Software Debugging
Linux
Github
Identity and Access Management
Python
PostgreSQL
Nginx
Reliability Engineering
Ansible
Prometheus
Reverse Proxy
Grafana
Amazon Web Services (AWS)
Kubernetes
Cloudwatch
Terraform

Job description

We're looking for a Cloud Infrastructure Engineer who thrives at the crossroads of classic Linux and AWS infrastructure and modern Site Reliability Engineering. This is a high-impact, hybrid role designed for someone who can manage cloud resources, harden Kubernetes clusters, and shape a more reliable and developer-friendly platform. You'll take over key responsibilities from our current Infra Lead who is transitioning to a software-focused role, giving you immediate ownership and space to shine. What You'll Do

  • Maintain and harden AWS infrastructure (EC2, ALB/NLB, WAF, IAM, CloudWatch)
  • Operate and evolve our EKS clusters powering Python-based AI services
  • Migrate existing services to Kubernetes using Terraform and Helm
  • Codify infrastructure with Terraform and manage host-level automation via Ansible
  • Build and improve CI/CD pipelines with GitHub Actions
  • Own observability efforts: Prometheus, Grafana, alerting, and on-call readiness
  • Support OS-level patching, certs, WAF rules, and general infra hygiene
  • Partner with engineers to guide best practices and drive platform reliability
  • Create clean, maintainable infrastructure documentation and playbooks
  • Occasionally support rare off-hours incidents (don't worry, really rare), * Professional development budget for conference tickets, online courses, and other relevant resources to help you grow.
  • Flexible benefits package to tailor perks that matters most for you.
  • Hybrid work and generous leave options to prioritize your work-life balance.
  • In-office perks, including free meals and snacks.
  • Company-funded sport activities, annual offsites, and team-building events.

Manychat is an Equal Opportunity Employer. We're committed to building a diverse and inclusive team. We do not discriminate against qualified employees or applicants because of race, color, religion, gender identity, sex, sexual orientation, pregnancy, national origin, ancestry, citizenship, age, marital status, physical disability, or any other characteristic protected by local law or ordinance.

If you have individual needs that may require an accommodation during the interview process, please indicate this in your application. We will do our best to provide assistance throughout your interview process to ensure you're set up for success. #J-18808-Ljbffr

Requirements

  • 5+ years of experience managing Linux in production (Ubuntu, Amazon Linux)
  • Strong experience with Kubernetes (ideally EKS), Helm, and Terraform
  • Comfort with running and debugging Python workloads in containers
  • Solid understanding of networking, IAM, and cloud security best practices
  • Hands-on Nginx experience (Ingress and reverse proxy setups)
  • Excellent communication skills; you can explain complex infra to devs clearly

Nice To Have Skills

  • Strong Ansible skills beyond the basics
  • PostgreSQL or Amazon RDS tuning and operations experience
  • Deep understanding of observability tools (Prometheus, Grafana, Loki, etc.)
  • Familiarity with PHP production environments
  • Experience with TDD, CI/CD best practices, and agile development
  • Any previous SRE-like exposure such as building resilience, automation, or incident tooling

About the company

Manychat is a leading Chat Marketing platform that helps businesses engage with their customers on Instagram, Facebook Messenger, WhatsApp, and Telegram.

Manychat is a Meta Official Business Partner, backed by top investors, including Bessemer Venture Partners.

More than one million companies choose Manychat to power their business-to-customer conversations, from small businesses to leading global brands. No matter the use case — generating leads, increasing engagement, providing 24/7 customer support, accepting payments, and beyond — Manychat helps businesses grow their ROI and revenue.

Manychat was founded in 2015 and currently has 200+ team members in five global offices — Barcelona, Amsterdam, Austin, Sao Paolo, Yerevan.

Manyсhat is a rich web application built with TypeScript, React Hooks and Redux Toolkit on the frontend side and PHP 8.1 and Python on the backend side. 

Manychat engineers prioritize both technology and user experience to deliver value to customers, striving to provide simple and reliable solutions.

Apply for this position