Senior DevOps Engineer
Role details
Job location
Tech stack
Job description
We are seeking a Senior DevOps Engineer to lead the infrastructure strategy as we enter the final stage of our platform transition. With core development nearing completion, your primary objective is to architect and enforce the resilient, high-availability infrastructure required for launch and scaling. You will take full ownership of our deployment pipelines and scaling capabilities, ensuring the stability while orchestrating a seamless migration of our customers to the new architecture., * Own Kubernetes: Take ownership of the entire lifecycle of our production Kubernetes environments, ensuring they are secure, monitored, and highly available.
-
Own AWS: Though our platform is entirely cloud agnostic and will run in multiple cloud providers, the large majority of our production instances will be hosted in AWS, and as such, you will take ownership of managing our infrastructure in AWS.
-
Architect & Build: Help design and build our platform, choose tools and technologies that best suit our platform to the architectural needs of our customers.
-
Mentoring: Mentoring and educating two other DevOps engineers who will work alongside you.
-
Collaborate with Customers: Support the Customer Success team by helping hospital IT departments with their most technical questions.
-
Strengthen Infrastructure: Design and maintain core networking infrastructure, including VPCs, firewalls, and VPNs.
-
Enable Engineering: Collaborate closely with backend, frontend, and machine learning engineers to support their infrastructure needs and empower them to work efficiently., Argo CD, Gitlab, Terraform, Atlantis, CloudFormation
-
Networking Cilium, Gateway API, ALB Loadbalancer, VPC Peering/ AWS Direct Connect
-
Day-2-operations Argo Rollouts, Argo CD, cert-manager, External Secrets, external-dns, Karpenter
-
Observability OpenTelemetry LGTM (Loki, Grafana, Tempo, Mimir), Alloy
-
Cloud Providers AWS, GCP, AliCloud, Hetzner
-
Messaging NATS, Pub/Sub, protobuf, gRPC
-
Storage Postgres, MongoDB, MinIO, S3, GCS
-
Backend Go, Python, Rust.
Requirements
Do you have experience in gRPC?, We're looking for a seasoned engineer with a strong background in building and scaling resilient cloud infrastructure., * Kubernetes Mastery: Deep, hands-on expertise in deploying, managing, and troubleshooting containerized applications on Kubernetes in a production environment.
- AWS Expertise: 7+ years of experience designing and implementing secure, scalable infrastructure in cloud providers, of which a considerable amount is in AWS.
- Infrastructure as Code: Expert-level proficiency in IaC and a mindset to automate everything.
- Strong Networking Foundation: A solid understanding of cloud networking principles (VPCs, DNS, load balancing, firewalls, VPNs).
- CI/CD Skills: A strong background in building and maintaining CI/CD pipelines (e.g., using ArgoCD, Jenkins, GitLab CI, or similar).
- Programming Skills: Though writing code won't be a daily responsibility, you need the ability to read and understand our applications written in Go, Python (and Rust).
- Tech Stack Familiarity: Knowing each and every tool in our stack is not required, but there should be a good overlap between our stack and your experience.
Nice-to-Haves:
- Medical Data Experience: Familiarity with medical data formats like DICOM is a plus.
- Working in a highly regulated environment like the medical or financial sector., * Quality-Minded Builder: You care about code quality and maintainability as much as speed. You write infrastructure that is clean, well-organized, and designed to be understood by future maintainers.
- Excellent Communicator: You can lead technical discussions with both internal teams and external customers with clarity and confidence.
- Proactive & Independent: You are a natural problem-solver who takes initiative and finds solutions.
- Strategic Thinker: You can align your technical priorities with overarching company goals.
- Flexible & Open-Minded: You can change your mind when presented with new information and thrive in an environment of constructive technical debate.