Senior Site Reliability Engineer
MLabs
5 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Remote
Tech stack
Amazon Web Services (AWS)
Cloud Computing
Computer Networks
Continuous Integration
Disaster Recovery
Distributed Systems
Reliability Engineering
Blockchain
Systems Architecture
Mttr
Containerization
Kubernetes
Solidity
Terraform
Job description
- Systems Architecture: Design and operate highly available, multi-region distributed systems with rigorous recovery strategies (RTO/RPO).
- Infrastructure as Code: Own large-scale IaC using Terraform, developing reusable modules and multi-account patterns with policy guardrails.
- Kubernetes Orchestration: Scale production environments (EKS, GKE, or AKS) utilizing GitOps (ArgoCD), Helm, and strict network policies.
- CI/CD Leadership: Build secure pipelines supporting blue/green and canary deployments, artifact signing (SBOM), and automated rollback strategies.
- SRE Advocacy: Define and improve SLOs, error budgets, and observability metrics to drive measurable reductions in MTTR.
- Collaboration: Partner with the Head of SRE and VP of Engineering to translate complex business requirements into reliable, secure platform services.
Requirements
Do you have experience in Terraform?, * 7+ years of experience in SRE, Platform Engineering, or Infrastructure Engineering operating production distributed systems.
- Multi-Cloud Mastery: Deep expertise in AWS or GCP, with experience running multi-region production environments and disaster recovery testing.
- Containerization: Hands-on experience with Kubernetes at scale, including GitOps workflows and production-grade security controls.
- Security Mindset: Strong background in Zero Trust principles, secrets management (Vault), and compliance frameworks (SOC 2, HIPAA, or NIST).
- Tooling: Extensive experience with Terraform-first infrastructure in large-scale, real-world environments.
Nice to Have:
- Experience with distributed ledger technology (DLT) or blockchain systems, particularly private/consortium deployments.
- Familiarity with EVM-based systems and smart contract tooling (Solidity, Hardhat).
- Experience operating active-active, globally distributed architectures.
- Background in supporting financial services or other highly regulated industries.
Benefits & conditions
- Incentive Package: Competitive base salary with Performance Bonuses.
- Ownership: Equity and Token participation.
- Future-Proofing: 401k and comprehensive health insurance (for US-based employees).
- Innovation: The opportunity to build a "greenfield" platform from scratch within a stable, venture-backed organization.
- Impact: Work on infrastructure that powers the world's leading organizations across multiple sectors.