Backend Engineer- INTL- LATAM
Role details
Job location
Tech stack
Job description
Insight Global is seeking a for a Backend Engineer with Kubernetes infrastructure knowledge to own the design, automation, and reliability of stateful workloads running on Azure AKS. This role focuses on building highly available, self-healing systems for databases and storage-backed services, eliminating manual intervention and "snowflake" configurations while meeting 99.99% uptime targets.
What You'll Do
-Own the full lifecycle of Kubernetes StatefulSets, including provisioning, scaling, upgrades, and graceful failover
-Design and implement high-availability architectures using pod anti-affinity, topology spread constraints, and zonal resilience
-Optimize and tune Azure persistent storage (Premium/Ultra Disks, Azure NetApp Files) via CSI drivers
-Build and automate disaster recovery workflows, including snapshot, restore, and rapid state reconciliation
-Provision and manage stateful infrastructure using Terraform and infrastructure-as-code best practices
-Create observability and alerting for PV utilization, disk pressure, replication lag, and storage health
-Ensure cluster upgrades and node rotations happen with zero manual data migration
Requirements
7+ Years of experience as a software engineer
-2 years of experience hands-on with Kubernetes StatefulSets, PVCs, and CSI
-Strong experience operating stateful services in production (e.g., Postgres, ClickHouse, Elasticsearch)
-Cloud knowledge- Azure preferred
-Experience automating infrastructure using Terraform
-Strong scripting or development skills in Go, Python, or Bash -Experience building GitOps workflows for complex, ordered deployments
-Background in custom controllers, operators, or lifecycle hooks (PreStop/PostStart)
-Experience with disaster recovery testing, RTO/RPO optimization, and snapshot automation
-Strong observability mindset (dashboards, alerts, SLOs for stateful systems)
-Prior ownership of large-scale distributed systems on Kubernetes