Software Developer - Cloud SRE DevOps
Estuate Inc.
Palo Alto, United States of America
yesterday
Role details
Contract type
Temporary contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Palo Alto, United States of America
Tech stack
API
Artificial Intelligence
Amazon Web Services (AWS)
Cloud Engineering
Code Review
Computer Programming
Programming Tools
Distributed Systems
Identity and Access Management
Python
Prometheus
Service Design
Software Engineering
Datadog
Google Cloud Platform
Istio
Grafana
Multi-Agent Systems
Multi-Cloud
Backend
Kubernetes
Production Code
Api Gateway
Terraform
Serverless Computing
Job description
We re hiring a Sr. Software Developer to design, build, and operate the platform and backend systems that power us at scale. You ll own core infrastructure from Kubernetes and cloud-native services to APIs and developer tooling and work closely with product, AI, and security teams. This is a hands-on, high-ownership role where you write production code, design systems, lead code reviews, and make the engineering organization more effective.
Requirements
- 6+ years of software engineering with deep backend and infrastructure focus.
- Strong programming skills in Python and/or Go you ship production code, not just scripts and configs.
- Deep, hands-on Kubernetes experience building and operating clusters, not just deploying to them.
- Proven experience designing and operating distributed systems in production.
- Cloud-native fluency across AWS and/or Google Cloud Platform compute, storage, IAM, networking, and managed services.
- Experience with infrastructure-as-code (Terraform or similar) and CI/CD pipelines.
- Familiarity with applied AI tooling and patterns agentic AI tools (Claude, LiteLLM), AI gateways, agent frameworks and being able to build backend services that integrate with them.
- Strong system design and architectural judgment.
- Clear communicator who partners well across product, security, and AI teams.
Nice to Have
- Observability stacks Prometheus, Grafana, Datadog, OpenTelemetry.
- Multi-cloud or hybrid infrastructure experience (AWS, Google Cloud Platform, on-prem).
- Familiarity with API gateways, AI gateways, and policy/authorization frameworks (ABAC, OPA).
- Service mesh or platform-as-a-service design experience.
- Track record of improving engineering productivity at scale.