Senior Internal Infrastructure Engineer
Role details
Job location
Tech stack
Job description
Our client is seeking a Senior Internal Infrastructure Engineer to lead the design and operation of secure, multi-environment platform infrastructure across Azure, Azure Government, and AWS. You will work at the intersection of platform engineering, SRE, security, and distributed systems, enabling engineering teams to ship faster while maintaining strict reliability and security standards.
This role spans GitOps, infrastructure as code, Kubernetes, observability, edge networking, and support for IoT, streaming, and ML workloads. They are looking for someone exceptional, someone who can own systems end-to-end and raise the bar across the entire platform.
What You'll Work On
- Architecting and operating secure infrastructure across Azure, Azure Gov, and AWS
- Building GitOps pipelines and reusable infrastructure modules (OpenTofu / Terraform)
- Running and scaling Kubernetes platforms (Helm, multi-cluster environments)
- Designing observability systems (metrics, logs, traces, alerting with Grafana)
- Supporting IoT, streaming, and real-time pipelines (AWS IoT Core, Kinesis)
- Operating edge networking and distributed sensor deployments
- Enabling secure ML / AI workloads across cloud and edge environments
- Strengthening platform security (IAM, secrets, encryption, policy, zero trust)
How They Think About Engineering
- Build systems that produce work, not one-off fixes
- Automate everything that can be automated
- Use AI as a force multiplier across development and operations
- Create guardrails that allow engineers to move fast safely
- Build platforms that other engineers love to use
Requirements
- 7+ years in infrastructure, platform, or SRE roles with ownership of production systems
- Deep experience in Azure, ideally in regulated or high-security environments (Azure Gov)
- Strong AWS experience, especially with IoT Core, Kinesis, and streaming architectures
- Expert-level Kubernetes experience (including Helm)
- Strong GitOps background with a track record of improving delivery systems
- Deep experience with infrastructure as code (Terraform / OpenTofu)
- Strong observability experience (Grafana, modern telemetry stacks)
- Experience with edge systems, distributed deployments, or remote telemetry pipelines
- Experience supporting ML / AI workloads in production environments
- Strong security depth across IAM, networking, secrets, and policy enforcement
- Experience with multi-cluster or hybrid cloud and edge Kubernetes environments
- Must be eligible for a U.S. security clearance
- You should be someone who sees systems clearly, identifies weaknesses quickly, and fixes them permanently.
Nice to Have
- Experience with FedRAMP, NIST, CMMC, IL4 / IL5, or similar frameworks
- Experience with service meshes, policy engines (OPA / Gatekeeper), or supply chain security
What Success Looks Like
- Faster delivery with fewer failures
- Secure, auditable infrastructure changes through GitOps
- Reliable operation of distributed systems across cloud and edge
- Strong, scalable foundations for IoT, streaming, and ML workloads