Data Center Infrastructure Engineer (TS)
Role details
Job location
Tech stack
Job description
We are seeking an experienced Data Center Infrastructure Engineer to design, implement, monitor, and manage enterprise-grade data center infrastructure. This engineer will ensure the reliability, scalability, and efficiency of all mission-critical operations across servers, storage systems, networking architecture, and power/cooling systems. The role blends infrastructure engineering, automation, modernization planning, and operational sustainment. The selected candidate will support data center operations at Joint Base Anacostia-Bolling and must maintain an active TS/SCI clearance., * Design, engineer, and implement infrastructure solutions for data center environments, including compute platforms, storage systems, networking, power, and cooling.
- Monitor and maintain data center systems to ensure high availability, performance, and operational continuity.
- Manage containerized and cloud-native infrastructure using technologies such as Kubernetes, Rancher, Helm, and Docker.
- Deploy and manage distributed storage technologies including Rook, Ceph, MinIO, S3-compatible systems, and PortWorx.
- Implement infrastructure-as-code (IaC) using Terraform, Ansible, and Desired State Configuration (DSC).
- Engineer and support load balancing, service networking, and cluster-level connectivity.
- Develop automation scripts and operational tooling using Python, PowerShell, and shell scripting.
- Support observability and monitoring systems to ensure proactive detection and incident response for infrastructure components.
- Participate in major incident response efforts and post-incident reviews, implementing corrective and preventive measures.
- Oversee data center physical operations including rack/stack tasks, cabling, hardware provisioning, and asset management.
- Collaborate with engineering, cybersecurity, cloud, and application teams to support enterprise modernization efforts.
Required Certifications (at least two):
- Security +
- Cloud Associate (AWS Solutions Architect Associate / Azure AZ-104 / Google Associate Cloud Engineer)
- Terraform Associate
- Cloud Professional/Architect (AWS Solutions Architect Professional / Azure Architect Expert)
- CKA (Certified Kubernetes Administrator)
Requirements
- AWS DevOps Engineer or Azure AZ-400
- CCSP
- Advanced observability tool certifications (Datadog, New Relic, Dynatrace, etc.)
- Incident management or SRE-focused training
Required Technical Knowledge:
Strong understanding of data center and cloud-native technologies including:
- Kubernetes, Rancher, Helm, Docker
- Cilium, Rook, Ceph, MinIO, S3, PortWorx
- Load balancing, ingress, and distributed networking
- Ansible, Desired State Configuration, Terraform
- Python, PowerShell, scripting/automation
- Server/storage platforms, virtualization, and networking fundamentals
- Power/cooling considerations for data center operations
- Monitoring and observability stacks (metrics, logs, tracing)
- Incident response and SLO/SLA-driven operations
Preferred Experience:
- Managing and modernizing enterprise or DoD data center infrastructure.
- Operating large-scale Kubernetes clusters across on-prem and hybrid cloud environments.
- Designing and operating high-availability server and storage platforms.
- Implementing enterprise monitoring frameworks using Prometheus, Grafana, ELK, OpenTelemetry, or similar.
- Supporting data center readiness, facility coordination, and hardware lifecycle management.
Benefits & conditions
We offer competitive compensation and an extraordinary benefits package including health, dental and vision insurance, 401K with company matching, flexible spending accounts, paid holidays, three weeks paid time off, and more.