Data Center Infrastructure Engineer (TS)

Koniag Services, Inc.
Washington, United States of America
13 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Washington, United States of America

Tech stack

Amazon Web Services (AWS)
Amazon Web Services (AWS)
Azure
Cloud Computing
Cloud Engineering
Computer Security
Data Centers
Distributed Data Store
Monitoring of Systems
Python
Network Architecture
Networking Basics
Powershell
Ansible
Prometheus
Shell Script
Virtualization Technology
Ceph
Datadog
Scripting (Bash/Python/Go/Ruby)
Load Balancing
Cloud Platform System
System Availability
Grafana
HybridCloud
Containerization
Kubernetes
Rancher
Terraform
Network Server
New Relic (SaaS)
Dynatrace
Docker

Job description

We are seeking an experienced Data Center Infrastructure Engineer to design, implement, monitor, and manage enterprise-grade data center infrastructure. This engineer will ensure the reliability, scalability, and efficiency of all mission-critical operations across servers, storage systems, networking architecture, and power/cooling systems. The role blends infrastructure engineering, automation, modernization planning, and operational sustainment. The selected candidate will support data center operations at Joint Base Anacostia-Bolling and must maintain an active TS/SCI clearance., * Design, engineer, and implement infrastructure solutions for data center environments, including compute platforms, storage systems, networking, power, and cooling.

  • Monitor and maintain data center systems to ensure high availability, performance, and operational continuity.
  • Manage containerized and cloud-native infrastructure using technologies such as Kubernetes, Rancher, Helm, and Docker.
  • Deploy and manage distributed storage technologies including Rook, Ceph, MinIO, S3-compatible systems, and PortWorx.
  • Implement infrastructure-as-code (IaC) using Terraform, Ansible, and Desired State Configuration (DSC).
  • Engineer and support load balancing, service networking, and cluster-level connectivity.
  • Develop automation scripts and operational tooling using Python, PowerShell, and shell scripting.
  • Support observability and monitoring systems to ensure proactive detection and incident response for infrastructure components.
  • Participate in major incident response efforts and post-incident reviews, implementing corrective and preventive measures.
  • Oversee data center physical operations including rack/stack tasks, cabling, hardware provisioning, and asset management.
  • Collaborate with engineering, cybersecurity, cloud, and application teams to support enterprise modernization efforts.

Required Certifications (at least two):

  • Security +
  • Cloud Associate (AWS Solutions Architect Associate / Azure AZ-104 / Google Associate Cloud Engineer)
  • Terraform Associate
  • Cloud Professional/Architect (AWS Solutions Architect Professional / Azure Architect Expert)
  • CKA (Certified Kubernetes Administrator)

Requirements

  • AWS DevOps Engineer or Azure AZ-400
  • CCSP
  • Advanced observability tool certifications (Datadog, New Relic, Dynatrace, etc.)
  • Incident management or SRE-focused training

Required Technical Knowledge:

Strong understanding of data center and cloud-native technologies including:

  • Kubernetes, Rancher, Helm, Docker
  • Cilium, Rook, Ceph, MinIO, S3, PortWorx
  • Load balancing, ingress, and distributed networking
  • Ansible, Desired State Configuration, Terraform
  • Python, PowerShell, scripting/automation
  • Server/storage platforms, virtualization, and networking fundamentals
  • Power/cooling considerations for data center operations
  • Monitoring and observability stacks (metrics, logs, tracing)
  • Incident response and SLO/SLA-driven operations

Preferred Experience:

  • Managing and modernizing enterprise or DoD data center infrastructure.
  • Operating large-scale Kubernetes clusters across on-prem and hybrid cloud environments.
  • Designing and operating high-availability server and storage platforms.
  • Implementing enterprise monitoring frameworks using Prometheus, Grafana, ELK, OpenTelemetry, or similar.
  • Supporting data center readiness, facility coordination, and hardware lifecycle management.

Benefits & conditions

We offer competitive compensation and an extraordinary benefits package including health, dental and vision insurance, 401K with company matching, flexible spending accounts, paid holidays, three weeks paid time off, and more.

About the company

Koniag Government Services (KGS) is an Alaska Native Owned corporation supporting the values and traditions of our native communities through an agile employee and corporate culture that delivers Enterprise Solutions, Professional Services and Operational Management to Federal Government Agencies. As a wholly owned subsidiary of Koniag, we apply our proven commercial solutions to a deep knowledge of Defense and Civilian missions to provide forward leaning technical, professional, and operational solutions. KGS enables successful mission outcomes for our customers through solution-oriented business partnerships and a commitment to exceptional service delivery. We ensure long-term success with a continuous improvement approach while balancing the collective interests of our customers, employees, and native communities. For more information, please visit www.koniag-gs.com .

Apply for this position