Site Reliability Engineer - TS/SCI with Poly
Role details
Job location
Tech stack
Job description
As a Site Reliability Engineer (SRE) supporting the CIO Infrastructure Services (CIS) program, you will help maintain the reliability, scalability, and performance of enterprise infrastructure services deployed across more than 250 global sites. You will engineer and optimize systems, automate operational workflows, strengthen monitoring capabilities, and ensure the stability and resilience of mission critical- environments.
You will partner closely with Engineering, Operations, Tech Refresh, Cybersecurity, and Data Center teams to ensure seamless integration of new capabilities into a high availability production environment, helping the Defense Intelligence Enterprise remain secure, connected, and -mission ready-., * Ensure the reliability, availability, and performance of enterprise IT systems across global environments
- Develop automation solutions that reduce manual effort, streamline operational tasks, and improve system resiliency
- Build and maintain monitoring, alerting, and observability capabilities supporting 24/7/365 enterprise operations
- Perform root cause analysis (RCA), corrective action planning, and long-term- problem remediation for infrastructure issues
- Partner with engineering teams to validate, test, and integrate new systems, upgrades, baselines, and enhancements into production
- Improve system performance through configuration tuning, capacity planning, and optimization of compute, storage, network, and virtualized environments
- Develop and maintain infrastructure-as-code, scripts, and operational automation to support consistent and repeatable deployments
- Support enterprise incident response, including triage, escalation, and service restoration for high visibility- events
- Maintain operational documentation including SOPs, runbooks, baselines, dashboards, and architectural diagrams
- Ensure compliance with ITIL/ITSM processes-including Incident, Problem, Change, and Configuration Management
- Strengthen the enterprise security posture by supporting patching, vulnerability remediation, and RMF related- configuration updates
- Coordinate with global operations teams to ensure service continuity, readiness, and adherence to SLAs and KPIs
- Leverage analytics, metrics, and monitoring data to identify performance trends and drive continuous service improvement initiatives
Requirements
Automation Tools,Enterprise Infrastructures,Enterprise Operations,Site Reliability Engineering, 5 + years of related experience, * CLEARANCE: Active TS/SCI with CI Polygraph
- EDUCATION: Bachelor's degree in computer science, engineering, IT, or related technical field(Additional experience may substitute for degree)
- 5+ years of experience in site reliability engineering, systems engineering, enterprise operations, or DevOps roles
- Hands-on experience with automation tools (PowerShell, Python, Ansible, Terraform, etc.)
- Strong experience supporting enterprise infrastructure domains including server compute, storage, virtualization, networking, and monitoring
- Experience with enterprise monitoring platforms (e.g., SolarWinds, SCOM, Splunk, Nagios, ELK)
- Strong understanding of ITIL/ITSM workflows and operational governance processes
- Demonstrated ability to troubleshoot complex technical issues across distributed enterprise environments
- Strong communication and collaboration skills working across multidisciplinary technical teams Excellent communication and stakeholder engagement skills
- US citizenship required
- LOCATION: Onsite
Preferred:
- ITIL v4 Foundations certification
- Experience supporting the client, DoDIIS, or Intelligence Community environments
- Familiarity with CMMC, NIST 800-53, policies, and RMF processes
- Experience with ServiceNow/Service Central and automated ticketing workflows
- Experience supporting hybrid cloud, virtual desktop infrastructure (VDI), or hyperconverged platforms
Benefits & conditions
The likely salary range for this position is $111,155 - $150,385. This is not, however, a guarantee of compensation or salary. Rather, salary will be set based on experience, geographic location and possibly contractual requirements and could fall outside of this range.
Our benefits package for all US-based employees includes a variety of medical plan options, some with Health Savings Accounts, dental plan options, a vision plan, and a 401(k) plan offering the ability to contribute both pre and post-tax dollars up to the IRS annual limits and receive a company match. To encourage work/life balance, GDIT offers employees full flex work weeks where possible and a variety of paid time off plans, including vacation, sick and personal time, holidays, paid parental, military, bereavement and jury duty leave. To ensure our employees are able to protect their income, other offerings such as short and long-term disability benefits, life, accidental death and dismemberment, personal accident, critical illness and business travel and accident insurance are provided or available. We regularly review our Total Rewards package to ensure our offerings are competitive and reflect what our employees have told us they value most.