High-Performance Computing (HPC) (SA2) (Government)

AT&T Inc.
Columbia, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 168K

Job location

Columbia, United States of America

Tech stack

Microsoft Windows
Confluence
JIRA
Bash
Unix
Ubuntu (Operating System)
CentOS
Configuration Management
Linux
File Systems
General Parallel File Systems
InfiniBand
Networking Hardware
Nagios
Office Automation
Red Hat Enterprise Linux - RHEL
Ansible
Prometheus
Software Engineering
Transmission Control Protocol (TCP)
Datadog
Computer Networking Systems
High Performance Computing
Grafana
GIT
Information Technology
Slurm

Job description

The scope of this Contract requires specialized expertise in areas such as high-performance computing (HPC), automated processing systems, distributed software design, and secure hosting and networking solutions. The IT infrastructure consists primarily of Linux, with some Windows, and UNIX. The environment includes a variety of network devices, server interconnections, mass storage solutions, and essential supporting infrastructure services. The services provided under this Contract support areas including HPC, infrastructure maintenance for HPC systems, networking, office automation, and the development of specialized software., AT&T has an opening for a High-Performance Computing (HPC) Systems Administrator to support a large client-based IT enterprise installation, configuration and networking of Linux and Windows based platforms. This position requires office presence a minimum of 5 days per week and is only located in the location(s) posted. No relocation is offered. Work to be performed at government customer site., The System Administrator provides HPC sustainment support across two geographically dispersed sites, including:

  • Linux-based HPC clusters (e.g., Red Hat/CentOS/Rocky/Ubuntu) with parallel file systems (e.g., Lustre/GPFS) and high-speed interconnects (InfiniBand/Slingshot).
  • Transition of new systems/capabilities into operations (clusters, SMP/MPP, parallel file systems).
  • Support to HPC and ABS (ABUNDANTSHIELD) SRE teams in accordance with Government policies and procedures.

Proficient with the following (as specific position requires):

  • Operate and maintain systems/services: monitoring, incident response, troubleshooting, and routine maintenance.
  • Install/configure Linux OS, file systems, and TCP/IP networking; troubleshoot OS and application issues.
  • Automate/administer via BASH scripting; compile/install software as required.
  • Use common operations and observability tooling: Jira, Confluence, Grafana, Prometheus, Nagios.
  • Support HPC workload and configuration management tooling: Slurm, git, Salt, Ansible.
  • Provide user support and escalation/status communication to agency management and internal customers.
  • Optimize operations through resource utilization and capacity analysis/planning.
  • Apply in-depth troubleshooting skills across heterogeneous systems (no single fixed solution).
  • Provide detailed analysis and feedback to agency management and internal customers for escalated tickets.
  • Provide support for the dispatch system and hardware problems and remains involved in the resolution process.
  • Harden, patch, and tune Linux/UNIX/Windows systems; implement OS-level enhancements to improve reliability and performance.

Required Clearance: TS/SCI with polygraph. (#ts/sci) (#polygraph)

Requirements

B.S. in a technical discipline and 5 years' experience as a System Administrator in programs and contracts of similar scope, type and complexity or 10 years' experience in lieu of degree.

  • DoD 8570 IAT II level certification required.

Benefits & conditions

Our High-Performance Computing (HPC) (SA2) (Government) earns between $98,100 - $167,830 yearly. Not to mention all the other amazing rewards that working at AT&T offers. Individual starting salary within this range may depend on geography, experience, expertise, and education/training.

Joining our team comes with amazing perks and benefits:

  • Medical/Dental/Vision coverage
  • 401(k) plan
  • Tuition reimbursement program
  • Paid Time Off and Holidays (based on date of hire, at least 23 days of vacation each year and 9 company-designated holidays) *Pro-rated when working less than 40 hrs/wk.
  • Paid Parental Leave
  • Paid Caregiver Leave
  • Additional sick leave beyond what state and local law require may be available but is unprotected · Adoption Reimbursement
  • Disability Benefits (short term and long term)
  • Life and Accidental Death Insurance
  • Supplemental benefit programs: critical illness/accident hospital indemnity/group legal
  • Employee Assistance Programs (EAP)
  • Extensive employee wellness programs
  • Employee discounts up to 50% off on eligible AT&T mobility plans and accessories, AT&T internet (and fiber where available) and AT&T phone

About the company

AT&T Global Public Sector is a trusted provider of secure, IP enabled, cloud-based, network solutions and professional services to the Federal Government. We are dedicated to recruiting, developing and empowering a diverse, high-performing workforce that is passionate about what they do, committed to our shared values and dedicated to our customers' mission.

Apply for this position