E01-B02 HPC Application Manager
Role details
Job location
Tech stack
Job description
As an HPC Application Manager, you will be responsible for the full lifecycle management of a defined portfolio of scientific and engineering software. Your primary focus will be ensuring the stability, performance, and availability of this software for users within a large-scale High-Performance Computing (HPC) environment. This position directly supports the research and development activities of the DoD High Performance Computing Modernization Program (HPCMP). The HPCMP provides large-scale systems and environments to the DoD RDT&E community which supports the warfighter.
The HPC Application Manager is part of a larger team of system administrators and support specialists responsible for operating and maintaining the HPC infrastructure. The team environment values methodical problem-solving, adherence to established procedures, and a strong commitment to user support., The duties for this position include but are not limited to:
-
Software Management: Install, configure, and maintain complex software packages on multiple Linux-based HPC systems.
-
User Support: Provide direct technical support by responding to and resolving user-submitted ServiceNow tickets for the assigned application portfolio.
-
Troubleshooting: Diagnose and resolve complex application failures, including OS-level issues such as library dependencies and driver conflicts.
-
Validation & Testing: Test and validate application functionality through system changes like maintenance, OS upgrades, and new system deployments.
-
Maintenance: Deploy software patches and updates to resolve bugs and address security vulnerabilities.
-
Compliance: Ensure all software management tasks adhere to established Standard Operating Procedures (SOPs), including formal Request for Change (RFC) processes.
-
Collaboration: Collaborate with internal technical teams and external software vendors as necessary to resolve complex application-related issues.
-
Ensure 100% of planned hours are worked and recorded.
-
Identify and forward to your leadership any opportunities that could lead to growth within your work area
-
Participate in growth efforts as requested
-
Ensure all contractual deliverables are met/exceeded to the customer's satisfaction.
-
Completes personal PDP and attend Meetings (with camera on)
-
Execute all contract requirements as assigned in accordance with the contract specific LCAT and requirements
-
Performs other related duties as assigned.
Requirements
Clearance: Secret Clearance
Education and Years of Experience: Bachelor's degree and 2+ years of related experience, OR an equivalent combination of education and experience including familiarity of an HPC environment.
-
Active DoD 8570/IAT-II Certification
-
Demonstrated experience installing and managing software in a Linux/UNIX command-line environment.
-
Experience with shell scripting in a Linux environment.
-
Experience troubleshooting software installation and runtime issues (e.g., library dependencies, environment variables).
-
Experience providing user support via a ticket-based system (e.g., ServiceNow)
-
Strong, methodical problem-solving skills for diagnosing application-level issues.
-
Excellent detail-orientation with the ability to follow complex procedural and administrative protocols.
-
Ability to clearly convey technical information to users of all skill levels.
-
Ability to work effectively as part of a larger technical team within defined role boundaries.
-
Ability to work a designated schedule while maintaining attendance and punctuality.
-
Demonstrated ability to rapidly learn the technical specifics of new and existing software packages.
PREFERRED ADDITIONAL QUALIFICATIONS
-
Experience with MATLAB in an HPC context (installation, configuration, troubleshooting).
-
Experience compiling applications from source code (make, CMake), particularly for GPU-enabled codes using platforms like CUDA or KOKKOS.
-
Familiarity with common HPC technologies, including schedulers (PBS, SLURM), interconnects (InfiniBand), and architectures (Cray EX, Linux clusters).
-
Experience with HPC package managers (Spack) and version control systems (GitLab).
-
Experience supporting other scientific applications such as Tecplot, FieldView, Pointwise, or Gitlab Runner.
Benefits & conditions
The proposed salary for this position is up to $64,000-$76,300 . There are a host of factors that can influence final salary including, but not limited to, Federal Government contract labor categories and contract wage rates, relevant prior work experience, specific skills and competencies, geographic location, education, and certifications. Our employees value the flexibility EXPANSIA allows them to balance quality work and their personal lives. We offer competitive compensation, benefits and learning and development opportunities. Our unique mix of benefits options is designed to support and protect employees and their families. Employment benefits include health and wellness programs, income protection, paid leave and retirement and savings.