System Engineer - NCSA
Role details
Job location
Tech stack
Job description
- Operations:
- Install, update, and troubleshoot hardware and software on HPC enterprise IT and cloud computing systems.
- Detect and investigate system issues and provide solutions.
- Monitor and respond to incoming support requests and alert accordingly.
- Create new solutions within existing configuration management (CM) tools and workload management (e.g., Puppet, Ansible, Slurm, HTCondor, Rancher, Kubernetes).
- Automate tasks using Continuous Integration (CI) tools (e.g., GitLab runners, GitHub action).
- Create new scripts in one or more programming languages to improve the efficiency of routine tasks.
- Technical Investigation:
- Participate in training junior engineers on system engineering best practices.
- Collaborate with other engineers to diagnose and resolve system issues in a controlled environment.
- Test, document, and present new technologies, and make recommendations for NCSA's policies and practices.
- Lead development, investigate small-scale systems and solutions for internal projects, and assist with large ones internally or externally.
- Outreach:
- Provide NCSA training, guidance, and assistance related to team operations and activities.
- Participate in conferences., Lead and delegate tasks for systems and services providing cloud computing. Collaborate with scientific researchers to resolve computing issues and design novel solutions to new research challenges. Coordinate installation and maintenance of cloud computing systems and services. This position is on-site with the possibility of being hybrid. Establish relationships with researchers at other institutions., * Technical Investigations:
-
Lead development of large projects, mentor other engineers in the process.
-
Lead a team to investigate the new systems and solutions by delegating tasks.
-
Plan new efforts for system design, deployment, and stability.
-
Collaborate with researchers to design systems and solutions for current and new project requirements.
-
Test and document new technologies and make technical recommendations for NCSA's policies, practices, system development, and architecture.
-
Evolve and Maintain NCSA support systems with state-of-the-art technology.
-
Maintain currency in the state of the art of assigned technical fields.
-
Operations:
-
Coordinate installation and maintenance activities on HPC, enterprise IT, and cloud computing systems.
-
Plan deployment and daily operations of HPC, enterprise IT, and cloud computing systems, monitor and respond to incoming alerts and support tickets.
-
Coordinate all aspects of planned maintenance operations for divisional systems.
-
Architect and build monitoring solutions to enhance customer satisfaction of supported systems.
-
Create new and validate other's solutions within existing configuration management (CM) tools and workload management (e.g., Puppet, Ansible, Rancher, Kubernetes).
-
Provide technical expertise as part of NCSA/ICI teams.
-
Expand existing NCSA/ICI teams' practices to new areas within the organization.
-
Coordinate with directorate-level teams, project coordinators, and sales vendors.
-
Recommend solutions within the scope of their subject matter expertise to meet project specifications and NCSA operational requirements.
-
Outreach:
-
Create and present training materials for internal and external use.
-
Prepare and present training to division- and directorate-level staff.
-
Represent NCSA at domain-specific events, meetings, and conferences.
-
Mentor Assistants/Staff-level System Engineers and student workers.
Requirements
Do you have experience in IT department experience?, Do you have a High school diploma or GED?, NCSA is seeking a motivated individual for a System Engineer or Senior System Engineer role. The successful candidate will install, maintain, and troubleshoot HPC systems and services, including Kubernetes, OpenStack, on-prem and off-prem cloud environments, and others, with a focus on supporting scientific research needs. They will collaborate with researchers to resolve computational issues and develop innovative solutions to emerging research challenges, as well as engage with the national research community to provide computational services to NSF-funded collaborations.
Candidates must be authorized to work in the United States, as this role is not eligible for visa sponsorship now or in the future. This position follows a hybrid schedule with two to three in-office days each week, and the selected candidate must reside within 90 miles of the National Petascale Computing Facility (NPCF) to support onsite work requirements., * High school diploma or equivalent
- Three (3) years of work experience in IT-related functions such as hardware/software support, programming, network design, network engineering, IT systems integration, or a closely related field. College coursework, which includes Information Technology (IT), IT Management, Programming, IT systems, or a closely related discipline, may be substituted as follows:
o 30 semester hours equals one (1) year, o An associate's degree (60 semester hours) equals eighteen (18) months, o 90 semester hours equals two (2) years, o Bachelor's Degree (120 semester hours) equals three (3) years
- Demonstrated experience supporting IT technologies, including servers.
Preferred Qualifications
- Operational experience with virtual system software frameworks (e.g., VMware, OpenStack).
- Operational experience with container orchestration frameworks (e.g., Kubernetes, Rancher).
- Operational experience with containerized software technologies (e.g., Docker, Apptainer, Podman).
- Operational experience with cloud technologies (e.g., AWS, Jupyter)., * High school diploma or equivalent.
- Three (3) years of work experience in IT-related functions such as hardware/software support, programming, network design, network engineering, IT systems integration, or a closely related field.
- College coursework which includes Information Technology (IT), IT Management, Programming, IT systems, or a closely related discipline may be substituted as follows:
o 30 semester hours equals one (1) year, o Associate's Degree (60 semester hours) equals eighteen (18) months, o 90 semester hours equals two (2) years, o Bachelor's Degree (120 semester hours) equals three (3) years
- Demonstrated experience supporting IT technologies, including servers.
- Demonstrated experience supporting OpenStack and/or Kubernetes clusters.
Preferred Qualifications:
- Eight years of experience in a combination of the following areas:
o Support of IT technologies in a large and/or complex environment. o Experience with computing systems hardware, networking, storage, and operating systems. o Proficient use of one or more programming languages to accomplish large, complex tasks. o Activities associated with the procurement, deployment, and management of Linux systems or other enterprise IT services.
- Experience operating systems in both data center and cloud environments.
- Experience planning complex system deployments.
- Experience or training in project management and delivery.
- Four years of experience researching, presenting, and/or publishing technical projects or papers in a professional or academic setting.
- Operational experience with virtual system software frameworks (e.g., VMware, OpenStack).
- Operational experience with container orchestration frameworks (e.g., Kubernetes, Rancher).
- Operational experience with containerized software technologies (e.g., Docker, Apptainer, Podman).
- Operational experience with cloud technologies (e.g., AWS, Jupyter).
Benefits & conditions
4.24.2 out of 5 stars Urbana, IL Hybrid work $80,000 - $100,000 a year - Full-time, Pulled from the full job description
- Health insurance
- Retirement plan
- Paid time off
- Vision insurance
- Dental insurance
- Life insurance, This is a 100% full-time Civil Service 5031 - Information Technology Technical Associate position, appointed on a 12-month basis. The expected start date is as soon as possible after 11/17/2025. Salary is commensurate with experience. The salary range is $80,000 - $100,000., This position is intended to be eligible for benefits. This includes Health, Dental, Vision, Life Insurance, a Retirement Plan, Paid time Off, and Tuition waivers for employees and dependents.