AI Systems Engineering

StaffRight Associates
13 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Tech stack

Artificial Intelligence
Systems Engineering
Computer Clusters
Computer Programming
Data Transmissions
Linux
File Systems
Distributed Systems
Python
Linux kernel
Machine Learning
Network Protocols
Remote Direct Memory Access
Software Engineering
Supercomputing
Wide Area Networks
High Performance Computing
Parallel Computation
Information Technology

Job description

StaffRight Associates is recruiting to identify a Systems Engineer capable of orchestrating one of the world''s most sophisticated computational research environments. The mission involves the architectural stewardship of massively parallel supercomputers and high-density GPU clusters designed for drug discovery.

You will be responsible for the systemic resilience and optimization of a multi-petabyte, multi-thousand-core infrastructure, ensuring that the intersection of hardware and software enables researchers to decode biologically significant molecules with unprecedented precision., * Engineer and maintain state-of-the-art GPU clusters, managing high-density configurations of hundreds of units for machine learning and simulation workloads.

  • Optimize high-performance RDMA fabrics and wide-area networks to ensure low-latency, high-throughput data transfer across specialized research segments.
  • Formalize the administration of massive Linux clusters, managing tens of thousands of CPU cores and ensuring the integrity of tens of petabytes of storage.
  • Synthesize custom supercomputing assets with commodity hardware, providing a secure, convenient, and high-performance interface for AI and drug discovery agents.
  • Validate system performance through rigorous scripting and programming of distributed systems, utilizing Python to enhance automation and reliability.

Requirements

This search targets elite practitioners in Computational Biochemistry and Distributed Systems Architecture who possess the first-principles mastery required to bridge the gap between theoretical molecular dynamics and large-scale infrastructure execution. The complexity of simulating atomic-level interactions demands an individual with an advanced academic pedigree (Ph.D. or Master's in Computer Science, Engineering, or a related quantitative field) capable of engineering the foundational fabrics that power discovery. Success in this role requires more than administrative proficiency; it necessitates a deep technical synthesis of high-performance computing (HPC) and agentic workflows to ensure that massive-scale GPU clusters and custom supercomputing hardware are seamlessly translated into actionable research environments., * Architectural Philosophy: A commitment to first-principles thinking regarding Linux internals, including deep-tier knowledge of file systems, networking protocols, and process management.

  • Technical Versatility: A track record of solving multi-dimensional problems in HPC systems engineering, software development, or large-installation systems administration.
  • Intellectual Curiosity: A relentless drive to explore the "how" and "why" of system behavior, coupled with the agility to pivot between high-level architectural design and granular troubleshooting.
  • Collaborative Literacy: Excellent communication skills with the ability to articulate complex technical constraints to multidisciplinary teams of scientists and researchers., * Foundational Excellence: An advanced degree (Ph.D. or Master''s) in Computer Science, Systems Engineering, or a related STEM discipline.
  • HPC Domain Expertise: Proven experience in high-performance data-parallel computing or the management of massively parallel systems.
  • Mathematical Rigor: A background that supports the understanding of computational modeling and the technical demands of atomic-level simulations.

Apply for this position