System Engineer - Systems Integrator

Hamilton Barnes
Amsterdam, Netherlands
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Amsterdam, Netherlands

Tech stack

Artificial Intelligence
Bash
Cloud Computing
Data Centers
Linux
InfiniBand
Python
Linux kernel
PCI Express
Performance Tuning
Graphics Processing Unit (GPU)
Cloud Platform System
Reliability of Systems

Job description

  • Design, deploy, and maintain high-performance cloud systems that are optimized to support demanding AI workloads in a live data centre environment.
  • Plan and conduct hardware R&D experiments onsite, testing new technologies and approaches to improve system reliability and efficiency.
  • Troubleshoot and resolve complex technical issues involving GPUs, networking (InfiniBand, NVLink), PCIe, and broader server infrastructure.
  • Perform thorough root cause analysis across hardware, software, and networking layers to identify problems and implement long-term fixes.
  • Develop and execute benchmarking methodologies to validate performance, ensuring systems operate at maximum efficiency.
  • Collaborate closely with cross-functional engineering teams to drive innovation, enhance performance, and scale infrastructure effectively.

Requirements

Skills/Must have

  • Strong knowledge of modern server architecture with a particular focus on GPU-based systems and their role in high-performance computing.
  • Hands-on experience working directly with GPUs, high-speed networking technologies such as InfiniBand and NVLink, and PCIe-based infrastructure.
  • Proficiency in Linux environments, with the ability to script and automate tasks using Python and Bash to streamline workflows and improve system performance.
  • Demonstrated ability to investigate, diagnose, and troubleshoot complex technical issues spanning hardware, networking, and software layers.
  • Practical experience in performance optimization for high-performance or cloud-based systems ensuring stability and efficiency under heavy workloads.
  • Strong analytical and problem-solving skills, with the capacity to work methodically through complex challenges.
  • Basic understanding of electronics, including soldering, wiring, and the use of fundamental diagnostic techniques.
  • Familiarity with the Linux kernel and kernel-level troubleshooting, as well as experience using measurement tools such as oscilloscopes and multimeters.

Benefits & conditions

  • Up to €100,000 and a comprehensive benefits package.

#J-18808-Ljbffr Salarisomschrijving

€90000 - €100000 monthly

About the company

Are you interested in working for Europe's leading Neocloud provider? You will join a global, Nasdaq-listed AI and cloud powerhouse, where you'll be the ultimate point of escalation, troubleshooting and resolving complex challenges across and cutting-edge server infrastructure. Deploy, integrate, and optimise GPU-based AI infrastructure; perform hardware R&D tests, troubleshoot complex GPU/NVLink/InfiniBand issues, and automate performance tuning with Python/Bash to maximise system efficiency and reliability. Accelerate your career growth at a rapidly expanding company who have grown its headcount by 46% in the last year, where you will be working with gain exposure to world-class infrastructure at the forefront of AI and cloud innovation

Apply for this position