AI Systems Engineer - LLM Execution

OpenNebula Systems

31 days ago

Role details

Contract type

Permanent contract

Employment type

Part-time / full-time

Working hours

Regular working hours

Languages

English

Job location

Tech stack

Artificial Intelligence

Systems Engineering

Software Documentation

Nvidia CUDA

Computer Programming

Distributed Systems

Memory Management

Python

Open Source Technology

Software Engineering

Large Language Models

Information Technology

HuggingFace

Machine Learning Operations

Nim (Programming Language)

Requirements

Integrate with LLM catalogs and registries (e.g., HuggingFace, NVIDIA NIM, internal repositories). * Collaborate with product and platform teams to shape a modular, portable AI Factory execution layer. * Interact with users to provide systems support, architecture definitions, recommendations, implementation, testing, training, and deployment of open source solutions. * Troubleshoot incidents, identify root causes, implement fixes, and document preventive measures. * Deliver quality performance indicators and maintain project documentation (journals, status reports, etc.). * Engage with international cloud-edge ecosystems and participate in open-source communities; willingness to travel occasionally. * Write and maintain software documentation and project reports. Qualifications * Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field. * Strong hands-on experience deploying and optimizing LLMs in production environments. * Experience with inference frameworks such as vLLM, TensorRT, Triton Inference Server, DeepSpeed-Inference, etc. * Hands-on experience with orchestration tools like Ray, NVIDIA NeMo/Dynamo, or KServe. * Experience deploying LLM workloads on hybrid or sovereign cloud environments. * Contributions to open-source LLM or inference projects. * Deep knowledge of multi-GPU systems and GPU memory management. * Solid understanding of distributed systems and networking bottlenecks in model serving. * Programming experience in Python; knowledge of CUDA and model quantization is a plus. * Familiarity with LLM catalogs (e.g., HuggingFace, NGC, NIM) and open-source MLOps or AI workload orchestration platforms. * Professional English fluency with strong writing and speaking clarity. Soft Skills & Collaboration * Strong customer service mindset with a focus on responsiveness and user satisfaction. * Clear communication and documentation with strong written and verbal English, asynchronous

Benefits & conditions

collaboration, * Excellent problem-solving and proactive issue resolution. * Self-management, accountability, and ability to work independently and meet deadlines. * Technical autonomy with Git, CI/CD, remote collaboration tools (Slack, Zoom, GitHub), and problem-solving without direct supervision. What's in it for me? * Competitive compensation and flexible remuneration options (meals, transport, childcare). * Customized workstation (macOS, Windows, Linux). * Private health insurance. * 6-hour Fridays and August work rhythm. * Paid time off: holidays, personal time, sick time, parental leave. * All-remote company with HQ in Madrid and offices in Boston and Brno. * Healthy work-life balance and support for digital disconnecting. * Flexible hiring options: full-time or part-time; employee (Spain/USA) or contractor (other locations). * Engineering-first culture with openness, collaboration, risk-taking, and continuous growth. * Exposure to a broad technology ecosystem with opportunities to learn and research new technologies. Seniority level * Mid-Senior level Employment type * Full-time Job function * Information Technology Industries * Software Development #J-18808-Ljbffr