AI Systems Engineer - LLM Execution

OpenNebula Systems
31 days ago

Role details

Contract type
Permanent contract
Employment type
Part-time / full-time
Working hours
Regular working hours
Languages
English

Job location

Tech stack

Artificial Intelligence
Systems Engineering
Software Documentation
Nvidia CUDA
Computer Programming
Distributed Systems
Memory Management
Python
Open Source Technology
Software Engineering
Large Language Models
Information Technology
HuggingFace
Machine Learning Operations
Nim (Programming Language)

Requirements

  • Integrate with LLM catalogs and registries (e.g., HuggingFace, NVIDIA NIM, internal repositories). * Collaborate with product and platform teams to shape a modular, portable AI Factory execution layer. * Interact with users to provide systems support, architecture definitions, recommendations, implementation, testing, training, and deployment of open source solutions. * Troubleshoot incidents, identify root causes, implement fixes, and document preventive measures. * Deliver quality performance indicators and maintain project documentation (journals, status reports, etc.). * Engage with international cloud-edge ecosystems and participate in open-source communities; willingness to travel occasionally. * Write and maintain software documentation and project reports. Qualifications * Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field. * Strong hands-on experience deploying and optimizing LLMs in production environments. * Experience with inference frameworks such as vLLM, TensorRT, Triton Inference Server, DeepSpeed-Inference, etc. * Hands-on experience with orchestration tools like Ray, NVIDIA NeMo/Dynamo, or KServe. * Experience deploying LLM workloads on hybrid or sovereign cloud environments. * Contributions to open-source LLM or inference projects. * Deep knowledge of multi-GPU systems and GPU memory management. * Solid understanding of distributed systems and networking bottlenecks in model serving. * Programming experience in Python; knowledge of CUDA and model quantization is a plus. * Familiarity with LLM catalogs (e.g., HuggingFace, NGC, NIM) and open-source MLOps or AI workload orchestration platforms. * Professional English fluency with strong writing and speaking clarity. Soft Skills & Collaboration * Strong customer service mindset with a focus on responsiveness and user satisfaction. * Clear communication and documentation with strong written and verbal English, asynchronous

Benefits & conditions

collaboration, * Excellent problem-solving and proactive issue resolution. * Self-management, accountability, and ability to work independently and meet deadlines. * Technical autonomy with Git, CI/CD, remote collaboration tools (Slack, Zoom, GitHub), and problem-solving without direct supervision. What's in it for me? * Competitive compensation and flexible remuneration options (meals, transport, childcare). * Customized workstation (macOS, Windows, Linux). * Private health insurance. * 6-hour Fridays and August work rhythm. * Paid time off: holidays, personal time, sick time, parental leave. * All-remote company with HQ in Madrid and offices in Boston and Brno. * Healthy work-life balance and support for digital disconnecting. * Flexible hiring options: full-time or part-time; employee (Spain/USA) or contractor (other locations). * Engineering-first culture with openness, collaboration, risk-taking, and continuous growth. * Exposure to a broad technology ecosystem with opportunities to learn and research new technologies. Seniority level * Mid-Senior level Employment type * Full-time Job function * Information Technology Industries * Software Development #J-18808-Ljbffr

Apply for this position