Software engineer

ITproposal B.V.

Eindhoven, Netherlands

2 days ago

Role details

Contract type

Temporary to permanent

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Eindhoven, Netherlands

Tech stack

Artificial Intelligence

Azure

Bash

Big Data

Cloud Computing

Cloud Engineering

Continuous Integration

DevOps

Python

Ansible

Data Processing

Scripting (Bash/Python/Go/Ruby)

Kubernetes

Machine Learning Operations

Terraform

Job description

Job title: DevOps / AI Infrastructure Engineer - GPU & KubernetesLocation: Eindhoven, Netherlands (TNDL - Eindhoven)Start: ASAP (or as agreed)Duration: 6 months (with possibility to extend)Experience: 8-10 years (including at least 1.5 years in DevOps/cloud/SRE focused on AI/ML)Language: English (fluent) Role summarySenior DevOps / AI Infrastructure Engineer to design, build and operate GPU-accelerated AI/ML infrastructure. You will enable high-performance training and inference workflows by managing cloud/GPU platforms, Kubernetes clusters, IaC, and AI tooling (Triton, Kubeflow, MLflow). The role combines deep platform engineering with automation and close collaboration with ML engineers and R&D teams. Key responsibilities

Design, deploy and operate GPU-enabled Kubernetes clusters and associated platform services for training and inference.

Build and maintain CI/CD, model CI and MLOps pipelines using tools such as Kubeflow, MLflow and Triton.

Implement and manage cloud infrastructure on Azure (and other clouds as needed), with GPU instances and storage for large datasets.

Automate provisioning and configuration using Terraform, Ansible and scripting (Python, Bash).

Optimize container orchestration, scheduling and GPU utilization for high-performance workloads.

Integrate AI inference platforms (NVIDIA Triton) and support model serving at scale.

Work with PLM/simulation and data teams to integrate model training and inference into engineering workflows.

Monitor, troubleshoot and tune platform performance, reliability and cost.

Define and enforce best practices for security, resource governance and data handling in AI pipelines.

Document architectures, runbooks and operational procedures; transfer knowledge to engineering teams.

Requirements

8-10 years industry experience; minimum 1.5 years in DevOps, cloud engineering or SRE with AI/ML focus.

Hands-on experience with major cloud providers (Azure preferred; experience wi.