Kevin Klues

Aug 22, 2024

From foundation model to hosted AI solution in minutes

What if you could build a custom AI on your own data with a single API call? Learn how to deploy powerful foundation models in minutes.

#1about 3 minutes

Introducing the IONOS AI Model Hub for easy inference

The IONOS AI Model Hub provides a simple REST API for accessing open-source foundation models and a vector database for RAG.

#2about 1 minute

Exploring the curated open-source foundation models available

The platform offers leading open-source models like Meta Llama 3 for English, Mistral for European languages, and Stable Diffusion XL for image generation.

#3about 7 minutes

How to implement RAG with a single API call

Retrieval-Augmented Generation (RAG) is simplified by abstracting vector database lookups and prompt augmentation into one API request using collection IDs and queries.

#4about 1 minute

Building end-to-end AI solutions in European data centers

Combine the AI Model Hub with IONOS Managed Kubernetes to build and deploy full AI applications within German data centers for data sovereignty.

#5about 3 minutes

Enabling direct GPU access within managed Kubernetes

The NVIDIA GPU Operator will enable direct consumption of GPU resources within IONOS Managed Kubernetes by automatically installing necessary drivers and components.

#6about 3 minutes

Deploying custom inference workloads with NVIDIA NIMs

Use the GPU Operator to request GPUs in a pod spec and deploy NVIDIA Inference Microservices (NIMs) to run custom, containerized AI models on your own infrastructure.

2 days ago

AI Software Engineer (m/f/d)

Sunhat
Köln, Germany

Remote

Senior

12 days ago

Senior Platform Engineer AI Services (w/m/d)

BWI GmbH
Bonn, Germany

Senior

14 days ago

Senior Machine Learning Engineer (f/m/d)

MARKT-PILOT GmbH
Stuttgart, Germany

Remote

Senior

Featured Partners

Your Next AI Needs 10,000 GPUs. Now What?

Your Next AI Needs 10,000 GPUs. Now What?

Anshul Jindal, Martin Piercy

about 2 months ago • World Congress 2025

Open Source AI, To Foundation Models and Beyond

Open Source AI, To Foundation Models and Beyond

Ankit Patel, Matt White, Philipp Schmid, Lucie-Aimée Kaffee, Andreas Blattmann

about 2 months ago • World Congress 2025

Supercharge your cloud-native applications with Generative AI

Supercharge your cloud-native applications with Generative AI

Cedric Clyburn

about a year ago • World Congress 2024

A Deep Dive on How To Leverage the NVIDIA GB200 for Ultra-Fast Training and Inference on Kubernetes

A Deep Dive on How To Leverage the NVIDIA GB200 for Ultra-Fast Training and Inference on Kubernetes

Kevin Klues

about 2 months ago • World Congress 2025

Efficient deployment and inference of GPU-accelerated LLMs

Efficient deployment and inference of GPU-accelerated LLMs

Adolf Hohl

about a year ago • World Congress 2024

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

WWC24 - Ankit Patel - Unlocking the Future Breakthrough Application Performance and Capabilities with NVIDIA

Ankit Patel

about a year ago • World Congress 2024

Developer Experience, Platform Engineering and AI powered Apps

Developer Experience, Platform Engineering and AI powered Apps

Ignacio Riesgo, Natale Vinto

about a year ago • World Congress 2024

Bringing AI Everywhere

Bringing AI Everywhere

Stephan Gillich

about a year ago • World Congress 2024

From learning to earning

Jobs that call for the skills explored in this talk.

Senior Backend Engineer – AI Integration (m/w/x)

1 month ago

Senior Backend Engineer – AI Integration (m/w/x)

chatlyn GmbH
Vienna, Austria

Senior

JavaScript

AI-assisted coding tools

4 days ago

Platform Engineer and AI Systems

AICU GmbH
Heilbronn, Germany

Remote

REST

Redis

Django

Python

+4

3 days ago

Lead AI / GenAI Solution Engineer

N-iX
Barcelona, Spain

Azure

Python

Elasticsearch

Machine Learning

Amazon Web Services (AWS)

6 days ago

Platform Engineer and AI Systems

AICU GmbH
Heilbronn, Germany

Remote

6 days ago

DevOps - Gen AI New

NEORIS
Municipality of Madrid, Spain

Remote

Bash

DevOps

Python

Docker

+5

6 days ago

Senior Solutions Architect, AI Factory

NVIDIA Corporation

Remote

Intermediate

6 days ago

AI Developer

Nvidia
Zürich, Switzerland

Intermediate

C++

Machine Learning

2 days ago

Senior Azure Data Platform Engineer - Infrastructure for Generative AI

Allianz Group
Barcelona, Spain

Remote

GIT

JSON

YAML

Azure

+7