Data Scientist (German speaking)

European Tech Recruit

30 days ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English, German

Experience level

Intermediate

Job location

Tech stack

Artificial Intelligence

Amazon Web Services (AWS)

Software Quality

Databases

Python

Machine Learning

Quantum Computing

PyTorch

Large Language Models

Multi-Agent Systems

Generative AI

GIT

Pandas

Information Technology

HuggingFace

Software Version Control

Docker

Data Generation

Job description

We're looking for a driven AI Data Scientist to work alongside world-leading experts in quantum computing and AI, contributing to cutting-edge projects that redefine the limits of Generative AI. Location: Madrid or Barcelona (Hybrid) Contract: Fixed Term contract ending 30th June 2026 Shape system design by bringing a data- and evaluation-first perspective to retrieval, orchestration, tool usage, and memory components-solving high-impact, real-world problems. Develop multi-step evaluation frameworks that reflect real-world performance across components such as retrieval, reasoning, and tool use in both cloud and edge environments. Build and maintain reproducible evaluation pipelines, including datasets, test suites, configurations, and automated regression tracking. Curate and generate high-quality datasets, including synthetic and adversarial examples, to strengthen coverage and system robustness. Perform deep error analyses, identifying failure patterns and translating them into

Requirements

actionable insights for engineers and researchers. Collaborate closely with ML teams to create a data flywheel - where evaluation continuously informs prompt design, data generation, training, and deployment. Champion best practices in code quality, documentation, version control, and reproducibility within ML pipelines. Master's or PhD in Computer Science, Machine Learning, Data Science, Physics, Engineering, or a related technical discipline. ~3+ years (mid-level) or 5+ years (senior) of experience as a Data Scientist, ML Engineer, or Research Scientist in applied AI/ML projects in production. ~ Demonstrated expertise in evaluating machine learning systems, ideally in LLMs, RAG pipelines, or multi-agent architectures. ~ Strong background in dataset creation and curation, including synthetic data generation. ~ Hands-on experience with agentic AI, retrievers and vector databases, and orchestration frameworks such as LangGraph or LlamaIndex. ~ Solid engineering foundations with proficiency in Python, Docker, Git, and scalable, modular ML codebases. ~ PyTorch, HuggingFace, LangGraph, LlamaIndex, Pandas, etc. ~ Experience with cloud platforms (preferably AWS). ~ Strong communication and problem-solving skills, with fluency in English. By applying to this role you understand that we may collect your personal data and store and process it on our systems.