Data Scientist (German speaking)
Role details
Job location
Tech stack
Job description
We're looking for a driven AI Data Scientist to work alongside world-leading experts in quantum computing and AI, contributing to cutting-edge projects that redefine the limits of Generative AI. Location: Madrid or Barcelona (Hybrid) Contract: Fixed Term contract ending 30th June 2026 Shape system design by bringing a data- and evaluation-first perspective to retrieval, orchestration, tool usage, and memory components-solving high-impact, real-world problems. Develop multi-step evaluation frameworks that reflect real-world performance across components such as retrieval, reasoning, and tool use in both cloud and edge environments. Build and maintain reproducible evaluation pipelines, including datasets, test suites, configurations, and automated regression tracking. Curate and generate high-quality datasets, including synthetic and adversarial examples, to strengthen coverage and system robustness. Perform deep error analyses, identifying failure patterns and translating them into
Requirements
actionable insights for engineers and researchers. Collaborate closely with ML teams to create a data flywheel - where evaluation continuously informs prompt design, data generation, training, and deployment. Champion best practices in code quality, documentation, version control, and reproducibility within ML pipelines. Master's or PhD in Computer Science, Machine Learning, Data Science, Physics, Engineering, or a related technical discipline. ~3+ years (mid-level) or 5+ years (senior) of experience as a Data Scientist, ML Engineer, or Research Scientist in applied AI/ML projects in production. ~ Demonstrated expertise in evaluating machine learning systems, ideally in LLMs, RAG pipelines, or multi-agent architectures. ~ Strong background in dataset creation and curation, including synthetic data generation. ~ Hands-on experience with agentic AI, retrievers and vector databases, and orchestration frameworks such as LangGraph or LlamaIndex. ~ Solid engineering foundations with proficiency in Python, Docker, Git, and scalable, modular ML codebases. ~ PyTorch, HuggingFace, LangGraph, LlamaIndex, Pandas, etc. ~ Experience with cloud platforms (preferably AWS). ~ Strong communication and problem-solving skills, with fluency in English. By applying to this role you understand that we may collect your personal data and store and process it on our systems.