Data Scientist Lead
Role details
Job location
Tech stack
Job description
- ️Positivity : At Nūmi, we cultivate positivity every day. Our mission drives us to communicate with passion and energy, always keeping the bigger impact in sight. We celebrate quick wins, but never lose track of our ultimate goal.
- Excellence : At Nūmi, excellence starts with dedication and a commitment to constant improvement. We embrace challenges as opportunities to learn and push ourselves to deliver high-quality results. Through effort, attention to detail, and integrity, we grow both individually and collectively as a team.
- Team-Work and Pedagogy : At Nūmi, we believe that teamwork makes us stronger. We encourage knowledge-sharing, open feedback, and mutual respect. Each Nūmiz is here to lift others up, ensuring that the collective succeeds.
- Ambition : At Nūmi, we set ambitious goals and challenge ourselves to think big. Nūmi is driven by bold ideas and relentless determination, always aiming for the stars to create real impact.
Why this role matters
At Nūmi, our ability to identify "golden" cell lines, batches, and conditions depends less on running more experiments and more on how well we collect, structure, and exploit our data.
Today, our experimental data spans:
- ELN entries
- Spreadsheets
- Bioreactor and equipment exports
- Analytical and QC readouts
This role is about turning that fragmented landscape into a coherent, trusted, and decision-ready data layer that powers R&D, guides scale-up choices, and ultimately underpins regulatory and industrialization efforts., * Lead the design of scalable data models and ontologies that reflect experimental workflows, metadata standards, and scientific context.
- Drive harmonization and integration of data from ELNs, spreadsheets, instruments, assays, and analytical systems into a central repository.Define naming conventions, unique identifiers, and metadata requirements to enable reliable cross-experiment comparison and traceability.
Develop Data Infrastructure & Pipelines
- Create ETL/ELT pipelines to import, clean, and validate data from lab hardware, software, and analytic sources.
- Work with IT and scientific software suppliers to operationalize robust and secure data flows across teams.Leverage industry best practices in data governance, FAIR principles (Findable, Accessible, Interoperable, Reusable), and scalable storage solutions.
I Enable Scientific Insight & Decision Support
- Build dashboards, visualizations, and automated analysis workflows to empower scientists to detect trends, identify key signals, and validate hypotheses efficiently.
- Design analytical workflows that support batch comparisons, condition optimization, and early detection of performance deviations.Translate complex scientific questions into data products, models, and operational analytics., * Partner closely with R&D squads (cell line, media, analytical, bioprocess teams) to understand data generation and scientific priorities.
- Provide training and support to colleagues, embedding strong data practices in experimentation and documentation.
- Act as a strategic interface between hard science and analytics, ensuring data outputs are relevant, actionable, and adoption-ready., * First interview with our cofounder and CTO Eugénie
- A technical case study in English to give you a taste of what we are working on at Nūmi
- A last interview on site (when possible) to meet cofounder and CEO Eden, part of the team and visit the labs & office
Requirements
Do you have experience in Laboratory information management systems?, You are a data scientist with a strong affinity for experimental science, who enjoys working close to the lab and building systems that scale. This is a high-autonomy, high-impact role. You will act as the internal reference for scientific data, lead projects end-to-end, make technical and architectural decisions, and ensure that leadership has clear visibility into data-driven insights and trade-offs., * MSc or PhD in Data Science, Bioinformatics, Computational Biology, Engineering, or a related field.
- 3+ years of experience working with complex experimental or scientific datasets.
- Demonstrated ability to lead projects independently and deliver end-to-end outcomes.
- Strong coding and infrastructure layout skills
- Data structuring, cleaning, and integrating from heterogeneous sources.
- Solid understanding of experimental workflows and scientific rigor.
- Comfortable operating in a fast-moving, early-stage startup environment.
- Clear written and verbal communication in English.
Nice to have
- Experience setting up data pipelines or lab data infrastructure.
- Familiarity with ELNs, LIMS, or lab-adjacent software tools.
- Experience in regulated or quality-conscious environments (ISO, HACCP, GMP-adjacent).
Benefits & conditions
Unique opportunity to be one of the first employees at an early stage biotech company, shaping the future of infant nutrition
Growth opportunities: you will have the opportunity to grow and develop within the company
Access to top-quality facilities, research services and infrastructure
Competitive salary, part coverage of health insurance, lunch and transportation costs
- Employee first mentality, we prioritize the well-being of our team members, both in terms of their mental and physical health