Chemical Data Scientist

Dunia Innovations GmbH
Berlin, Germany
16 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Berlin, Germany

Tech stack

Artificial Intelligence
Data analysis
Databases
Python
Machine Learning
MongoDB
NumPy
Pandas
Scikit Learn
Data Pipelines

Job description

Dunia is building AI for one of the hardest unsolved problems in science: turning materials discovery from an academic, trial-and-error process into a programmable, scalable discipline.

We run high-throughput experiments, simulations, and ML-driven optimization loops. The hardest problem is not generating data, and as our lab throughput increases, so does our data output. It is knowing what that data actually means.

As Chemical Data Scientist, you will sit at the center of Dunia's discovery loop. You will be the person who spots the signal from the noise, and whether a trend is real or misleading. You will ensure that the organization, its systems and models are actually learning.

This role is about judgment as much as analysis. You will shape how experimental evidence flows into models, simulations, and decisions, and in doing so, how fast and how well Dunia discovers new materials.

Your tasks will include: Be the scientific sense-maker

  • Interrogate data from ongoing electrocatalyst campaigns
  • Identify patterns, anomalies, and failure modes that others miss
  • Develop intuition for where experiments lie, and where they tell the truth

Close the loop between matter and models

  • Decide which experimental signals should inform ML feature design
  • Decide which experimental signals and discrepancies are important enough to merit computational explanation.
  • Give lab/automation teams concrete feedback on experimental quality and design

Make learning compound, not fragment

  • Create clear, concise digests that align the entire team on what was learned
  • Track how understanding evolves across campaigns, not just within them
  • Raise the bar for how scientific progress is communicated internally

Influence the system, not just the analysis

  • Shape data pipelines and analytical views by using them aggressively
  • Help define what "analysis-ready" dataactually meansin practice
  • Ensure infrastructure evolves aroundreal scientific workflows, not abstractions

Requirements

Do you have experience in Python?, Do you have a Master's degree?, * Master's or PhD in chemistry, electrochemistry, materials science, or a related field

  • 3-6 years of experience working with experimental or computational data in a research or R&D context (post-Master's; fewer years post-PhD is fine)
  • Strong Python skills and comfort with data analysis tooling (pandas,numpy, scikit-learn, etc.)
  • Familiarity with electrochemistry or energymaterials.You should have intuition for what experimental artifacts look like and which trends are chemically meaningful
  • Clear technical communication: you can write a summary that a lab scientist, an ML engineer, and a program manager can all use
  • Bonus: exposure to tools like RDKit, pymatgen, or ASE; familiarity with MongoDB or similar databases
  • English fluency

Apply for this position