AI Scientist - DeepTech

BAILLY NETWORK
Paris, France
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
€ 120K

Job location

Paris, France

Tech stack

Training Data
Artificial Intelligence
C++
Software Debugging
Python
Machine Learning
NoSQL
Open Source Technology
TensorFlow
SQL Databases
Parquet
PyTorch
Transfer Learning
Large Language Models
Deep Learning
GIT
Scikit Learn
Information Technology
Slurm
Data Pipelines
Data Generation

Job description

Join a deeptech startup building Foundation Models for structured data (tabular & time-series) instead of yet another generic LLM.

As an AI Scientist, you will work on pre-training, fine-tuning and synthetic data modelling of large transformer-based models, at the core of a production-grade platform used by major enterprise and digital-native clients. You report directly to the scientific leadership and collaborate closely with a compact team of ML engineers and researchers.

What you will do :

  • Design and improve foundation models for structured data (tables, time-series) based on modern transformer architectures
  • Pre-train, fine-tune and evaluate large-scale deep learning models in the cloud or on private clusters
  • Build and maintain evaluation frameworks and metrics aligned with real customer use cases (classification, regression, forecasting, anomaly detection)
  • Drive active learning strategies and training data optimisation: sample selection, dataset curation, synthetic data generation, robustness and transfer learning
  • Stay on top of the latest ML research and propose model or training improvements that translate into measurable performance gains.
  • Communicate research ideas to the scientific community and experimental design to the internal ML team (papers, talks, internal notes)
  • Collaborate with ML engineers, data scientists and customers to deliver representation algorithms that unlock high-impact downstream applications.
  • Run ad-hoc analyses to understand and debug model behaviour, failure modes and scaling properties

Tech stack & environment :

  • ML / DL: Python, modern deep learning frameworks (PyTorch, DeepSpeed, SLURM-based training), strong focus on transformer architectures.
  • Foundation Models: large-scale pre-training and fine-tuning on tabular and time-series data, with strong emphasis on reproducibility and scalability.
  • Data: large structured datasets (Parquet, SQL, NoSQL), high-volume transactional and behavioural data.
  • Infra: cloud and private clusters for distributed training, experiment tracking, CI for research code.
  • Culture: research-driven, fast-moving, high ownership, small senior team, strong fundamentals over hype

Requirements

  • PhD in Computer Science, Machine Learning or related field with a focus on deep learning.
  • 3+ years of hands-on experience training, fine-tuning and evaluating deep learning algorithms (especially Transformers) at scale (cloud or private clusters).
  • Strong background in machine learning theory and practice, with a rigorous approach to reproducibility and experimental design.
  • Proficient in Python and modern ML frameworks/tools (PyTorch, Sklearn, experiment management, Git).
  • Comfortable working with large structured datasets and data pipelines (Parquet, SQL / NoSQL).
  • Excellent communication skills in English, able to work cross-functionally with researchers, engineers and business stakeholders.
  • Self-starter, autonomous, thriving in a fast-paced early-stage environment with high expectations and a strong excellence mindset.

Strong bonuses:

  • You have a publication record in top-tier ML conferences or journals
  • You have demonstrated experience in designing and running large-scale ML experiments (SLURM, Pytorch, Deepspeed).
  • Demonstrated machine learning experience in one of the following: open-source activity, data science competitions.
  • Track record of translating research into business impact
  • Experience in developing and debugging in C/C++, Python

Benefits & conditions

  • Salary: 80-120 K€ gross / year depending on seniority and track record.
  • Location: Paris
  • Remote: up to 2 days / week after probation
  • Equity: stock options (BSPCE) to reflect your impact and align long-term incentives.
  • Benefits: comprehensive health insurance, paid leave aligned with French standards, dynamic work environment with a focus on work-life balance.

About the company

Why it's interesting : * Work on frontier research in tabular foundation models, an AI segment still largely unsolved and estimated as a massive next frontier for enterprise AI. * Join a well-funded deeptech backed by Tier-1 investors and founders from leading AI and SaaS companies. * Ship research directly into production for high-profile customers in finance, commerce and other data-intensive industries

Apply for this position