Staff Engineer, AI and Data Science

Regeneron
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 207K

Job location

Tech stack

Artificial Intelligence
Data analysis
Software Quality
Computer Programming
Databases
Decision Support Systems
Integer Programming
Python
Knowledge Management
Linear Programming
Matlab
SQL Databases
JMP (Statistical Software)
Large Language Models
Deep Learning
Information Technology
Dataiku
Databricks

Job description

You will design, implement, and operationalize AI and DS models for upstream (cell-culture/bioreactor), downstream (purification) operations, Formulation Development and multiple Analytics teams while partnering closely with process-development, manufacturing-sciences, and digital teams. You will turn data into prescriptive guidance, deploy production-grade models, and build innovative AI solutions that enhance process understanding, optimization, and automation.

A Typical Day in the Role of Staff Engineer Might Look Like:

  • Build and deploy AI/ML-powered solutions to accelerate our digitalization journey.
  • Advance PAPD's broader AI, DS and related digital-maturity initiatives.
  • Collaborate with process engineers, citizen data scientists, IT, and manufacturing colleagues to coordinate AI and Advanced modeling efforts enterprise wide.
  • Explore, prototype and implement GenAI approaches and solutions (e.g., Retrieval-Augmented Generation) to enhance knowledge management, and decision support.
  • Develop, validate, and maintain mechanistic, hybrid, and data-driven models for cell culture, purification, formulation and other processes.
  • Translate complex bioprocess questions into quantitative modeling strategies that inform scale-up, tech transfer, and continuous improvement.
  • Mentor citizen data scientists and champion best practices in model development, method selection, and code quality.

Requirements

  • Analytical rigor and creative problem solving
  • Ability to drive projects autonomously while thriving in cross-functional teams
  • Excellent written and verbal communication
  • Passion for innovation and continuous learning

This role requires a Ph.D. in Chemical/Biochemical Engineering, Biotechnology, Applied Mathematics, Computer Science or related field with 2+ years of industrial experience OR- Master's with 5+ years. Mechanistic understanding of upstream and/or downstream bioprocess unit operations, scale-up/down principles, and critical quality attributes is required. A demonstrated success modeling bioprocesses via first-principles, hybrid, or data-driven (ML) methods is preferred.

A strong foundation in AI/ML algorithms (regression, classification, Bayesian methods, deep learning, time-series, probabilistic modeling) is a plus, along with expertise in multivariate statistics for process modeling, real-time monitoring, and control. Expert programming proficiency in Python and SQL and experience with statistical/computational tools such as JMP, SIMCA, MATLAB is helpful. Proven ability to communicate technical concepts to multidisciplinary stakeholders a must. Experience with GenAI stacks (LLMs, vector databases, RAG pipelines) and multimodal techniques is necessary.

Preferred Qualifications

  • Hands-on experience with cloud analytics platforms (e.g., Dataiku, Databricks).
  • Strong working knowledge of Quality-by-Design (QbD) principles and statistically rigorous Design-of-Experiments (DoE) for defining design space, optimizing critical process parameters, and informing robust control strategies.
  • Familiarity with PAT and chemometric modeling (e.g., Raman spectroscopy) for bioprocess monitoring and control.
  • Understanding of operation research techniques such as combinatorial optimization, linear programming, mixed integer programming is a plus.
  • Strong publication record in bioprocess modeling or AI for biomanufacturing.

Apply for this position