Engineer - MLOps & Scientific Platforms - Data Foundry

The Lilly Company

San Diego, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Compensation

$ 165K

Job location

San Diego, United States of America

Tech stack

A/B testing

API

Agile Methodologies

Artificial Intelligence

Amazon Web Services (AWS)

Application Performance Management

Azure

Bioinformatics

C++

Software Documentation

Computational Biology

Computer Simulation

Nvidia CUDA

Data Infrastructure

Data Systems

Software Debugging

Fault Tolerance

Monitoring of Systems

Python

Machine Learning

Molecular Modelling

Performance Tuning

TensorFlow

Management of Software Versions

Data Logging

High Performance Computing

Feature Engineering

PyTorch

Istio

Delivery Pipeline

Multi-Agent Systems

Reliability of Systems

Event Driven Architecture

Scikit Learn

Kubernetes

Information Technology

Machine Learning Operations

Virtual Agents

Api Design

Api Gateway

REST

Software Version Control

Data Pipelines

Api Management

Job description

We are seeking an Engineer - MLOps & Scientific Platforms - Data Foundry to operationalize Data Foundry's scientific tools and analytical methods into actionable-prototypes. You will build the ML deployment pipelines, model serving infrastructure, API layers, and observability guardrails that make our scientific discovery methods and tools reliable, scalable, and consumable, both by discovery scientists and by the Frontier AI group's autonomous agents.

This role sits at the interface between Methods4Insight (which develops analytical methods) and Architecture4Insight (which provides the agile data infrastructure). Your job is to ensure every scientific tool Data Foundry produces are analytics-ready, well-monitored, and exposed through APIs with the response-time guarantees and error handling that both human users and AI agents require.

Responsibilities

MLOps & Model Lifecycle Management

Build and maintain end-to-end ML deployment pipelines: experiment tracking, model versioning (MLflow, Weights & Biases), containerized model serving, and automated retraining triggers.
Develop model registry infrastructure and feature engineering pipelines that enable computational scientists to access models.
Implement monitoring and alerting for data pipelines, APIs, ML models, and agentic systems (LLMOps) to ensure system reliability and performance at scale.
Build dashboards and metrics tracking for pipeline execution, API latency, token usage, model prediction quality, and system health
Establish structured logging and tracing infrastructure for debugging and performance optimization across scientific data systems

Scientific Tool Agile Deployment

Deploy predictive and analytical methods from Methods4Insight (e.g. cheminformatics, structural biology, bioinformatics, reaction informatics) with versioning, structured error handling, and response-time guarantees that enable insight generation in agile manner. Productionize when and where needed in partnerships with Tech@Lilly.
Build serving infrastructure supporting both synchronous (interactive scientist queries) and asynchronous (batch and agent-invoked) workloads in partnership with Tech@Lilly and Frontier AI.
Define and implement API contracts, documentation standards, and testing frameworks that ensure scientific tools are analysis ready, robust and consumable by external teams including Frontier AI.

Platform Engineering & Integration

Build and operate cloud-native model serving infrastructure (AWS, Azure, or GCP) using containers, Kubernetes, and infrastructure-as-code.
Develop CI/CD pipelines for ML models: automated validation, A/B testing, canary deployments, and rollback procedures.
Integrate model serving with Data Foundry's data pipelines, ensuring models have access to properly formatted, versioned training and inference data.

Frontier AI Interface & Collaboration

Partner with the Frontier AI team and Tech@Lilly to ensure Data Foundry's scientific tools are exposed via well-defined interfaces (REST APIs, MCP-compatible endpoints) that agents can invoke programmatically.
Collaborate on API performance requirements: latency targets, throughput guarantees, and graceful degradation under load.
Work with Methods4Insight scientists to ensure deployed models include appropriate uncertainty quantification and confidence metrics.

Requirements

B.S. or M.S. in Computer Science, Data Science, Machine Learning, Bioinformatics, Computational Biology, or related field.
3+ years of experience in MLOps, ML engineering, or scientific platform d evelopment
Qualified applicants must be authorized to work in the United States on a full-time basis. Lilly will not provide support for or sponsor work authorization or visas for this role, including but not limited to F-1 CPT, F-1 OPT, F-1 STEM OPT, J-1, H-1B, TN, O-1, E-3, H-1B1, or L-1., * Pharmaceutical or biotech research industry experience.
Strong Python skills; experience with ML frameworks (PyTorch, TensorFlow, scikit-learn) and ML lifecycle tools (MLflow, W&B, Kubeflow, or similar).
Proven track record building and deploying production model serving infrastructure - containerized endpoints, RESTful/gRPC APIs, and operational monitoring
Working knowledge of cloud platforms (AWS, Azure, or GCP), Kubernetes, and CI/CD automation.
Strong communication skills with ability to collaborate across computational scientists, software engineers, and partner teams.
Experience operationalizing scientific or computational models (cheminformatics, bioinformatics, structural biology, QSAR, molecular simulations, PK/PD, systems biology, or ODE-based models).
Hands-on experience with model monitoring, drift detection, and automated retraining systems.
Familiarity with API gateway patterns, event-driven architectures, and service mesh technologies.
Experience with feature stores, data versioning (DVC), or experiment tracking at scale.
Exposure to AI agent frameworks (MCP, LangChain) or building APIs that AI systems invoke programmatically.
Experience with C, C++, CUDA, or GPU-accelerated computing for optimizing model training/inference performance; familiarity with containerizing HPC workloads (Singularity/Apptainer).

Benefits & conditions

Actual compensation will depend on a candidate's education, experience, skills, and geographic location. The anticipated wage for this position is

$66,000 - $165,000

Full-time equivalent employees also will be eligible for a company bonus (depending, in part, on company and individual performance). In addition, Lilly offers a comprehensive benefit program to eligible employees, including eligibility to participate in a company-sponsored 401(k); pension; vacation benefits; eligibility for medical, dental, vision and prescription drug benefits; flexible benefits (e.g., healthcare and/or dependent day care flexible spending accounts); life insurance and death benefits; certain time off and leave of absence benefits; and well-being benefits (e.g., employee assistance program, fitness benefits, and employee clubs and activities).Lilly reserves the right to amend, modify, or terminate its compensation and benefit programs in its sole discretion and Lilly's compensation practices and guidelines will apply regarding the details of any promotion or transfer of Lilly employees.

#WeAreLilly

About the company

At Lilly, we unite caring with discovery to make life better for people around the world. We are a global healthcare leader headquartered in Indianapolis, Indiana. Our employees around the world work to discover and bring life-changing medicines to those who need them, improve the understanding and management of disease, and give back to our communities through philanthropy and volunteerism. We give our best effort to our work, and we put people first. We're looking for people who are determined to make life better for people around the world. Locations: San Diego, CA; San Francisco, CA; Boston, MA; Louisville, CO; Indianapolis, IN Lilly Small Molecule Discovery is purpose-built to create molecules that make life better for people. Discovery Technology and Platforms (DTP) accelerates molecule discovery by building optimized foundational platforms, streamlining lab operations through advanced technologies and data connectivity, and investing in novel capabilities. Data Foundry is a multidisciplinary team within DTP that enables AI-native drug discovery through four integrated pillars: Architecture4Insight (data infrastructure and scientific software), Methods4Insight (analytical and computational methods), Automation & Scale4Insight (lab automation and agentic workflows), and Preparedness4Insight (data governance and readiness). These pillars empower every Lilly scientist to make optimal decisions by providing seamless access to data, insights, and AI-driven capabilities-serving both human scientists and autonomous AI agents.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all