Machine Learning Engineer I - Large Language Models - AI & Human Health Research

Mount Sinai Health System
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 164K

Job location

Tech stack

C
Java
API
Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Azure
Big Data
Bioinformatics
C++
Computational Biology
Databases
Continuous Integration
Information Engineering
Data Infrastructure
DevOps
Hadoop
Monitoring of Systems
Python
Machine Learning
Product Management
Standard Sql
Scala
Software Engineering
SQL Databases
Data Streaming
Google Cloud Platform
Enterprise Software Applications
Cloud Platform System
Chatbots
Large Language Models
Multi-Agent Systems
Prompt Engineering
Spark
Model Validation
Generative AI
GIT
Containerization
Kubernetes
Information Technology
Machine Learning Operations
Software Version Control
Docker
Programming Languages

Job description

We are seeking a skilled LLM Engineer I to join our team in the SinAI Assurance Lab. The LLM Engineer will play a key role in designing, building, and deploying large language model (LLM) applications including retrieval-augmented generation (RAG) systems, agentic platforms, and clinical chatbots and will be responsible for designing, maintaining, and optimizing data infrastructure and model validation pipelines that ensure all AI systems, Generative and Non-Generative, deployed across the Mount Sinai Health System (MSHS) are rigorously validated for compliance, performance, and patient safety.

You will work closely with AI product teams, clinical and technical stakeholders, DevOps engineers, and the AI Governance Committee to engineer scalable data flows that support model validation, real-time monitoring, and Machine Learning Engineer I will be primarily responsible for contributing to the development and enhancement of machine learning applications and systems. They will work closely with other engineers and data scientists to design and implement scalable and efficient machine learning systems.

Requirements

  • Bachelor's degree in Computer Science, Statistics, Mathematics, or related field.
  • Knowledge of at least one programming language among Scala, Python, Java, C, or C++.
  • Knowledge of big data technologies (e.g., Hadoop, Spark)
  • Knowledge of Software Development Lifecycle.
  • Self-motivated with a demonstrated ability to work independently, and to exercise independent judgment in developing complex techniques or programs in a dynamic environment.
  • Act as the major contributor in the development and operationalization of four different applications.
  • Play a key technical role in maintaining deployed products
  • Understanding of machine learning algorithms (Supervised, Unsupervised ML algorithms).
  • Familiarity with SQL or other database languages.

Preferred:

  • Master's degree in a quantitative discipline (e.g., Statistics, Operations Research, Bioinformatics, Economics, Computational Biology, Computer Science, Information Technology, Mathematics, Physics) or equivalent practical experience
  • 2+ years of experience in data engineering, software engineering, or machine learning.
  • Proficient in Python and SQL
  • Proficiency in at least one cloud computing platforms (e.g., AWS, Azure, GCP)
  • Intermediate knowledge of Machine Learning
  • Familiarity with ML lifecycle management tools (e.g., MLflow, Kubeflow, Airflow)
  • Experience on deployment and operationalization of ML Systems
  • Experience with monitoring tools for AI model tracking
  • Understanding of DevOps principles, CI/CD pipelines, and containerization (e.g., Docker, Kubernetes)
  • Experience with version control systems (e.g., Git) Knowledge of big data technologies (e.g., Hadoop, Spark)
  • Hands-on experience building and deploying LLM-based applications in production (chatbots, copilots, summarization, Q&A, or decision-support tools).
  • Experience designing and implementing retrieval-augmented generation (RAG) architectures, including chunking strategies, embedding models, and vector databases (e.g., Pinecone, Weaviate, FAISS, pgvector, Milvus).
  • Experience with agentic frameworks and orchestration libraries (e.g., LangChain, LlamaIndex, LangGraph, CrewAI, AutoGen, Semantic Kernel) including tool/function calling and multi-agent workflows.
  • Experience building conversational AI / chatbot systems, including dialog state management, memory, and integration with enterprise systems.
  • Familiarity with foundation model APIs and SDKs (e.g., OpenAI, Anthropic, Google, Azure OpenAI, AWS Bedrock) and open-weight model families (e.g., Llama, Mistral, Qwen, Gemma).
  • Working knowledge of prompt engineering, prompt evaluation, and LLM observability/evaluation tooling (e.g., LangSmith, Langfuse, Arize, Ragas, DeepEval).
  • Familiarity with fine-tuning and model adaptation techniques (e.g., supervised fine-tuning, LoRA/QLoRA, PEFT, instruction tuning, RLHF/DPO) and serving stacks (e.g., vLLM, TGI, Triton).
  • Awareness of LLM safety, guardrails, and evaluation practices (hallucination, bias, sycophancy, jailbreak resistance) - experience with healthcare-specific evaluation is a plus.
  • Strong problem-solving skills and ability to work in cross-functional teams

Benefits & conditions

The Mount Sinai Health System (MSHS) provides salary ranges that comply with the New York City Law on Salary Transparency in Job Advertisements. The salary range for the role is $109,000.00 - $163,695.00 Annually. Actual salaries depend on a variety of factors, including experience, education, and operational need. The salary range or contractual rate listed does not include bonuses/incentive, differential pay or other forms of compensation or benefits.

Non-Bargaining Unit, J18 - AI and Human Health - ISM, Icahn School of Medicine

Apply for this position