Sr Data Scientist - Gen AI ML - New York / Jersey City

PHOTON, LLC
New York, United States of America
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 249K

Job location

New York, United States of America

Tech stack

Continuous Integration
Data Retrieval
Monitoring of Systems
Python
PostgreSQL
Open Source Technology
TensorFlow
Search Technologies
Workflow Management Systems
PyTorch
Large Language Models
Multi-Agent Systems
Prompt Engineering
Software Security
Generative AI
Backend
FastAPI
Containerization
AI Platforms
HuggingFace
Machine Learning Operations
Asynchronous Programming
Artificial Intelligence Markup Language (AIML)
Automation Anywhere
Api Management
Docker

Job description

Develop and orchestrate sophisticated AI workflows using LangGraph and multi-agent architectures.

Build and maintain Advanced RAG systems utilizing LlamaIndex and vector databases for high-accuracy retrieval.

Integrate and swap diverse LLMs (commercial and open-source) based on performance and cost requirements.

Design and deploy high-performance, scalable backend services using FastAPI and Async Python.

Fine-tune large language models (LLMs) using PyTorch/TensorFlow to improve domain-specific performance.

Optimize GenAI workflows for latency, cost, and reliability using advanced prompt engineering and monitoring tools.

Containerize and deploy AI services via Docker to production environments.

Requirements

Do you have experience in Quantization?, LLMs: Gemini, OpenAI, Claude, Llama, and Local Model deployment.

Frameworks: LangChain, LlamaIndex, and Hugging Face.

Orchestration: LangGraph and Multi-Agent Systems (MAS).

Development: Python, FastAPI, and Asynchronous Programming.

RAG & Data: PostgreSQL, Vector Databases, and Advanced Retrieval strategies.

ML/DL: PyTorch, TensorFlow, and Model Fine-tuning.

Deployment: Docker, Production API management, and LLM monitoring.

Tools: Prompt Engineering, Workflow Design, and GenAI Optimization., Hands-on experience building and deploying GenAI applications in a production setting.

Strong proficiency in Python and the modern AI library ecosystem (LangChain, LlamaIndex, etc.).

Experience with vector search, embedding models, and advanced data retrieval patterns.

Knowledge of model fine-tuning techniques and local LLM quantization/hosting.

Familiarity with production-grade monitoring, API security, and CI/CD for ML.

Benefits & conditions

Pulled from the full job description

  • 401(k)
  • Health insurance
  • Paid time off
  • Vision insurance
  • Dental insurance
  • Paid holidays, Minimum Compensation: USD 71,000 Maximum Compensation: USD 249,000 Compensation is based on actual experience and qualifications of the candidate. The above is a reasonable and a good faith estimate for the role. Medical, vision, and dental benefits, 401k retirement plan, variable pay/incentives, paid time off, and paid holidays are available for full time employees. This position is not available for independent contractors No applications will be considered if received more than 120 days after the date of this post

Apply for this position