AI Data Scientist- RAG, SLM & Distributed Data...

Insight Global
Hartford, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Hartford, United States of America

Tech stack

API
Artificial Intelligence
Automated Storage and Retrieval Systems
Databases
Distributed Data Store
Distributed Systems
Data Flow Control
Python
Language Modeling
Search Technologies
Software Engineering
SQL Databases
Google Cloud Platform
Cloud Platform System
Sql Optimization
Flask
Large Language Models
Prompt Engineering
Generative AI
FastAPI
Api Design

Job description

We are looking for a mid-level AI Engineer with hands-on experience in Retrieval-Augmented Generation (RAG)systems, Small Language Models (SLMs), and distributed databases such as Google Cloud Spanner.

You will work closely with senior engineers and product teams to build scalable AI systems that integrate retrieval pipelines, language models, and distributed transactional infrastructure. This role is ideal for someone who has already built AI features in production and wants to deepen their expertise in applied GenAI systems.

Requirements

AI RAG pipelines, Embeddings, Prompt engineering

Models SLM/LLM integration

Database Spanner schema design, SQL optimization

Backend Python, APIs

Cloud GCP, * 3-5 years of software engineering experience.

  • 1-2 years working with LLM or RAG-based systems.

  • Strong proficiency in Python.

  • Experience with:

o Embedding models and vector search

o LangChain, LlamaIndex, or similar frameworks

o API development (FastAPI/Flask)

  • Experience working with Google Cloud Spanner or similar distributed SQL databases.

  • Solid understanding of distributed systems fundamentals.

  • Comfortable working in cloud environments (GCP preferred). * Experience fine-tuning or quantizing small language models.

  • Familiarity with evaluation metrics for retrieval systems (Recall@K, etc.).

  • Knowledge of:

o Vertex AI

o Pub/Sub

o Dataflow

  • Experience optimizing AI inference for cost and latency.

  • Exposure to CI/CD pipelines.

Apply for this position