AI Data Scientist- RAG, SLM & Distributed Data...

Insight Global

Hartford, United States of America

1 month ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Hartford, United States of America

Tech stack

API

Artificial Intelligence

Automated Storage and Retrieval Systems

Databases

Distributed Data Store

Distributed Systems

Data Flow Control

Python

Language Modeling

Search Technologies

Software Engineering

SQL Databases

Google Cloud Platform

Cloud Platform System

Sql Optimization

Flask

Large Language Models

Prompt Engineering

Generative AI

FastAPI

Api Design

Job description

We are looking for a mid-level AI Engineer with hands-on experience in Retrieval-Augmented Generation (RAG)systems, Small Language Models (SLMs), and distributed databases such as Google Cloud Spanner.

You will work closely with senior engineers and product teams to build scalable AI systems that integrate retrieval pipelines, language models, and distributed transactional infrastructure. This role is ideal for someone who has already built AI features in production and wants to deepen their expertise in applied GenAI systems.

Requirements

AI RAG pipelines, Embeddings, Prompt engineering

Models SLM/LLM integration

Database Spanner schema design, SQL optimization

Backend Python, APIs

Cloud GCP, * 3-5 years of software engineering experience.

1-2 years working with LLM or RAG-based systems.
Strong proficiency in Python.
Experience with:

o Embedding models and vector search

o LangChain, LlamaIndex, or similar frameworks

o API development (FastAPI/Flask)

Experience working with Google Cloud Spanner or similar distributed SQL databases.
Solid understanding of distributed systems fundamentals.
Comfortable working in cloud environments (GCP preferred). * Experience fine-tuning or quantizing small language models.
Familiarity with evaluation metrics for retrieval systems (Recall@K, etc.).
Knowledge of:

o Vertex AI

o Pub/Sub

o Dataflow

Experience optimizing AI inference for cost and latency.
Exposure to CI/CD pipelines.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all