AI Developer

CREW

Germantown, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Germantown, United States of America

Tech stack

HTML

A/B testing

API

Artificial Intelligence

Automated Storage and Retrieval Systems

Encodings

Databases

Data Deduplication

ETL

Data Transformation

Relational Databases

Software Debugging

Graph Database

Python

PostgreSQL

Neo4j

Redis

Markdown

Search Technologies

SQL Databases

Data Logging

Data Processing

Autoscaling

Large Language Models

Generative AI

Indexer

Backend

SC Clearance

Containerization

Kubernetes

Information Technology

Api Design

Data Pipelines

Serverless Computing

Docker

Job description

Responsible for model integration, data pipelines, retrieval infrastructure, and the engineering scaffolding required to ship reliable, secure, and cost-effective Artificial Intelligence (AI) features. This role ensures the delivery of production-grade Large Language Model (LLM) systems that meet real-world demands for performance, cost-efficiency, and governance., * Design and implement end-to-end RAG architectures, including document ingestion, chunking, embedding generation, vector indexing, query planning, retrieval, and response synthesis.

Evaluate and integrate LLMs, embedding models, and vector databases to support efficient and accurate retrieval and generation.
Design and implement scaffolding and orchestration around LLMs, including prompt templating, tool invocation, evaluation harnesses, and safety guards.
Develop data processing pipelines for structured and unstructured content (PDF, DOCX, HTML, Markdown, databases, APIs); implement normalization, deduplication, PII redaction, and metadata enrichment.
Implement and optimize retrieval strategies and context construction (citation, source attribution, grounding).
Adapt retrieval and embedding strategies to domain-specific taxonomies, ontologies, or structured schemas; support contextual retrieval from hierarchical or relational sources.
Productionize LLM-based systems: containerize components (Docker), deploy orchestration via Kubernetes or serverless platforms, implement observability (OpenTelemetry, logging, tracing), and manage configuration.
Measure and improve quality: define offline and online evals, golden datasets, A/B tests, hallucination detection, toxicity filters, and guardrails.
Optimize performance and cost: batching, caching, streaming, and efficient context management.
Implement security, privacy, and compliance best practices including access controls, injection defense, and safe data handling.
Develop solutions that can run entirely on-premise or in air-gapped environments, prioritizing data sovereignty and privacy.
Various other duties in direct support of accomplishment of primary duties listed.

Requirements

Education: Master's degree preferred. Bachelor's in Computer Science, Data Science, AI, or related field with equivalent experience considered, or related field or equivalent practical experience.

Training and Experience: 3-7 years in backend development, AI systems, or related roles, with a focus on LLMs integration or retrieval systems.

General Skills: Must have strong software engineering fundamentals and a deep understanding of working with LLMs in production environments. The ideal candidate brings hands-on experience with Python and modern data tooling and is comfortable building robust pipelines that connect unstructured content, structured data, and retrieval systems to power context-aware LLM workflows. You should demonstrate fluency in the design and reasoning of data movement processes, including ingestion, preprocessing, vector indexing, and query generation. Experience working with both open-weight and API-based large language models is also essential. This role requires a practical mindset, a strong command of SQL and retrieval strategies over relational data, and the ability to experiment, evaluate, and iterate toward scalable, cost-effective, and trustworthy AI features.

Required Skills:

Mastery in Python, including experience with modern practices in structuring, testing, and maintaining codebases.
Orchestrated Retrieval-Augmented Generation (RAG) systems, including document chunking, embedding, vector search, and grounded context construction.
Expertise with PostgreSQL and pgvector, including schema design and structured retrieval over relational data.
Robust operational understanding with SQL query generation, particularly in the context of semantic or hybrid retrieval.
Comprehensive background integrating and orchestrating LLMs, with a focus on prompt templating, tool usage, and response parsing.
Familiarity with Google ADK or equivalent frameworks for LLM scaffolding and orchestration.
Proficient in utilizing unstructured and structured data, including ingestion from PDFs, DOCX, Markdown, HTML, and APIs.
Experience deploying and debugging LLM systems, including containerization (Docker), API-based LLM integration (e.g., Ollama or vLLM), and environment configuration.

Preferred Skills

Background with graph-enhanced retrieval, using tools like Neo4j or ArangoDB, and an understanding of when and how to apply knowledge graphs to improve LLM grounding.
Versed in model adaptation techniques, including LoRA, QLoRA, or PEFT approaches for fine-tuning or personalization.
Expert in designing and implementing advanced prompt optimization frameworks, including developing automated evaluation systems and troubleshooting complex failure modes to enhance AI model performance and reliability.
Proven ability to design end-to-end hybrid search and reranking pipelines, such as ColBERT, BGE rerankers, or commercial tools like Cohere Rerank.
Expertise with infrastructure optimizations, such as autoscaling (KEDA, HPA), Redis caching layers, or efficient streaming and batching.
Demonstrated skill in safe deployment practices, including prompt injection mitigation and handling of sensitive or regulated data.

Clearance:Must be able to obtain/maintain a Secret clearance. Prefer holds an active Secret clearance.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all