AI/LLM Engineer

Matlen Silver

Charlotte, United States of America

yesterday

Role details

Contract type

Internship / Graduate position

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Compensation

$ 135K

Job location

Remote

Charlotte, United States of America

Tech stack

Java

API

Artificial Intelligence

Cloud Computing

Databases

Information Engineering

Data Files

Data Retrieval

Distributed Systems

Interoperability

Python

Oracle Applications

Performance Tuning

SQL Databases

Systems Architecture

Systems Integration

Management of Software Versions

AI Infrastructure

Enterprise Software Applications

Chatbots

Large Language Models

Multi-Agent Systems

Prompt Engineering

Generative AI

Kubernetes

Information Technology

Low Latency

Machine Learning Operations

Serverless Computing

Web Api

Job description

Job Description - AI/LLM Engineer (RAG & Agentic Systems) We are seeking a highly motivated AI/LLM Engineer to join a growing team focused on building next-generation generative AI solutions within the banking and financial services domain. This role will focus on designing and productionizing Retrieval-Augmented Generation (RAG) pipelines, agentic AI systems, and LLM-powered applications that interact with financial and structured enterprise data. The team is open to strong early-career candidates, including recent graduates with exceptional academic backgrounds, internships, or impactful AI/ML projects., * Design, extend, and optimize RAG pipelines, retrieval strategies, embedding workflows, and semantic search capabilities.

Build and productionize agentic AI architectures that orchestrate across:
RAG workflows
Structured SQL/data retrieval
External APIs and downstream actions (e.g., report/PPT generation)
Fine-tune and evaluate LLMs using model adaptation techniques, prompt engineering, and inference optimization.
Implement model safety mechanisms, guardrails, hallucination mitigation, and response validation strategies.
Develop and maintain APIs, endpoints, and tooling for model serving, observability, monitoring, and versioning.
Partner with SQL/data engineering teams to securely integrate structured enterprise data into LLM workflows.
Design reusable retrieval templates and interfaces for enterprise-scale AI applications.
Implement testing frameworks and monitoring for:
Latency
Accuracy
Hallucination rates
Cost efficiency
Retrieval quality
Participate in architecture and vendor-selection discussions focused on scalability, performance, and cost optimization.

Requirements

Do you have experience in Tooling?, * Hands-on experience building production-grade RAG pipelines and agentic AI systems.

Strong experience with LLM fine-tuning, model adaptation, or custom inference workflows.
Deep understanding of at least one LLM orchestration framework such as:
LangChain
LlamaIndex
Similar orchestration frameworks
Strong Python development and API engineering experience.
Ability to clearly explain:
System architecture decisions
Tool/framework selection
Dataset preparation
Evaluation methodologies
Deployment and productionization strategies
Experience working on AI/ML projects such as:
Chatbots
Financial AI applications
Intelligent document/query systems

Nice-to-Have Skills

Experience integrating LLMs with relational or structured databases.
Familiarity with vector databases and embedding stores such as:
Pinecone
Milvus
Weaviate
Oracle vector/embedding capabilities
Knowledge of:
Prompt engineering
Retrieval augmentation
LLM safety
Hallucination mitigation
Guardrail implementation
Experience with cloud infrastructure, model hosting, and monitoring in distributed environments.
Exposure to Kubernetes, serverless architectures, and AI infrastructure cost optimization.
Java experience is a plus for interoperability with enterprise systems.

Preferred Background

Banking, financial services, fintech, or enterprise AI environments.
Strong academic background in Computer Science, AI/ML, Data Science, or related fields.
Candidates with standout internships, research, or hands-on AI projects are highly encouraged to apply.

Benefits & conditions

3.43.4 out of 5 stars Charlotte, NC Hybrid work $55 - $65 an hour - Contract, 18 Month W2 Contract Hybrid 3 days onsite 2 days remote Charlotte, NC (Local candidates ONLY) Onsite Interview June 2nd One and Done Interviews $55-$65/hour

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

Apply for this position

Good distractions

Moments

Videos View all