AI/LLM Engineer

Matlen Silver
Charlotte, United States of America
yesterday

Role details

Contract type
Internship / Graduate position
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 135K

Job location

Remote
Charlotte, United States of America

Tech stack

Java
API
Artificial Intelligence
Cloud Computing
Databases
Information Engineering
Data Files
Data Retrieval
Distributed Systems
Interoperability
Python
Oracle Applications
Performance Tuning
SQL Databases
Systems Architecture
Systems Integration
Management of Software Versions
AI Infrastructure
Enterprise Software Applications
Chatbots
Large Language Models
Multi-Agent Systems
Prompt Engineering
Generative AI
Kubernetes
Information Technology
Low Latency
Machine Learning Operations
Serverless Computing
Web Api

Job description

Job Description - AI/LLM Engineer (RAG & Agentic Systems) We are seeking a highly motivated AI/LLM Engineer to join a growing team focused on building next-generation generative AI solutions within the banking and financial services domain. This role will focus on designing and productionizing Retrieval-Augmented Generation (RAG) pipelines, agentic AI systems, and LLM-powered applications that interact with financial and structured enterprise data. The team is open to strong early-career candidates, including recent graduates with exceptional academic backgrounds, internships, or impactful AI/ML projects., * Design, extend, and optimize RAG pipelines, retrieval strategies, embedding workflows, and semantic search capabilities.

  • Build and productionize agentic AI architectures that orchestrate across:
  • RAG workflows
  • Structured SQL/data retrieval
  • External APIs and downstream actions (e.g., report/PPT generation)
  • Fine-tune and evaluate LLMs using model adaptation techniques, prompt engineering, and inference optimization.
  • Implement model safety mechanisms, guardrails, hallucination mitigation, and response validation strategies.
  • Develop and maintain APIs, endpoints, and tooling for model serving, observability, monitoring, and versioning.
  • Partner with SQL/data engineering teams to securely integrate structured enterprise data into LLM workflows.
  • Design reusable retrieval templates and interfaces for enterprise-scale AI applications.
  • Implement testing frameworks and monitoring for:
  • Latency
  • Accuracy
  • Hallucination rates
  • Cost efficiency
  • Retrieval quality
  • Participate in architecture and vendor-selection discussions focused on scalability, performance, and cost optimization.

Requirements

Do you have experience in Tooling?, * Hands-on experience building production-grade RAG pipelines and agentic AI systems.

  • Strong experience with LLM fine-tuning, model adaptation, or custom inference workflows.
  • Deep understanding of at least one LLM orchestration framework such as:
  • LangChain
  • LlamaIndex
  • Similar orchestration frameworks
  • Strong Python development and API engineering experience.
  • Ability to clearly explain:
  • System architecture decisions
  • Tool/framework selection
  • Dataset preparation
  • Evaluation methodologies
  • Deployment and productionization strategies
  • Experience working on AI/ML projects such as:
  • Chatbots
  • Financial AI applications
  • Intelligent document/query systems

Nice-to-Have Skills

  • Experience integrating LLMs with relational or structured databases.
  • Familiarity with vector databases and embedding stores such as:
  • Pinecone
  • Milvus
  • Weaviate
  • Oracle vector/embedding capabilities
  • Knowledge of:
  • Prompt engineering
  • Retrieval augmentation
  • LLM safety
  • Hallucination mitigation
  • Guardrail implementation
  • Experience with cloud infrastructure, model hosting, and monitoring in distributed environments.
  • Exposure to Kubernetes, serverless architectures, and AI infrastructure cost optimization.
  • Java experience is a plus for interoperability with enterprise systems.

Preferred Background

  • Banking, financial services, fintech, or enterprise AI environments.
  • Strong academic background in Computer Science, AI/ML, Data Science, or related fields.
  • Candidates with standout internships, research, or hands-on AI projects are highly encouraged to apply.

Benefits & conditions

3.43.4 out of 5 stars Charlotte, NC Hybrid work $55 - $65 an hour - Contract, 18 Month W2 Contract Hybrid 3 days onsite 2 days remote Charlotte, NC (Local candidates ONLY) Onsite Interview June 2nd One and Done Interviews $55-$65/hour

Apply for this position