AI System Engineer

Visionsoft International

McLean, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

McLean, United States of America

Tech stack

Clean Code Principles

API

Artificial Intelligence

Amazon Web Services (AWS)

Confluence

JIRA

Azure

Cloud Computing

Code Coverage

Code Review

Continuous Integration

Data Cleansing

ETL

Database Queries

DevOps

Django

Amazon DynamoDB

Github

Python

Key Management

PostgreSQL

Machine Learning

MongoDB

MySQL

Multiprocessing

NoSQL

NumPy

OAuth

Open Source Technology

RabbitMQ

Redis

TensorFlow

JSON Web Token

Software Deployment

Software Engineering

SQLAlchemy

Unstructured Data

WebSocket

Multithreading

Data Storage Management

Google Cloud Platform

PyTorch

Flask

Large Language Models

Multi-Agent Systems

Prompt Engineering

Caching

Backend

FastAPI

Pandas

Event Driven Architecture

Pytest

Gitlab-ci

Scikit Learn

Integration Tests

Kubernetes

Information Technology

HuggingFace

Kafka

Machine Learning Operations

Celery

Virtual Agents

Api Design

Serverless Computing

Docker

Microservices

Requirements

Python Fundamentals (Must Have)

Deep expertise in Python 3.10+, including asyncio, multithreading/multiprocessing, decorators, generators, and metaclasses
Proficiency with foundational packages: NumPy, Pandas, Pydantic, httpx/requests, dataclasses, typing
Strong grasp of clean code principles, SOLID design, and Pythonic idioms
Experience writing unit/integration tests with pytest and maintaining high code coverage
Familiarity with linting/formatting toolchains (ruff, black, isort, mypy) and pre-commit hooks
Experience with dependency and environment management (Poetry, uv, pip, venv, conda)

Agentic AI, LangChain & MCP (Core Focus)

Proven hands-on experience with Model Context Protocol (MCP) designing, building, and maintaining MCP servers and clients
Strong working experience with FastMCP for building Python-based MCP servers with tools, resources, and prompts
Expert-level experience with LangChain (chains, agents, memory, retrievers, output parsers, LCEL)
Experience with LangGraph for stateful, multi-agent, and graph-based agentic workflows
Understanding of tool/function calling, structured outputs, and agent-to-agent communication patterns
Experience integrating multiple LLM providers (Anthropic Claude, OpenAI, Azure OpenAI, Gemini, open-source models)
Knowledge of RAG architecture: chunking strategies, embeddings, hybrid search, re-ranking, and evaluation

Backend & API Development

5+ years building production APIs with FastAPI, Flask, or Django REST Framework
Experience with streaming responses (SSE/WebSockets) for real-time LLM output
Solid understanding of authentication/authorization mechanisms (OAuth2, JWT, API key management)
Experience designing scalable microservices and event-driven architectures (Kafka, RabbitMQ, Celery)

Data & Storage

Strong SQL skills (PostgreSQL, MySQL) and experience with ORMs (SQLAlchemy)
Hands-on experience with vector databases: Chroma, Pinecone, Qdrant, Weaviate, pgvector, or FAISS
Experience with caching layers (Redis) and NoSQL stores (MongoDB, DynamoDB)
Data preprocessing, ETL pipeline development, and working with structured/unstructured data

ML/AI Foundations

Working knowledge of machine learning fundamentals: embeddings, similarity metrics, classification, evaluation
Familiarity with PyTorch, TensorFlow, or scikit-learn for model training/inference where needed
Experience with Hugging Face ecosystem (Transformers, datasets, model hub)
Understanding of prompt engineering, few-shot learning, and LLM evaluation frameworks (RAGAS, DeepEval, LangSmith evals)

Cloud, DevOps & MLOps

4+ years deploying applications on AWS, Azure, or Google Cloud Platform (Lambda, ECS/EKS, Cloud Run, Azure Functions)
Proficiency with Docker; working knowledge of Kubernetes and Helm
CI/CD experience with GitHub Actions, GitLab CI, or Azure DevOps
Experience with LLM observability and tracing tools (LangSmith, Langfuse, Arize Phoenix, OpenTelemetry)
Familiarity with secrets management, rate limiting, and cost monitoring for LLM workloads

Security & Responsible AI

Experience implementing guardrails, input/output validation, and PII handling in AI pipelines
Awareness of prompt injection risks and mitigation strategies in agentic/MCP-based systems
Understanding of compliance considerations (SOC 2, GDPR, HIPAA) when handling sensitive data

Collaboration & Leadership

Experience mentoring engineers, conducting code reviews, and setting technical standards
Ability to translate business problems into AI solution architectures
Excellent communication skills with both technical and non-technical stakeholders
Comfortable in Agile/Scrum delivery models with tools like Jira and Confluence

Nice to Have

Contributions to open-source AI/LLM projects (LangChain, MCP servers, etc.)
Experience with fine-tuning (LoRA/QLoRA) or self-hosted model serving (vLLM, Ollama, TGI)
Knowledge of A2A protocols, CrewAI, AutoGen, or other multi-agent frameworks
Experience building Slack/Teams bots or IDE integrations powered by MCP

Education & Experience

Minimum 7-10 years of overall software engineering experience with strong Python expertise
3+ years of hands-on experience building LLM-powered or AI/ML applications in production
Bachelor's/Master's degree in Computer Science, Engineering, AI/ML, or equivalent industry experience
Demonstrated experience owning end-to-end delivery of AI products from design to deployment

Role details

Job location

Tech stack

Requirements

Apply for this position

Good distractions

Moments

Videos View all