AI Systems Architect
Mphasis
Charlotte, United States of America
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Charlotte, United States of America
Tech stack
Java
Artificial Intelligence
Amazon Web Services (AWS)
Automated Storage and Retrieval Systems
Azure
Cloud Computing
Cloud Engineering
Distributed Systems
Python
Azure
Data Streaming
Retrieval-Augmented Generation
Large Language Models
Multi-Agent Systems
Caching
Kubernetes
Data Pipelines
Go
Programming Languages
Job description
We are seeking an experienced AI Systems Architect to design, build, and scale high-performance distributed AI systems. The ideal candidate will have deep expertise in GenAI, LLMs, and cloud-native architectures, along with hands-on experience in building enterprise-scale AI/ML platforms and agent-based systems., * Architect and deliver scalable, high-performance distributed systems
- Design and deploy AI/ML and GenAI platforms at enterprise scale
- Build and manage agent-based architectures, including:
- Prompt and context engineering
- MCP servers
- Evaluation frameworks
- Optimize LLM inference pipelines for latency, throughput, and efficiency
- Design and implement agent data & retrieval systems (vector DBs, hybrid search, memory, graph-based reasoning)
- Lead Kubernetes-based, cloud-native deployments
- Provide technical leadership, architecture governance, and hands-on mentoring to engineering teams
Requirements
- Strong experience in designing and implementing high-performance, large-scale distributed systems
- Proven experience in implementing and deploying AI/ML platforms at scale
- Expertise in building agent-based architectures, evaluation frameworks, and prompt/context engineering
- Knowledge of MCP (Model Context Protocol) servers
- Hands-on experience in LLM inference optimization, including batching and caching strategies
- Strong experience with Kubernetes and cloud infrastructure (AWS/Azure/GCP)
- Proficiency in at least one programming language (Python, Java, Go, etc.)
- Expertise in designing agent data stacks & retrieval systems, including:
- Vector databases
- Hybrid search
- Data freshness strategies
- Memory systems
- Graph reasoning
- BM25 and advanced retrieval techniques, * Experience with RAG (Retrieval-Augmented Generation) frameworks
- Familiarity with multi-agent systems and orchestration frameworks
- Exposure to real-time data pipelines and streaming architectures