Senior AI Platform Backend Engineer (LLM)
Epam
Municipality of Madrid, Spain
4 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Municipality of Madrid, Spain
Tech stack
Artificial Intelligence
DevOps
Github
Python
Machine Learning
Message Broker
Natural Language Processing
NLTK
Recommender Systems
TensorFlow
Test Driven Development
Chatbots
PyTorch
Delivery Pipeline
Large Language Models
Prompt Engineering
Deep Learning
Backend
Keras
FastAPI
HuggingFace
Machine Learning Operations
Spacy
GPT
Jenkins
Microservices
Job description
Maintain and enhance CI/CD pipelines using tools such as GitHub Actions, AWS CodePipeline, Jenkins or ArgoCD
- Design and develop backend architecture for AI Verification and ChatGPT services utilizing Python and FastAPI
- Build, optimize and scale classifiers and tools leveraging machine learning, encoders and rule-based models
- Architect and implement solutions following Domain-Driven Design (DDD) and Test-Driven Development (TDD) best practices
- Design, develop, and maintain a production-grade LLM-as-a-judge service for verifying AI-generated content from source documents, leveraging frameworks such as HuggingFace Transformers, SpaCy, NLTK, and BM25
- Build and maintain high-throughput Retrieval-Augmented Generation (RAG) services, including ingestion pipelines and message brokers
- Perform prompt engineering, including techniques such as Chain-of-Thought and Few-Shot Learning, across various LLMs (OpenAI, Anthropic, Google, etc.)
- Provide hands-on expertise and support for one or more leading AI frameworks (TensorFlow, Keras, PyTorch, BERT, etc.)
- Demonstrate technical leadership in at least one AI specialization, such as graph recommendation systems, deep learning or natural language processing
Requirements
We are seeking a talented and proactive Senior AI Platform Backend Engineer (LLM) to lead the design, optimization and deployment of machine learning pipelines using MLOps in cloud environments. In this role, you will implement LLM-based solutions for chatbots and Retrieval-Augmented Generation (RAG) systems and develop robust DevOps/MLOps pipelines for production., Strong proficiency in Python and experience developing backend services with FastAPI
- Experience building and maintaining CI/CD pipelines using tools such as GitHub Actions, AWS CodePipeline, Jenkins or ArgoCD
- Hands-on experience designing and optimizing scalable machine learning models, including classifiers, encoders and rule-based systems
- Experience working with large language models (LLMs) and frameworks such as HuggingFace Transformers, Spacy, NLTK and BM25
- Proven ability to design, develop and maintain high-load Retrieval-Augmented Generation (RAG) services, including ingestion pipelines and message brokers
Benefits & conditions
Private health insurance
- EPAM Employees Stock Purchase Plan
- 100% paid sick leave
- Referral Program
- Professional certification
- Language courses, Why Join EPAM
- WORK AND LIFE BALANCE. Enjoy more of your personal time with flexible work options, 24 working days of annual leave and paid time off for numerous public holidays.
- CONTINUOUS LEARNING CULTURE. Craft your personal Career Development Plan to align with your learning objectives. Take advantage of internal training, mentorship, sponsored certifications and LinkedIn courses.
- CLEAR AND DIFFERENT CAREER PATHS. Grow in engineering or managerial direction to become a People Manager, in-depth technical specialist, Solution Architect, or Project/Delivery Manager.
- STRONG PROFESSIONAL COMMUNITY. Join a global EPAM community of highly skilled experts and connect with them to solve challenges, exchange ideas, share expertise and make friends.