Graduate AI Engineer

Reply Ltd
Charing Cross, United Kingdom
18 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Junior

Job location

Charing Cross, United Kingdom

Tech stack

Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Confluence
JIRA
Azure
Databases
Python
Machine Learning
Open Source Technology
Performance Tuning
TensorFlow
Tokenization
Google Cloud Platform
Chatbots
PyTorch
Large Language Models
Information Technology
Low Latency
Atlassian Tools
Machine Learning Operations
REST
GPT
Microservices

Job description

models. These solutions are designed for high relevance, low latency, and strict compliance, ensuring maximum impact for our clients. We are recruiting for Autumn 2026. Responsibilities: Design, develop, and train large language models and AI systems. Fine-tune pre-trained LLMs (e.g., GPT, LLaMA, Mistral, Falcon) for specific use cases. Build and optimize prompting strategies, Retrieval-Augmented Generation (RAG), and agent-based systems. Prepare, clean, and manage large-scale datasets for model training. Implement model evaluation, benchmarking, and performance optimization. Deploy models into production using scalable and secure architectures. Collaborate with cross-functional teams to translate business needs into AI solutions. Monitor model performance, manage model drift, iterate improvements, and stay current with the latest research and advancements in AI and LLMs. About the Candidate, Bachelor's or Master's degree (2:1 or higher) in Computer Science, AI, Machine Learning, or a

Requirements

related field (or equivalent experience), is essential. Strong experience with Python and ML frameworks such as PyTorch or TensorFlow, and hands-on experience training, fine-tuning, or deploying LLMs. Solid understanding of NLP, transformers, attention mechanisms, embeddings, and experience with data preprocessing, tokenization, and dataset pipelines. Familiarity with REST APIs, microservices, model serving, and MLOps tools (e.g., MLflow, Kubeflow, Airflow, Weights & Biases). Experience with cloud platforms (AWS, GCP, Azure), distributed training, model parallelism, inference optimization, and GPU/TPU infrastructure. Knowledge of vector databases (e.g., FAISS, Pinecone), security, privacy, and responsible AI practices. Strong problem-solving, analytical, and communication skills, with a positive, team-oriented attitude and a passion for continuous learning. Additional advantages include experience with RLHF, open-source contributions, building AI copilots/chatbots, client and stakeholder management, and use of Atlassian tools like Jira and Confluence. Willingness to travel within the UK and EU for client engagements as required. Reply is an Equal Opportunities Employer and committed to embracing diversity in the workplace. We provide equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type regardless of age, sexual orientation, gender, identity, pregnancy, religion, nationality, ethnic origin, disability, medical history, skin colour, marital status or parental status or any other characteristic protected by the Law. Reply is committed to making sure that our selection methods are fair to everyone. To help you during the recruitment process, please let us know of any Reasonable Adjustments you may need.

About the company

Graduate AI Engineer About Sail Reply: Sail Reply is an AI tech innovation consultancy that delivers experience-led, value-focused solutions for some of the world's most forward-thinking organisations. Our mission is democratising LLMs to any business process by turning proprietary knowledge into competitive advantage with bespoke LLMs built for Clients domain and deployed at scale. We build bespoke LLM solutions tailored to the client's business processes, delivering enterprise-grade performance comparable to leading off-the-shelf model. Providing a solution designed for high relevance, low latency, and compliance. Role Overview: As an AI engineer, you will help deliver experience-led, value-focused solutions for innovative organizations by building bespoke LLMs tailored to client business processes. Your work will focus on turning proprietary knowledge into competitive advantage by deploying custom LLMs at scale, achieving enterprise-grade performance on par with leading off-the-shelf

Apply for this position