AI Engineer
Role details
Job location
Tech stack
Job description
What if the systems you built didn't just optimise engagement, but played a role in determining someone's future career? This is an opportunity to work on AI solutions that directly impact education, certification, and access to skilled professions. Your work will sit at the crossroads of large language models, evaluation frameworks, and high-stakes decision-making, where precision, fairness and transparency are critical. About the Company This UK-based AI organisation is redefining how professional certification and assessment are delivered. Their goal is to make accreditation more accessible, scalable and fair through intelligent automation. You'll join a small, highly skilled team where your voice matters. Engineers here are trusted to take ownership, influence direction and deliver meaningful outcomes without unnecessary red tape. Collaboration is strong, egos are low, and the problems being solved have real societal impact. What you'll gain:
-
A high level of autonomy and ownership over your work
-
The chance to build AI systems with genuine real-world impact
-
End-to-end involvement across the full AI lifecycle
-
A collaborative, pragmatic engineering culture
-
Exposure to advanced areas including LLM evaluation, multimodal AI and human-in-the-loop systems The Role As an AI Engineer, you'll design and scale intelligent assessment systems capable of evaluating knowledge and skills accurately and efficiently. Your work will involve:
-
Developing AI-driven assessment pipelines using structured rubrics, sample answers and historical data
-
Creating evaluation frameworks, golden datasets and robust testing methodologies
-
Experimenting with LLM workflows, optimising for performance, cost and latency
-
Applying statistical techniques to validate model improvements
-
Building multimodal systems that can interpret text, images and video
-
Generating meaningful, transparent feedback for users
-
Implementing observability, monitoring and evaluation pipelines
-
Designing safeguards such as confidence thresholds and human review processes
-
Building APIs and deploying scalable solutions in the cloud
-
Contributing to overall system architecture and technical strategy You'll take ownership across the full lifecycle from experimentation and model tuning through to deployment and iteration.
Requirements
You're an experienced AI/ML engineer who enjoys solving complex, high-impact problems and working with autonomy. You'll likely have:
- Around 4+ years' experience in AI/ML engineering (or a relevant PhD with industry exposure)
- A track record of delivering end-to-end ML systems in production
- Strong Python skills and experience building APIs
- Hands-on experience with LLMs and modern AI workflows
- Expertise in evaluation frameworks and model optimisation
- Familiarity with observability, experimentation and human-in-the-loop systems
- Experience deploying to cloud environments (AWS preferred)
- A solid understanding of fairness, privacy and model integrity
- Confidence working in a fast-paced, small team environment Desirable: Experience in EdTech, Python, multimodal AI or automated assessment systems.