Data Scientist

Stott and May

Edinburgh, United Kingdom

2 days ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Edinburgh, United Kingdom

Tech stack

Artificial Intelligence

Data analysis

Big Data

Continuous Integration

Data Governance

Data Integration

Machine Learning

Natural Language Processing

Feature Engineering

Istio

Large Language Models

Prompt Engineering

Machine Learning Operations

Api Design

Api Gateway

Ddos

Software Version Control

Microservices

Job description

In this role, you will uncover insights and intelligence that help customers in dynamic, data-intensive industries operate, scale, and innovate. You will develop robust, future-ready machine learning and analytical models that enable predictive insights, automation, and data-driven decision making across complex digital transformation programmes. Working with modern data platforms, high-quality datasets, and advanced AI frameworks, you will collaborate with engineering, product, and analytics teams to build models that are accurate, explainable, and scalable. The role empowers you to shape end-to-end analytical ecosystems, enhancing decision quality, strengthening operational resilience, and guiding organisations toward an AI-enabled future., * Explore, clean, and analyse large, complex datasets to uncover patterns, trends, and opportunities

Develop, train, and validate machine learning, statistical, and predictive models to solve business problems
Design and run experiments such as A/B tests, hypothesis tests, and simulations to quantify outcomes
Collaborate with data engineers, analysts, product managers, and domain experts to translate business requirements into modelling tasks
Build end-to-end ML pipelines, including feature engineering, preprocessing, and deployment-ready outputs
Apply advanced techniques such as NLP, time series forecasting, anomaly detection, optimisation, or LLM/GenAI methods
Implement model evaluation frameworks using offline metrics, cross-validation, online experiments, and human-in-the-loop feedback
Communicate insights through dashboards, visualisations, written summaries, and presentations for technical and non-technical stakeholders
Ensure models are interpretable and explainable, providing transparency into key drivers and assumptions
Work with engineering teams to deploy models into production and monitor, retrain, or recalibrate as data and conditions evolve

Requirements

5-12 years of experience as a Data Scientist
Hands-on experience with GenAI, Gemini, or open-source LLMs and developing GenAI applications for code translation, text extraction, summarisation, and SDLC optimisation
Experience with AI agents, chatbots, RAG (retrieval-augmented generation), and vector databases (PG vector, Croma DB)
Familiarity with GenAI performance evaluation tools such as Pegasus, Ragas, and DeepEval
Ability to create conversational interfaces with React JS or other frontend components, develop and deploy AI agents using LangGraph, ADK, A2A, MCP
Strong programming skills in Python (LangChain/LangGraph/LangSmith frameworks) and TypeScript (preferred)
Solid understanding of LLMs, prompt engineering, and graph-based workflows
Knowledge and implementation of input/output guardrails for hallucination, PII filtering, HAP, and bias mitigation
Experience implementing security best practices and mitigating spikes, DDoS attacks, and other threat scenarios
Knowledge of API gateways, ISTIO, and diagnosing end-to-end communication failures
Hands-on experience with API development and microservices architecture

Desirable Skills, Knowledge and Experience

Strong experience applying machine learning, statistical modelling, and predictive analytics to real-world business problems
Ability to resolve end-to-end connectivity and data integration issues in cross-functional teams
Experience working with large, complex datasets, including cleaning, feature engineering, and exploratory data analysis
Familiarity with LLMs, NLP techniques, and GenAI frameworks, including embeddings, prompt engineering, or fine-tuning
Experience building end-to-end ML pipelines, including validation, optimisation, deployment, and monitoring
Understanding of MLOps practices, including model versioning, registries, CI/CD for ML, and automated workflows
Ability to translate business problems into analytical tasks and communicate insights clearly
Knowledge of data governance, quality, lineage, ethics, privacy, and responsible AI principles
Comfort working with cloud platforms (GCP preferred) for scalable model training and deployment
Growth-oriented mindset with enthusiasm for exploring new algorithms, tools, and emerging AI/ML techniques