Machine Learning Engineer

Descriptionmy
9 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Tech stack

Cloud Computing
Computer Programming
Software Debugging
Distributed Systems
Python
Machine Learning
Kubernetes

Job description

The RoleProductionise machine learning models from research through validation, staging and live deploymentBuild, maintain and optimise scalable inference infrastructure supporting high-throughput, low-latency workloadsImprove performance and reliability across GPU-based environmentsDesign and implement model serving and deployment workflowsDevelop monitoring and observability tools to track system performance, errors and utilisationSupport data preparation and model integration as part of the wider development lifecycleCollaborate with research, engineering and infrastructure teams to improve deployment efficiency and platform scalabilityEvaluate and integrate third-party infrastructure and inference tooling where appropriate

Requirements

RequirementsProven experience deploying and maintaining ML inference systems in production environmentsStrong programming experience in Python and familiarity with modern machine learning frameworksExperience working with containerisation and orchestration technologies such as Kubernetes or similarExposure to distributed systems and cloud-based infrastructureExperience supporting GPU workloads and performance optimisationStrong troubleshooting skills across performance, scaling and system reliabilityComfortable working cross-functionally within research-led environmentsAbility to operate in fast-paced teams with evolving technical priorities Nice to HaveExperience building or improving model serving infrastructureUnderstanding of distributed training or inference techniquesExperience debugging low-level performance or hardware-related issuesExposure to real-time or latency-sensitive ML applications

Apply for this position