Senior Machine Learning Engineer - AI Systems & Frameworks

10xEngineers

9 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Remote

Tech stack

API

Artificial Intelligence

C++

Code Review

Computer Programming

Continuous Integration

Data Structures

Software Debugging

Software Design Patterns

Memory Management

Python

Machine Learning

OpenCV

Open Source Technology

Performance Tuning

Software Architecture

TensorFlow

Software Engineering

Data Streaming

System Programming

PyTorch

Large Language Models

Concurrency

Caching

Generative AI

HuggingFace

Job description

We are seeking a senior engineer to join our high-performance team, which partners with leading AI chip companies to deliver cutting-edge software that enables end-users to run Vision and Generative AI inference workloads efficiently on custom accelerators.

As a Senior Machine Learning Engineer, you will be a key technical member of the team, working in a hands-on role to contribute to the design and development of frameworks, APIs, and runtime integrations that enable AI models to run seamlessly on next-generation hardware. You will also help shape technical direction, mentor junior engineers, and collaborate closely with the client's compiler/runtime and hardware architecture teams.

This is not a traditional applied ML role. Instead, it requires deep software engineering expertise with strong exposure to AI/ML systems, ideal for someone who thrives in systems programming and performance optimization, and wants to pivot into the AI/ML domain while continuing to build on their proven experience in delivering robust, high-quality software. In this role, you will directly contribute to enabling large-scale AI workloads on custom hardware.

What you'll do

Define and implement software architecture and design patterns to ensure frameworks and applications are scalable, maintainable, and reusable.
Architect and develop Python/C++ frameworks for integrating ML models with custom runtimes.
Design and implement high-performance APIs, bindings, and libraries to support inference of Vision and Generative AI models.
Build and maintain Model Zoo, loaders, and optimization workflows to simplify model deployment.
Profile, debug, and optimize performance-critical paths in runtime and framework layers.
Develop real-time pipelines and applications using GStreamer, OpenCV, and other multimedia/ML frameworks.
Collaborate with compiler/runtime teams to ensure smooth integration of graph-level and operator optimizations.
Mentor and guide junior engineers; enforce software engineering best practices across design, testing, CI/CD, and code reviews.
Own end-to-end delivery of software components: define scope, set timelines, and drive execution.

Requirements

Do you have a Master's degree?, * 5-7+ years of professional experience in software engineering, ideally in systems programming or performance-critical applications.

Strong programming expertise in C++ (C++11/14/17) and Python.
Experience with C++/Python bindings (e.g., pybind11, SWIG).
Strong fundamentals in:
Data structures and algorithms
Concurrency, threading, and synchronization
Memory management, caching, and performance profiling
Networking and streaming systems
Exposure to ML frameworks (PyTorch, TensorFlow, ONNX Runtime) and how they interface with hardware runtimes.
Experience building frameworks, SDKs, or toolchains used by other developers or customers.

Bonus Points

Hands-on experience deploying vision or Generative AI models (transformers, diffusion models, LLM inference).
Experience with multimedia/vision pipelines (e.g., GStreamer plugin development).
Contributions to open-source ML systems projects (e.g., Hugging Face, ONNX Runtime, TensorRT).
Prior experience in a technical lead or client-facing role.