Machine Learning (ML) Engineer

Modelcat

10 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Tech stack

Artificial Intelligence

Amazon Web Services (AWS)

Computer Vision

C++

Continuous Integration

Linux

Python

Machine Learning

Language Modeling

Node.js

Object Detection

OpenCV

Software Architecture

TensorFlow

Signal Processing

Web Applications

PyTorch

Transfer Learning

Deep Learning

Data Strategy

Pandas

Scikit Learn

Production Code

Web Technologies

Machine Learning Operations

REST

Job description

We're seeking a motivated ML Engineer to help advance our AutoML platform. You'll play a key role in expanding its capabilities, onboarding new ML use-cases across vision, time-series, and beyond, and improving the product as we scale. This role offers meaningful growth potential toward a technical leadership track., * Contribute to the development and enhancement of our AutoML system for Edge AI, including pipelines that combine deep-learning and conventional algorithms for embedded devices

Object tracking, multi-model pipelines, and emerging use-cases

Build and improve platform features across compute clusters and our web application
Define abstractions and contribute to the architecture of cloud, cluster, and embedded components

ML Use-Case Expansion

Integrate new ML use-cases across a broad range of data domains and maintain and improve existing ones, including:

Time-series and audio, object re-identification, segmentation and keypoints
Action recognition (video), radar and point cloud data, multi-modal (vision + audio + sensor)
Small language models (NLP/SLM), classification, and object detection

Work with foundational computer vision and non-CV ML models - train, evaluate, modify, and combine them to unlock new functionality

Edge AI Optimization & Deployment

Optimize AI solutions for edge devices using TinyML frameworks, creating models that fit a range of chip sizes and memory constraints
Deploy ML and non-ML algorithms on embedded targets (MCU and application-class microprocessors)
Productize research-quality code into robust, production-ready systems

Collaboration & Craft

Partner on data strategies, preprocessing pipelines, and model training workflows
Stay current with Edge AI and AutoML advancements
Document your work and contribute to technical reports

Requirements

Master's degree in CS, EE, or a related field (PhD a plus)
4+ years of relevant industry experience in ML (AutoML and Edge AI experience highly valued)
Strong Python skills with the ability to write production-quality code; C/C++ a plus
Solid command of ML frameworks: TensorFlow, PyTorch, ONNX
Proficient with the standard DS toolset: scikit-learn, OpenCV, pandas
Comfortable working in Linux-based development environments
Experience onboarding new ML use-cases and expanding into new data domains
Excellent problem-solving skills and strong written and verbal English communication

Preferred

Experience with cloud platforms (AWS) and web technologies (Node.js, REST APIs)
Familiarity with compute cluster tools such as Ray and Optuna
Knowledge of model compression techniques: pruning, quantization, transfer learning, knowledge distillation
Experience defining software architecture for ML systems
Familiarity with CI/CD practices
Understanding of embedded systems concepts
Experience with non-ML algorithms and signal processing

Benefits & conditions

Market Opportunity - Edge AI is exploding, and we're solving a critical pain point in a massive and growing market
Real Customer Impact - Our platform compresses 12-24 months of model development into 24-48 hours - validated by customers like NXP and Silicon Labs
Technical Depth - Work on hard, meaningful problems at the intersection of AutoML, TinyML, and embedded systems
Growth Trajectory - Join during a pivotal growth phase with significant room to grow into technical leadership
Competitive Compensation - Base salary, performance-based bonus, and meaningful equity stake

ModelCat is an equal opportunity employer committed to building a diverse and inclusive team.

About the company

ModelCat is transforming how companies develop AI models for embedded, edge, and IoT devices. Our innovative platform uses AI to build AI - turning model architecture selection, training, optimization, and validation into a single powerful step. ModelCat takes what was previously a 12-24 month process requiring highly skilled AI professionals and reduces it to a 24-48 hour AI-powered job that can be run by developers, data scientists, and product owners. Trusted by industry leaders like NXP and Silicon Labs, ModelCat is a venture-backed startup headquartered in Sunnyvale, California.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all