ML Data Engineer - Computer Vision, Video & Sensor Data
autonomous-teaming
Canton of Toulouse-5, France
14 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English, French, GermanJob location
Canton of Toulouse-5, France
Tech stack
Amazon Web Services (AWS)
Computer Vision
Automation of Tests
Cloud Storage
Databases
Data Infrastructure
ETL
File Systems
Python
NoSQL
NumPy
Object Detection
SQL Databases
Management of Software Versions
Data Processing
Pandas
Machine Learning Operations
Lidar
Docker
Data Generation
Job description
- Design and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).
- Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative data.
- Build and operate active learning loops to prioritize high-value samples and accelerate dataset improvements.
- Write robust preprocessing and transformation pipelines using Python, NumPy, Pandas, and Albumentations for large-scale computer vision workloads.
- Manage labeling workflows, including automation, QA validation, annotation consistency checks, and dataset versioning.
- Collaborate with ML Engineers to fine-tune, train, and evaluate detection models, feeding insights back into data generation and selection.
- Analyze model weaknesses, blind spots, bias, and drift to derive actionable data improvements.
- Create internal tools and dashboards to visualize, audit, and analyze dataset quality, diversity, long-tail distributions, and model performance gaps.
Requirements
Do you have experience in Sensors?, * Strong experience in Python and data processing frameworks (Pandas, NumPy, vectorized operations, multiprocessing).
- Hands-on experience building ETL/ELT pipelines for ingesting, transforming, and structuring large video and sensor datasets.
- Experience with data orchestration and lifecycle management for ML and computer vision workflows, including dataset versioning and reproducibility.
- Solid understanding of object detection pipelines (Detectron2, MMDetection, COCO format, bounding-box standards).
- Experience with active learning, uncertainty sampling, or semi-supervised dataset workflows.
- Familiarity with data annotation platforms (CVAT, Label Studio) and automated QA/consistency checks.
- Strong grasp of evaluation metrics for object detection (IoU, mAP, precision-recall curves, class-wise metrics).
- Comfortable with databases (SQL/NoSQL), file systems, and the management of large-scale image, video, and sensor datasets.
- Ability to work cross-functionally with perception, deployment, robotics, and data infrastructure teams.
- Fluent in English, German and/or French are a plus
Nice to Have:
- Experience with cloud storage and MLOps tools (AWS S3, MinIO, ClearML, MLFlow, Weights & Biases).
- Familiarity with ROS / robotics data formats (bag files, TF trees, sensor_msgs), Docker, or embedded ML workflows.
- Prior work with robotics, drones, or multi-sensor perception systems, including IR, LiDAR, radar, or audio datasets.
Meta:
- Outside-the-box creativity with a blend of conceptual and systematic design thinking.
- High intrinsic motivation, attention to detail, and strong problem-solving mindset.
- Structured, methodical, and reliable execution, even under uncertainty.
- Humble, collaborative, and mission-driven - values collective success over ego.
- High ethical standards and disciplined work ethic.
- Extra-curricular achievements, leadership, or unique projects are a plus.
- NATO-aligned nationality or close ally citizenship is required.
- Successful candidates must obtain security clearance., The world is changing. Exponential technologies are enabling new types of security threats. We are committed to staying ahead by building nimble, scalable, and cost-effective defences. We are looking for passionate developers who are eager to create exceptional products, safeguard our freedom, and strengthen the resilience of democracies.
About the company
Who we are: Autonomous Teaming is a defence-tech start-up specializing in machine vision solutions. Driven by cutting-edge innovation, our team works on next-generation technologies designed to meet rapidly evolving security challenges.
What we do: We develop systems that enable computers and sensors to operate as coordinated teams, collaborating in real time to counter AI-powered asymmetric threats at scale - including drone swarms and other UXVs. Our mission is to build resilient, intelligent defence capabilities that perform reliably in the most demanding environments.
Who we are: Based in Munich, Berlin, and Toulouse, we are expanding rapidly across Europe with plans to open additional office hubs. We value close, in-person collaboration as the foundation for building complex, high-impact technology, while maintaining flexibility aligned to role and team needs. Our culture is built on ownership, responsibility, and trust - with a shared commitment to growing and building together.
If you require alternative methods of application or screening, you must approach the employer directly to request this as Indeed is not responsible for the employer's application process.