ML Data Engineer - Computer Vision, Video & Sensor Data

autonomous-teaming
Canton of Toulouse-5, France
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English, French, German

Job location

Remote
Canton of Toulouse-5, France

Tech stack

Amazon Web Services (AWS)
Computer Vision
Automation of Tests
Cloud Storage
Databases
Data Infrastructure
ETL
File Systems
Python
NoSQL
NumPy
Object Detection
SQL Databases
Management of Software Versions
Data Processing
Pandas
Machine Learning Operations
Lidar
Docker
Data Generation

Job description

  • Design and maintain high-throughput, scalable pipelines to ingest and organize large volumes of time-series camera and sensor data (RGB, IR, thermal, acoustic, depth, IMU).
  • Own, curate, and continuously improve computer vision datasets for object detection and classification, ensuring high-quality, diverse, and statistically representative data.
  • Build and operate active learning loops to prioritize high-value samples and accelerate dataset improvements.
  • Write robust preprocessing and transformation pipelines using Python, NumPy, Pandas, and Albumentations for large-scale computer vision workloads.
  • Manage labeling workflows, including automation, QA validation, annotation consistency checks, and dataset versioning.
  • Collaborate with ML Engineers to fine-tune, train, and evaluate detection models, feeding insights back into data generation and selection.
  • Analyze model weaknesses, blind spots, bias, and drift to derive actionable data improvements.
  • Create internal tools and dashboards to visualize, audit, and analyze dataset quality, diversity, long-tail distributions, and model performance gaps.

Requirements

Do you have experience in Sensors?, * Strong experience in Python and data processing frameworks (Pandas, NumPy, vectorized operations, multiprocessing).

  • Hands-on experience building ETL/ELT pipelines for ingesting, transforming, and structuring large video and sensor datasets.
  • Experience with data orchestration and lifecycle management for ML and computer vision workflows, including dataset versioning and reproducibility.
  • Solid understanding of object detection pipelines (Detectron2, MMDetection, COCO format, bounding-box standards).
  • Experience with active learning, uncertainty sampling, or semi-supervised dataset workflows.
  • Familiarity with data annotation platforms (CVAT, Label Studio) and automated QA/consistency checks.
  • Strong grasp of evaluation metrics for object detection (IoU, mAP, precision-recall curves, class-wise metrics).
  • Comfortable with databases (SQL/NoSQL), file systems, and the management of large-scale image, video, and sensor datasets.
  • Ability to work cross-functionally with perception, deployment, robotics, and data infrastructure teams.
  • Fluent in English, German and/or French are a plus

Nice to Have:

  • Experience with cloud storage and MLOps tools (AWS S3, MinIO, ClearML, MLFlow, Weights & Biases).
  • Familiarity with ROS / robotics data formats (bag files, TF trees, sensor_msgs), Docker, or embedded ML workflows.
  • Prior work with robotics, drones, or multi-sensor perception systems, including IR, LiDAR, radar, or audio datasets.

Meta:

  • Outside-the-box creativity with a blend of conceptual and systematic design thinking.
  • High intrinsic motivation, attention to detail, and strong problem-solving mindset.
  • Structured, methodical, and reliable execution, even under uncertainty.
  • Humble, collaborative, and mission-driven - values collective success over ego.
  • High ethical standards and disciplined work ethic.
  • Extra-curricular achievements, leadership, or unique projects are a plus.
  • NATO-aligned nationality or close ally citizenship is required.
  • Successful candidates must obtain security clearance., The world is changing. Exponential technologies are enabling new types of security threats. We are committed to staying ahead by building nimble, scalable, and cost-effective defences. We are looking for passionate developers who are eager to create exceptional products, safeguard our freedom, and strengthen the resilience of democracies.

About the company

We are a defence-tech start-up specializing in machine vision solutions. If you have a passion for cutting-edge innovation, and drive to use your skills to create next generation solutions, this is an opportunity for you! What we do: We are developing solutions that enable computers and sensors to collaborate as teams, working together to address emerging security challenges. Our primary mission is to defend against AI-powered asymmetric threats at scale, such as drone swarms and other UXVs. Who we are: Based in Munich, Berlin and Bordeaux/Toulouse we are rapidly expanding across Europe with plans to open more office hubs soon. We embrace a hybrid work culture - valuing the collaborations that happens in the office, while also empowering our team members to work remotely with responsibility and autonomy.

Apply for this position