Machine Learning Research Engineer - Perception & Foundation Models
Role details
Job location
Tech stack
Job description
Zendar's "Semantic Spectrum" technology extracts rich scene understanding from radar sensing. As a Senior ML Research Engineer in Paris, your goal is to evolve this technology into a multi-modal foundation model architecture.
You will design and implement the architecture end-to-end. This involves training models from scratch on massive datasets, defining evaluation metrics for long-tail validation, and partnering with platform teams to ensure successful deployment in real-time embedded systems.
Why this role is exciting:
- Ownership: You will drive architectural decisions, making rigorous tradeoffs between approach A vs. B.
- Scale: You will work with a real-world dataset covering tens of thousands of kilometers across multiple continents.
- Impact: You will see your work validated on real vehicles, bridging the gap between research and production.
What You'll Do:
- Architect Multi-Sensor Strategies: Own the technical strategy for multi-sensor perception models. Design fusion architectures for streaming inputs (camera/radar/Lidar) utilizing early fusion and temporal fusion.
- Deliver Production-Ready Models: Build and deploy models for:
- Full-Scene Understanding: Occupancy grids, free-space, and dynamic occupancy.
- 3D Perception: Object detection and tracking.
- Static Environment: Lane line and road structure estimation.
- Drive Reliability: Target "four nines" reliability behavior in defined conditions, focusing on the messy long tail of real-world driving.
- Optimize for Real-Time: Partner with embedded teams to ensure models meet strict constraints (latency, memory, throughput) and integrate cleanly via stable interfaces.
Requirements
Do you have a Doctoral degree?, * Experience: 5+ years of experience (or a PhD) designing and implementing ML systems, with demonstrated ownership of research-to-production outcomes.
- Deep Learning Expertise: Strong background in perception, specifically transformer-based architectures, temporal modeling, and multi-modal learning.
- Training Mastery: Demonstrated experience training large models from scratch (not just fine-tuning) on large-scale datasets.
- Engineering Proficiency: Proficient in Python and a major deep learning framework (PyTorch or TensorFlow).
- Strategic Thinking: Ability to lead architectural discussions, articulate tradeoffs, quantify risks, and set realistic milestones.
Bonus Points:
- Sensor Knowledge: Experience with multi-sensor fusion (camera, radar, Lidar) and the nuances of real-world sensor noise.
- Advanced Education: PhD in Machine Learning, Computer Vision, or Robotics.
- Foundation Models: Experience with multi-modal pretraining, self-supervised learning, and scaling laws/strategies for autonomy.
- Modern Architectures: Familiarity with "Transfusion-style" paradigms (transformer-based fusion across modalities and time) and BEV-centric perception.
- Advanced Perception Tasks: Experience with 3D detection, occupancy networks, tracking, and streaming inference.
Benefits & conditions
- Competitive salary ranging from €75,000 to €95,000 annually depending on experience and equity
- Hybrid work model: in office 3 days per week (Monday, Tuesday, Thursday), the rest… work from wherever!
- Modern Workspace: Fully equipped, modern office in the heart of Paris
- Transportation/Commute: Commuter benefits (e.g., partial reimbursement for public transport or cycling programs, where applicable)
- Subsidized meal vouchers (tickets restaurant
- Wellness Pass (ex Gymlib)