Research Scientist / Engineer - Data & Evaluation

RHODA AI CORPORATION

Palo Alto, United States of America

1 month ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Palo Alto, United States of America

Tech stack

Computer Vision

Data Deduplication

Language Modeling

Data Pipelines

Data Selection

Job description

Design and implement scalable curation pipelines for web-scale video pretraining data: ingestion, deduplication, quality filtering, and content classification across internet-scale video corpora
Develop video-specific annotation frameworks and quality filters - motion quality, scene diversity, action content, temporal coherence - to improve pretraining signal
Build evaluation frameworks and benchmarks to measure causal video model capabilities: prediction quality, temporal coherence, long-horizon rollout fidelity, and downstream robot task performance
Research and implement data selection, mixing, and weighting strategies that improve video generation quality and transfer to robotic control
Deploy and scale vision-language models (VLMs) and video understanding models for automated annotation, filtering, and content scoring at web scale
Collaborate closely with pre-training and post-training teams to ensure data quality and evaluation methodology drive research decisions
Track model capability trends across training runs, catching regressions and surfacing improvements early, * The video curation and evaluation rigor you build directly determines pretraining quality and research iteration speed for the entire team
Build the benchmark infrastructure that gives the team an honest signal of model progress toward real robot performance
High leverage: improvements to data quality compound across every training run
Work at the intersection of large-scale systems and generative model research with visibility across all model development

Requirements

Do you have experience in Scientific publications?, * Strong understanding of data-centric ML and how web video data quality affects large generative model performance

Experience building large-scale video data pipelines: ingestion, filtering, deduplication, and quality scoring
Familiarity with video-specific data characteristics: temporal structure, motion quality, scene diversity, and action content
Solid ML fundamentals with hands-on experience training or evaluating large generative models
Ability to design evaluations for video generation models that are diagnostic, reproducible, and actionable
Staff-level candidates are expected to define technical direction and drive research strategy independently; senior/MTS candidates execute complex projects with strong fundamentals and growing scope

Nice to Have (But Not Required)

PhD or strong research background in ML, computer vision, or a related field
Experience with large-scale web video dataset curation (e.g., WebVid, HowTo100M, Ego4D, or similar)
Familiarity with video generation quality metrics (FVD, perceptual quality, motion consistency)
Experience running VLM or CLIP-style inference at scale for automated video filtering and annotation
Prior work on evaluation methodology for video generation or world models
Understanding of how web video data properties connect to downstream robotic action prediction
Publication record at NeurIPS, ICML, ICLR, CVPR, or related venues

About the company

At Rhoda AI, we're building the full-stack foundation for the next generation of humanoid robots - from high-performance, software-defined hardware to the foundational models and video world models that control it. Our robots are designed to be generalists capable of operating in complex, real-world environments and handling scenarios unseen in training. We work at the intersection of large-scale learning, robotics, and systems, with a research team that includes researchers from Stanford, Berkeley, Harvard, and beyond. We're not building a feature; we're building a new computing platform for physical work - and with over $400M raised, we're investing aggressively in the R&D, hardware development, and manufacturing scale-up to make that a reality.

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all