Senior Software Engineer, Simulator Evaluation
Role details
Job location
Tech stack
Job description
We are looking for aSenior Software Engineer to build the metrics and systems that grade this hybrid environment. You will work at the intersection of software engineering and AI, ensuring that our simulated worlds-whether driven by explicit rules or foundation models-provide a trustworthy representation of reality.
In this hybrid role, you will report to a Senior Staff Software Engineering Manager and define the "Yardstick of Reality" used to validate and train Waymo technology.
You will:
- Architect the Eval Rubrik: You will develop novel methodologies to evaluate the simulator across the stack. You will distinguish between true driving challenges and realism artifacts-whether it's a logic gap, a physics glitch, or a model hallucination.
- Build at Scale: You will design and implement high-throughput pipelines (C++/Python) capable of processing massive datasets of simulation logs. You will turn raw, noisy data into clear, actionable signals.
- The "Critic" for the System: You will partner closely with AI research and other simulation teams, as the eval workflows you build will drive rapid innovation and research roadmaps.
- Strategic Leadership: You will navigate ambiguity to determine what matters most for realism. You will lead the strategy for specific domains, ensuring our evaluation evolves as fast as our simulation technology.
You have:
- Engineering Craftsmanship
Requirements
- 5+ years of software development experience.
- Proficiency in Python or C++, with experience building scalable data processing systems or evaluation frameworks.
- Strong software design principles: you write clean, testable code that is built to last.
Data Intuition & Quantitative Rigor:
- A "Data Detective" mindset: You can look at a distribution of outcomes and intuitively spot anomalies, selection bias, or system errors.
- Experience designing and implementing evaluation frameworks for complex systems or machine learning models.
System & Model Fluency:
- Comfort working with complex, hybrid systems. You understand how to evaluate different types of "black boxes," whether they are heuristic-based, physics-based, or learned models.
We prefer:
- Background in fields that blend code, math, and simulation: Autonomous Vehicles, Algorithmic Trading, AdTech/Search Ranking, Machine Learning, or Robotics.
- Experience with SQL and the Python data stack (Pandas, NumPy, SciPy).
- Familiarity with evaluating Generative AI / LLMs or experience with agent-based modeling and behavioral logic.
- Experience taking a metric from "research concept" to "production pipeline."
Benefits & conditions
The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.
Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements. Salary Range $204,000 - $259,000 USD Applied = 0 MORE JOBS LIKE THIS