Research Engineer, Responsibility Engineering, DeepMind
Role details
Job location
Tech stack
Job description
- Prototype and deliver scalable engineering solutions rapidly.
- Architect and optimize training and inference pipelines to evaluate the frontier language models.
- Develop post-training strategies to mitigate adversarial risks including jailbreak and prompt injection attacks.
- Collaborate with Research Scientists to translate safety research into implementations and present results to cross-functional stakeholders.
- Build and maintain evaluation infrastructure to systematically track model safety performance.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.
Requirements
Do you have experience in Test automation?, Do you have a Master's degree?, * Bachelor's degree in Computer Science, Machine Learning, Mathematics, or a related technical field, or equivalent practical experience.
- 8 years of experience in machine learning engineering or large-scale software systems.
- 3 years of experience in Python programming.
- 3 years of experience with ML frameworks such as JAX, PyTorch, or TensorFlow., * Master's degree or PhD in Computer Science, Engineering, or a related field with a focus on Machine Learning.
- Experience working directly on AI safety, adversarial robustness, jailbreak evaluation, or responsible AI research.
- Experience in Python and C++ for high-performance ML library development.
- Experience with adversarial machine learning, red-teaming, AI safety evaluation, or security research.
- Experience building evaluation frameworks, benchmarks, or automated testing pipelines for ML models.