Software Engineer - Machine Learning III

Spectraforce

Mountain View, United States of America

13 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Job location

Mountain View, United States of America

Tech stack

Training Data

Artificial Intelligence

Data Systems

Distributed Computing Environment

Python

Machine Learning

TensorFlow

Software Engineering

Data Logging

PyTorch

Delivery Pipeline

Large Language Models

Information Technology

Free and Open-Source Software

Machine Learning Operations

Software Version Control

Data Generation

Job description

Design and train prompt injection detection models and prompt safety classifiers that operate on both inputs to and outputs from client's agentic AI systems.
Build hybrid deployment pipelines that split safety inference between on-device (phone, XR/AR) and cloud, optimizing for latency, privacy, and detection coverage.
Apply post-training techniques (e.g. RLHF, reward modeling, policy optimization) to optimize guardrail model performance, calibration, and robustness against adaptive adversaries.
Curate and generate adversarial training data: direct and indirect prompt injections, jailbreaks, tool-use exploits, and unsafe-output cases drawn from red-teaming and production signals.
Build evaluation harnesses that measure attack success rate, false-positive rate, latency, and on-device footprint across model iterations and threat categories.
Partner with agent, device, and platform teams to integrate safety models into mobile-use agents, XR/AR assistants, and cloud agentic workflows, and to close the loop from production incidents back into training data.
Work cross-functionally with security researchers, modeling teams, and product engineers; document methods and, where appropriate, contribute to patents and publications.

Requirements

Experience with on-device or edge ML deployment (ExecuTorch, Core ML, TFLite, MLC-LLM, vendor NPU toolchains) and model compression (quantization, distillation, pruning) for safety models.
Experience with telemetry, logging, or user-facing data systems on mobile, XR/AR, or consumer platforms, including privacy-preserving handling of user data (e.g., anonymization, on-device processing, federated approaches).
Publications at top-tier ML/NLP/security venues (NeurIPS, ICML, ICLR, ACL, EMNLP, USENIX Security, IEEE S&P), patents, or open-source contributions in the safety, alignment, or AI security space., * M.S. or Ph.D. in Computer Science, Machine Learning, Electrical Engineering, or a related field; or B.S. with equivalent industry experience.
3+ years of industry experience in ML engineering or applied AI research, with demonstrated ownership of production ML systems.
2+ years of industry experience in software engineering.
Strong proficiency in Python and PyTorch (or JAX/TensorFlow), with solid software engineering fundamentals (version control, testing, and reproducible experimentation).
Hands-on experience post-training LLMs with RLHF, DPO, RLAIF, or reward modeling including reward design, preference data curation, and training stability.
Hands-on experience training and deploying classifier or guardrail models for safety, content moderation, abuse detection, or adversarial robustness.
Familiarity with prompt injection, jailbreak, and agentic AI threat models, and with distributed training frameworks (DeepSpeed, FSDP, Accelerate)., * Experience building safety or moderation systems for agentic AI: tool-use guardrails, indirect prompt injection defenses, or output filtering for autonomous agents.
Experience with red-teaming, adversarial data generation, or automated attack pipelines (e.g., GCG, PAIR, generator-critic frameworks).

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all