AI Infrastructure Engineer, Model Optimization & Deployment, Optimus

Tesla Motors
Palo Alto, United States of America
5 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 176K

Job location

Palo Alto, United States of America

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Artificial Neural Networks
Automation of Tests
Azure
Cloud Computing
Program Optimization
Serialization
Memory Management
Protocol Buffers
Python
Machine Learning
Prometheus
Azure
Smart Devices
Software Engineering
AI Infrastructure
Google Cloud Platform
PyTorch
Flask
Grafana
FastAPI
Containerization
Kubernetes
ONNX (Open Neural Network Exchange) Format
Avro
Machine Learning Operations
TensorRT
REST
Serverless Computing
Docker

Job description

Tesla AIissolvingrobust, real-world AI through humanoid robots.As a Software Engineer for the Optimus team, you will build the tools and infrastructure to make and measure improvements to neural network architecture, visualize data,assistwith exporting and deploying neural networks toTesla'sneural network chip with real-time latency constraints on Optimus, and evaluate experimental results. You will help us automate the entire workflows of training, validation, and production ofOptimus. Most importantly, you will see your work repeatedly shipped to andutilizedby thousands of Humanoid Robots in real world applications. What You'll Do

  • Optimize ML models for latency, memory usage, and inference speed

  • Quantize, prune, and convert models (e.g., to ONNX, TensorRT, TFLite) for deployment on various platforms (cloud, edge, mobile)

  • Benchmark and profile model performance across different environments

  • Package and deploy models as REST APIs, batch jobs, or streaming services using tools like FastAPI, Flask, or gRPC

  • Implement CI/CD pipelines for automated testing and deployment of ML models

  • Ensure scalability and reliability of ML services in production environments

Requirements

  • Strong proficiency in Python and PyTorch

  • Experience with model optimization tools (e.g., ONNX, TensorRT, TFLite, TVM)

  • Experience with model inference optimization and quantization

  • Solid understanding of containerization and orchestration (Docker, Kubernetes)

  • Familiarity with cloud platforms (AWS, GCP, Azure) and serverless deployments

  • Strong grasp of software engineering principles and CI/CD pipelines

  • Experience deploying models to edge devices or mobile platforms

  • Knowledge of data serialization formats (e.g., protobuf, Avro)

  • Exposure to observability tools (e.g., Prometheus, Grafana) for ML monitoring

Benefits & conditions

Along with competitive pay, as a full-time Tesla employee, you are eligible for the following benefits at day 1 of hire:

  • Medical plans > plan options with $0 payroll deduction
  • Family-building, fertility, adoption and surrogacy benefits
  • Dental (including orthodontic coverage) and vision plans, both have options with a $0 paycheck contribution
  • Company Paid (Health Savings Accounts) HSA Contribution when enrolled in the High-Deductible medical plan with HSA
  • Healthcare and Dependent Care Flexible Spending Accounts (FSA)
  • 401(k) with employer match, Employee Stock Purchase Plans, and other financial benefits
  • Company paid Basic Life, AD&D
  • Short-term and long-term disability insurance (90 day waiting period)
  • Employee Assistance Program
  • Sick and Vacation time (Flex time for salary positions, Accrued hours for Hourly positions), and Paid Holidays
  • Back-up childcare and parenting support resources
  • Voluntary benefits to include: critical illness, hospital indemnity, accident insurance, theft & legal services, and pet insurance
  • Weight Loss and Tobacco Cessation Programs
  • Tesla Babies program
  • Commuter benefits
  • Employee discounts and perks program

Expected Compensation $176,000 - $420,000/annual salary + cash and stock awards + benefits

Apply for this position