Sr Software Dev Engineer, Stores Foundational AI -SFAI

Amazon.com, Inc.
Seattle, United States of America
2 days ago

Role details

Contract type
Internship / Graduate position
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 227K

Job location

Seattle, United States of America

Tech stack

Artificial Intelligence
Big Data
Code Review
Computer Programming
Software Design Patterns
Distributed Systems
Machine Learning
TensorFlow
Software Engineering
PyTorch
Large Language Models
Prompt Engineering
Build Management
Optimization Algorithms
Machine Learning Operations
TensorRT
Programming Languages

Job description

We're building a foundational LLM for Amazon Stores that fuses general world knowledge with Amazon e-commerce domain knowledge to provide new and improved shopping experiences for our customers. We are searching for pioneers who are passionate about technology, innovation, and customer experience, and are ready to make a lasting impact on the industry. You'll be working with talented scientists and engineers to innovate on behalf of our customers. If you're fired up about being part of a dynamic, driven team, then this is your moment to join us on this exciting journey!, In this role you will leverage your engineering background and expertise to help develop generative AI for shopping. As a Senior Software Development Engineer, you will:

Architect and build scalable ML infrastructure that powers the training and deployment of large language models-directly shaping the future of AI-driven shopping experiences for all Amazon customers Drive technical innovation by designing experimentation frameworks and tooling that accelerate breakthrough insights, enabling scientists and engineers to iterate faster and smarter Lead cross-functional initiatives partnering with applied scientists and engineering teams to translate frontier research into production systems that delight customers Mentor and elevate the team through technical leadership, code reviews, and architectural guidance-raising the bar for engineering excellence across the organization Own impactful projects end-to-end across diverse technologies-from distributed computing and ML operations to prompt engineering-while navigating ambiguity and making strategic trade-offs that balance innovation with delivery

A day in the life On any given day, you may work on: Design and build end-to-end RL post-training pipelines (rollout * reward * optimization) at cluster scale Improve RL training stability (PPO / GRPO / RLOO) by monitoring and tuning key metrics such as reward, KL divergence, and policy stability Optimize RL post-training efficiency (GPU utilization, batching, sequence packing, async rollouts) Partner with research scientists to translate new RL algorithms into scalable, production-ready systems Profile and eliminate bottlenecks across compute, networking, and storage Build observability systems for training dynamics, system health, and experiment tracking Collaborate cross-functionally to run experiments, iterate quickly, and unblock research progress Mentor engineers and contribute to system design and long-term technical roadmap

Requirements

5+ years of non-internship professional software development experience

  • 5+ years of programming with at least one software programming language experience
  • 4+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
  • Experience as a mentor, tech lead or leading an engineering team
  • Knowledge of Machine Learning and LLM fundamentals, including transformer architecture, training/inference lifecycles, and optimization techniques
  • Demonstrated ability to drive technical direction and influence engineering decisions across teams

Preferred Qualifications

  • Knowledge of ML frameworks including JAX, PyTorch, vLLM, SGLang, Dynamo, TorchXLA, and TensorRT
  • Experience with distributed systems, big data technologies, and machine learning infrastructure

Benefits & conditions

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, WA, Seattle - 168,100.00 - 227,400.00 USD annually

Apply for this position