Senior AI Researcher, On-Device LLM Efficiency

Qualcomm
San Diego, United States of America
15 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 239K

Job location

Remote
San Diego, United States of America

Tech stack

Artificial Intelligence
Systems Engineering
Computer Programming
Computer Engineering
Python
Machine Learning
Smart Devices
Software Deployment
Software Engineering
PyTorch
Large Language Models
Deep Learning
Information Technology

Job description

As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Machine Learning Researcher, you will conduct fundamental research that creates innovative machine learning methodology that achieves beyond state-of-the-art performance. Qualcomm Engineers collaborate with cross-functional teams to enhance the world of mobile, edge, auto, and IOT products through machine learning research., * Research and development in the area of LLM inference efficiency algorithms, efficient model architecture design, and/or LLM training

  • Develop creative solutions with consideration of practical challenges on devices
  • Implementation and evaluation of possible solutions in both simulation and on-device environments

Requirements

  • Master's degree in Computer Engineering, Computer Science, Electrical Engineering, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

OR PhD in Computer Engineering, Computer Science, Electrical Engineering, or related field.

  • 6+ months of academic and/or work experience developing and/or optimizing machine learning models, systems, platforms, or methods., * Master's degree in Computer Science, Electrical Engineering, or related field
  • 4+ years of AI research experience
  • Strong background in deep learning and Transformers
  • Strong programming skills in Python and PyTorch
  • Experience in LLM reasoning or inference acceleration research, * PhD in Computer Science, Electrical Engineering, or related field
  • Experience in LLM efficiency research such as efficient attention, inference acceleration, or KV cache compression
  • Experience in on-device AI deployment on mobile or edge devices
  • Publishing research papers at top-tier AI/ML conferences, e.g., NeurIPS, ICML, and ICLR, as a lead author

Benefits & conditions

The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer - and you can review more details about our US benefits at this link .

About the company

Qualcomm Technologies, Inc., Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. Pay range and Other Compensation & Benefits : $159,100.00 - $238,700.00

Apply for this position