Senior AI Researcher, On-Device LLM Efficiency

Qualcomm

San Diego, United States of America

15 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 239K

Job location

Remote

San Diego, United States of America

Tech stack

Artificial Intelligence

Systems Engineering

Computer Programming

Computer Engineering

Python

Machine Learning

Smart Devices

Software Deployment

Software Engineering

PyTorch

Large Language Models

Deep Learning

Information Technology

Job description

As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Machine Learning Researcher, you will conduct fundamental research that creates innovative machine learning methodology that achieves beyond state-of-the-art performance. Qualcomm Engineers collaborate with cross-functional teams to enhance the world of mobile, edge, auto, and IOT products through machine learning research., * Research and development in the area of LLM inference efficiency algorithms, efficient model architecture design, and/or LLM training

Develop creative solutions with consideration of practical challenges on devices
Implementation and evaluation of possible solutions in both simulation and on-device environments

Requirements

Master's degree in Computer Engineering, Computer Science, Electrical Engineering, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.

OR PhD in Computer Engineering, Computer Science, Electrical Engineering, or related field.

6+ months of academic and/or work experience developing and/or optimizing machine learning models, systems, platforms, or methods., * Master's degree in Computer Science, Electrical Engineering, or related field
4+ years of AI research experience
Strong background in deep learning and Transformers
Strong programming skills in Python and PyTorch
Experience in LLM reasoning or inference acceleration research, * PhD in Computer Science, Electrical Engineering, or related field
Experience in LLM efficiency research such as efficient attention, inference acceleration, or KV cache compression
Experience in on-device AI deployment on mobile or edge devices
Publishing research papers at top-tier AI/ML conferences, e.g., NeurIPS, ICML, and ICLR, as a lead author

Benefits & conditions

The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Even more importantly, please note that salary is only one component of total compensation at Qualcomm. We also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants (employees on sales-incentive plans are not eligible for our annual bonus). In addition, our highly competitive benefits package is designed to support your success at work, at home, and at play. Your recruiter will be happy to discuss all that Qualcomm has to offer - and you can review more details about our US benefits at this link .

About the company

Qualcomm Technologies, Inc., Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law. Pay range and Other Compensation & Benefits : $159,100.00 - $238,700.00

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all