Principal Machine Learning Engineer, Accelerated Apache Spark

NVIDIA Ltd.

Santa Clara, United States of America

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 152K

Job location

Santa Clara, United States of America

Tech stack

Java

Artificial Intelligence

Algorithm Design

Big Data

C++

Nvidia CUDA

Computer Programming

Data Centers

ETL

Python

Machine Learning

NumPy

Open Source Technology

TensorFlow

SciPy

SQL Databases

Reinforcement Learning

Data Processing

Graphics Processing Unit (GPU)

Feature Engineering

PyTorch

Delivery Pipeline

Large Language Models

Spark

Pandas

Scikit Learn

Information Technology

XGBoost

Machine Learning Operations

Job description

NVIDIA is looking for a Machine Learning (ML) Engineer to join the GPU accelerated Apache Spark team. Apache Spark is the most popular data processing engine in data centers for running large scale workloads for ETL, SQL, and ML/DL model training and inference pipelines, spanning many domains and use cases. NVIDIA GPUs offer a promising avenue for significantly speeding up and/or lowering the cost of running Apache Spark applications at massive scales. You will work with the open source community to accelerate Apache Spark with GPUs. You will apply the latest ML/AI methods to empower enterprises to migrate Spark workloads onto GPUs at scale.

What you'll be doing:

Design and implement machine learning solutions for performance prediction and optimization of GPU accelerated enterprise Apache Spark workloads.
Develop advanced algorithms and adaptive systems to continuously improve the performance of Apache Spark workloads on GPUs.
Develop AI-based agents and tools to assist with fixing system issues and application optimization.
Collaborate with key partners and customers on the deployment of complex machine learning solutions in various environments.
Maintain deep domain expertise by knowing the latest published advances in ML systems and algorithms.
Provide technical mentorship and leadership in data science and machine learning to a team of engineers.

Requirements

BS, MS, or PhD or equivalent experience in Machine Learning, Data Science, Computer Science or a closely related field.
12+ years of professional experience in designing, implementing, and productionizing high-quality ML/DL solutions.
5+ experience as technical lead in ML model development.
Proven hands-on experience (2+ years) with large-scale data processing platforms, such as Apache Spark.
Proven ability to employ modern tooling and sound techniques for all aspects of crafting, deploying, and maintaining machine learning models.
Excellent programming skills in Python and Python data science related libraries like numpy, pandas, scikit-learn, scipy, pytorch, and tensorflow.
Deep experience with sophisticated ML methodologies, including LLM/GenAI, reinforcement learning, and adaptive, on-line ML systems.
Strong expertise in feature engineering, feature importance assessment, and developing boosted tree model solutions (e.g., XGBoost).

Ways to stand out from the crowd:

Understanding of the internal workings and architecture related to Apache Spark.
Familiarity with NVIDIA GPUs and CUDA.
Experience coding in Scala, Java, and/or C++.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most experienced and dedicated people in the world working for us. If you are passionate about what you do, creative and autonomous, we want to hear from you!

Benefits & conditions

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

Apply for this position

Good distractions

Moments

Videos View all