Principal Machine Learning Engineer, Accelerated Apache Spark

NVIDIA Ltd.
Santa Clara, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 152K

Job location

Santa Clara, United States of America

Tech stack

Java
Artificial Intelligence
Algorithm Design
Big Data
C++
Nvidia CUDA
Computer Programming
Data Centers
ETL
Python
Machine Learning
NumPy
Open Source Technology
TensorFlow
SciPy
SQL Databases
Reinforcement Learning
Data Processing
Graphics Processing Unit (GPU)
Feature Engineering
PyTorch
Delivery Pipeline
Large Language Models
Spark
Pandas
Scikit Learn
Information Technology
XGBoost
Machine Learning Operations

Job description

NVIDIA is looking for a Machine Learning (ML) Engineer to join the GPU accelerated Apache Spark team. Apache Spark is the most popular data processing engine in data centers for running large scale workloads for ETL, SQL, and ML/DL model training and inference pipelines, spanning many domains and use cases. NVIDIA GPUs offer a promising avenue for significantly speeding up and/or lowering the cost of running Apache Spark applications at massive scales. You will work with the open source community to accelerate Apache Spark with GPUs. You will apply the latest ML/AI methods to empower enterprises to migrate Spark workloads onto GPUs at scale.

What you'll be doing:

  • Design and implement machine learning solutions for performance prediction and optimization of GPU accelerated enterprise Apache Spark workloads.
  • Develop advanced algorithms and adaptive systems to continuously improve the performance of Apache Spark workloads on GPUs.
  • Develop AI-based agents and tools to assist with fixing system issues and application optimization.
  • Collaborate with key partners and customers on the deployment of complex machine learning solutions in various environments.
  • Maintain deep domain expertise by knowing the latest published advances in ML systems and algorithms.
  • Provide technical mentorship and leadership in data science and machine learning to a team of engineers.

Requirements

  • BS, MS, or PhD or equivalent experience in Machine Learning, Data Science, Computer Science or a closely related field.
  • 12+ years of professional experience in designing, implementing, and productionizing high-quality ML/DL solutions.
  • 5+ experience as technical lead in ML model development.
  • Proven hands-on experience (2+ years) with large-scale data processing platforms, such as Apache Spark.
  • Proven ability to employ modern tooling and sound techniques for all aspects of crafting, deploying, and maintaining machine learning models.
  • Excellent programming skills in Python and Python data science related libraries like numpy, pandas, scikit-learn, scipy, pytorch, and tensorflow.
  • Deep experience with sophisticated ML methodologies, including LLM/GenAI, reinforcement learning, and adaptive, on-line ML systems.
  • Strong expertise in feature engineering, feature importance assessment, and developing boosted tree model solutions (e.g., XGBoost).

Ways to stand out from the crowd:

  • Understanding of the internal workings and architecture related to Apache Spark.
  • Familiarity with NVIDIA GPUs and CUDA.
  • Experience coding in Scala, Java, and/or C++.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most experienced and dedicated people in the world working for us. If you are passionate about what you do, creative and autonomous, we want to hear from you!

Benefits & conditions

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.

Apply for this position