Machine Learning Engineer - Kernels

MindBeam LLC

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Compensation

$ 190K

Job location

Remote

Tech stack

C++

Profiling

Nvidia CUDA

Machine Learning

Performance Tuning

TensorFlow

Parallel Computation

Gpu Programming

Information Technology

Production Code

Software Coding

Job description

Design and implement custom GPU/accelerator kernels to maximize performance.
Profile, benchmark, and optimize critical ML workloads.
Collaborate with researchers to translate algorithmic advances into efficient, production-ready code.
Stay current with hardware advancements (CUDA, ROCm, TPU) to inform kernel design.
Document and share best practices for low-level optimization.

Requirements

Do you have experience in System performance optimization?, Do you have a Master's degree?, * Bachelor's, Master's, or PhD in Computer Science, Electrical Engineering, or related field-or equivalent experience.

2+ years of experience in GPU programming, parallel computing, or systems-level optimization.
Strong coding skills in C++, CUDA, or similar languages.
Familiarity with ML frameworks and their low-level backends.
Experience optimizing workloads for distributed and heterogeneous compute environments.
Comfort with profiling tools and performance diagnostics.

About You

You are detail-oriented, performance-obsessed, and excited by the challenge of squeezing out every ounce of compute efficiency. You enjoy working at the intersection of algorithms and hardware, and you thrive in a collaborative environment where bold ideas are encouraged.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all