Machine Learning Engineer - Kernels
MindBeam LLC
2 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Intermediate Compensation
$ 190KJob location
Remote
Tech stack
C++
Profiling
Nvidia CUDA
Machine Learning
Performance Tuning
TensorFlow
Parallel Computation
Gpu Programming
Information Technology
Production Code
Software Coding
Job description
- Design and implement custom GPU/accelerator kernels to maximize performance.
- Profile, benchmark, and optimize critical ML workloads.
- Collaborate with researchers to translate algorithmic advances into efficient, production-ready code.
- Stay current with hardware advancements (CUDA, ROCm, TPU) to inform kernel design.
- Document and share best practices for low-level optimization.
Requirements
Do you have experience in System performance optimization?, Do you have a Master's degree?, * Bachelor's, Master's, or PhD in Computer Science, Electrical Engineering, or related field-or equivalent experience.
- 2+ years of experience in GPU programming, parallel computing, or systems-level optimization.
- Strong coding skills in C++, CUDA, or similar languages.
- Familiarity with ML frameworks and their low-level backends.
- Experience optimizing workloads for distributed and heterogeneous compute environments.
- Comfort with profiling tools and performance diagnostics.
About You
You are detail-oriented, performance-obsessed, and excited by the challenge of squeezing out every ounce of compute efficiency. You enjoy working at the intersection of algorithms and hardware, and you thrive in a collaborative environment where bold ideas are encouraged.