Neural Network Performance Engineer

Humanoid

Charing Cross, United Kingdom

4 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Junior

Job location

Charing Cross, United Kingdom

Tech stack

Artificial Neural Networks

Software Debugging

Python

Open Source Technology

Graphics Processing Unit (GPU)

PyTorch

Deep Learning

Job description

Here at Humanoid, we believe in a future where robots amplify human potential. That's why we've set out on a mission to build the world's most capable, commercially-scalable, and safe humanoid robots. We're bringing that mission to life with HMND-01 Alpha - our rapidly developed humanoid platform now running in real industrial pilots - and we're growing the team to take it even further., We're hiring a Neural Network Performance Engineer to join our VLA team based in London. In this role, you will work on all aspects of running capable neural-network based control policies at a high rate with minimal latency, both on cloud hardware and onboard. Your work will be critical to delivering smooth robot motions while reacting to environment changes as quickly as possible., * Analyze performance bottlenecks of a particular model architecture and come up with potential improvements.

Make the model run on a new hardware (e.g. NVIDIA Thor) efficiently.
Implement custom kernels to reduce memory throughput requirements where it matters.
Quantize a model with minimal loss of quality.
Suggest and implement changes of model architecture that will enable better performance characteristics without sacrificing model capabilities.

Requirements

Do you have experience in Python?, * 3+ years building deep-learning systems (industry or research) with shipped models or published artifacts to show for it.

1+ years experience working on performance of neural network inference (analyzing bottlenecks, writing custom kernels, quantizing models, fighting deep learning compilers).
Excellent understanding of GPU architecture and why some models run faster than others.
Strong Python + PyTorch/JAX; you can profile, debug numerics, and write maintainable research code.
You document experiments clearly and communicate trade-offs crisply. Nice to have:
Robotics or autonomous driving experience.
Open source code showcasing your ability to improve inference performance.
Publications at ICLR/ICML/NeurIPS or equivalent open-source contributions.
Familiarity with vision-language (VLM) or vision-language-action (VLA) models.

Benefits & conditions

Meaningful time off to rest and recharge: 23 days of annual leave (accrued), 15 days of paid sick leave, and paid company holidays.
Fully funded private healthcare for UK employees, with broad provider access, virtual and in-person care, and strong mental health and serious illness support.
Equity included-we believe builders should share in what they build.
Pension scheme with a total 8% contribution (5% employee, 3% employer) on full earnings.
Free daily breakfast, catered lunch, and snacks in-office.
Collaboration with top-tier engineers, researchers, and product experts in AI and robotics.
Freedom to influence the product and own key initiatives.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

Apply for this position

Good distractions

Moments

Videos View all