Machine Learning Architect

AYO & AYO INCORPORATED
Boston, United States of America
10 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Boston, United States of America

Tech stack

Artificial Intelligence
Artificial Neural Networks
Big Data
Python
Machine Learning
TensorFlow
Graphics Processing Unit (GPU)
PyTorch
Machine Learning Operations
Software Library

Job description

We are seeking a deep ML practitioner to join as a founding team member - someone with hands-on experience working on or alongside a foundation model at scale, who understands what happens under the hood when splitting jobs across thousands of GPUs, and who is excited to bring that depth to novel hardware. They have a full-stack understanding of machine learning architectures, love to optimize algorithms across disciplinary boundaries, and will deploy and train models directly on our prototype chips to help us prove out what our processor can do - no prior hardware experience required.

What You'll Do:

  • Deploy and run trained models on prototype hardware and digital twins, producing working demonstrations on our chips.
  • Develop and adapt algorithms to train models on novel processing environments, including our prototype hardware.
  • Work with hardware engineers to define and refine processor architecture based on insights learned through model training and experimentation.
  • Maintain a deep curiosity about what makes machine learning systems work - and bring that curiosity to bear on how they run on new hardware.
  • Support go-to-market strategy development

Requirements

  • PhD in machine learning, representation learning, theory of computation, or a related field - or equivalent industry experience working on foundation models at scale.
  • Experience training models at scale - distributed training across many GPUs, working with large datasets and compute.
  • Has built and trained neural networks from scratch
  • Deep knowledge of the structure and internal operation of neural networks - including how and why they behave the way they do (e.g. interpretability or explainability work is a plus).
  • Excitement about applying deep AI expertise to new and novel hardware environments - you don't need prior experience with photonics or silicon, but you want to learn.
  • Fluent knowledge of Python
  • Fluency in PyTorch (preferred), TensorFlow, JAX, or other industry-standard ML software libraries

Benefits & conditions

Pulled from the full job description

  • Health insurance
  • Vision insurance
  • Dental insurance, * We are working on a genuinely hard and interesting problem: unseating the GPU as the dominant AI compute platform.
  • Early-stage means real ownership, real impact, and meaningful equity.
  • Competitive salary. Equity commensurate with stage and seniority. Benefits package including health, dental, and vision.
  • Fully onsite in Boston - we are a collaborative, in-person team.

Apply for this position