Hardware Architect

Acceler8 Talent
Mountain View, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate

Job location

Mountain View, United States of America

Tech stack

Artificial Intelligence
Program Optimization
Encodings
Computer Programming
Software Debugging
Software Architecture
Low Latency
Physical Design

Job description

Acceler8 Talent is seeking an experienced Hardware Architect to join a well funded startup whose hardware promises to drastically change the economics of compute for the worlds' largest models.

With over $600m raised, and a world-class team with a track record of shipping highly successful products, this company abandons legacy chip design assumptions and strives for the best possible solution for every aspect of their chip - there is no such thing as "good enough".

As a Hardware Architect, you will define the ISA and microarchitecture for next-generation compute engines, working at the intersection of research, software, and hardware. You will translate ML/AI workload requirements into architectural specifications and guide the design from concept through first silicon and bring-up.

What You'll Do Here

  • Define ISA and microarchitecture for compute cores, memory subsystems, and interconnects
  • Derive architectural requirements directly from ML/AI use cases and emerging model trends
  • Author detailed architectural specifications and structured interface definitions
  • Specify numeric formats and quantization strategies optimized for AI workloads
  • Estimate area, timing, and power to inform architectural tradeoffs
  • Collaborate closely with Research, Software, RTL, and Physical Design teams to ensure cohesive execution
  • Participate in architecture reviews, design reviews, and test planning
  • Support first-silicon bring-up and post-silicon debug

Requirements

  • Strong background in computer architecture with deep understanding of modern compute systems
  • 3+ years experience translating workloads/algorithms into hardware architectures
  • Proven ability to evaluate hardware cost, area, timing, and power tradeoffs
  • Excellent fundamentals in latency, throughput, and scalability theory
  • Knowledge of numeric formats, quantization techniques, and precision tradeoffs for ML workloads
  • Experience with programming and performance/code optimization
  • Ability to define structured interfaces and bit-level encodings using constructs such as structs and tagged unions
  • Strong verbal and written communication skills, with the ability to clearly document and defend architectural decisions
  • Comfortable working cross-functionally in a fast-paced, execution-driven environment

Bonus Points If You Have

  • Familiarity with parallel execution models such as VLIW, SIMD, or vector architectures
  • Experience designing custom ML silicon or AI systems

Apply for this position