AI Kernel Engineer
Role details
Job location
Tech stack
Job description
The AI Kernel Engineer in Quadric plays the key role to enable a large number of AI kernels/operators to run efficiently on the Quadric platform. The AI Kernel Engineer at Quadric will [1] develop a highly efficient Quadric kernel library for a variety of AI/LLM models; [2] analyze the performance and optimize the kernel for different hardware configurations; This senior technical role demands deep knowledge of hardware architecture, compiler toolchain and optimization techniques.
Our preference is for a candidate located in the California Bay Area who can regularly collaborate from our Burlingame office. This role follows a hybrid schedule with at least two in-office days per week expected, but actual schedule may adjust depending on team and business need. We believe strong technical collaboration, rapid iteration, and shared problem-solving are well supported by working together in person. The team and company also gather periodically for onsite meetings and offsite events to connect, collaborate, and align on priorities.
Responsibilities
-
Develop AI/LLM kernels/operators on Quadric platform for efficient inference
-
Optimize the kernel performance for different hardware configurations and workloads
-
Profile and analyze kernel performance in terms of compute, data and parallelism; identify micro-architecture and software bottlenecks and provide optimization solutions
-
Optimize kernel C/C++ codes, maximize hardware utilization
-
Collaborate across related areas of the AI inference stack to support team and business priorities
-
Make Improvement to Quadric toolchain, compiler and runtime
-
Provide technical support and documents to customers and developer community
Requirements
-
Bachelor's or Master's in Computer Science and/or Electric Engineering
-
5+ years of experience in AI kernel development and optimization
-
experience with model and kernel inference performance profiling
-
experience with at least one of the following compute development: CUDA, DSP, NEON, Triton-lang
-
Proficiency in C/C++ and Python, experience with assembly language a plus
-
Demonstrate good capability in problem solving, debug and communication
Benefits & conditions
At Quadric, we value Integrity, Humility, and Happiness. What we expect from one another is simple and clear: Initiative, Collaboration, and Completion. We are a collaborative team focused on building something extraordinary in the edge computing space.
-
Competitive salary and meaningful equity
-
Medical, dental, and vision plan options starting on day one
-
401(k) retirement plan
-
Flexible paid time off (unlimited, non-accrual) to support work-life balance
-
When working in-office, enjoy company-provided lunches and a stocked kitchen
-
Convenient office location within walking distance of the Caltrain station
-
Support for commuting, including monthly parking or Caltrain passes
-
Downtown Burlingame office location, close to shops, cafes, and local amenities
-
A politics-free, highly collaborative environment where talented people can do their best work and make an immediate impact
-
The opportunity to build long-term career relationships in a company that values strong personal connections alongside professional excellence
The base salary range for this position is $110,000 to $270,000. This range reflects the full span of levels and geographies at which Quadric hires for this role. The actual base salary offered will depend on a number of factors, including the specific level of the role, years and depth of relevant experience, technical skills and competencies, the criticality of the role to the business, internal equity, and work location. In addition to base salary, this role is eligible for equity and a discretionary annual performance bonus as applicable to the role and level.
Quadric also offers the generous benefits package outlined above and other programs designed to support your health and wellbeing.