Gen AI Architect in Santa Clara

Energy Jobline
Santa Clara, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Santa Clara, United States of America

Tech stack

Training Data
API
Artificial Intelligence
Computer Vision
Program Optimization
Memory Management
Open Source Technology
Performance Tuning
TensorFlow
Graphics Processing Unit (GPU)
PyTorch
Large Language Models
Deep Learning

Job description

You represent the pinnacle of Applied AI engineering. You are not just using APIs; you are optimizing the models themselves. You understand the mathematics behind the attention mechanism, you know how to squeeze performance out of GPUs, and you can customize models for specific domains. You provide the high-level technical vision and handle the most difficult edge cases. ., Model Fine-Tuning: Implement PEFT (Parameter-Efficient Fine-Tuning), LoRA, and QLoRA to adapt open-source models (Llama 3, Mistral) to specific client domains.

Optimization & Quantization: Perform model quantization to reduce inference costs and latency without sacrificing quality. Manage Dense Vectors and embedding optimizations.

State-of-the-Art Exploration: Continuously research and implement the latest advancements (e.g., State Space Models, Long-Context optimizations) into client deliverables.

Strategic Consulting: Act as a trusted advisor to C-level client executives, defining the "Art of the Possible" and guiding long-term AI roadmaps.

Requirements

Looking for a Gen AI architect with 15+ years experience and 8+years experience focusing on Model Optimization, Fine-Tuning & Strategic AI in San Francisco, CA., Deep Learning: PyTorch/TensorFlow, Transformers architecture internals, Attention mechanisms.

Model Ops: Serving custom models (vLLM, TGI), GPU memory management, Quantization techniques (GGUF, AWQ).

Advanced Data: Training data curation, synthetic data , RLHF concepts.

Tech Leadership: Ability to define the technical culture and set standards for the entire FDE organization.

Soft Skills:

Executive communication and ability to influence C-level leaders.

Thought leadership and industry presence (conferences, playbooks, forums).

Cross-org leadership and conflict resolution.

Ability to define long-term AI vision and cultural standards.

Strategic decision-making balancing cost, risk, and performance.

About the company

Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide. We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.

Apply for this position