Principal GenAI Data Engineer

Zscaler, Inc.
San Jose, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 260K

Job location

Remote
San Jose, United States of America

Tech stack

Artificial Intelligence
Software as a Service
Collaborative Software
Databases
Data Architecture
Information Engineering
Data Infrastructure
Graph Database
Python
Knowledge-Based Systems
Metadata
Software Engineering
Unstructured Data
Data Ingestion
Large Language Models
Multi-Agent Systems
Generative AI
Data Strategy
Data Management
Machine Learning Operations
Data Pipelines

Job description

We are looking for a Principal GenAI Data Engineer to join our IT Data Strategy team. This role is fully remote within the US, reporting to the Senior Manager, Enterprise AI Data Platform. We are seeking an experienced technical leader to drive the design and implementation of enterprise-grade Generative AI data ingestion, knowledge preparation, and platform architectures that enable scalable, production-ready GenAI applications. This role focuses on architecting robust pipelines and platforms for ingesting, processing, governing, and serving structured and unstructured enterprise data for AI/LLM workloads. The ideal candidate combines deep expertise in enterprise data architecture, unstructured data pipelines, GenAI platform engineering, and strong software engineering skills in Python., * Architect enterprise-scale GenAI data platforms for ingestion, transformation, enrichment, and serving of structured and unstructured data

  • Design scalable pipelines for enterprise knowledge ingestion from diverse data sources including documents, SaaS platforms, knowledge bases, collaboration tools, and databases
  • Define architecture for metadata extraction, chunking, enrichment, embeddings generation, and knowledge preparation workflows
  • Design AI-ready data models and storage strategies for vector, graph, and hybrid knowledge systems
  • Architect scalable unstructured data processing pipelines for text, images, PDFs, tables, and multimodal content

Who You Are (Success Profile)

  • You act like an owner. Your passion for the mission fuels your bias for action. You operate with integrity because you genuinely care about the outcome. You adapt to what's needed, navigating seamlessly between high-level strategy and hands-on execution.
  • You are a problem-solver. You seek out challenges because you are energized by finding solutions, knowing that solving the hard problems delivers the biggest impact.
  • You lead with integrity. You do the right thing, even when it's hard. You hold yourself and others to a high standard of accountability and build trust by matching your words with consistent, transparent action.
  • You think at scale. You connect your day-to-day work to the larger company mission and think globally. You build solutions, processes, and teams that are not just effective today but are built to last and support a high-growth, global organization.
  • You are a high-trust collaborator. You are ambitious for the team, not just yourself. You embrace our challenge culture by giving and receiving ongoing feedback-knowing that candor delivered with clarity and respect is the truest form of teamwork and the fastest way to earn trust.

Requirements

  • Expert-level Python programming and software engineering capabilities
  • Experience building distributed/scalable data pipelines for AI workloads
  • Strong understanding of unstructured data extraction and processing pipelines
  • Experience with vector databases, graph databases, and metadata/knowledge storage systems
  • Hands-on experience with clustering, entity recognition algorithms, and modern retrieval strategies (including RAG, search, and agentic AI workflows)

What Will Make You Stand Out (Preferred Qualifications)

  • Deep understanding of AI-ready data platform design principles and the ability to bridge platform/data engineering with GenAI/LLM application requirements
  • Experience with LLMOps / GenAIOps frameworks such as LangSmith, Evaluation Framework like Arize Phoenix, Weights & Biases, or MLflow
  • Familiarity with Agent Frameworks like LangGraph, CrewAI, or Google ADK

#LI-Remote #LI-YC2

Benefits & conditions

Zscaler's salary ranges are benchmarked and are determined by role and level. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations and could be higher or lower based on a multitude of factors, including job-related skills, experience, and relevant education or training.

The base salary range listed for this full-time position excludes commission/ bonus/ equity (if applicable) + benefits.

Base Pay Range

$182,000-$260,000 USD

At Zscaler, we are committed to building a team that reflects the communities we serve and the customers we work with. We foster an inclusive environment that values all backgrounds and perspectives, emphasizing collaboration and belonging. Join us in our mission to make doing business seamless and secure.

Our Benefits program is one of the most important ways we support our employees. Zscaler proudly offers comprehensive and inclusive benefits to meet the diverse needs of our employees and their families throughout their life stages, including:

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!

About the company

Zscaler accelerates digital transformation to ensure our customers can be more agile, efficient, resilient, and secure. As an AI-forward enterprise , we are constantly pushing the envelope, leveraging the world's largest security data lake to power our cloud-native Zero Trust Exchange platform. This innovation protects our customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location. Here, impact in your role matters more than title and trust is built on results. We say, impact over activity. We seek innovators who actively use AI to amplify their impact and who thrive in an environment where we leverage intelligent systems to stay ahead of evolving threats. We believe in transparency and value constructive, honest debate -we're focused on getting to the best ideas, faster. We build high-performing teams that can make an impact quickly and with high quality. To do this, we are building a culture of execution centered on customer obsession , collaboration, ownership, and accountability. We value high-impact, high-accountability with a sense of urgency where you're enabled to do your best work and embrace your potential. If you're driven by purpose, thrive on solving complex challenges, and want to be part of the team that's helping to secure the AI age, we invite you to bring your talents to Zscaler and help shape the future of cybersecurity., Learn more about Zscaler's Future of Work strategy, hybrid working model, and benefits here (https://www.zscaler.com/careers) .

Apply for this position