AI Data Engineer

Apple Inc.
Cupertino, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 203K

Job location

Cupertino, United States of America

Tech stack

API
Artificial Intelligence
Airflow
Automated Storage and Retrieval Systems
Google BigQuery
Information Systems
Continuous Integration
Data as a Services
Data Architecture
Data Validation
Information Engineering
Data Infrastructure
ETL
Data Transformation
Data Security
Document Management Systems
Graph Database
MongoDB
Neo4j
Performance Tuning
Cloud Services
SQL Databases
Data Streaming
Unstructured Data
Snowflake
GIT
Information Technology
Kafka
Data Delivery
REST
Software Version Control
Data Pipelines
Databricks

Job description

Imagine what you could do here. At Apple, new ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.

Are you passionate about building the data pipelines that make AI systems fast, accurate, and reliable?

Do you thrive on engineering clean data flows that connect enterprise systems to intelligent applications?

Can you build infrastructure that's both production-grade and purpose-built for AI consumption?

The Applied Data Science team within Legal Operations is building production-grade AI for a global legal organization - and every AI system is only as good as the data flowing into it. The AI Data Engineer owns the pipelines, data feeds, and integration infrastructure that ensure AI applications have the right data, in the right form, at the right time., The AI Data Engineer builds and maintains the data infrastructure that powers AI applications across Legal Operations. You will design and implement data pipelines that ingest from legal systems, transform data into AI-ready formats, load vector databases and other AI stores, and expose data services through APIs. This role is embedded within the AI team and works in close partnership with AI and data colleagues to ensure AI systems have reliable, high-quality data at every stage.

*\tDesign and implement data pipelines that ingest, transform, and deliver data from legal systems (matter management, eBilling, CLM, document management) to AI applications

*\tBuild and maintain pipelines that load and refresh vector databases, document stores, and graph databases used by AI retrieval systems

*\tEngineer data transformations that prepare legal data for AI consumption - chunking, embedding generation, metadata enrichment, and schema normalization

*\tBuild upstream and downstream integrations with MCP (Model Context Protocol), vector databases, and knowledge graphs to support context engineering and AI retrieval systems

*\tDevelop and maintain APIs that expose structured and unstructured data to AI applications and analytics tools

*\tImplement data quality checks and validation at pipeline ingestion points to ensure AI systems receive reliable, complete data

*\tBuild monitoring and alerting for pipeline health, data freshness, and load failures

*\tUnderstand AI data access patterns and optimize data delivery for AI performance

*\tIntegrate with the semantic layer - consuming entity resolution outputs, taxonomy mappings, and enriched datasets to ground AI applications

*\tImplement ETL/ELT processes using dbt, Fivetran, or similar tools with a focus on reliability and maintainability

*\tDocument pipeline designs, data contracts, and operational runbooks

Requirements

Bachelor's degree in Computer Science, Data Science, Information Systems, or related field (or equivalent experience); Master's degree preferred

4+ years of experience in data engineering related to AI application

Strong proficiency in SQL and Python for data engineering and transformation

Experience with cloud data platforms (Snowflake, Databricks, BigQuery, or similar)

Experience with ETL/ELT tools (dbt, Fivetran, Airflow, or similar)

Experience building and maintaining REST APIs

Understanding of data modeling and data transformation best practices

Experience with version control (Git) and CI/CD practices

Ability to work closely with AI/ML teams and understand their data requirements

Preferred Qualifications

Experience with vector databases (Pinecone, Weaviate, Chroma), embedding generation pipelines, document stores (MongoDB or similar) and their integration patterns

Understanding of RAG, MCP architectures, context engineering principles, and how data quality affects retrieval performance

Experience with semantic layer technologies (dbt Semantic Layer, Cube, AtScale), knowledge graphs (Neo4j), or ontology design

Experience with streaming or event-driven data architectures (Kafka or similar)

Familiarity with legal operations data (matter management, eBilling, CLM, document management)

Benefits & conditions

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $203,300 and $305,600, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apply for this position