Data Engineer- Full Stack

Ford Motor Company
Santa Fe, United States of America
10 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Junior
Compensation
$ 17K

Job location

Remote
Santa Fe, United States of America

Tech stack

Query Performance
Adaptable Database Systems
Artificial Intelligence
Airflow
Amazon Web Services (AWS)
Google BigQuery
Cloud Storage
Software Documentation
Continuous Integration
Information Engineering
Data Governance
Data Infrastructure
ETL
Data Warehousing
Digital Assets
Data Flow Control
Power BI
Cloudera
Tableau
Workflow Management Systems
Enterprise Data Management
Google Cloud Platform
System Availability
Large Language Models
Grafana
Multi-Agent Systems
Prompt Engineering
Generative AI
Indexer
GIT
Data Layers
Data Lake
AI Platforms
Data Lineage
Data Management
Machine Learning Operations
Terraform
Splunk
Looker Analytics
Data Pipelines

Job description

You will architect and scale end-to-end data and AI pipelines on GCP, transforming complex telemetry and enterprise data into high-quality, analytics-ready assets using Medallion architectures. You will design and integrate Gen AI capabilities - including LLM-powered data enrichment, retrieval-augmented generation (RAG), and intelligent automation - into the data platform. You will lead the implementation of robust CI/CD workflows, rigorous data governance, and security controls while mentoring junior talent and driving engineering best practices. By collaborating with cross-functional stakeholders and optimizing cloud performance, you will ensure the data and AI platform remains secure, cost-effective, and highly available to power critical business insights and next-generation AI experiences.

What you'll do...

  • Design and implement end-to-end data pipelines (ETL/ELT) that ingest, process, and curate large-scale enterprise data, including telemetry/vehicle data and other structured/unstructured sources.
  • Build and maintain Gen AI pipelines - including embedding generation, vector store indexing, retrieval-augmented generation (RAG), and LLM orchestration - to enable intelligent search, summarization, and conversational analytics over enterprise data.
  • Migrate and modernize data assets to a centralized data platform (e.g., BigQuery) using principled data lake/warehouse architectures (Bronze/Silver/Gold or Medallion architecture) to power analytics, reporting, and AI/ML workloads.
  • Architect scalable data models and data warehouses, optimizing for query performance, maintainability, cost efficiency, and downstream AI consumption.
  • Develop and operate robust orchestration pipelines using Airflow/Astronomer or Schedule Query, with secure, reproducible CI/CD workflows (Terraform + Git) for both data and AI artifacts.
  • Integrate LLM APIs and AI services (e.g., Vertex AI, OpenAI, LangChain) into data workflows to automate data enrichment, classification, anomaly narratives, and natural-language interfaces.
  • Build and maintain reliable data and model quality checks, lineage, and monitoring with observability tools (e.g., Splunk, Looker/Grafana/Tableau/Power BI dashboards) to rapidly detect and resolve data and AI pipeline issues.
  • Implement data governance, security, and compliance controls (data lineage, access controls, PII/PHI protection, prompt injection safeguards, responsible AI guardrails) in collaboration with security and privacy teams.
  • Lead the design and delivery of analytics-ready and AI-ready data assets for cross-functional teams, including dashboards, alerts, self-service analytics, and AI-powered insight tools.
  • Evaluate, prototype, and productionize emerging Gen AI capabilities (agents, function calling, fine-tuning, multimodal models) to solve business problems and improve platform intelligence.
  • Mentor and coach junior engineers on data engineering, AI/ML integration patterns, prompt engineering best practices, and documentation standards.
  • Collaborate with data scientists, ML engineers, product managers, and business stakeholders to translate requirements into scalable data and AI solutions and timely insights.
  • Monitor cost and capacity planning for cloud and AI resources; optimize storage, compute, and token usage across GCP services (BigQuery, Dataflow, Dataproc, GCS, Vertex AI).
  • Participate in on-call rotations and incident response to maintain high availability of data and AI services.

Requirements

  • A bachelor's degree
  • 5+ years of experience in data engineering, data platforms, or a similar role.
  • 3+ years of hands-on experience with Google Cloud Platform (BigQuery, Cloud Storage, Dataflow, Dataproc; Schedule Query or equivalent scheduling/orchestration) or AWS.
  • 1+ years of experience working with Generative AI technologies - including LLMs, embeddings, vector databases, RAG architectures, or AI orchestration frameworks (e.g., LangChain, Semantic Kernel, LlamaIndex).
  • 1+ year experience building Semantic Data layer to serve AI agents.

Even better, you may have...

  • Practical experience building and operating data pipelines with orchestration tools (Airflow/Astronomer; Schedule Query).
  • Experience with infrastructure-as-code and CI/CD (Terraform, Git, and related tooling).
  • Demonstrated ability to design and implement analytics-ready data assets and dashboards; familiarity with BI tools (Looker, Tableau, Power BI, Grafana) for monitoring and reporting.
  • Strong communication skills and ability to work effectively with cross-functional teams (engineering, analytics, product, security).

Benefits & conditions

You may not check every box, or your experience may look a little different from what we've outlined, but if you think you can bring value to Ford Motor Company, we encourage you to apply!

As an established global company, we offer the benefit of choice. You can choose what your Ford future will look like: will your story span the globe, or keep you close to home? Will your career be a deep dive into what you love, or a series of new teams and new skills? Will you be a leader, a changemaker, a technical expert, a culture builder...or all of the above? No matter what you choose, we offer a work life that works for you, including:

  • Immediate medical, dental, vision and prescription drug coverage
  • Flexible family care days, paid parental leave, new parent ramp-up programs, subsidized back-up child care and more
  • Family building benefits including adoption and surrogacy expense reimbursement, fertility treatments, and more
  • Vehicle discount program for employees and family members and management leases
  • Tuition assistance
  • Established and active employee resource groups
  • Paid time off for individual and team community service
  • A generous schedule of paid holidays, including the week between Christmas and New Year's Day
  • Paid time off and the option to purchase additional vacation time.

This position is a salary grade 7-8 and ranges $99,600-$198,500.

This position is a salary grade 7-8 and ranges from $138,800-$232,700 (California candidates).

Final determination of salary grade will be based on candidate's skills and experience, and base salary will be set within the applicable range according to job scope, responsibility and competitive market value.

About the company

Ford's Electric Vehicles, Digital and Design (EVDD) team is charged with delivering the company's vision of a fully electric transportation future. EVDD is customer-obsessed, entrepreneurial, and data-driven and is dedicated to delivering industry-leading customer experience for electric vehicle buyers and owners. You'll join an agile team of doers pioneering our EV future by working collaboratively, staying focused on only what matters, and delivering excellence day in and day out. Join us to make positive change by helping build a better world where every person is free to move and pursue their dreams.

Apply for this position