Senior Data Engineer

Ford Motor Company
Dearborn, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Dearborn, United States of America

Tech stack

Java
Airflow
Data analysis
Google BigQuery
Information Engineering
Data Warehousing
Database Development
Data Flow Control
Hypertext Transfer Protocols (HTTP)
Python
Raw Data
Cloudera
Technical Data Management Systems
Google Cloud Platform
Data Ingestion
Build Server
Data Lake
Information Technology
Terraform
Data Pipelines

Job description

Senior Data Engineer - positions offered by Ford Motor Credit Company LLC (Dearborn, Michigan). Note, this is a hybrid position whereby the employee will work both from home and from the Dearborn office. Hence, the employee must live within a reasonable commuting distance from Dearborn, MI. Implement methods for automation of all parts of the pipeline to minimize labor in development and production. Analyze complex data, organizing raw data, and integrating massive datasets from multiple data sources to build analytical domains and reusable data products. Work with architects to evaluate and productionalize data pipelines for data ingestion, curation, and consumption. Communicate with stakeholders to formulate business problems as technical data requirements, identify and implement technical solutions while ensuring key business drivers are captured in collaboration with product management. Develop exceptional analytical data products using both streaming and batch ingestion patterns on Google Cloud Platform with solid data warehouse principles. Serve as the Subject Matter Expert in Data Engineering with a focus on GCP native services and other well integrated third-party technologies.

Requirements

Master's degree or foreign equivalent in Computer Science or related field and 5 years of experience in the job offered or related occupation. 5 years of experience with each of the following skills is required: 1. SQL Development for analyzing complex data, organizing raw data, and integrating massive datasets from multiple data sources into data factory data lake. 2. Analytics and data product development to formulate business problems as technical data requirements, identify and implement technical solutions while ensuring key business drivers are captured in collaboration with product deliverables. 3 years of experience with the following skill is required: 1. Google Cloud Platform (GCP) experience with solutions designed and implemented at production scale. 2 years of experience with each of the following skills is required: 1. GCP native (or equivalent) services including Big Query, Google Cloud Storage, PubSub, Dataflow, Dataproc, and Cloud Build to evaluate and productionalize data pipelines for data ingestion, curation, and consumption. 2. Working with Airflow for scheduling and orchestration of data pipelines. 3. Utilizing Terraform to provision Infrastructure as Code. 4. Using Java or Python for development of cloud run http and event-driven functions which handle http requests and events from cloud environment and use event triggers., Master's degree or foreign equivalent in Computer Science or related field and 5 years of experience in the job offered or related occupation. 5 years of experience with each of the following skills is required: 1. SQL Development for analyzing complex data, organizing raw data, and integrating massive datasets from multiple data sources into data factory data lake. 2. Analytics and data product development to formulate business problems as technical data requirements, identify and implement technical solutions while ensuring key business drivers are captured in collaboration with product deliverables. 3 years of experience with the following skill is required: 1. Google Cloud Platform (GCP) experience with solutions designed and implemented at production scale. 2 years of experience with each of the following skills is required: 1. GCP native (or equivalent) services including Big Query, Google Cloud Storage, PubSub, Dataflow, Dataproc, and Cloud Build to evaluate and productionalize data pipelines for data ingestion, curation, and consumption. 2. Working with Airflow for scheduling and orchestration of data pipelines. 3. Utilizing Terraform to provision Infrastructure as Code. 4. Using Java or Python for development of cloud run http and event-driven functions which handle http requests and events from cloud environment and use event triggers.

Apply for this position