Cloud Data Engineer

Detroit Tigers, Inc.
Detroit, United States of America
6 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Shift work
Languages
English

Job location

Remote
Detroit, United States of America

Tech stack

Airflow
Azure
Cloud Computing
Cloud Database
Cloud Storage
Continuous Integration
Data Architecture
Data Governance
Data Warehousing
DevOps
Issue Tracking Systems
Python
Machine Learning
Regression Testing
Cloud Services
SQL Databases
Management of Software Versions
Spark
Data Strategy
Machine Learning Operations
Software Version Control
Data Pipelines
Databricks

Job description

Job Summary: The Detroit Tigers are seeking a Cloud Data Engineer, Baseball Systems. This role will be responsible for designing, managing, and automating data processes across our data architecture to support Baseball Operations initiatives, including the deployment and operationalization of machine learning models. This position will report to the Manager, Baseball Systems Data., * Design, implement, and maintain our data architecture and processing pipelines at scale.

  • Design, implement, and use data quality assurance frameworks to support the process of identifying inconsistent data patterns.
  • Collaborate with Tigers data engineers and data scientists to implement good data hygiene practices and procedures in our data processes.
  • Work with external data vendors to triage and remedy data quality issues.
  • Automate and execute test cases in data pipelines and manage data issue tracking.
  • Build and maintain MLOps infrastructure to support the deployment, monitoring, and retraining of machine learning models in production.
  • Partner with data scientists to productionize models, ensuring reproducibility, scalability, and reliability across the ML lifecycle.

Requirements

  • Proficiency building data processing pipelines using SQL and Python.
  • Experience with cloud computing, cloud storage, and cloud services.
  • Experience with cloud-based data lakes, data warehouses, and related tooling.
  • Strong understanding of data strategies and practices, such as continuous integration, regression testing, and versioning.
  • Experience building, maintaining, and querying SQL data warehouses built for data science and analytics.
  • Familiarity with MLOps concepts and tooling, including model serving, monitoring, and pipeline orchestration.

Preferred Knowledge, Skills and Abilities:

  • Understanding of data quality frameworks and best practices for implementation.
  • Familiarity with baseball and with current baseball research.
  • Experience using Apache Spark (Databricks on Azure preferred).
  • Experience with Airflow or similar workflow orchestration tools.
  • Effective communication skills with an ability to explain technical concepts to developers and business partners.
  • Experience with DevOps and MLOps practices for CI/CD pipelines, including model versioning and experiment tracking.
  • Experience working with containers and container deployment, including containerized model serving.
  • Familiarity with open-source data quality frameworks.

Benefits & conditions

  • Office environment.
  • The location may be based in Detroit or fully remote.
  • Occasional evening, weekend, and holiday hours are required.

All items listed above are illustrative and not comprehensive. They are not contractual in nature and are subject to change at the discretion of Detroit Tigers.

Apply for this position