Data Engineer
Role details
Job location
Tech stack
Job description
The New York Mets are seeking a Data Engineer for its Data Engineering Technology team. This role is a unique opportunity to work with data and systems in support of operations in both baseball and the business of baseball. It requires hands-on experience in ingesting, processing, storing, optimizing and distributing structured and unstructured sources of data in areas as diverse as ticketing operations, business strategy, baseball analytics, performance technology and other key areas. You will join a global team (NY and Poland) of data engineering professionals who provide and support best in class end-to-end data and engineering services to Baseball and Business Operations, including but not limited to, Analytics and Systems teams. Prior experience in or knowledge of baseball is a plus but is not required., * Ensure all production data sources are optimally ingested, processed, and distributed to applications, dashboards and users at set frequencies including real time in-game data
- Onboard new datasets and technologies for use by analysts, data scientists, players and other stakeholders throughout the organization
- Troubleshoot, communicate, mitigate and solve any data ingestion and/or quality issues to all stakeholders
- Design, code, test and roll out new datasets, as well as enhance and bug fix existing datasets
- Support quantitative analysts in Baseball and Business Analytics with production deployments and maintenance of machine learning and other predictive models
- Build and manage Data Model, Data Domains and Data Catalog to keep data well-organized, accurate and curated for efficient data discovery
- Deploy and manage data quality solutions (e.g. scalable framework, monitoring dashboard) to ensure accuracy, integrity, and proactive monitoring of information
- Help maintain a data catalog for efficient data discovery
- Actively collaborate with global team of Data Engineering/Support professionals on data engineering and support requests/issues
- Able to work flexible schedule during the baseball season to ensure in-game support for users
- Coordinate with IT and Infrastructure team to ensure a robust cloud development and production environment, The above information is intended to describe the general nature, type, and level of work to be performed. The information is not intended to be an exhaustive or complete list of all responsibilities, duties, and skills required for this position. Nothing in this job description restricts management's right to assign or reassign duties and responsibilities to this job at any time. The individual selected may perform other related duties as assigned or requested.
The New York Mets value the unique qualities individuals with various backgrounds and experiences can offer the organization. Our continued success depends heavily on the quality of our workforce. The Organization is committed to providing employees with the opportunity to develop to their fullest potential.
Requirements
- BS degree in Computer Science or a related field
- 2+ years' experience in data engineering and data operations/support roles
- Comfort level with both on-premise and cloud (GCP or Azure experience required) environments
- Technical skills (Python, dbt, SQL, T-SQL, Linux) in working with ingesting, processing, and distributing large scale (both structured and unstructured) data sets
- Experience extracting data from FTP and API data sources
- Experience building data structures and data pipelines in the cloud, preferably Azure, Google Cloud Platform and on-prem
- Experience in using modern Software Development Life Cycle (SDLC) and DevOps tools from development to production e.g. Terraform, Cloud Build, BitBucket
- Ability to provide prompt support and resolution of data issues
- Experience in implementing scalable data quality solutions
- Knowledge of big data frameworks, database, orchestration and other data tools, such as Google Cloud Composer (Airflow), Google BigQuery, Micorsoft SQL Server, Postgres, Google Dataflow, Google Pub/Sub, etc.
- Ability to deliver superior customer experience through continuous process improvement
- Strong analytical skills and ability to work well in a collaborative and fast paced environment