TELECOMMUTE Lead Data Engineer /Architect
Nbcuniversal Media, LLC
New York, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Remote
New York, United States of America
Tech stack
API
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Apache HTTP Server
Azure
Data Architecture
Data Governance
Data Integrity
ETL
Data Transformation
Data Systems
Github
Python
Enterprise Messaging Systems
SQL Databases
Data Streaming
Data Logging
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Data Ingestion
Snowflake
Backend
Event Driven Architecture
Data Lake
PySpark
Infrastructure Automation Frameworks
Information Technology
Kafka
Data Lakehouse
Terraform
Stream Analytics
Data Pipelines
Api Management
Redshift
Databricks
Job description
As a Data Engineer, you will play a key role in designing and building scalable, real-time and batch data solutions that power analytics and operational platforms across NBCUniversal Operations & Technology. This role will focus on building and maintaining API connectors, setting up event-driven data pipelines, managing data flow into an event house or lakehouse, and ensuring data quality and reliability throughout. You will collaborate with cross-functional teams to enable the efficient movement and transformation of data for enterprise use.
- API Connector Development: Build, maintain, and document scalable API integrations with internal and external platforms to automate data ingestion and synchronization.
- Event-Driven Architecture: Design and implement data pipelines using event-driven frameworks and messaging systems (e.g., Kafka, Kinesis) to support real-time analytics and operational use cases.
- Data Lakehouse Engineering: Manage data ingestion and transformation pipelines feeding into an event house or lakehouse architecture, ensuring consistency and scalability across datasets.
- Data Transformation & Orchestration: Use ETL/ELT tools and scripting to clean, transform, and enrich data from various sources into usable formats for analysts and stakeholders.
- Data Quality & Validation: Establish automated validation rules, monitor data integrity, and proactively resolve quality issues to ensure trust in data systems.
- Collaboration & Enablement: Partner with analytics, business, and engineering teams to understand data requirements and deliver scalable solutions that meet evolving needs.
Requirements
- Bachelor's degree in Computer Science, Engineering, Data Science, or a related field, or equivalent experience.
- 3-6 years of experience as a data engineer or in a similar backend data role.
- Experience building and maintaining API integrations and event-driven data pipelines.
- Proficiency in SQL and data pipeline scripting (e.g., Python, PySpark).
- Experience with cloud platforms such as AWS, Azure, or Google Cloud (e.g., S3, Redshift, Snowflake, or Databricks).
- Familiarity with event streaming platforms such as Apache Kafka, Kinesis, or similar.
- Strong attention to detail with a commitment to data accuracy and quality assurance.
- Ability to thrive in a fast-paced, collaborative environment., * Experience with CI/CD pipelines and infrastructure-as-code tools (e.g., Terraform, GitHub Actions).
- Familiarity with lakehouse architecture principles and tools like Delta Lake or Apache Iceberg.
- Experience building robust logging, error handling, and monitoring for data pipelines.
- Understanding of data governance, privacy, and compliance principles.
- Strong documentation and communication skills, with the ability to explain technical solutions to non-technical stakeholders.
- Background in media, entertainment, or consumer technology environments.