Data Engineer-L3

Learn Beyond Consulting LLC

Los Angeles, United States of America

3 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Los Angeles, United States of America

Tech stack

Artificial Intelligence

Airflow

Amazon Web Services (AWS)

Azure

BASIC (Programming Language)

Google BigQuery

Cloud Database

Continuous Integration

Data Validation

Information Engineering

Data Integrity

ETL

Data Transformation

Software Debugging

DevOps

Distributed Computing Environment

Distributed Data Store

Hive

Python

Cloud Services

SQL Databases

Data Logging

Cloud Platform System

Azure

Delivery Pipeline

Snowflake

Spark

Electronic Medical Records

GIT

Data Layers

Kubernetes

Amazon Web Services (AWS)

Data Management

GPT

Software Version Control

Data Pipelines

Redshift

Job description

Serve as L3 support: triage high-severity incidents, perform advanced debugging/root-cause analysis, deploy hotfixes, and create runbooks for L2 teams.
Build and maintain batch/streaming data pipelines using ETL/ELT tools (dbt,) to integrate and transform multi-source data.
Implement data quality validation, monitoring, alerting, and documentation; optimize pipelines for performance, cost, and reliability (partitioning, indexing, error handling).
Partner with analytics, data science, and business teams to deliver data requirements, troubleshoot issues, and ensure SLAs for freshness/completeness.

Requirements

8-10+ years data engineering experience building and supporting production pipelines at scale.
Design, build, and maintain data ingestion, transformation, and delivery pipelines across structured and semi-structured data sources.
Develop modular, reusable data transformation logic using Python, SQL, and frameworks such as dbt.
Implement data models and schemas optimized for analytics and reporting (star, snowflake, or dimensional).
Apply Medallion Architecture principles to organize data layers for quality, traceability, and performance.
Use cloud-native data services such as AWS Glue, S3, Redshift, EMR or Azure Data Factory, ADLS, Synapse to manage data workflows.
Set up and manage data pipeline orchestration, scheduling, and monitoring using Airflow, ADF, or equivalent tools.
Apply data quality checks, validation logic, and logging mechanisms to ensure consistency and trust in data assets.
Collaborate with analysts, scientists, and architects to design data models that align with business and analytical needs.
Maintain code versioning, testing, and CI/CD standards for data pipeline development.
Proven cloud data platform + orchestration experience (Snowflake/BigQuery + Airflow/dbt).
L3 support experience: incident management, on-call rotations, debugging distributed data systems.

Core Competencies & Skills

Strong understanding of data engineering fundamentals - ETL/ELT design, data modeling, schema evolution, and data integrity.
Proficient in Python and SQL for data transformation, automation, and workflow scripting.
Hands-on experience with cloud-based data services in AWS (S3, Glue, Redshift, EMR) or Azure (ADLS, ADF, Synapse).
Working knowledge of distributed data processing concepts (Spark, Hive, or equivalent).
Familiarity with dbt for transformation design, testing, and data documentation.
Awareness of Medallion Architecture and data layering concepts for scalable data management.
Understanding of orchestration frameworks like Airflow or Data Factory for scheduling and monitoring pipelines.
Knowledge of Git-based version control, CI/CD, and basic DevOps practices in data workflows.
Have an AI skill set, a little bit on Claude, ChatGPT, and other tool supports, or at least who can pick up those skills.

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all