Matthias Niehoff

Enjoying SQL data pipelines with dbt

Treat your analytics like code. Build modular, testable, and documented SQL data pipelines with dbt.

Enjoying SQL data pipelines with dbt
#1about 1 minute

The challenge of managing traditional SQL data pipelines

Traditional data pipelines often rely on unstructured Python glue code and notebooks, making them difficult to maintain and extend.

#2about 4 minutes

Introducing dbt for structured SQL data transformations

dbt is a command-line tool that brings software engineering principles to the transformation layer of ELT, allowing you to build data pipelines with just SQL.

#3about 1 minute

Setting up a dbt project and defining data sources

A walkthrough of a dbt project structure shows how to define raw data sources and their associated tests using a `sources.yml` file.

#4about 2 minutes

Tracking data changes over time with dbt snapshots

The `dbt snapshot` command provides a simple way to capture historical changes in your source data by creating slowly changing dimension tables.

#5about 3 minutes

Using seeds for static data and running models

Use `dbt seed` to load small, static datasets like country codes and `dbt run` to execute SQL models, which can be modularized with Jinja macros.

#6about 3 minutes

Generating documentation and visualizing data lineage

dbt automatically generates a web-based documentation site from your project's metadata, including a complete, interactive data lineage graph.

#7about 1 minute

Implementing and running data quality tests

The `dbt test` command executes predefined and custom SQL-based tests to ensure data integrity and quality throughout your pipeline.

#8about 4 minutes

Applying software engineering practices to data pipelines

dbt integrates with standard developer tools like pre-commit hooks for linting, CI/CD for automated testing, and profiles for managing environments.

#9about 4 minutes

Exploring the dbt ecosystem and key integrations

The dbt ecosystem includes packages for extended testing, orchestration tools like Airflow, visualization layers like Lightdash, and integrations with analytical databases like DuckDB.

#10about 1 minute

Addressing the extraction and loading phases of ELT

While dbt focuses on transformation, tools like Airbyte, Fivetran, or custom scripts are used to handle the initial extraction and loading of data into the warehouse.

#11about 2 minutes

Understanding dbt's core benefits and limitations

dbt excels at simplifying data transformation with code-based practices but is not a tool for data ingestion, a full data catalog, or a no-code solution.

#12about 1 minute

Q&A: Raw data formats and comparing dbt to Spark

Answering audience questions clarifies the strategy of loading raw data as-is and positions dbt as a simpler, SQL-focused alternative to complex systems like Apache Spark.

Related jobs
Jobs that call for the skills explored in this talk.

Featured Partners

Related Articles

View all articles
AG
Andre Braun, GitLab
Now is the time for industrialized software development
Now is the time for industrialized software development Recently, I received a letter from my car’s manufacturer alerting me to a recall. They had discovered a defective part and wanted to replace it. It was easily fixed, and I might have forgotten a...
Now is the time for industrialized software development
CH
Chris Heilmann
With AIs wide open - WeAreDevelopers at All Things Open 2025
Last week our VP of Developer Relations, Chris Heilmann, flew to Raleigh, North Carolina to present at All Things Open . An excellent event he had spoken at a few times in the past and this being the “Lucky 13” edition, he didn’t hesitate to come and...
With AIs wide open - WeAreDevelopers at All Things Open 2025
BB
Benedikt Bischof
Making Data Warehouses Fast: A Developer’s Story
Welcome to this issue of the WeAreDevelopers Live Talk series. This article recaps an interesting talk by Adnan Rahic who teaches the audience how to make data warehouses.About the Speaker: Adnan is senior developers advocate at Cube. His passion lie...
Making Data Warehouses Fast: A Developer’s Story

From learning to earning

Jobs that call for the skills explored in this talk.

Data Engineer DBT

Data Engineer DBT

ATLANSE
Canton of Rueil-Malmaison, France

Remote
45-55K
Senior
Linux
Scrum
Python
+3