Databricks Expert / Data Lakehouse Consultant

BILINK CORP.

Chicago, United States of America

1 month ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 160K

Job location

Remote

Chicago, United States of America

Tech stack

Unity

SAP Cloud

API

Agile Methodologies

Artificial Intelligence

Airflow

Amazon Web Services (AWS)

Data analysis

Business Logic

Computing Platforms

Azure

Cloud Computing

Cloud Database

Cloud Storage

Continuous Integration

Data Validation

Data Discovery

Information Engineering

Data Infrastructure

ETL

Data Vault Modeling

Data Warehousing

Software Design Patterns

Github

Hive

Python

Query Optimization

Power BI

Standard Sql

Azure

SAP Applications

SAP HANA

SAP NetWeaver Data Management

Data Streaming

Tableau

Azure

Spark

SAP Business Technology Platform

Data Layers

Build Management

Data Lake

PySpark

Git Flow

Star Schema

SAP S/4HANA

Amazon Web Services (AWS)

Machine Learning Operations

Data Lakehouse

Azure

Data Pipelines

Databricks

Job description

We help clients move from legacy reporting to future-proof analytics and planning architectures, leveraging SAP S/4HANA, SAP Analytics Cloud, SAP Datasphere, and SAP BTP as well as modernize their data infrastructure by migrating from legacy data warehouses and ETL pipelines to scalable, cloud-native Lakehouse architectures powered by Databricks, Delta Lake, and the broader Azure/AWS data ecosystem., We are seeking an experienced Databricks Expert to lead and support enterprise data lakehouse initiatives. This role spans data engineering, data modeling, and platform architecture across Databricks, Delta Lake, Unity Catalog, and cloud data ecosystems (Azure / AWS).

You will work closely with data engineering, analytics, and business stakeholders to design scalable ingestion pipelines, curated data layers, and consumption-ready datasets - delivering solutions aligned with enterprise architecture and governance standards.

This role requires a strong technical foundation in Spark and Python-based data engineering, combined with a solid understanding of data modeling and lakehouse design patterns., Data Engineering & Pipeline Development (Primary)

Design and build scalable data ingestion pipelines using Databricks (Spark, PySpark, Spark SQL)
Implement Medallion Architecture (Bronze / Silver / Gold) patterns for raw, curated, and consumption layers
Develop and optimize Delta Lake tables:
- Schema evolution and enforcement* MERGE / UPSERT patterns for CDC and incremental loads* Z-Ordering, compaction, and vacuuming for performance
Build and orchestrate workflows using Databricks Workflows / Apache Airflow / Azure Data Factory
Implement data quality checks and validation frameworks (e.g., Great Expectations, custom DQ layers)Data Modeling & Lakehouse Design
Design semantic and analytical data models (Star Schema, Data Vault, OBT) for BI and ML consumption
Implement and govern Unity Catalog for data discovery, lineage, and access control
Model business logic: KPI frameworks (actuals vs. plan vs. forecast), Slowly Changing Dimensions (SCD Type 1/2), aggregated fact tables and pre-computed summary layers
Ensure semantic consistency and reusability across data consumers (BI, ML, APIs)Platform & Infrastructure
Configure and manage Databricks workspaces, clusters, and job compute
Implement cost optimization strategies (auto-scaling, spot instances, cluster policies)
Support CI/CD for data pipelines using Git-based workflows (GitHub / Azure DevOps) and Databricks Asset Bundles or dbx
Work with cloud infrastructure across Azure (ADLS Gen2, Azure Data Factory, Synapse) and AWS (S3, Glue, Redshift)

Analytics & ML Enablement

Prepare and expose datasets for BI tools (Power BI, Tableau, SAP Analytics Cloud)
Support MLflow for experiment tracking and model registry
Collaborate with data scientists and analysts to ensure data readiness for advanced analytics
Design Feature Store tables and serve them for ML pipelines where applicable

Client & Project Delivery

Gather business and data requirements and translate them into Databricks platform solutions
Work in Agile or hybrid delivery models
Support UAT, training, go-live, and post-go-live enhancements
Collaborate closely with Bilink architects and client stakeholders

Requirements

Do you have experience in Spark?, * 5+ years of data engineering experience

Strong hands-on experience with Databricks (notebooks, jobs, workspace administration)
Proficiency in PySpark / Spark SQL and Python (data engineering patterns, testing, packaging)
Delta Lake in production: ACID transactions, time travel, schema management, MERGE/CDC patterns
Experience with Medallion Architecture and lakehouse design patterns
Working knowledge of Unity Catalog (governance, permissions, lineage)
Experience with cloud storage and integration (Azure ADLS / AWS S3)
Solid SQL skills for data modeling and query optimization
Excellent communication skills (client-facing role)

Nice-to-Have

Delta Live Tables (DLT) pipelines - declarative, streaming, and batch
Databricks Asset Bundles for CI/CD across multi-environment deployments (dev / QA / prod)
Databricks Account Administration (SCIM / IdP integration, Unity Catalog metastore management, workspace provisioning)
Databricks GenAI features: Genie AI/BI and Agent Bricks
Background in financial data, supply chain, or CPG data domains
Apache Airflow or Azure Data Factory orchestration; dbt for transformation
Great Expectations or similar DQ frameworks; MLflow and Databricks Feature Store
Databricks certifications (Data Engineer Associate/Professional, Spark Developer)
Familiarity with SAP data extraction patterns (SAP HANA, CDS views, ODP) - a strong plus

Profiles We Are Looking For

Hands-on data engineer (not just functional or purely architectural)
Comfortable working across Data Engineering, Analytics, and IT
Curious, structured, and quality-driven
Able to challenge requirements and propose better solutions
Interested in growing toward Solution Architect or Lead Data Engineer roles

Benefits & conditions

515 North State Street, Chicago, IL 60654 Hybrid work $110,000 - $160,000 a year - Full-time, Contract, Pulled from the full job description

401(k)
Health insurance
401(k) matching
Paid time off
Dental insurance, * 401(k) with matching
Health and dental insurance
Paid time off

Bilink Corp. is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.

Note: We prefer to hire directly - applications via staffing agencies or C2C arrangements may be considered with a lower priority.

Job Types: Full-time, Contract

Pay: $110,000.00 - $160,000.00 per year, * 401(k)

401(k) matching
Dental insurance
Health insurance
Paid time off

Application Question(s):

How many years of hands-on Databricks (Spark/PySpark) experience do you have?
Have you built and maintained Delta Lake tables in a production environment (MERGE/CDC, schema evolution)?
Have you worked hands-on with Unity Catalog (permissions, lineage, governance)?
This role is hybrid in Chicago, IL (3 days/week onsite). Can you reliably work onsite 3 days per week?
Are you applying as an individual candidate (not through a staffing agency or C2C arrangement)?
Will you now or in the future require visa sponsorship? If yes, what is your current status?
What is your expected annual salary (or expected W2 hourly rate, if contract)?
Briefly describe the largest Databricks/lakehouse project you delivered: your role, scope, and approximate data volume. (3-5 sentences)

About the company

Bilink Corp. is a data & analytics consulting firm specializing in SAP-centric transformations for mid-to-large enterprises across manufacturing, CPG, and distribution industries., Why Join Bilink Corp. * Work on high-impact SAP analytics & planning transformations as well as data lakehouse and modernization projects. * Exposure to modern SAP architectures (S/4, SAC, Datasphere, BTP) & cloud data architectures (Databricks, Delta Lake, Azure, AWS) * Lean, expert-driven teams (no body-shopping) * Strong culture of technical excellence & pragmatism * Opportunity to grow with Bilink's expansion in the US market

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all