Data Engineer

MatchPoint Solutions

Maryland Heights, United States of America

1 month ago

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Intermediate

Compensation

$ 146K

Job location

Remote

Maryland Heights, United States of America

Tech stack

Azure

Code Review

Continuous Integration

Data Cleansing

Data Deduplication

Data Governance

Data Infrastructure

ETL

Data Security

Database Queries

Distributed Computing Environment

Integrated Development Environments

Python

Query Optimization

Azure

SQL Databases

Data Streaming

Data Logging

Azure

Spark

Backend

GIT

PySpark

Infrastructure Automation Frameworks

Data Lineage

Bicep

Real Time Data

Terraform

Azure

Software Version Control

Data Pipelines

Serverless Computing

Databricks

Job description

We are looking for an experienced Data Engineer to build and maintain robust, scalable data infrastructure for a high-priority customer project. You will architect and deliver the data pipelines that power data science model development and business intelligence, working closely with the Data Scientists and Backend Developer to ensure reliable, high-quality data flows across the Azure ecosystem., * Design, build, and maintain production-grade data pipelines ingesting data from multiple enterprise source systems into Azure Databricks and Azure Synapse Analytics.

Implement and manage the Medallion Architecture (Bronze, Silver, Gold layers) within Databricks to ensure structured, traceable data progression from raw ingestion to analytics-ready datasets.
Develop ELT/ETL workflows using Azure Data Factory, Databricks Notebooks, and PySpark - including data cleaning, deduplication, and transformation logic.
Build and optimize end-to-end data pipelines to prepare clean, feature-engineered datasets for modelling.
Architect and manage Delta tables within Databricks, enforcing schema evolution, time travel, and data quality constraints.
Administer and leverage Unity Catalog for data governance, access control, and lineage tracking across all ingested datasets.
Configure and manage Azure Data Lake Storage Gen2 (ADLS Gen2) as the central data repository.
Collaborate with Data Scientists to optimise data schemas and provide clean, feature-ready datasets for modelling.
Develop and maintain data models within Azure Synapse Analytics (dedicated and serverless SQL pools).
Use Azure DevOps (ADO) for source control, CI/CD pipeline management, and collaborative code review on all pipeline assets.
Implement monitoring, alerting, and logging for all data pipeline processes to ensure operational reliability.
Apply data security, governance, and compliance best practices aligned to Unity Catalog and customer requirements.

Requirements

4+ years of professional experience as a Data Engineer with a strong Azure focus.
Databricks + Medallion Architecture: hands-on experience designing and implementing ingestion layers using the Medallion Architecture (Bronze/Silver/Gold) within Azure Databricks, including Delta Live Tables or structured notebook workflows.
Python / PySpark Notebooks: primary development environment for pipeline and transformation logic; strong command of PySpark for distributed data processing and Python for utility scripting and orchestration.
Azure Data Services - Synapse, Delta Tables, Unity Catalog: demonstrable experience with Azure Synapse Analytics (SQL and Spark pools), Delta table management (schema evolution, VACUUM, OPTIMIZE), and Unity Catalog for governance and lineage. These are named source systems on this engagement.
ETL/ELT Pipeline Development: proven track record building robust cleaning, deduplication, and multi-source transformation pipelines that handle messy, real-world enterprise data at scale.
Azure DevOps (ADO): use of ADO for Git-based source control, pull request workflows, and CI/CD pipeline deployment of Databricks jobs and ADF pipelines.
Solid experience with Azure Data Factory for orchestrating complex data pipelines.
Strong SQL skills including complex query optimisation.

Desirable Skills

Microsoft Certified: Azure Data Engineer Associate (DP-203) certification.
Familiarity with infrastructure-as-code tools (Terraform, Bicep) for Azure resource deployment.
Experience with data cataloguing and governance using Microsoft Purview.
Knowledge of streaming architectures and near-real-time data ingestion patterns.

Personal Attributes

Detail-oriented with a strong commitment to data quality and pipeline reliability.
Proactive self-starter able to operate with a high degree of autonomy.
Strong collaborator with the ability to align engineering decisions to data science and business requirements.
Comfortable navigating ambiguity and adapting to evolving project needs.

About the company

MatchPoint Solutions is a fast-growing, young, energetic global IT-Engineering services company with clients across the US. We provide technology solutions to various clients like Uber, Robinhood, Netflix, Airbnb, Google, Sephora, and more! More recently, we have expanded to working internationally in Canada, China, Ireland, UK, Brazil, and India. Through our culture of innovation, we inspire, build, and deliver business results, from idea to outcome. We keep our clients on the cutting edge of the latest technologies and provide solutions by using industry-specific best practices and expertise. We are excited to be continuously expanding our team. If you are interested in this position, please send over your updated resume. We look forward to hearing from you!, Jones Lang LaSalle + Saint Louis, MO JLL empowers you to shape a brighter way. Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology fo…

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all