Data Engineer

MatchPoint Solutions
Maryland Heights, United States of America
3 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 146K

Job location

Remote
Maryland Heights, United States of America

Tech stack

Azure
Code Review
Continuous Integration
Data Cleansing
Data Deduplication
Data Governance
Data Infrastructure
ETL
Data Security
Database Queries
Distributed Computing Environment
Integrated Development Environments
Python
Query Optimization
Azure
SQL Databases
Data Streaming
Data Logging
Azure
Spark
Backend
GIT
PySpark
Infrastructure Automation Frameworks
Data Lineage
Bicep
Real Time Data
Terraform
Azure
Software Version Control
Data Pipelines
Serverless Computing
Databricks

Job description

We are looking for an experienced Data Engineer to build and maintain robust, scalable data infrastructure for a high-priority customer project. You will architect and deliver the data pipelines that power data science model development and business intelligence, working closely with the Data Scientists and Backend Developer to ensure reliable, high-quality data flows across the Azure ecosystem., * Design, build, and maintain production-grade data pipelines ingesting data from multiple enterprise source systems into Azure Databricks and Azure Synapse Analytics.

  • Implement and manage the Medallion Architecture (Bronze, Silver, Gold layers) within Databricks to ensure structured, traceable data progression from raw ingestion to analytics-ready datasets.
  • Develop ELT/ETL workflows using Azure Data Factory, Databricks Notebooks, and PySpark - including data cleaning, deduplication, and transformation logic.
  • Build and optimize end-to-end data pipelines to prepare clean, feature-engineered datasets for modelling.
  • Architect and manage Delta tables within Databricks, enforcing schema evolution, time travel, and data quality constraints.
  • Administer and leverage Unity Catalog for data governance, access control, and lineage tracking across all ingested datasets.
  • Configure and manage Azure Data Lake Storage Gen2 (ADLS Gen2) as the central data repository.
  • Collaborate with Data Scientists to optimise data schemas and provide clean, feature-ready datasets for modelling.
  • Develop and maintain data models within Azure Synapse Analytics (dedicated and serverless SQL pools).
  • Use Azure DevOps (ADO) for source control, CI/CD pipeline management, and collaborative code review on all pipeline assets.
  • Implement monitoring, alerting, and logging for all data pipeline processes to ensure operational reliability.
  • Apply data security, governance, and compliance best practices aligned to Unity Catalog and customer requirements.

Requirements

  • 4+ years of professional experience as a Data Engineer with a strong Azure focus.
  • Databricks + Medallion Architecture: hands-on experience designing and implementing ingestion layers using the Medallion Architecture (Bronze/Silver/Gold) within Azure Databricks, including Delta Live Tables or structured notebook workflows.
  • Python / PySpark Notebooks: primary development environment for pipeline and transformation logic; strong command of PySpark for distributed data processing and Python for utility scripting and orchestration.
  • Azure Data Services - Synapse, Delta Tables, Unity Catalog: demonstrable experience with Azure Synapse Analytics (SQL and Spark pools), Delta table management (schema evolution, VACUUM, OPTIMIZE), and Unity Catalog for governance and lineage. These are named source systems on this engagement.
  • ETL/ELT Pipeline Development: proven track record building robust cleaning, deduplication, and multi-source transformation pipelines that handle messy, real-world enterprise data at scale.
  • Azure DevOps (ADO): use of ADO for Git-based source control, pull request workflows, and CI/CD pipeline deployment of Databricks jobs and ADF pipelines.
  • Solid experience with Azure Data Factory for orchestrating complex data pipelines.
  • Strong SQL skills including complex query optimisation.

Desirable Skills

  • Microsoft Certified: Azure Data Engineer Associate (DP-203) certification.
  • Familiarity with infrastructure-as-code tools (Terraform, Bicep) for Azure resource deployment.
  • Experience with data cataloguing and governance using Microsoft Purview.
  • Knowledge of streaming architectures and near-real-time data ingestion patterns.

Personal Attributes

  • Detail-oriented with a strong commitment to data quality and pipeline reliability.
  • Proactive self-starter able to operate with a high degree of autonomy.
  • Strong collaborator with the ability to align engineering decisions to data science and business requirements.
  • Comfortable navigating ambiguity and adapting to evolving project needs.

About the company

MatchPoint Solutions is a fast-growing, young, energetic global IT-Engineering services company with clients across the US. We provide technology solutions to various clients like Uber, Robinhood, Netflix, Airbnb, Google, Sephora, and more! More recently, we have expanded to working internationally in Canada, China, Ireland, UK, Brazil, and India. Through our culture of innovation, we inspire, build, and deliver business results, from idea to outcome. We keep our clients on the cutting edge of the latest technologies and provide solutions by using industry-specific best practices and expertise. We are excited to be continuously expanding our team. If you are interested in this position, please send over your updated resume. We look forward to hearing from you!, Jones Lang LaSalle + Saint Louis, MO JLL empowers you to shape a brighter way. Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology fo…

Apply for this position