Lead Data Engineer -Databricks

Alphanumeric Systems, Inc.

Richmond, United States of America

1 month ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 208K

Job location

Remote

Richmond, United States of America

Tech stack

Agile Methodologies

Artificial Intelligence

Amazon Web Services (AWS)

Unit Testing

Azure

Big Data

Continuous Integration

Information Engineering

Data Governance

Data Systems

Software Debugging

Job Scheduling

Systems Development Life Cycle

Standard Sql

Data Streaming

Data Ingestion

Spark

Gitlab

Data Lake

PySpark

Deployment Automation

Data Pipelines

Databricks

Job description

Alphanumeric is hiring a DATA ENGINEER IV LEAD to work remotely (EST hours preferred) with an established leader in the financial and insurance industries. Candidates located in or near Richmond, VA are strongly preferred. This is a contract-to-hire opportunity with an approximate conversion salary of $135,000 annually. Pay Range: $93.00 - $100.00/hr. W2 No third-party agencies please. Sponsorship is not available for this position. As a member of the Data Solutions and Data Engineering team, you will play a key role in transforming enterprise data capabilities using Databricks and modern cloud-based data engineering practices. This position focuses heavily on enhancing out-of-the-box Databricks functionality, developing scalable frameworks, optimizing Delta Lake infrastructure, and supporting large-scale data transformation initiatives utilizing medallion architecture principles. You will help design and build robust data ingestion, standardization, and curation pipelines across bronze, silver, and gold layers while improving the usability and accessibility of enterprise data. This includes supporting initiatives involving AI-powered extraction of underwriting and medical data from images and PDFs to improve policy cost analysis and business insights. What You'll Be Doing:

Partner with business users to gather and define data requirements
Collaborate with architects and technical leads to design scalable Databricks solutions
Build and support data engineering solutions throughout the full SDLC lifecycle
Develop frameworks and reusable components for pipeline standardization, SLA monitoring, and data quality management
Create and optimize Delta tables, Databricks dashboards, and orchestration workflows
Design and maintain dimensional and ER-based data models utilizing medallion architecture
Implement batch and streaming data pipelines within Databricks
Create unit tests, perform SIT testing, and support UAT troubleshooting efforts
Debug and resolve data defects and performance issues
Implement and maintain data security policies and compliance standards
Work closely with upstream/downstream teams, including offshore and vendor partners
Produce technical documentation, training materials, and knowledge transfer sessions
Research and evaluate emerging Databricks capabilities including Intelligent Document Processing, Genie AI coding, and self-service workspace features

Requirements

Minimum 3 years of hands-on Databricks experience in Azure or AWS environments
Strong experience utilizing medallion architecture and modern data modeling techniques
Expertise creating reusable data engineering frameworks and standardization processes
Experience developing dimensions and fact tables with SCD2 tracking
Strong Databricks orchestration experience with both batch and streaming pipelines
Experience performing testing, debugging, and production support
Strong understanding of Databricks compute and storage optimization
Experience with CI/CD, GitLab, and deployment automation tools
Strong understanding of Agile methodologies, story creation, and effort estimation

Hands-On Technical Skills:

Expert-level SQL skills
Intermediate or higher proficiency in PySpark
Experience with Lakeflow and data quality expectations coding
Strong understanding of Databricks features including Unity Catalog, Spark UI, Job Scheduling, and related capabilities
Ability to quickly learn and implement newer Databricks technologies and AI-driven capabilities, * 5+ years of Databricks experience building enterprise-scale capabilities from scratch
Experience working within insurance or financial services environments
Experience with data governance and observability platforms
Understanding of enterprise data models and schema evolution strategies

Soft Skills:

Excellent communication and presentation skills
Ability to create training materials and technical documentation
Strong problem-solving and analytical thinking
Collaborative mindset with strong teamwork skills
Design-thinking approach to solution development

Lead Data Engineer -Databricks

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all