DataFlux Solution Architect Databricks Modernization Engagement
Info Dinamica Inc
Rochester, United States of America
3 days ago
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Rochester, United States of America
Tech stack
Unity
Third Normal Form
Airflow
Amazon Web Services (AWS)
Azure
Data Architecture
Data Deduplication
Data Governance
Data Integration
Data Vault Modeling
Hive
Metadata
Meta-Data Management
Role-Based Access Control
Reference Data
Migration Manager
SAS (Software)
Google Cloud Platform
Data Lake
PySpark
DataFlux
Data Management
Databricks
Job description
Our client is driving a strategic modernization initiative to migrate enterprise Data Quality, MDM, and data integration workloads from SAS DataFlux to the Databricks Lakehouse Platform. We are seeking a Senior Onshore DataFlux Solution Architect to lead architecture, migration strategy, and target-state design for this multi-phase transformation program. Key Responsibilities
- Lead end-to-end architecture for SAS DataFlux to Databricks migration programs
- Perform current-state assessment, gap analysis, target-state design, and migration roadmap planning
- Define scalable solutions using Databricks, Delta Lake, Unity Catalog, DLT, and Medallion Architecture
- Partner with Enterprise Architecture, Data Governance, and Security teams for RBAC, lineage, metadata, and PII compliance
- Provide technical leadership to onshore/offshore teams, conduct design reviews, and enforce engineering standards
- Design high-performance data quality, matching, deduplication, and MDM solutions using PySpark and Spark SQL
Requirements
- 10+ years of enterprise data architecture experience
- 5+ years of hands-on expertise with SAS DataFlux (dfPower Studio / Data Management Studio / Data Management Server)
- Strong experience in MDM domains: Customer, Product, Vendor, Employee, Location, and Reference Data
- Proven experience with Databricks, Delta Lake, Unity Catalog, Delta Live Tables, and Workflows
- Strong PySpark and Spark SQL development skills
- Experience with cloud platforms: Azure, AWS, or Google Cloud Platform
- Knowledge of modern data tools such as Fivetran, ADF, Airflow, and dbt
- Strong understanding of Data Quality, Data Governance, Data Modeling (3NF, Dimensional, Data Vault), and Metadata Management, * Experience leading DataFlux modernization or sunset programs
- Databricks certifications preferred
- Experience in Healthcare, Insurance, or Financial Services domains