Data Pipeline Engineer
Dexian DISYS
Washington, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
SeniorJob location
Washington, United States of America
Tech stack
Amazon Web Services (AWS)
Azure
Databases
Data Cleansing
Information Engineering
Data Integrity
ETL
Data Loss
Data Transformation
Data Mining
Relational Databases
Identity and Access Management
Python
Key Management
Oracle Applications
Scrum
Role-Based Access Control
Azure
Azure
SharePoint
SQL Databases
Transport Layer Security
Database Migration
Amazon Web Services (AWS)
Information Technology
Physical Data Models
Data Pipelines
Databricks
Job description
The Data Engineer in this role will support programs involving one or more of the following:
- Focuses explicitly on the one-time and phased mass migration efforts, ensuring zero data loss, strict adherence to mapping rules, and the successful execution of technical cutover protocols.
- Scope of Work 2.1 Pipeline Development and Implementation
- Build automated ETL/ELT pipelines to extract legacy data (Oracle, SharePoint) and load it into AWS RDS and Azure ADLS Gen2.
- Program data cleansing, standardization, and validation logic into the pipelines based on the Data Quality Rulebook.
- Execute the physical data extraction and load during pre-production and production cutovers.
2.2 Solution Design and Optimization
- Tune extraction queries to minimize performance impacts on legacy production systems during sync operations.
- Implement physical data models for the target databases.
- Code and automate the technical rollback mechanisms and data reconciliation scripts.
2.3 Stakeholder Engagement and Change Management
- Collaborate directly with the Business Analyst to interpret and implement STTM documents.
- Provide feedback on technical feasibility and performance implications of proposed data transformation rules.
- Participate in daily agile ceremonies and sprint planning.
2.4 Governance, Ethics, and Risk
- Embed automated pre- and post-migration data quality checks directly into the ETL scripts.
- Implement encryption standards at rest (AWS KMS) and in transit (TLS 1.2+).
- Apply role-based access control (RBAC) schemas within the database layers via IAM and Microsoft Entra ID.
2.5 Documentation and Reporting
- Document migration code repositories, operational runbooks, and cutover execution scripts.
- Generate automated migration success/failure logs for auditing.
Requirements
3.1 Education
- Bachelor's or Master's in Computer Science, Data Engineering, or a related quantitative field.
3.2 Certifications (Preferred)
- AWS Certified Data Engineer - Associate or Microsoft Certified Azure Data Engineer.
3.3 Mandatory Experience
- 5+ years building ETL pipelines, with heavy emphasis on one-time mass data migrations from legacy relational databases.
3.4 Technical Knowledge
- Expert SQL and Python.
- Hands-on experience with database migration tools (e.g., AWS DMS), Databricks, and legacy Oracle ecosystems.
3.5 Core Competencies
- Rigorous attention to detail regarding data integrity, strong problem-solving skills for schema mismatches, and ability to work under strict cutover deadlines.
About the company
Dexian stands at the forefront of Talent + Technology solutions with a presence spanning more than 70 locations worldwide and a team exceeding 10,000 professionals. As one of the largest technology and professional staffing companies and one of the largest minority-owned staffing companies in the United States, Dexian combines over 30 years of industry expertise with cutting-edge technologies to deliver comprehensive global services and support.