Data Engineer with Ab Initio

Randstad
Phoenix, United States of America
10 days ago

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 145K

Job location

Phoenix, United States of America

Tech stack

Agile Methodologies
Airflow
Data analysis
CA Workload Automation Ae
Google BigQuery
Bioinformatics
Cloud Storage
Information Engineering
Data Governance
Data Security
Data Vault Modeling
Software Debugging
Data Flow Control
Python
Shell
Operational Databases
Performance Tuning
Scrum
Cloudera
SQL Databases
Test Case Design
Freeform SQL
Google Cloud Platform
Spark
Parallel Computation
Ab Initio
Build Server
GIT
Data Layers
Build Management
Production Code
Google BigQuery
Data Pipelines

Job description

job summary: Randstad Digital is hiring and we're looking for someone like YOU to join our team! If you are seeking a new opportunity, looking to grow in your career, or you know someone who is - we want to hear from you! Take a look at the below opportunity, or feel free to visit RandstadUSA.com to view and apply.

location: Phoenix, Arizona job type: Contract to Perm salary: $56.71 - 69.53 per hour work hours: 8am to 5pm education: Bachelors

responsibilities:

  • Pipeline Modernization: Lead the translation and migration of complex enterprise data graphs from Ab Initio (Co>Operating System, EME) into modern GCP-native frameworks using Python, SQL, and Cloud Composer (Apache Airflow).
  • BigQuery Architecture: Design and build optimized, highly secure data warehouses and semantic layers within Google BigQuery. Implement strict partitioning, clustering, and slot management strategies to ensure low-latency performance and cost optimization.
  • Framework Development: Build reusable, config-driven ingestion and transformation libraries (using Python and Dataflow/Dataproc) to minimize "one-off" pipelines and accelerate enterprise-wide cloud onboarding.
  • Data Governance & Risk Mitigation: Enforce data quality rules, automated reconciliation metrics, and robust end-to-end lineage mapping using GCP Dataplex and Data Catalog to adhere to strict enterprise risk and compliance frameworks.
  • Automation & GenAI Adoption: Evaluate and utilize modern automation and GenAI-assisted tooling to accelerate schema mapping, legacy code translation (e.g., Ab Initio/SQL to Python/Beam), and test case generation.
  • Agile Collaboration: Partner closely with Data Governance, Security, Infrastructure, and downstream Analytics teams in an Agile/Scrum environment to deliver production-ready code via CI/CD pipelines (Git, Cloud Build).

qualifications: 4+ years of Data Engineering experience or equivalent.

3+ years of hands-on experience building, debugging, and maintaining enterprise-scale data pipelines using Ab Initio (and/or parallel architectures like Spark/Informatica) alongside Unix Shell Scripting and Autosys.

2+ years of direct experience deploying production data workloads on Google Cloud Platform, specifically utilizing BigQuery, Cloud Storage (GCS), and Cloud Composer/Airflow.

Advanced expertise in writing and tuning highly complex SQL queries and designing robust data models (Dimensional, Data Vault, or Star Schemas).

Equal Opportunity Employer: Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.

At Randstad Digital, we welcome people of all abilities and want to ensure that our hiring and interview process meets the needs of all applicants. If you require a reasonable accommodation to make your application or interview experience a great one, please contact HRsupport@randstadusa.com.

Pay offered to a successful candidate will be based on several factors including the candidate's education, work experience, work location, specific job duties, certifications, etc. In addition, Randstad Digital offers a comprehensive benefits package, including: medical, prescription, dental, vision, AD&D, and life insurance offerings, short-term disability, and a 401K plan (all benefits are based on eligibility).

This posting is open for thirty (30) days.

,

  • Pipeline Modernization: Lead the translation and migration of complex enterprise data graphs from Ab Initio (Co>Operating System, EME) into modern GCP-native frameworks using Python, SQL, and Cloud Composer (Apache Airflow).
  • BigQuery Architecture: Design and build optimized, highly secure data warehouses and semantic layers within Google BigQuery. Implement strict partitioning, clustering, and slot management strategies to ensure low-latency performance and cost optimization.
  • Framework Development: Build reusable, config-driven ingestion and transformation libraries (using Python and Dataflow/Dataproc) to minimize "one-off" pipelines and accelerate enterprise-wide cloud onboarding.
  • Data Governance & Risk Mitigation: Enforce data quality rules, automated reconciliation metrics, and robust end-to-end lineage mapping using GCP Dataplex and Data Catalog to adhere to strict enterprise risk and compliance frameworks.
  • Automation & GenAI Adoption: Evaluate and utilize modern automation and GenAI-assisted tooling to accelerate schema mapping, legacy code translation (e.g., Ab Initio/SQL to Python/Beam), and test case generation.
  • Agile Collaboration: Partner closely with Data Governance, Security, Infrastructure, and downstream Analytics teams in an Agile/Scrum environment to deliver production-ready code via CI/CD pipelines (Git, Cloud Build).

Requirements

4+ years of Data Engineering experience or equivalent. 3+ years of hands-on experience building, debugging, and maintaining enterprise-scale data pipelines using Ab Initio (and/or parallel architectures like Spark/Informatica) alongside Unix Shell Scripting and Autosys. 2+ years of direct experience deploying production data workloads on Google Cloud Platform, specifically utilizing BigQuery, Cloud Storage (GCS), and Cloud Composer/Airflow. Advanced expertise in writing and tuning highly complex SQL queries and designing robust data models (Dimensional, Data Vault, or Star Schemas).

Apply for this position