Data Engineer - Ab initio

Mphasis
Hanover, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Hanover, United States of America

Tech stack

Java
Big Data
Business Process Modeling
Cloud Computing
Databases
Data Validation
Information Engineering
ETL
Data Security
Data Systems
Integrated Development Environments
Python
Oracle Applications
Queueing Systems
Scala
Shell Script
Software Engineering
SQL Databases
Teradata
Data Processing
Rollup
System Availability
Ab Initio
Documentation System
Containerization
Kubernetes
Information Technology
Real Time Data
Kafka
Data Pipelines
Docker

Job description

We are seeking a highly skilled resource to design and implement high-performance, event-driven data pipelines, ensuring low-latency data processing and high availability system for the large credit card processing system. The ideal candidate will work with the Ab Initio ecosystem (GDE, EME, Conduct>It) to build stateful services that ingest, filter, and transform data from sources like Kafka or message queues, pushing updates to dashboards or downstream databases in near-real-time., * Create complex Ab Initio continuous flow graphs, including stateful joins, sliding time windows, and aggregations.

  • Implement event-driven data pipelines using Kafka, MQ, and file streams.
  • Ensure the resilience of continuous flows, including checkpointing and recovery, to guarantee "exactly-once" processing.
  • Apply advanced Ab Initio components (e.g., Reformat, Rollup, Join, Partition) to ensure low-latency performance.
  • Proactively monitor live production streams to ensure 24/7 reliability and troubleshooting data issues
  • Develop ETL pipelines for batch and real-time data ingestion and transformation.
  • Implement and ensure data validation, data security, integrity, and compliance across big data platforms.
  • Monitor and troubleshoot performance issues in large-scale clusters.
  • Collaborate with data scientists, analysts, and application teams to deliver high-quality data solutions.
  • Automate workflows and improve operational efficiency using scripting and orchestration tools.

Requirements

  • Deep understanding of credit card process system
  • Deep knowledge of GDE (Graphical Development Environment), EME (Enterprise Meta>Environment), Conduct>It, and Continuous Flows.
  • Understanding of Kafka, message queues, and real-time stateful services.
  • Proficiency in Unix/Linux shell scripting, SQL, and database technologies (e.g., Oracle, Teradata).
  • Experience in Java , Scala, Python or Kafka is plus.
  • Familiarity with Linux/Unix environments and shell scripting.
  • Understanding of data security, governance, and compliance standards.
  • Experience with cloud-based big data platforms
  • Exposure to containerization (Docker, Kubernetes) for big data workloads.
  • Knowledge of CI/CD pipelines for data engineering projects.

Behavioral Skills:

  • Good Communication skills
  • 5 days Work from Office at Berkley Heights, NJ
  • Team Player
  • Ability to work in a changing environment
  • Strong problem solving and analytical skills
  • Ability to work independently or within a team
  • Manage day-to-day challenges and communicate developmental risks with the technical team

Qualifications:

  • Bachelor's degree in computer science, Software Engineering, or a related field.
  • Proficiency in business process modeling and documentation tools.
  • Product implementation experience is preferred

Apply for this position