MarTech Data Engineering POD supporting a Client Facing Java

Factspan Inc
Atlanta, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Atlanta, United States of America

Tech stack

Agile Methodologies
Airflow
Google BigQuery
Continuous Integration
Data Validation
Information Engineering
Data Governance
ETL
Data Profiling
Data Warehousing
DevOps
Data Flow Control
Python
MongoDB
Performance Tuning
Scrum
SQL Databases
Data Streaming
Data Logging
Scripting (Bash/Python/Go/Ruby)
Google Cloud Platform
Data Ingestion
Delivery Pipeline
Spring-boot
Information Technology
SentinelOne Expertise
Data Pipelines

Job description

This role is part of the MarTech Data Engineering POD supporting a Client Facing Java based Spring Boot application. The initiative aims to build a scalable data warehouse solution which aggregates, transforms and loads sales data into MongoDB from Big Query with the help of Airflow and Dataflow. Finally, publish data with the help of Spring Boot application., * Design and develop data ingestion, transformation, and orchestration pipelines using Google Cloud Platform (BigQuery, Dataflow, Cloud Composer) and Airflow.

  • Implement and maintain data integrations between MongoDB and Big Query, ensuring high performance and scalability.
  • Build and optimize ETL/ELT workflows to support data orchestration, transformation and warehousing.
  • Implement data quality checks, monitoring, and exception handling frameworks to ensure clean and reliable data.
  • Work closely with Lead DE and SME to translate Requirements into data pipelines and orchestration rules.
  • Support performance tuning, data reconciliation, and defect resolution during testing and hypercare phases.
  • Collaborate with QA team for end-to-end test validation and ensure production readiness.
  • Participate in daily Agile ceremonies, sprint reviews, and deployment planning.

Requirements

  • Must have expertise in Java based Spring Boot Application.
  • Strong hands-on experience in Google Cloud Platform services: BigQuery, Dataflow, GCS, Cloud Composer (Airflow).
  • Proficiency in SQL for data profiling, transformation, and performance optimization.
  • Proficiency in Python for pipeline automation, scripting, and orchestration.
  • Experience building batch and streaming data pipelines.
  • Exposure to working with marketing or customer engagement platforms is a plus.
  • Understanding data quality, logging, monitoring, and alerting best practices.

Preferred Skills:

  • Experience in Airflow DAG development for complex data pipelines.
  • Experience with CI/CD and DevOps principles for data deployments.
  • Experience in SentinelOne for Data Governance.
  • Strong troubleshooting and performance tuning skills.

Soft Skills & Attributes:

  • Strong problem-solving skills and attention to detail.
  • Ability to work effectively in a distributed global POD model (onsite + offshore).
  • Should be able to work in client facing environment.
  • Proactive communication and ownership mindset.
  • Comfortable working in fast-paced agile environments with tight timelines.
  • Excellent verbal and written communication skills.

Education: Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.

Apply for this position