Data Engineer - PySpark

SSI People

Pittsburgh, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Job location

Pittsburgh, United States of America

Tech stack

Agile Methodologies

Data Validation

Information Engineering

Data Transformation

Dataspaces

Software Debugging

Linux

Hadoop

Python

Object-Oriented Software Development

Performance Tuning

Standard Sql

SQL Databases

Data Processing

Sql Optimization

Spark

GIT

Pandas

PySpark

Information Technology

Data Pipelines

Jenkins

Job description

We are seeking an experienced Data Engineer with strong PySpark and SQL expertise to join our Banking client in Pittsburgh, PA. This is a fully onsite role supporting data engineering initiatives, pipeline development, and modernization efforts within a structured and governed data environment., * Participate in daily Scrum calls and Agile ceremonies.

Develop, enhance, and maintain data pipelines and transformation processes.
Perform coding, testing, debugging, and implementation activities.
Collaborate with business and technical teams to translate requirements into scalable data solutions.
Ensure data quality through rigorous validation, reconciliation, and testing.
Optimize existing data processing workflows and improve performance.
Support modernization initiatives and upcoming change release implementations.

Requirements

Visa/Sponsorship: Candidates must be authorized to work on W2 without current or future sponsorship requirements., * 8+ years of overall IT experience.

Strong hands-on experience with PySpark and Spark-based data processing.
Advanced SQL skills with expertise in complex data transformations.
Experience building and maintaining large-scale data pipelines.
Strong data validation, reconciliation, and troubleshooting skills.
Ability to translate business requirements and source-to-target mappings into technical solutions.
Experience working in structured, governed data ecosystems.
Strong Python, SQL, Git, and Linux skills.
Experience with object-oriented programming principles.
Practical debugging and performance optimization experience.

Preferred Skills

Jenkins
Apache Spark
Hadoop Ecosystem
Pandas
Banking or Financial Services experience

Role details

Job location

Tech stack

Job description

Requirements

Apply for this position

Good distractions

Moments

Videos View all