Data Engineer - PySpark

SSI People
Pittsburgh, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Pittsburgh, United States of America

Tech stack

Agile Methodologies
Data Validation
Information Engineering
Data Transformation
Dataspaces
Software Debugging
Linux
Hadoop
Python
Object-Oriented Software Development
Performance Tuning
Standard Sql
SQL Databases
Data Processing
Sql Optimization
Spark
GIT
Pandas
PySpark
Information Technology
Data Pipelines
Jenkins

Job description

We are seeking an experienced Data Engineer with strong PySpark and SQL expertise to join our Banking client in Pittsburgh, PA. This is a fully onsite role supporting data engineering initiatives, pipeline development, and modernization efforts within a structured and governed data environment., * Participate in daily Scrum calls and Agile ceremonies.

  • Develop, enhance, and maintain data pipelines and transformation processes.
  • Perform coding, testing, debugging, and implementation activities.
  • Collaborate with business and technical teams to translate requirements into scalable data solutions.
  • Ensure data quality through rigorous validation, reconciliation, and testing.
  • Optimize existing data processing workflows and improve performance.
  • Support modernization initiatives and upcoming change release implementations.

Requirements

Visa/Sponsorship: Candidates must be authorized to work on W2 without current or future sponsorship requirements., * 8+ years of overall IT experience.

  • Strong hands-on experience with PySpark and Spark-based data processing.
  • Advanced SQL skills with expertise in complex data transformations.
  • Experience building and maintaining large-scale data pipelines.
  • Strong data validation, reconciliation, and troubleshooting skills.
  • Ability to translate business requirements and source-to-target mappings into technical solutions.
  • Experience working in structured, governed data ecosystems.
  • Strong Python, SQL, Git, and Linux skills.
  • Experience with object-oriented programming principles.
  • Practical debugging and performance optimization experience.

Preferred Skills

  • Jenkins
  • Apache Spark
  • Hadoop Ecosystem
  • Pandas
  • Banking or Financial Services experience

Apply for this position