Back

TELECOMMUTE Data Engineer

Innovative IT Solutions Inc

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Job location

Remote

Tech stack

Java

Big Data

Cloudera Impala

Computer Programming

Data Integration

Distributed Systems

Hadoop

HBase

Hive

Performance Tuning

Data Processing

Data Ingestion

Spark

Backend

PySpark

Job description

Develop and maintain data pipelines using Spark (PySpark/Scala/Java)
Work with HBase and Hadoop ecosystem tools (Hive, Impala, Hue)
Process large datasets in distributed environments
Collaborate with backend teams for data integration
Optimize performance of data processing jobs
Support data ingestion, transformation, and storage solutions
Participate in testing and performance tuning

Requirements

Strong experience with Apache Spark (must-have)
Hands-on with HBase (must-have)
Good programming skills in Java or PySpark
Experience with Hadoop ecosystem (Hive, Impala, Hue)
Strong understanding of data processing concepts

Apply for this position