TELECOMMUTE Data Engineer

Innovative IT Solutions Inc
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Remote

Tech stack

Java
Big Data
Cloudera Impala
Computer Programming
Data Integration
Distributed Systems
Hadoop
HBase
Hive
Performance Tuning
Data Processing
Data Ingestion
Spark
Backend
PySpark

Job description

  • Develop and maintain data pipelines using Spark (PySpark/Scala/Java)
  • Work with HBase and Hadoop ecosystem tools (Hive, Impala, Hue)
  • Process large datasets in distributed environments
  • Collaborate with backend teams for data integration
  • Optimize performance of data processing jobs
  • Support data ingestion, transformation, and storage solutions
  • Participate in testing and performance tuning

Requirements

  • Strong experience with Apache Spark (must-have)
  • Hands-on with HBase (must-have)
  • Good programming skills in Java or PySpark
  • Experience with Hadoop ecosystem (Hive, Impala, Hue)
  • Strong understanding of data processing concepts

Apply for this position