TELECOMMUTE Data Engineer
Innovative IT Solutions Inc
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
EnglishJob location
Remote
Tech stack
Java
Big Data
Cloudera Impala
Computer Programming
Data Integration
Distributed Systems
Hadoop
HBase
Hive
Performance Tuning
Data Processing
Data Ingestion
Spark
Backend
PySpark
Job description
- Develop and maintain data pipelines using Spark (PySpark/Scala/Java)
- Work with HBase and Hadoop ecosystem tools (Hive, Impala, Hue)
- Process large datasets in distributed environments
- Collaborate with backend teams for data integration
- Optimize performance of data processing jobs
- Support data ingestion, transformation, and storage solutions
- Participate in testing and performance tuning
Requirements
- Strong experience with Apache Spark (must-have)
- Hands-on with HBase (must-have)
- Good programming skills in Java or PySpark
- Experience with Hadoop ecosystem (Hive, Impala, Hue)
- Strong understanding of data processing concepts