Data Engineer

Collabera

Chicago, United States of America

yesterday

Role details

Contract type

Temporary contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Compensation

$ 152K

Job location

Chicago, United States of America

Tech stack

PHP

API

Big Data

Cloudera Impala

Relational Databases

File Systems

Distributed Computing Environment

Elasticsearch

Hadoop

MapReduce

HBase

Hive

Python

MySQL

Performance Tuning

Queueing Systems

Cloudera

Solr

SQL Databases

Sqoop

Software Vulnerability Management

Data Processing

Data Ingestion

Sql Optimization

Spark

Semi-structured Data

Data Analytics

Real Time Data

Kafka

Spark Streaming

Kibana

Data Pipelines

Sql Tuning

Programming Languages

Job description

You will work closely with cross-functional teams to design, develop, and optimize scalable data pipelines that process large volumes of structured and semi-structured data from multiple enterprise sources. The ideal candidate brings strong experience in Hadoop ecosystems, Spark Structured Streaming, SQL optimization, and distributed data processing technologies. This is a high-impact role where you will contribute directly to critical security and vulnerability management initiatives by enabling reliable, high-quality data processing and analytics capabilities. Top Three skills: o Advanced SQL & Distributed Data Processing (Spark SQL, Hive, Impala) o Spark Structured Streaming & Real-Time Data Pipeline Development o Hadoop Ecosystem & Big Data Technologies (Kafka, Hive, HBase, MapReduce, Cloudera), o Sql o Distributed Data Processing o SQL Performance Tuning o Advanced SQL o Real-Time Data Pipeline Development o Big Data Technologies o Hadoop Ecosystem o MapReduce o Spark o Spark SQL o SOLR o ElasticSearch o Hortonworks o hive o impala o hbase o hadoop o cloudera o big data

Requirements

o Strong SQL expertise with technologies such as Hive, Impala, MySQL, or Spark SQL o Hands-on experience with Spark Structured Streaming and distributed data processing o Experience working within Hadoop/Big Data ecosystems o Strong experience with tools and technologies such as Spark, Kafka, Sqoop, MapReduce, HBase, SOLR, ElasticSearch, Kibana, Cloudera/CDP/Hortonworks o Experience building ingestion pipelines from APIs, message queues, relational databases, and file systems o Proficiency in at least one programming language: Scala, Python, or PHP, o Recent experience supporting U.S.-based clients or enterprise environments within the last 5 years is highly preferred.

Benefits & conditions

Hourly Rate: $72 - $73 per hour, The Company offers the following benefits for this position, subject to applicable eligibility requirements: medical insurance, dental insurance, vision insurance, 401(k) retirement plan, life insurance, long-term disability insurance, short-term disability insurance, paid parking/public transportation, (paid time , paid sick and safe time , hours of paid vacation time, weeks of paid parental leave, paid holidays annually - AS Applicable)

About the company

At Collabera, we don't just offer jobs-we build careers. As a global leader in talent solutions, we provide opportunities to work with top organizations, cutting-edge technologies, and dynamic teams. Our culture thrives on innovation, collaboration, and a commitment to excellence. With continuous learning, career growth, and a people-first approach, we empower you to achieve your full potential. Join us and be part of a company that values passion, integrity, and making an impact.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all