Data Engineer

Collabera
Chicago, United States of America
yesterday

Role details

Contract type
Temporary contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Compensation
$ 152K

Job location

Chicago, United States of America

Tech stack

PHP
API
Big Data
Cloudera Impala
Relational Databases
File Systems
Distributed Computing Environment
Elasticsearch
Hadoop
MapReduce
HBase
Hive
Python
MySQL
Performance Tuning
Queueing Systems
Cloudera
Solr
SQL Databases
Sqoop
Software Vulnerability Management
Data Processing
Data Ingestion
Sql Optimization
Spark
Semi-structured Data
Data Analytics
Real Time Data
Kafka
Spark Streaming
Kibana
Data Pipelines
Sql Tuning
Programming Languages

Job description

You will work closely with cross-functional teams to design, develop, and optimize scalable data pipelines that process large volumes of structured and semi-structured data from multiple enterprise sources. The ideal candidate brings strong experience in Hadoop ecosystems, Spark Structured Streaming, SQL optimization, and distributed data processing technologies. This is a high-impact role where you will contribute directly to critical security and vulnerability management initiatives by enabling reliable, high-quality data processing and analytics capabilities. Top Three skills: o Advanced SQL & Distributed Data Processing (Spark SQL, Hive, Impala) o Spark Structured Streaming & Real-Time Data Pipeline Development o Hadoop Ecosystem & Big Data Technologies (Kafka, Hive, HBase, MapReduce, Cloudera), o Sql o Distributed Data Processing o SQL Performance Tuning o Advanced SQL o Real-Time Data Pipeline Development o Big Data Technologies o Hadoop Ecosystem o MapReduce o Spark o Spark SQL o SOLR o ElasticSearch o Hortonworks o hive o impala o hbase o hadoop o cloudera o big data

Requirements

o Strong SQL expertise with technologies such as Hive, Impala, MySQL, or Spark SQL o Hands-on experience with Spark Structured Streaming and distributed data processing o Experience working within Hadoop/Big Data ecosystems o Strong experience with tools and technologies such as Spark, Kafka, Sqoop, MapReduce, HBase, SOLR, ElasticSearch, Kibana, Cloudera/CDP/Hortonworks o Experience building ingestion pipelines from APIs, message queues, relational databases, and file systems o Proficiency in at least one programming language: Scala, Python, or PHP, o Recent experience supporting U.S.-based clients or enterprise environments within the last 5 years is highly preferred.

Benefits & conditions

Hourly Rate: $72 - $73 per hour, The Company offers the following benefits for this position, subject to applicable eligibility requirements: medical insurance, dental insurance, vision insurance, 401(k) retirement plan, life insurance, long-term disability insurance, short-term disability insurance, paid parking/public transportation, (paid time , paid sick and safe time , hours of paid vacation time, weeks of paid parental leave, paid holidays annually - AS Applicable)

About the company

At Collabera, we don't just offer jobs-we build careers. As a global leader in talent solutions, we provide opportunities to work with top organizations, cutting-edge technologies, and dynamic teams. Our culture thrives on innovation, collaboration, and a commitment to excellence. With continuous learning, career growth, and a people-first approach, we empower you to achieve your full potential. Join us and be part of a company that values passion, integrity, and making an impact.

Apply for this position