Principal Software Engineer - Spark

Cloudera, Inc.

San Jose, United States of America

yesterday

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

$ 320K

Job location

Remote

San Jose, United States of America

Tech stack

Java

Airflow

Amazon Web Services (AWS)

Azure

C++

Cloud Engineering

Data as a Services

Information Engineering

Data Infrastructure

Distributed Computing Environment

Distributed Systems

Python

Open Source Technology

Openshift

Cloud Services

Cloudera

Scala

Private Cloud Environment

Google Cloud Platform

Spark

Containerization

Kubernetes

Information Technology

Rancher

Docker

Job description

Drive the multi-year technical roadmap and architectural vision for Cloudera Data Engineering.
Gain deep technical knowledge across the data services technical stack, with a focus on Spark, Airflow, Iceberg, and apply this expertise in your daily work.
Foster engineering excellence through technical mentorship, design reviews, and architectural guidance.
Collaborate with product, engineering, and cross-functional partners, leading the delivery of several large, critical features in Cloudera's data engineering experience.
Work on large-scale distributed systems, ranging from hundreds to thousands of nodes in production clusters.
Bring passion for programming, clean coding practices, attention to detail, and a strong focus on quality.

Requirements

Relevant studies / BS or MS in Computer Science or related field
10+ years of experience as a Software Engineer in the data infrastructure space
Strong understanding of at least one of the following languages: Java, Scala, C++, Python, GoLang. And interested to learn the languages we're using.
Passionate about programming, clean coding habits, attention to detail, and focus on quality
Deep expertise in distributed data processing systems and cloud-native architectures.
Excellent communication and collaboration skills
Experience with containerization (Kubernetes, Docker).
Experience with using/developing Apache Spark/Airflow or other related technologies.
Experience with public cloud (AWS/Azure/GCP) and/or private cloud (OpenShift/Rancher)
(Most importantly) An open-minded attitude, desire to learn new things and build great products

You might also have…

Contributed to open-source projects.
Strong understanding of modern Lakehouse architectures, open table formats, and metadata/catalog services.
Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling.
Solid experience with at least one cloud service (AWS, Azure, GCP, OpenShift)

Benefits & conditions

The expected base salary range for this role in California is $270 - $320k

The salary will vary depending on your job-related skills, experience and location

What you can expect from us:

Generous PTO Policy
Support work life balance with Unplugged Days
Flexible WFH Policy
Mental & Physical Wellness programs
Phone and Internet Reimbursement program
Access to Continued Career Development
Comprehensive Benefits and Competitive Packages
Paid Volunteer Time
Employee Resource Groups

About the company

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world's largest enterprises. Cloudera Data Engineering is the next-generation cloud-native service that helps our customers run large-scale data engineering workflows made up of industry-standard big data processing frameworks like Apache Spark, Apache Airflow, Iceberg with just a few clicks, across both on-premises and public cloud environments. Cloudera Data Engineering is a next-generation cloud-native service that enables customers to run large-scale data engineering workflows using industry-standard big data technologies such as Apache Spark, Apache Airflow, and Apache Iceberg with just a few clicks across both on-premises and public cloud environments. We are seeking a Principal Staff Engineer with a strong technical background in the data infrastructure space to lead the Cloudera Data Engineering experience for all customers using Cloudera Data Engineering Spark, Airflow, and Lakehouse. This high-impact IC role offers the opportunity to shape the future of Cloudera's Data Engineering and Lakehouse products across multiple cloud environments, impacting thousands of customers worldwide.

Role details

Job location

Tech stack

Job description

Requirements

Benefits & conditions

About the company

Apply for this position

Good distractions

Moments

Videos View all