Principal Software Engineer - Spark
Cloudera, Inc.
San Jose, United States of America
yesterday
Role details
Contract type
Permanent contract Employment type
Full-time (> 32 hours) Working hours
Regular working hours Languages
English Experience level
Senior Compensation
$ 320KJob location
Remote
San Jose, United States of America
Tech stack
Java
Airflow
Amazon Web Services (AWS)
Azure
C++
Cloud Engineering
Data as a Services
Information Engineering
Data Infrastructure
Distributed Computing Environment
Distributed Systems
Python
Open Source Technology
Openshift
Cloud Services
Cloudera
Scala
Private Cloud Environment
Google Cloud Platform
Spark
Containerization
Kubernetes
Information Technology
Rancher
Docker
Go
Job description
- Drive the multi-year technical roadmap and architectural vision for Cloudera Data Engineering.
- Gain deep technical knowledge across the data services technical stack, with a focus on Spark, Airflow, Iceberg, and apply this expertise in your daily work.
- Foster engineering excellence through technical mentorship, design reviews, and architectural guidance.
- Collaborate with product, engineering, and cross-functional partners, leading the delivery of several large, critical features in Cloudera's data engineering experience.
- Work on large-scale distributed systems, ranging from hundreds to thousands of nodes in production clusters.
- Bring passion for programming, clean coding practices, attention to detail, and a strong focus on quality.
Requirements
- Relevant studies / BS or MS in Computer Science or related field
- 10+ years of experience as a Software Engineer in the data infrastructure space
- Strong understanding of at least one of the following languages: Java, Scala, C++, Python, GoLang. And interested to learn the languages we're using.
- Passionate about programming, clean coding habits, attention to detail, and focus on quality
- Deep expertise in distributed data processing systems and cloud-native architectures.
- Excellent communication and collaboration skills
- Experience with containerization (Kubernetes, Docker).
- Experience with using/developing Apache Spark/Airflow or other related technologies.
- Experience with public cloud (AWS/Azure/GCP) and/or private cloud (OpenShift/Rancher)
- (Most importantly) An open-minded attitude, desire to learn new things and build great products
You might also have…
- Contributed to open-source projects.
- Strong understanding of modern Lakehouse architectures, open table formats, and metadata/catalog services.
- Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling.
- Solid experience with at least one cloud service (AWS, Azure, GCP, OpenShift)
Benefits & conditions
The expected base salary range for this role in California is $270 - $320k
The salary will vary depending on your job-related skills, experience and location
What you can expect from us:
- Generous PTO Policy
- Support work life balance with Unplugged Days
- Flexible WFH Policy
- Mental & Physical Wellness programs
- Phone and Internet Reimbursement program
- Access to Continued Career Development
- Comprehensive Benefits and Competitive Packages
- Paid Volunteer Time
- Employee Resource Groups
About the company
At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world's largest enterprises.
Cloudera Data Engineering is the next-generation cloud-native service that helps our customers run large-scale data engineering workflows made up of industry-standard big data processing frameworks like Apache Spark, Apache Airflow, Iceberg with just a few clicks, across both on-premises and public cloud environments.
Cloudera Data Engineering is a next-generation cloud-native service that enables customers to run large-scale data engineering workflows using industry-standard big data technologies such as Apache Spark, Apache Airflow, and Apache Iceberg with just a few clicks across both on-premises and public cloud environments.
We are seeking a Principal Staff Engineer with a strong technical background in the data infrastructure space to lead the Cloudera Data Engineering experience for all customers using Cloudera Data Engineering Spark, Airflow, and Lakehouse. This high-impact IC role offers the opportunity to shape the future of Cloudera's Data Engineering and Lakehouse products across multiple cloud environments, impacting thousands of customers worldwide.