Senior Data Platform Engineer
Role details
Job location
Tech stack
Job description
Stack is developing revolutionary AI and advanced autonomous systems designed to enhance safety, reliability, and efficiency of modern operations. Stack's autonomous technology incorporates cutting-edge advancements in artificial intelligence, robotics, machine learning, and cloud technologies, empowering us to create innovative solutions that address the needs and challenges of the dynamic trucking transportation industry. With decades of experience creating and deploying real world systems for demanding environments, the Stack team is dedicated to developing an autonomous solution ecosystem tailored to the trucking industry's unique demands.
About the Role:
In the Compute Platform team, our mission is to provide the foundational compute platform that powers large-scale autonomous systems development. The team is responsible for enabling engineers and researchers to efficiently run compute and data intensive workloads on Stack AV infrastructure.
The Data Platform team is responsible for designing, implementing and maintaining the Stack AV on-premises data platform. The team supports large scale OLAP/OLTP and feature engineering workloads for multiple Product Development groups across the company. You will work at the intersection of infrastructure, distributed systems, and developer experience-ensuring that our critical services and pipelines are reliable, efficient, and easy to run.
As a Senior Data Platform Engineer, you will design and operate high scale data systems that power engineers across the company.
Responsibilities:
- Design and operate distributed storage systems for scheduling and executing large-scale batch workloads.
- Build and maintain an open source, modern data platform.
- Optimize utilization of storage resourcesImprove reliability and fault tolerance of large-scale storage systems and data platform components.
- Collaborate with teams across the company to understand workload requirements and improve platform capabilities.
- Contribute to platform tooling, automation, and CI/CD workflows.
Requirements
Do you have experience in System troubleshooting?, * 7+ years of experience building and operating distributed storage systems or modern data platforms.
- Experience operating streaming platforms such as Kafka or Pulsar.
- Fluent in Python, and SQL, with experience writing and maintaining highly available data applications using Trino and Apache Spark.
- Knowledge of table formats (Iceberg, Delta Lake, Hudi, Xtable).
- Experience operating and optimizing at least one RDBMS (Postgres, MySQL).
- Strong debugging and problem-solving skills in complex distributed systems.
- Ability to collaborate across teams and communicate technical concepts clearly.