Senior Staff Software Engineer: Data & Storage Platform

Uber
Sunnyvale, United States of America
27 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 297K

Job location

Sunnyvale, United States of America

Tech stack

Java
Artificial Intelligence
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Apache HTTP Server
C++
Cloud Computing
Cloud Engineering
ETL
Data Systems
Distributed Data Store
Hadoop Distributed File System
Hive
Data Intelligence
Meta-Data Management
MySQL
NoSQL
Open Source Technology
Performance Tuning
Redis
System Programming
Unstructured Data
AI Infrastructure
Graphics Processing Unit (GPU)
Data Storage Management
Cloud Platform System
Retrieval-Augmented Generation
Spark
Multi-Cloud
Apache Flink
Cassandra
Presto
Oracle Cloud Infrastructure

Job description

As a Senior Staff Engineer , you will be the architect behind Uber's Elastic Compute , Unified Semantic Layer , and Self-Evolving Knowledge Platform . You will lead the transition from legacy on-premise systems to a Modern, Cloud-Native Ecosystem , designing solutions that handle exabytes of data. Whether it's optimizing Transactional Distributed Storage , or operationalizing Agentic Data Intelligence , your work will define the industry standard for high-performance infrastructure.

What You Will Do

  • Architect the Multi-Modal Fabric: Unify batch, streaming, and AI compute into one intelligent fabric, enabling real-time insights and trustworthy AI agents at a global scale.
  • Revolutionize Storage & Catalog: Drive the architecture for a unified catalog and metadata management service for unstructured data, leveraging native cloud object store capabilities.
  • Operationalize AI Intelligence: Partner with teams like QueryCopilot and DataIQ to bridge human validation with autonomous reasoning through agentic workflows.
  • Lead Storage Modernization: Evolve our massive-scale persistence layers-including Docstore (Transactional Distributed Storage) and Distributed MySQL-to increase resiliency and reduce operational overhead.
  • Open Source & Act as a force multiplier by contributing to the community (Hudi, Iceberg, Presto).

Requirements

  • 14+ Years of Engineering Excellence: Proven experience designing and operating world-class distributed data and storage systems.
  • Mastery of Storage Internals: Extensive storage experience is a must. You have deep expertise in:
  • Batch & Object Storage: HDFS, Cloud Object Storage (S3/GCS/OCI), and Blobstore metadata management.
  • Storage Optimization: Practical experience with Apache Hudi or Apache Iceberg for lakehouse architectures.
  • Transactional Systems: Experience with distributed transactional storage (e.g., Docstore, Google Spanner, TiDB).
  • NoSQL & Cache: Cassandra, Redis, and high-throughput Key-Value stores.
  • Data + AI Convergence: Deep understanding of how compute fabrics (Spark, Flink, Ray) integrate with vector databases and model-serving platforms.
  • Query Engine Proficiency: Architect-level knowledge of Presto, Trino, or Hive for large-scale analytical processing.
  • Systems Programming: Expert-level command of Java, Go, Scala, or C++ with a focus on performance tuning and distributed consensus., * Infrastructure at Scale: Experience designing AI infrastructure, including RAG (Retrieval-Augmented Generation) systems and high-bandwidth data loading for GPUs.
  • Distributed Relational Mastery: Hands-on experience with Sharded/Distributed MySQL (Vitess) and managing large-scale tabular data.
  • Multi-Cloud Vision: Demonstrated ability to build portable data solutions across OCI and GCP, optimizing for resource efficiency and intelligent scheduling.
  • SLA-Driven Reliability: Expertise in building observability, data freshness, and quality frameworks for Tier-0 mission-critical services.Strategic Influence: Proven ability to lead platform modernization, mentor Staff-level engineers, and influence long-term technical strategy across multiple organizations.

About the company

Uber's Data Platform is the heart of the company's critical decision-making and AI-driven intelligence. We are evolving from traditional data processing toward a Next-Generation Data Intelligence Platform -a unified, elastic fabric that collapses the silos between batch, streaming, and multimodal data.

Apply for this position