Senior Software Engineer - Storage Infrastructure

Yahoo
Sunnyvale, United States of America
16 days ago

Role details

Contract type
Permanent contract
Employment type
Part-time (≤ 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 267K

Job location

Remote
Sunnyvale, United States of America

Tech stack

Query Performance
Java
API
Amazon Web Services (AWS)
Cloud Computing
Databases
Data Recovery
Data Security
Database Schema
Shard (Database Architecture)
Disaster Recovery
Distributed Data Store
Distributed Systems
Amazon DynamoDB
Python
Memcached
Open Source Technology
Operational Databases
Performance Tuning
Query Optimization
Redis
Distributed Caching
Software Engineering
SQL Databases
Transaction Processing (Computing)
Google Cloud Platform
Sql Optimization
Cloud Monitoring
Amazon Web Services (AWS)
Database Optimization
Caching
Reliability of Systems
Database Migration
Storage Technologies
Information Technology
Data Management

Job description

As a Senior Software Engineer, you will design and optimize the foundational storage layer powering our 2.5B+ profile dataset. Your work on Cloud Spanner schema design, Valkey (Redis-compatible) caching strategies, and multi-region replication ensures sub-10ms data access for APIs serving millions of requests per second, directly enabling hundreds of millions in annual advertising revenue.

You will build and maintain petabyte-scale storage infrastructure with 99.99% availability, implementing efficient read/write patterns, multi-region replication, and disaster recovery mechanisms. Your expertise in distributed databases and caching systems is critical to balancing performance, cost, and reliability at massive scale.

This role demands deep knowledge of Cloud Spanner internals, distributed caching architectures, and production database operations at scale. You will collaborate closely with API, Ingestion, and SRE teams to ensure optimal data access patterns while maintaining data durability and system reliability., * Design and optimize Cloud Spanner schemas for efficient profile storage, query patterns, and write throughput at 2.5B+ profile scale

  • Implement Valkey (Redis-compatible) caching strategies achieving sub-10ms read latency for hot data access patterns
  • Build multi-region Spanner replication and automated failover mechanisms ensuring 99.99% availability and disaster recovery
  • Optimize Spanner read/write throughput, reduce hot-spotting, and improve query performance through index design and query optimization
  • Implement comprehensive monitoring and alerting systems tracking storage health, latency percentiles (p50, p95, p99), capacity utilization, and cost
  • Collaborate with API team on efficient data access patterns, query optimization, and caching strategies for activation endpoints
  • Partner with Ingestion team on high-throughput write patterns, batch loading strategies, and schema evolution without downtime
  • Design backup, point-in-time recovery, and disaster recovery procedures for critical user profile data
  • Troubleshoot production storage issues including performance degradation, hot-spotting, lock contention, and capacity constraints
  • Work with SRE teams on capacity planning, autoscaling strategies, cost optimization, and infrastructure efficiency
  • Implement cache invalidation strategies, cache warming, and distributed caching patterns for consistent data access
  • Create comprehensive documentation for storage architecture, operational runbooks, disaster recovery procedures, and on-call playbooks

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related technical field, * 5+ years software engineering experience building production systems
  • 3+ years hands-on experience with distributed databases or large-scale storage systems
  • 2+ years with GCP infrastructure (Spanner, Memorystore, Cloud Monitoring) or AWS equivalents (DynamoDB, ElastiCache)

Technical Skills

  • Strong proficiency in Java, Go, or Python for infrastructure and database tooling development
  • Hands-on experience with Cloud Spanner, CockroachDB, TiDB, or other distributed SQL databases
  • Experience with Redis, Valkey, Memcached, or other distributed caching systems in production
  • Deep understanding of distributed systems: consistency models (strong vs. eventual), replication strategies, consensus algorithms (Paxos, Raft)
  • SQL optimization skills and database schema design expertise including indexing strategies, partitioning, and query tuning
  • Familiarity with database performance tuning: profiling slow queries, analyzing execution plans, optimizing hot-spotting

Competencies

  • Strong performance tuning and troubleshooting abilities in distributed database environments
  • Demonstrated ability delivering reliable infrastructure solutions on schedule with minimal guidance
  • Excellent collaboration with infrastructure, application, and SRE teams
  • Team-level impact with ability to influence technical decisions within immediate team
  • Understanding of data durability, consistency guarantees, and operational excellence, * Experience with multi-region Cloud Spanner deployments at petabyte scale
  • Knowledge of cache invalidation strategies, cache coherence protocols, and distributed caching patterns
  • Prior experience in large-scale user data platforms, identity systems, or adtech storage infrastructure
  • Familiarity with database migration tools (gh-ost, pt-online-schema-change) and zero-downtime schema evolution
  • Understanding of data partitioning strategies, sharding, horizontal scaling, and distributed transaction processing
  • Experience with database backup and recovery tools, point-in-time recovery, and disaster recovery testing
  • Contributions to database or distributed systems open-source projects (Spanner clients, Redis modules, CockroachDB)
  • Self-driven, detail-oriented, excellent multitasking abilities in fast-paced environments

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

Benefits & conditions

The compensation for this position ranges from $128,250.00 - $266,875.00/yr and will vary depending on factors such as your location, skills and experience.The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

About the company

Yahoo serves as a trusted guide for hundreds of millions of people globally, helping them achieve their goals online through our portfolio of iconic products. For advertisers, Yahoo Advertising offers omnichannel solutions and powerful data to engage with our brands and deliver results. About The Team Our platform is the foundational identity and data layer for 900M+ monthly active users, serving 2.5B+ profiles at massive scale. We are building a predictive, identity-centric insights engine-ensuring our audience is understood with precision to deliver hyper-personalized experiences and advertising solutions across all our digital properties. Our mission centers on first-party data strategy: capturing, enriching, and activating audience signals to build a 360-degree view of every user. We operate under a Privacy-by-Design philosophy, adhering to global regulations (GDPR, CCPA) and industry security standards, while leveraging a cloud-native stack across GCP (BigQuery, Spanner, Dataflow, Composer, GKE) and AWS, with modern MLOps practices to deliver measurable business impact.

Apply for this position