Software Engineer - Observability Platform (Golang / Kubernetes

Roku's Cloud Technology Infrastructure
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Tech stack

Amazon Web Services (AWS)
Azure
Databases
Distributed Data Store
Distributed Systems
Open Source Technology
Prometheus
Software Engineering
Parquet
Google Cloud Platform
Grafana
Kubernetes
Data Pipelines
Go

Job description

You will work on core observability systems (metrics, logs, traces) while also developing robust data pipelines and storage solutions optimized for high throughput, performance, and cost. You'll leverage technologies such as time-series databases, columnar storage formats (e.g., Parquet), and distributed data processing frameworks to advance the platform's capabilities. Collaboration with cross-functional teams is critical, as you'll integrate observability into Roku's cloud-native stack and contribute improvements back to the open-source community.

  • Extend and integrate open-source observability systems, and when needed, structurally overhaul core components, such as storage layers and query paths, to improve performance, reliability, and usability of these tools at scale.
  • Build services to improve performance, usability, reliability, and cost efficiency.
  • Implement features like pre-aggregation, downsampling, and sampling to reduce load and accelerate queries.
  • Create developer-facing capabilities for metrics, logs, and traces usage, data quality, and cost management.
  • Automate onboarding, dashboards, alerting, and tracing.
  • Collaborate across platform and infrastructure teams to integrate observability into Roku's cloud-native stack.

Requirements

  • Extensive experience in software engineering building distributed, high-throughput systems or observability platforms.
  • Hands-on Go experience; our observability ecosystem is Go-based, making it the most effective language for this role.
  • Experience with, or strong interest in, observability tools (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, Clickhouse) and standards (OpenTelemetry, OpenTracing, OpenMetrics).
  • Deep understanding of distributed systems and data models
  • Hands-on experience with Kubernetes, and cloud platforms (AWS, GCP, Azure).

Benefits & conditions

Roku is committed to offering a diverse range of benefits as part of our compensation package to support our employees and their families. Our comprehensive benefits include global access to mental health and financial wellness support and resources. Local benefits include statutory and voluntary benefits which may include healthcare (medical, dental, and vision), life, accident, disability, commuter, and retirement options (401(k)/pension). Our employees can take time off work for vacation and other personal reasons to balance their evolving work and life needs. It's important to note that not every benefit is available in all locations or for every role. For details specific to your location, please consult with your recruiter.

About the company

Teamwork makes the stream work. Roku is changing how the world watches TV. Roku is the TV streaming platform in the U.S., Canada, and Mexico, and we aim to power every television in the world. Our mission is to be the TV streaming platform that connects the entire TV ecosystem, linking consumers to content, enabling publishers to build and monetize large audiences, and providing advertisers unique capabilities to engage consumers. From your first day at Roku, you'll make a valuable - and valued - contribution. We are a fast-growing public company where no one is a bystander. You will have the opportunity to delight millions of TV streamers worldwide while gaining meaningful experience across a variety of disciplines. Are you passionate about building scalable, high-performance systems that process massive amounts of data? Do you thrive on designing and implementing innovative solutions to empower engineering teams with actionable insights? Are you excited to advance open-source observability at a massive scale? Join us to extend (open source) observability tools and build new capabilities that help teams manage data better and get actionable insights. About the Team The Observability team is part of Roku's Cloud Technology Infrastructure organisation and plays a critical role in our platform. We are a high-performing, fast-moving international team that thrives on ownership, effective communication, and delivering impactful engineering solutions. Our mission is to advance Roku's observability platform, which operates at an impressive scale by ingesting terabytes of data daily and managing hundreds of millions of active series. We focus on building scalable, performant systems to meet Roku's unique needs. By leveraging open-source tools, CNCF-supported projects, and custom solutions, we enable engineering teams across Roku to monitor, debug, and optimise their applications with ease. We prioritise a results-driven culture rooted in ownership, accountability, and fast iteration, and we emphasise solving meaningful engineering problems, collaboration, and continuous improvement of how we work as a team., Roku is a fast-paced place where everyone focuses on the company's success. We value people who are great at their jobs, easy to work with, and humble. We appreciate a sense of humor. We believe a few very talented folks can do more for less cost than a larger number of less talented teams. We are independent thinkers with big ideas who act boldly, move fast, and accomplish extraordinary things through collaboration and trust. Roku is a company changing how the world watches TV.

Apply for this position