Senior Software Engineer, Infrastructure

OpenAI Inc.
Seattle, United States of America
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 325K

Job location

Seattle, United States of America

Tech stack

Query Performance
Distributed Systems
Data Ingestion
Reliability of Systems
Performance Monitor
Data Management

Job description

As a Senior Infrastructure Engineer on the Statsig team, you will build and scale the foundational infrastructure that powers OpenAI's experimentation and rollout tools.

You'll work on the distributed systems that deliver configuration decisions in real time, ingest massive volumes of experimentation data, and power analytics that help teams understand how product and model changes perform in production.

This role is deeply technical and focuses on performance, scalability, and reliability at extreme scale. You will design systems that support billions of feature evaluations, operate with strict latency requirements, and provide the data foundation for experimentation across OpenAI's product ecosystem.

In this role, you will:

Design and operate low-latency configuration delivery systems powering progressive rollouts across OpenAI's product suites.

Build highly scalable data ingestion and analytics infrastructure to support experimentation, product analytics, and feature performance monitoring.

Improve the performance, efficiency, and reliability of Statsig's core infrastructure, ensuring systems remain fast and stable as OpenAI's products scale globally.

Optimize query performance and data availability for experimentation and analytics workflows used by teams across OpenAI.

Lead large technical initiatives and shape the architecture of experimentation and rollout infrastructure used across the company.

Requirements

Have experience building large-scale distributed systems with strict performance and reliability requirements.

Enjoy solving low-latency systems problems, such as real-time configuration delivery, high-throughput ingestion pipelines, or large-scale analytics systems.

Have experience building and optimizing large-scale data platforms, event pipelines, etc.

Care deeply about system reliability, observability, and operational excellence in production environments.

Take ownership of complex technical problems end-to-end and enjoy building infrastructure that enables other teams to move faster.

Location: This role is based in Bellevue, WA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.

About the company

The Statsig team within OpenAI is responsible for the core experimentation, rollout, and analytics infrastructure that powers every layer of OpenAI's product development stack. Our systems enable teams across OpenAI to safely launch features, run experiments, and understand how product and model changes perform in the real world. Based out of OpenAI's Bellevue office, we are a tight-knit team that values in-person collaboration, moving quickly with a strong bias toward impact, and building systems that empower other builders across the company., OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity., At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Apply for this position