ChatGPT Performance Engineer

OpenAI Inc.
Glastonbury, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 325K

Job location

Remote
Glastonbury, United States of America

Tech stack

API
Artificial Intelligence
Profiling
Databases
Distributed Systems
Memory Management
Python
Performance Tuning
Software Engineering
Performance Testing
GPT
Go

Job description

We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth., OpenAI is looking for an experienced Performance Engineer to help us scale the performance, reliability, and efficiency of our systems. In this role, you'll apply deep technical expertise to optimize infrastructure and application-level performance across mission-critical products like ChatGPT and our developer API. You'll work cross-functionally with teams building core services, training models, and developing real-time user experiences to push our latency, throughput, and cost-efficiency to the next level., * Develop tooling and metrics that provide deep observability into system performance.

  • Collaborate closely with infra, platform, training, and product teams to identify key performance goals and drive systemic improvements.
  • Influence architecture and design decisions to prioritize latency, throughput, and efficiency at scale.
  • Lead investigations into high-impact performance regressions or scalability issues in production.
  • Drive performance testing strategies and help define SLAs/SLOs around latency and throughput for critical systems.

Requirements

We are looking for engineers who thrive in ambiguous environments, value deep systems understanding, and are motivated by delivering measurable impact. This is a highly technical, individual contributor role focused on root-cause analysis, profiling, instrumentation, and architecture-level performance improvements across our stack., * Have 7+ years of experience in software engineering with a strong track record in performance or reliability of high-scale distributed systems.

  • Are deeply comfortable with performance profiling tools and tracing systems.
  • Have experience optimizing performance across one or more layers of the stack (e.g., database, networking, storage, application runtime, GC tuning, Python/Golang internals, GPU utilization).
  • Have a strong understanding of OS internals, scheduling, memory management, and IO patterns.
  • Have contributed to observability, benchmarking, or performance-focused infrastructure at scale.
  • Have demonstrated success navigating ambiguity and aligning stakeholders around performance goals.
  • Value simplicity, rigor, and collaboration when solving complex systems problems.

About the company

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity., At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Apply for this position