Staff Site Reliability Engineer, Core AI Infrastructure

Coinbase, Inc.
Charlotte, United States of America
2 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior
Compensation
$ 257K

Job location

Charlotte, United States of America

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Amazon Web Services (AWS)
Systems Engineering
Bash
Cloud Computing
Cloud Computing Security
Configuration Management
Continuous Integration
Linux
Disaster Recovery
Python
Network Security
Network Architecture
Reliability Engineering
Ansible
Ruby
Scripting (Bash/Python/Go/Ruby)
Delivery Pipeline
GIT
Kubernetes
Information Technology
Build Tools
Puppet
Terraform
Software Version Control
Devsecops
Docker
Go

Job description

  • *AI-Driven Innovation: *Join a high-performing team of skilled engineers driving AI transformation at Coinbase. This role involves leading the development of scalable AI products with direct exposure to high-level executives, focusing on rapid ideation, execution, and delivering impactful solutions in a dynamic, incubator-style environment.
  • Partner with the Coinbase Infrastructure team to support and extend existing ci/cd frameworks to support IT services, including enterprise network platforms
  • Partner with security and compliance to build surveillance tooling into deployment pipelines
  • Design and implement automation to streamline overall operational IT support workflows
  • Action Kubernetes deployment, implementation, and support
  • Build a technological roadmap based on product requirements
  • Participate in on-call to support the AWS service deployment pipeline
  • Promote DevSecOps mentality and establish best practices to ensure top-tier cloud security
  • Set and maintain a standard of excellence for technical documentation across IT engineering
  • Participate in an operational environment with strict SLAs and managed incident response and disaster recovery strategies
  • Facilitate incident response, conduct root cause analysis and blameless retrospectives
  • Define metrics and design/implement automation opportunities based on monitoring/observability
  • Developing and maintaining integrations with other systems, such as source control and build systems
  • Troubleshooting and resolving technical issues with internal toolings, Depending on your location, the General Data Protection Regulation (GDPR) and California Consumer Privacy Act (CCPA) may regulate the way we manage the data of job applicants. Our full notice outlining how data will be processed as part of the application procedure for applicable locations is available here. By submitting your application, you are agreeing to our use and processing of your data as required. For US applicants only, by submitting your application you are agreeing to arbitration of disputes as outlined here.

Requirements

  • 10+ years experience supporting network infrastructure
  • 10+ years experience automating cloud infrastructure
  • Proficient in at least one scripting languages (Bash, python, Ruby, Go, etc)
  • Proficiency with version control using CI/CD (Git)
  • Strong experience supporting AWS services and CI/CD workflows using terraform or equivalent framework
  • Strong experience with configuration management systems like Terraform, Ansible, Chef, Puppet, or Salt
  • Strong experience with containers and containers orchestration like Docker and Kubernetes
  • Demonstrated ability to responsibly use generative AI tools and copilots (e.g., LibreChat, Gemini, Glean) in daily workflows, continuously learn as tools evolve, and apply human-in-the-loop practices to deliver business-ready outputs and drive measurable improvements in efficiency, cost, and quality

Nice to haves:

  • Expertise with linux, bash, ruby, python and/or go
  • Expertise automating EC2 or containers deployment with terraform
  • Strong network security fundamentals
  • Experience managing and leveraging log aggregation
  • Experience working in a highly regulated environment
  • Experience in a fast-paced, high-growth company
  • Experience in a Remote-first IT environment

About the company

At Coinbase, our mission is to increase economic freedom in the world. It's a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform - and with it, the future global financial system. To achieve our mission, we're seeking a very specific candidate. We want someone who is passionate about our mission and who believes in the power of crypto and blockchain technology to update the financial system. We want someone who is eager to leave their mark on the world, who relishes the pressure and privilege of working with high caliber colleagues, and who actively seeks feedback to keep leveling up. We want someone who will run towards, not away from, solving the company's hardest problems. Our work culture is intense and isn't for everyone. But if you want to build the future alongside others who excel in their disciplines and expect the same from you, there's no better place to be. While many roles at Coinbase are remote-first, we are not remote-only. In-person participation is required throughout the year. Team and company-wide offsites are held multiple times annually to foster collaboration, connection, and alignment. Attendance is expected and fully supported.

Apply for this position