Technical Operations Engineer, Web3 Core Platforms

QuickNode
1 month ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English

Job location

Remote

Tech stack

Bash
Border Gateway Protocol
Continuous Integration
Software Debugging
Linux
Distributed Systems
DNS
JSON
Python
Log Analysis
Ansible
Blockchain
Prometheus
Grafana
Kubernetes
Web3.js
Terraform

Job description

Make mainnet boring. We launch chains, keep them fast, and fix the stuff that shouldn't break. You'll run K8s at scale, ship IaC, lead incidents, and write the tooling that keeps production sane. If you can occasionally patch a client or upstream a small fix, even better.

What We Actually Need

  • Linux + Kubernetes: ****you debug real production-networking, storage, rollouts, perf.
  • IaC (Terraform/Helm/Ansible): you ship repeatable infra, not snowflakes.
  • Go or Python (plus Bash): ****you automate noise away and build small, sharp tools.
  • Blockchain operations: JSON-RPC internals; running/tuning RPC/validator nodes; log analysis.
  • Observability: SLOs/error budgets, Prometheus/Grafana instrumentation that people use.
  • Networking: strong fundamentals (DNS/TLS/LB); you can reason about anycast/BGP when needed.

What You'll Do

  • Launch & upgrade chains: standups, hard-forks, snapshots, pruning, clean rollbacks.
  • Automate everything: modules, golden images, CI/CD, zero-touch deploys across regions.
  • Run K8s at scale: safe rollouts, HPA/VS/ingress tuning, capacity & cost plans that hold up.
  • Own incidents: lead SEV0-2, publish RCAs, and ship the change that prevents a replay.
  • Build signal, not noise: SLOs/error budgets, useful dashboards, alerts that only page when users hurt.
  • Code where it counts: write/extend tools (snapshots, replay/load, state sync checks); occasionally patch client bugs that bite production and upstream when it's worth it.

Why This Role Stands Out

  • Impact: your work lights up production; chain launches, reliability wins, performance gains.
  • Pace > Ceremony: tight reviews, practical standards, minimal meetings, async-first.
  • Growth: own big surfaces; learn protocol internals while leveling up distributed systems chops.
  • Remote-first: follow-the-sun coverage; humane on-call.
  • Compensation & benefits: region-aligned, bonus-eligible and shared early; no bait-and-switch.

Requirements

Do you have experience in Terraform?

Benefits & conditions

  • SLO thinking with before/after on alert noise, latency, or MTTR.
  • Real RPC/validator ops experience (beyond a laptop demo).
  • Tooling/code that made ops better (Go/Python, IaC modules, or a small upstream fix).
  • You explain complex failures simply, and leave systems simpler.
  • Production experience with one or more of EVM (Geth/Erigon/Nethermind/Besu), Cosmos SDK/CometBFT, Solana (Agave/QUIC), or Substrate.
  • Multi-cloud experience plus capacity & cost modeling that survived contact with reality.

Process

Steps may vary slightly by region/seniority; we keep it lean.

  • 30-min intro + context with Talent Acquisition
  • 60-min technical deep dive with the Hiring Manager (incidents you led, K8s/IaC trade-offs, tooling you built)
  • 60-min hands-on with Team Members (pair on a small plan/code review or targeted tool fix)
  • 30-min meet a Founder

New Hire Onboarding: Week 1-12

  • Week 1-2: ship a small IaC change + a dashboard someone actually uses.
  • Week 3-6: own 2-3 chains/subsystems, kill a flaky alert, close a recurring papercut.
  • Week 7-12: lead one upgrade/cutover with clean rollback; publish an RCA and the code/infrastructure change that prevents recurrence.

The QuickNode compensation philosophy includes pillars to ensure fair and unbiased compensation for all employees. To design and deliver total reward offerings that are employee-centric. To offer a competitive benefit package in all locations where we operate. To prioritize attracting and retaining the best talent globally. To maintain a high-performing and flexible way of working.

About the company

QuickNode is a cloud-based infrastructure company that powers the blockchain ecosystem. Our mission is to be the indispensable utility that empowers companies and innovators globally to build next-generation, Web3 enabled businesses & applications using blockchain technology. QuickNode is backed by some of the world's best investors including Tiger Global, Y Combinator, SoftBank, and the Seven Seven Six Fund. The QuickNode team has over 120 people maintaining high performance global data infrastructure for amazing customers serving billions of requests daily. We are a global remote company with an HQ in Miami, Florida. Location Remote (with regional coverage for 24hr operations). Limited travel may be required for conferences or offsites, generally less than 10 days per year.

Apply for this position