GPU & Runtime Systems Engineer
Role details
Job location
Tech stack
Job description
As a GPU & Runtime Systems Engineer, you will design, build, and evolve secure sandboxed runtime environments for Kubernetes workloads, focusing on runtime isolation, performance, and security. You will integrate container runtimes, lightweight VMs, and virtualization technologies to support GPU-accelerated workloads in multi-tenant environments. Day-to-day, you'll develop GPU-aware sandboxing and scheduling strategies, optimize containerization and I/O performance for latency-sensitive workloads, and influence architectural decisions across Linux internals, container runtimes, virtualization layers, and GPU drivers.
Some of what you'll work on:
- Design and implement secure execution environments for containerized and virtualized workloads.
- Build GPU-aware scheduling, isolation, and resource management strategies for multi-tenant workloads.
- Optimize container, VM, and I/O performance across GPU-accelerated workloads.
- Conduct profiling, benchmarking, and performance tuning for runtime, virtualization, and GPU stacks.
- Contribute to architectural decisions across Linux internals, container runtimes, virtualization layers, and GPU drivers.
- Collaborate with security, platform, and infrastructure teams to define and implement runtime isolation and performance standards.
Requirements
- 3+ years of experience in systems, platform, infrastructure, or production engineering at scale.
- Strong hands-on experience with Kubernetes, container orchestration, and cloud-native architectures, including controllers, operators, or scheduling extensions.
- Experience designing, implementing, or operating secure execution environments (container runtimes, sandboxed workloads, or virtualized systems).
- Practical experience with lightweight virtualization and sandboxing technologies (e.g., Kata Containers, gVisor, KubeVirt, QEMU).
- Experience supporting GPU-accelerated workloads in multi-tenant environments, including GPU scheduling, isolation, device passthrough, mediated devices, or virtualization.
- Proficient in systems-oriented programming (Go, C/C++, Rust, Bash) with strong Linux internals knowledge.
- Skilled at diagnosing and resolving complex performance, reliability, or isolation issues across containers, VMs, and infrastructure.
- Experienced in profiling, benchmarking, and tuning performance across runtime, virtualization, and GPU stacks.
Preferred:
- Experience building systems for safely executing untrusted or sensitive workloads in shared environments.
- Familiarity with GPU drivers and low-level virtualization or I/O optimization techniques.
- Experience defining threat models and implementing runtime security policies in multi-tenant systems.
Wondering if you're a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams - even if you aren't a 100% skill or experience match. Here are a few qualities we've found compatible with our team. If some of this describes you, we'd love to talk.
- You love building high-performance systems that operate reliably under extreme scale and demand.
- You're curious about the intersection of security, virtualization, Kubernetes, and GPU infrastructure.
- You're an expert in reasoning about trade-offs between isolation, performance, and operability.
Benefits & conditions
The base salary range for this role is $139,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a comprehensive benefits program (all based on eligibility)., In addition to a competitive salary, we offer a variety of benefits to support your needs, including:
- Medical, dental, and vision insurance - 100% paid for by CoreWeave
- Company-paid Life Insurance
- Voluntary supplemental life insurance
- Short and long-term disability insurance
- Flexible Spending Account
- Health Savings Account
- Tuition Reimbursement
- Ability to Participate in Employee Stock Purchase Program (ESPP)
- Mental Wellness Benefits through Spring Health
- Family-Forming support provided by Carrot
- Paid Parental Leave
- Flexible, full-service childcare support with Kinside
- 401(k) with a generous employer match
- Flexible PTO
- Catered lunch each day in our office and data center locations
- A casual work environment
- A work culture focused on innovative disruption
Our Workplace
While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.
California Consumer Privacy Act - California applicants only