Software Engineer - Linux Kernel / GPU Monitoring
Role details
Job location
Tech stack
Job description
The eBPF Platform team owns the shared eBPF infrastructure inside the Datadog Agent, and is responsible for its reliability, performance, and evolution across a wide variety of Linux distributions and kernel versions. We build tooling and agent functionality for product teams utilizing eBPF (Network Performance Monitoring, Universal Service Monitoring, Cloud Workload Security, GPU Monitoring), enable new teams exploring eBPF, and centralize deep kernel expertise across the organization. The team contributes to open source projects such as btfhub and cilium/ebpf.
Datadog is investing heavily in GPU Monitoring to give customers deep visibility into GPU utilization, health, and performance across their infrastructure. The eBPF Platform team builds the agent-side foundation that makes this possible, from eBPF programs that capture GPU activity at the kernel level, to the metrics pipelines and validation infrastructure that ensure data quality at scale.
In this role, you will work at the intersection of eBPF, the Linux kernel, and GPU infrastructure. You'll contribute to GPU Monitoring capabilities within the Datadog Agent while also working across the broader eBPF platform, investigating production incidents, improving reliability, and helping shape the architecture of one of the most widely deployed eBPF solutions in the industry.
At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.
What Youâll Do:
- Contribute to GPU Monitoring feature development end-to-end, from ideation to implementation within the Datadog Agent
- Build and maintain shared eBPF functionality for product teams to use in their eBPF-based products
- Investigate and debug complex production issues that span the kernel, eBPF programs, and agent runtime
- Research, prototype, develop, and document solutions to hard problems in the eBPF and GPU monitoring space
- Provide technical input to product and architecture decisions
Requirements
- You have experience working with and developing in the Linux kernel, either using eBPF, as a kernel developer, or building low level programs in C
- You have significant experience building high throughput/low latency systems
- You have been building applications and tooling for 5+ years and know the systems youâve worked on in depth
- You are motivated by solving problems for our customers first and foremost and building beautiful products that our customers love
- You have ability to work effectively in a team
Benefits & conditions
- New hire stock equity (RSUs) and employee stock purchase plan (ESPP)
- Continuous professional development, product training, and career pathing
- Intra-departmental mentor and buddy program for in-house networking
- An inclusive company culture, ability to join our Community Guilds
- Access to Inclusion Talks, our Internal panel discussions
- Free, global Spring Health benefits for employees and dependents age 6+
- Competitive global benefits
Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.