Software Engineer - Linux Kernel / GPU Monitoring

Datadog
Municipality of Madrid, Spain
17 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Municipality of Madrid, Spain

Tech stack

Artificial Intelligence
Software Debugging
Linux kernel
Datadog
Software Application Programming
Low Latency

Job description

The eBPF Platform team owns the shared eBPF infrastructure inside the Datadog Agent, and is responsible for its reliability, performance, and evolution across a wide variety of Linux distributions and kernel versions. We build tooling and agent functionality for product teams utilizing eBPF (Network Performance Monitoring, Universal Service Monitoring, Cloud Workload Security, GPU Monitoring), enable new teams exploring eBPF, and centralize deep kernel expertise across the organization. The team contributes to open source projects such as btfhub and cilium/ebpf.

Datadog is investing heavily in GPU Monitoring to give customers deep visibility into GPU utilization, health, and performance across their infrastructure. The eBPF Platform team builds the agent-side foundation that makes this possible, from eBPF programs that capture GPU activity at the kernel level, to the metrics pipelines and validation infrastructure that ensure data quality at scale.

In this role, you will work at the intersection of eBPF, the Linux kernel, and GPU infrastructure. You'll contribute to GPU Monitoring capabilities within the Datadog Agent while also working across the broader eBPF platform, investigating production incidents, improving reliability, and helping shape the architecture of one of the most widely deployed eBPF solutions in the industry.

At Datadog, we place value in our office culture - the relationships and collaboration it builds and the creativity it brings to the table. We operate as a hybrid workplace to ensure our Datadogs can create a work-life harmony that best fits them.

What You’ll Do:

  • Contribute to GPU Monitoring feature development end-to-end, from ideation to implementation within the Datadog Agent
  • Build and maintain shared eBPF functionality for product teams to use in their eBPF-based products
  • Investigate and debug complex production issues that span the kernel, eBPF programs, and agent runtime
  • Research, prototype, develop, and document solutions to hard problems in the eBPF and GPU monitoring space
  • Provide technical input to product and architecture decisions

Requirements

  • You have experience working with and developing in the Linux kernel, either using eBPF, as a kernel developer, or building low level programs in C
  • You have significant experience building high throughput/low latency systems
  • You have been building applications and tooling for 5+ years and know the systems you’ve worked on in depth
  • You are motivated by solving problems for our customers first and foremost and building beautiful products that our customers love
  • You have ability to work effectively in a team

Benefits & conditions

  • New hire stock equity (RSUs) and employee stock purchase plan (ESPP)
  • Continuous professional development, product training, and career pathing
  • Intra-departmental mentor and buddy program for in-house networking
  • An inclusive company culture, ability to join our Community Guilds
  • Access to Inclusion Talks, our Internal panel discussions
  • Free, global Spring Health benefits for employees and dependents age 6+
  • Competitive global benefits

Benefits and Growth listed above may vary based on the country of your employment and the nature of your employment with Datadog.

Apply for this position