SRE/Dev Ops Engineer (Hybrid, Sunnyvale)

CrowdStrike
Sunnyvale, United States of America
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Sunnyvale, United States of America

Tech stack

API
Application Release Automation
Software as a Service
Computer Security
Computer Networks
Continuous Integration
Data as a Services
DevOps
Disaster Recovery
Distributed Systems
Github
NoSQL
Open Source Technology
Performance Tuning
Reliability Engineering
Prometheus
Software Engineering
Workflow Management Systems
Pulumi
Istio
Grafana
Multi-Cloud
Reliability of Systems
Kubernetes
Linkerd (Service Mesh)
Terraform
Dynatrace
Jenkins

Job description

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed - we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We're also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We're always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About the Role:

CrowdStrike's engineering organization depends on shared infrastructure platforms that power critical product capabilities. These platforms need dedicated engineering ownership to operate reliably, scale safely, harden for security, and mature into self-service capabilities that teams can depend on.

You'll own production infrastructure that spans multiple cloud providers and regions, serving engineering teams across the organization. The work is equal parts platform engineering and operational excellence - building automation, hardening security, establishing governance, and enabling consuming teams to adopt these platforms effectively. You'll also help shape what comes next as the team's scope grows.

We're hiring at all seniority levels - scope and compensation adjust accordingly.

What You'll Do:

  • Run production infrastructure - Deploy, upgrade, and maintain platform services across multiple clouds and regions on Kubernetes.
  • Build and maintain CI/CD pipelines - Make it safe and fast to ship infrastructure changes using GitOps workflows and release automation.
  • Build control planes - Create the APIs and tooling that make provisioning and scaling repeatable and self-service.
  • Own capacity planning - Track usage, forecast growth, right-size clusters, and keep infrastructure costs in check.
  • Build observability - Set up metrics, dashboards, and alerts using Prometheus and Grafana. Write runbooks that make on-call clear and actionable.
  • Own on-call and incidents - Join the on-call rotation, resolve issues, write postmortems, and turn repeat problems into automation.
  • Automate everything - Deployments, upgrades, certificate rotations, failover. If you do it by hand more than once, automate it.
  • Driving system reliability by blending software engineering principles with AI-driven automation, moving from reactive firefighting to proactive, automated operations.
  • Harden security - set up auth, encryption, secret rotation, and network policies. Keep dependencies patched and CVEs resolved.
  • Own disaster recovery - Build backup strategies, test failover, and make sure platforms can survive infrastructure failures.
  • Enable other teams - Provide templates, patterns, and direct support to help engineering teams use platforms reliably.
  • Collaborate across teams - Collaborate with Infrastructure, SRE, and Data Services on shared operational problems.

Requirements

  • 8+ years in DevOps, SRE, or platform engineering.
  • Hands-on experience running stateful distributed systems on Kubernetes in production.
  • CI/CD experience - Building and owning pipelines using GitHub Actions, Jenkins, Tekton, or similar tools.
  • Infrastructure-as-code skills - Terraform, Pulumi, or Crossplane, no manual configuration.
  • GitOps experience - ArgoCD or Flux for managing infrastructure deployments.
  • Observability skills - Prometheus, Grafana, and distributed tracing tools like Jaeger or OpenTelemetry.
  • Database operations - Backup, restore, schema management, and performance tuning for relational and NoSQL databases.
  • Security mindset - You implement auth, encryption, secret management, and network policies as part of normal work.
  • Multi-cloud or multi-region experience - you have managed infrastructure across providers or regions.
  • Able to work in our Sunnyvale office 2+ days per week

Bonus Points:

  • Experience running security platforms or telemetry pipelines at large scale.
  • Experience building internal developer platforms and self-service tooling.
  • Familiarity with service mesh tools like Istio or Linkerd.
  • Experience running workflow orchestration platforms like Temporal or Argo Workflows.
  • Experience running distributed tracing or telemetry infrastructure at scale.
  • Experience with disaster recovery automation for stateful systems.
  • Background at a cybersecurity or high-availability SaaS company.
  • Contributions to open-source projects or the broader tech community.
  • Go proficiency - our platforms and services are primarily written in Go.

About the company

Benefits of Working at CrowdStrike: * Market leader in compensation and equity awards * Comprehensive physical and mental wellness programs * Competitive vacation and holidays for recharge * Paid parental and adoption leaves * Professional development opportunities for all employees regardless of level or role * Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections * Vibrant office culture with world class amenities * Great Place to Work Certified across the globe CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

Apply for this position