Site Reliability Engineer - Tactical Reconnaissance & Strike

Anduril Industries
Costa Mesa, United States of America
4 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Intermediate
Compensation
$ 167K

Job location

Costa Mesa, United States of America

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Confluence
JIRA
Azure
Cloud Computing
Configuration Management
Collaborative Software
Computer Security
Nvidia CUDA
Continuous Integration
Ghost (Backup Software)
Github
Hardware Design
Monitoring of Systems
Python
OpenCL
Reliability Engineering
Ansible
Software Engineering
Systems Integration
Data Logging
Google Cloud Platform
System Availability
Grafana
Parallel Computation
GIT
Containerization
Kubernetes
Infrastructure Automation Frameworks
Information Technology
Data Analytics
Terraform
Software Version Control
Data Pipelines
Docker
Vulnerability Analysis
Artifactory

Job description

Anduril's Tactical Recon & Strike (TRS) is a division with two missions: 1) build highly capable autonomous drones, and 2) build solid rocket motors at scale. We transform products like Ghost, Anvil, Bolt, and Altius from early concepts into fully operational capabilities by partnering closely with specialist engineering, operations, and production teams. Through our Anduril Rocket Motor Systems (RMS) team, we design and manufacture solid rocket motors using advanced materials, proprietary formulations, and high-volume production methods-delivering safe, reliable propulsion systems that support a wide range of mission requirements. TRS hires software engineers & hardware engineers, who are excited to build across a diverse and powerful portfolio-from autonomous aerial systems to high-performance solid rocket motors. Our teams contribute to highly capable autonomous robotics systems and propulsion products that operate reliably in the most demanding mission environment. About The Job

As a Site Reliability Engineer you will be responsible for deploying, integrating, and managing customer and developmental cloud environments across TRS. This role requires a systems-thinking engineer who can bridge software development, platform engineering, and mission operations to ensure seamless integration of new capabilities while improving production scalability and maintaining reliability. The ideal candidate will own the end-to-end lifecycle of cloud-based deployments, drive continuous improvement of data pipelines and observability infrastructure for TRS's growing drone fleets, and identify opportunities to leverage emerging platform services to enhance system performance and data quality. This position will also play a critical role in scaling integration best practices and building out functional capabilities across additional TRS product lines.

What You Will Do

  • Cloud Deployment & Environment Management: Own and execute customer and developmental cloud deployments across TRS product lines, ensuring reliable configuration management, version control, and seamless promotion of releases from development through production environments.
  • Anduril Platform Services Integration: Evaluate, prototype, and integrate emerging platform capabilities (such as RDF and MissionSim) and/or 3rd party services (such as Arena AI and AFATDS/AXS) to improve data discoverability, consistency, and analytical capabilities across TRS systems.
  • Data Pipeline & Observability Infrastructure: Maintain and enhance existing data pipelines, metrics frameworks, and monitoring solutions including Grafana and Nominal; ensure high availability, data quality, and actionable insights for engineering and operations teams.
  • Field Support & Operational Testing: Collaborate directly with field operation teams during feature rollouts to conduct real-world testing, troubleshoot issues in operational environments, gather actionable feedback to inform system improvements and ensure mission success, and enable customer self-serve provisioning of environments.
  • Cross-Product Line Expansion: Partner with leadership to establish integration engineering functions and best practices across all TRS product lines, developing reusable patterns, documentation, and tooling that accelerate deployment capabilities and operational maturity., To ensure your safety and help you navigate your job search with confidence, please keep the following critical points in mind:
  • No Financial Requests: Anduril will never solicit payment or demand personal financial details (such as banking information, credit card numbers, or social security numbers) at any stage of our hiring process. Our legitimate recruitment is entirely free for candidates.

Requirements

  • Bachelors Degree in Computer Science or other STEM focused degree
  • Advanced proficiency in programming languages (Python for scripting and integration).
  • 3+ years of experience with CI/CD tools like GitHub Actions, Jfrog Artifactory, and Git.
  • Proficiency with IaC tools (Terraform, Ansible).
  • 3+ plus years of experience with cloud platforms (Azure, AWS, GCP).
  • Proficiency in containerization (Docker) and container orchestration (Kubernetes).
  • Experience with logging and monitoring tools (Nominal and Grafana).
  • Understanding of parallel computing frameworks (CUDA, OpenCL).
  • Strong collaboration skills and proficiency with collaborative tools (JIRA, Confluence).
  • Eligible to obtain and maintain an active U.S. Secret security clearance., * Masters or other advanced STEM degree
  • Technical expertise and demonstrated performance in one or more of the following areas: networking, cloud technologies, application development, hardware design, and/or cybersecurity
  • Minimum of 5 years of operations and engineering experience

Benefits & conditions

The salary range for this role is an estimate based on a wide range of compensation factors, inclusive of base salary only. Actual salary offer may vary based on (but not limited to) work experience, education and/or training, critical skills, and/or business considerations. Highly competitive equity grants are included in the majority of full time offers; and are considered part of Anduril's total compensation package. Additionally, Anduril offers top-tier benefits for full-time employees, including, At Anduril, we invest in our people. Our comprehensive, competitive benefits package (available at little to no cost to employees) ensures you're supported in health, recovery, and whatever comes next. For more information, Explore Our Benefits .

About the company

Anduril Industries is a defense technology company with a mission to transform U.S. and allied military capabilities with advanced technology. By bringing the expertise, technology, and business model of the 21st century's most innovative companies to the defense industry, Anduril is changing how military systems are designed, built and sold. Anduril's family of systems is powered by Lattice OS, an AI-powered operating system that turns thousands of data streams into a realtime, 3D command and control center. As the world enters an era of strategic competition, Anduril is committed to bringing cutting-edge autonomy, AI, computer vision, sensor fusion, and networking technology to the military in months, not years.

Apply for this position