Product Manager - HPC Software (Infrastructure & Storage)

The Salt Company Inc
Piedmont, United States of America
yesterday

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Piedmont, United States of America

Tech stack

Artificial Intelligence
BIOS
Firmware
Uptime
Product Management
Software Architecture
Project Management
Software Systems
AI Infrastructure
Extensible Firmware Interface
Information Technology

Job description

AI & HPC Software & Hardware Product Manager focused on defining and delivering software capabilities for firmware lifecycle management, rack-scale operations, and power/thermal optimization across HPC and AI infrastructure., * Lead and drive the end-to-end product strategy and roadmap for AI and HPC software management capabilities, including firmware, rack, and power management domains.

  • Define customer value propositions and business outcomes for reliability, uptime, energy efficiency, and operational simplicity across Cray EX, ProLiant, and AI Factory deployments.
  • Translate market and field requirements into product requirements for firmware lifecycle workflows (upgrade orchestration, rollback, policy controls, compliance tracking).
  • Own rack management software requirements spanning rack-level health, telemetry, inventory state, fault isolation, and automated remediation workflows.
  • Define and prioritize power management capabilities, including power capping, thermal-aware scheduling, workload-to-power policy alignment, and datacenter efficiency reporting.
  • Partner with engineering to align software architecture and release sequencing across iLO, system firmware, rack management controllers, and cluster management platforms.
  • Collaborate with PMM, sales, and solutions teams to create technical enablement content, reference architectures, and customer-facing narratives for power-efficient AI/HPC operations.
  • Drive lifecycle governance across plan, build, launch, sustain, and end-of-life phases, ensuring roadmap clarity and measurable adoption outcomes.
  • Establish KPI frameworks for product success, including firmware compliance rates, mean time to recovery, rack-level incident reduction, and power utilization effectiveness indicators.

Requirements

  • Bachelor's degree or equivalent in computer science, engineering, or related field of study.
  • MBA or advanced degree in computer science or engineering preferred.
  • 8+ years of work experience in software product management, systems management, infrastructure software, or a related field

Knowledge & Skills

  • Deep knowledge of firmware and platform software stacks (BMC, BIOS/UEFI, device firmware update flows, and lifecycle policy controls).
  • Strong understanding of rack-scale infrastructure management, including telemetry pipelines, alerting systems, inventory models, and remediation operations.
  • Expertise in power and thermal management concepts for AI and HPC systems, including performance-per-watt trade-offs and operational guardrails.
  • Strong cross-functional leadership skills to drive alignment across engineering, operations, support, sales, and marketing teams.
  • Ability to convert complex technical capabilities into clear product requirements, release plans, and customer value narratives.
  • Strong analytical and commercial acumen, including KPI design, business case development, and prioritization under constrained resources.

Apply for this position