HPC Solutions Architect in Dallas
Role details
Job location
Tech stack
Job description
We are seeking an HPC Solutions Architect to drive the technical design, integration, and delivery of high-performance computing (HPC) solutions supporting advanced compute workloads.
This is a highly technical, customer-facing role focused on designing scalable, high-performance architectures across compute, storage, networking, Kubernetes, and security domains. The position spans the full solution lifecycle-from requirements discovery and workload analysis through proof-of-concept, deployment, and ongoing optimization.
The ideal candidate brings deep expertise in HPC architectures, strong hands-on experience with performance tuning and system design, and the ability to translate complex customer requirements into scalable, production-ready solutions.
Key Responsibilities
Customer Engagement & Technical Discovery
-
Work directly with customers to understand HPC workload requirements, performance targets, and technical objectives
-
Lead technical discovery sessions to assess application characteristics, bottlenecks, and scalability challenges
-
Serve as a trusted technical advisor throughout the solution lifecycle
Solution Architecture & Design
-
Design and document end-to-end HPC architectures across compute (CPU/GPU), storage, networking, orchestration, and security
-
Recommend hardware and software solutions aligned with performance, scalability, and efficiency goals
-
Develop architecture blueprints, integration guides, and implementation plans
Performance Optimization & Workload Engineering
-
Support proof-of-concept and benchmarking initiatives to validate solution performance
-
Perform workload profiling, system tuning, and performance optimization
-
Conduct workload reviews to improve scalability, resilience, and efficiency
Implementation & Delivery
-
Provide technical guidance during deployment to ensure seamless integration into customer environments
-
Act as a liaison between customers and internal engineering, product, and operations teams
-
Support solution delivery from design through implementation and optimization
Cross-Functional Collaboration
-
Partner with engineering and product teams to incorporate customer feedback into platform improvements
-
Build relationships with vendors across GPU, networking, and storage ecosystems
-
Collaborate with internal teams to refine HPC solution offerings
Innovation & Technical Leadership
-
Stay current on HPC technologies including GPUs, accelerators, interconnects, and orchestration frameworks
-
Represent the organization in customer workshops, solution reviews, and technical presentations
-
Contribute to best practices, reference architectures, and reusable design patterns
Requirements
-
Proven experience in HPC solution architecture, systems integration, or large-scale distributed systems design
-
Strong expertise across:
-
GPU and CPU architectures (CUDA, NVIDIA ecosystem)
-
Workload schedulers such as Slurm and Kubernetes
-
High-performance networking (InfiniBand, RDMA, RoCE)
-
Distributed storage systems (Lustre, GPFS, Ceph, VAST)
-
Kubernetes and container orchestration for HPC workloads
-
Security integration including , encryption, and compliance
-
Strong Linux systems knowledge including tuning, performance profiling, and system-level optimization
-
Ability to translate customer requirements into detailed architecture and integration plans
-
Strong communication skills with experience leading workshops, technical reviews, and customer engagements
-
Experience working cross-functionally with engineering, product, and operations teams
-
Ability to present complex technical solutions to both technical and executive audiences
Experience
-
Experience delivering HPC or AI/ML workloads from design through deployment and optimization
-
Familiarity with containerized HPC environments (e.g., Kubernetes, Singularity)
-
Experience with automation and infrastructure deployment practices
-
Background in proof-of-concept delivery, benchmarking, and workload migration
-
Awareness of emerging HPC technologies including next- GPUs and interconnects
-
Bachelor's or Master's degree in Computer Science, Engineering, Physics, or related field
-
Relevant certifications such as AWS Solutions Architect, Azure Solutions Architect Expert, GCP Professional Cloud Architect, CCNP, RHCE, CKA, or CKS