Principal Solutions Architect

Nebius Group
3 days ago

Role details

Contract type
Permanent contract
Employment type
Full-time (> 32 hours)
Working hours
Regular working hours
Languages
English
Experience level
Senior

Job location

Remote

Tech stack

Artificial Intelligence
Amazon Web Services (AWS)
Azure
Cloud Computing
Configuration Management
Nvidia CUDA
Distributed Systems
General-Purpose Computing on Graphics Processing Units
Python
Machine Learning
Performance Tuning
Ansible
TensorFlow
Google Cloud Platform
PyTorch
Deep Learning
Kubernetes
Slurm
Machine Learning Operations
Terraform

Job description

We are seeking an experienced, customer-obsessed Principal Solutions Architect to lead technical engagement with Nebius' most strategic customers across multiple industry segments. This is a senior, high-impact role focused on complex presales, large-scale ML workloads, and end-to-end customer solution ownership - from early discovery through production adoption. As a Principal Solutions Architect, you will act as a trusted technical partner for executive stakeholders, AI/ML leaders, and platform teams. You will shape solution strategy, drive complex PoCs, influence product direction, and ensure successful adoption of Nebius AI cloud for mission-critical ML/AI workloads., * Own end-to-end technical presales and solution delivery for strategic and enterprise customers, from initial discovery through PoC, architecture design, and production readiness.

  • Serve as a trusted technical advisor to senior customer stakeholders (CTO, Head of ML, Platform Engineering, AI Infrastructure teams).
  • Lead complex ML/AI infrastructure and MLOps architectures, including large-scale training and inference workloads on GPU-accelerated cloud platforms.
  • Design, validate, and document reference architectures, Infrastructure-as-Code solutions, and best-practice deployment patterns using Nebius AI.
  • Drive and execute advanced PoCs, workshops, architecture reviews, and executive-level presentations to demonstrate value and accelerate customer adoption.
  • Partner closely with Sales, Product, Engineering, and Support to represent real-world customer requirements and influence product roadmap decisions.
  • Act as a single point of technical authority for key customer scenarios across internal teams (product, support, marketing).
  • Support strategic marketing initiatives including conferences, hackathons, webinars, customer case studies, and technical thought leadership.
  • Mentor and provide technical leadership to other Solutions Architects, helping raise the overall technical bar of the organization.

Requirements

  • 10+ years of experience in senior technical roles such as Solutions Architect, Systems Architect, ML Platform Engineer, or similar - with significant customer-facing responsibilities.
  • Proven track record of end-to-end delivery of complex presales engagements, including discovery, solution design, PoCs, and production transition.
  • Deep hands-on experience with large-scale ML-based workloads, including GPU training and inference at scale.
  • Strong expertise in cloud infrastructure and MLOps, including Kubernetes-based platforms and distributed systems.
  • Solid hands-on experience with Infrastructure as Code and configuration management (Terraform, Ansible), and strong Python skills.
  • Deep understanding of GPU computing stacks for ML/AI workloads (drivers, CUDA, libraries, performance optimization).
  • Exceptional communication skills - able to clearly articulate complex technical concepts to both technical and executive audiences.
  • A highly customer-centric mindset with the ability to balance customer needs, technical feasibility, and product strategy.

It will be an added bonus if you have:

  • Prior experience as a Senior AI/ML Specialist Solutions Architect, Technical ML Product Manager, or in a similar senior AI/ML-focused role.
  • Hands-on experience with HPC and ML orchestration frameworks (e.g., Slurm, Kubeflow).
  • Practical experience with deep learning frameworks such as PyTorch and TensorFlow.
  • Strong understanding of the cloud ML ecosystem across major providers (NVIDIA, AWS, Azure, Google Cloud).
  • Experience influencing product direction based on customer feedback and real-world usage patterns.Experience leading and contributing to end-to-end large scale ML deployment projects on all layers of the stack.

Benefits & conditions

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

We're growing and expanding our products every day. If you're up to the challenge and are excited about AI and ML as much as we are, join us!

About the company

Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field. Where we work Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 800 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.

Apply for this position