Senior GPU Compiler Software Development Engineer
Role details
Job location
Tech stack
Job description
Working on GPU support for OpenAI/Triton - a language and compiler for writing highly efficient custom Deep-Learning primitives. Work with the open-source community to analyze, develop, test, and deploy performance improvements for neural networks implemented with Triton on GPUs with ROCm., New features development, support and optimization of OpenAI/Triton project for GPUs. Communication with other developers, customers and project managers. Test implementation, project documentation and verification of system with unit/component/functional tests.
Requirements
Do you have experience in Python?, Must have
Strong C/C++ programming skills
Experience with compiler internals (llvm, gcc or any other)
Basic Python programming skills
Experience in performance analysis
Nice to have
Basic understanding of ML technologies
Experience with GPGPU (General purpose GPU) computing (HIP, CUDA, OpenCL, etc.)
Experience with PyTorch
Experience with LLVM and MLIR compiler infrastructure, analysis or optimizations implementation
Knowledge of ROCm infrastructure
Experience in CMake, make/ninja build system
GEMM performance fundamentals
Experience with Docker
Other
Languages
English: B2 Upper Intermediate
About the company
Luxoft, a DXC Technology Company, (NYSE: DXC), is a digital strategy and software engineering firm providing bespoke technology solutions that drive business change for customers the world over. Luxoft uses technology to enable business transformation, enhance customer experiences, and boost operational efficiency through its strategy, consulting, and engineering services. Luxoft combines a unique blend of engineering excellence and deep industry expertise, specializing in automotive, financial services, travel and hospitality, healthcare, life sciences, media and telecommunications.