AIML - Staff ML Infrastructure Engineer
Role details
Job location
Tech stack
Requirements
Bachelors in Computer Science, engineering, or a related field\n6+ years of hands-on experience in building scalable backend systems for training and evaluation of machine learning models\nProficient in relevant programming languages, like Python or Go\nStrong expertise in distributed systems, reliability and scalability, containerization, and cloud platforms\nProficient in cloud computing infrastructure and tools: Kubernetes, Ray, PySpark\nAbility to clearly and concisely communicate technical and architectural problems, while working with partners to iteratively find
Advance degrees in Computer Science, engineering, or a related field\nProficient in working with and debugging accelerators, like: GPU, TPU, AWS Trainium\nProficient in ML training and deployment frameworks, like: JAX, Tensorflow, PyTorch, TensorRT, vLLM