Sr AI Engineer - MLOps
Role details
Job location
Tech stack
Job description
The primary purpose of this role is to develop an artificial intelligence (AI) platform that supports a wide array of machine learning (ML) models, including sophisticated deep learning frameworks and large language models (LLMs). This role will work on scaling model performance, building essential tools and frameworks, and managing compute and storage resources. The role involves close collaboration with cross-functional teams to identify new opportunities leveraging AI platform capabilities across different domains to accelerate AI infused product development.
Roles & Responsibilities:
- Scales the platform for high performance and integrates new AI capabilities as APIs to ensure the platform remains adaptable and efficient in hosting a variety of ML models.
- Designs, develops, and implements tools and frameworks that support ML experimentation and deployment.
- Manages GPU and CPU resources to optimize the execution of AI models to ensure the platform runs efficiently, balancing performance with cost-effectiveness.
- Works closely with data scientists to integrate AI models smoothly into platform.
- Creates and manages efficient data movement and pipelines for the AI platform to operate smoothly. Optimizes data flows to support the demands of high-velocity AI model training and inference.
- Analyzes platform performance metrics and user feedback to drive continuous improvement initiatives. Utilizes insights to guide platform enhancements, ensuring the AI platform remains at the forefront of technological advancements and user satisfaction.
- Collaborates effectively with diverse teams, integrating technical expertise with business insights and user needs.
- Implements security protocols and governance measure for AI platform, ensuring data integrity and compliance with industry standards and best practices.
Requirements
Do you have experience in Sales?, Do you have a Bachelor's degree?, Years of Experience:
6-8 years of experience
Education Qualification & Certifications:
Bachelor's Degree (Science, Technology, Engineering, Math or related field)
Skill Set Required: Primary Skills (must have):
- Experience in AI/ML Platform Engineering, Data, and ML Operations tools and frameworks.
- Experience working with GPU and CPU Infrastructure, optimizing ML models for performance.
- Programming experience in Python or equivalent. Experience working with Continuous Integration/Continuous Deployment tools
- Experience in defining technical requirements and performing high level design for complex solutions
- Experience in SQL and NoSQL databases, Hadoop ecosystem, Druid, Trino, Big Query, Google Vertex AI.