Senior AI Scientist: Enterprise AI Systems
Role details
Job location
Tech stack
Job description
We are seeking a Sr. AI Scientist to design, experiment with, and productionize AI systems for enterprise use cases. This role focuses on building reliable AI workflows, developing strong experimentation and evaluation frameworks, and optimizing language models (including small and mid-sized models) for real world deployment. Experience with agentic systems is a plus., As a Sr. AI Scientist, y ou will operate at the intersection of research and applied engineering- owning scoped, end-to-end AI workflows where appropriate-while partnering closely with platform and product teams to ensure scientific rigor, reliability, and measurable business impact., AI System Design & Implementation
- Design and implement AI systems capable of handling complex workflows, including multi step reasoning, planning, and decision making.
- Explore and apply various orchestration patterns and architectures to solve business problems effectively.
- Translate abstract requirements into reliable AI behaviors under real operational constraints such as latency, cost, and safety.
Experimentation & Evaluation
- Lead systematic experimentation across prompts, agents, model variants, and tool configurations.
- Define and implement evaluation strategies for agentic systems , including task success, robustness, failure recovery, and hallucination analysis.
- Develop lightweight benchmarks, simulations, and offline/online evaluation loops to guide rapid iteration and decision-making.
Model Optimization & SLMs
- Fine-tune and adapt small and medium language models using techniques such as PEFT, SFT, and distillation.
- Balance performance, latency, and cost for enterprise-grade workloads.
- Contribute to model selection decisions (SLMs vs. larger models) based on use-case requirements and empirical evidence.
Applied End-to-End Delivery
- Own bounded, end-to-end AI workflows from problem framing through deployment where this accelerates impact.
- Partner with engineering teams on integration, monitoring, and lifecycle management.
- Contribute to production readiness without acting as a full-time platform or infrastructure owner.
Technical Leadership
- Act as a technical reference point for agentic AI best practices across the team.
- Mentor junior scientists on experimentation methodology, evaluation design, and practical GenAI system development.
- Influence roadmap and technical direction through evidence-backed recommendations .
Requirements
- Education : MS or PhD in Artificial Intelligence, Computer Science, or a related quantitative field.
- Applied ML / GenAI Experience : 5+ years building and iterating on ML systems, with hands-on experience using Transformer-based models.
- Model Development & Deployment : Strong experience in one or more areas: model training, fine tuning, optimization, or production deployment of ML systems.
- Experimentation Mindset : Strong intuition for experimental design, evaluation metrics, error analysis, and iterative improvement.
- Model Adaptation : Experience with techniques such as fine tuning, PEFT, SFT, distillation, or other model optimization approaches.
- Programming : Strong Python skills and solid software engineering fundamentals.
- Production Awareness : Familiarity with deploying and maintaining AI systems in production environments (cloud, CI/CD, monitoring), without requiring deep infrastructure specialization.
- Bonus : Experience building agentic workflows, tool using systems, or autonomous AI agents (e.g., LangGraph, custom agent frameworks); RLHF experience is also a plus.