Cutting LLM Costs Without Cutting Quality: How to Beat Proprietary LLMs with Fine-Tuned Open Source

About This Session

Let's cut through the hype: most AI agents never make it past the demo stage. The gap between a working prototype and a production-grade system comes down to one thing—evaluation. Without reliable metrics, you're guessing at what's working, what needs fixing, and whether your agents are actually improving. You'll learn how to: - Define custom metrics tailored to your use case - Calibrate LLM judges for cost-effective assessments - Track evaluation results over time to measure real progress Whether you're building LLM-powered apps or leading AI teams, you'll leave with actionable tools to move from proof-of-concept to production—with the transparency and reliability enterprises demand.

Speaker

Viktoria Semaan

Principal AI Evangelist · databricks

Principal AI Evangelist at databricks

Read bio

Viktoria Semaan is a Principal AI Evangelist at Databricks, specializing in AI agents, data engineering, and enterprise AI implementation. She has spent years helping organizations build production-ready AI systems, previously as a Senior Developer Advocate at AWS. With 15+ years of experience spanning technology, engineering, and management consulting, Viktoria has led Build and Go-to-Market efforts with top software companies. She holds an MBA from UCLA Anderson and an entrepreneurship program at HEC Paris. Named an AI/ML Top Voice on LinkedIn, Viktoria is deeply passionate about coaching, talent development, and mentoring. She regularly shares educational content with her audience of over 630,000 tech professionals, fostering a collaborative and inclusive community across her platforms.