Skip to content

AI Engineering

Compress, Cut, and Distill: The Latest Gen AI Model Compression Techniques in Practice

with Sergio Perez

Friday 10 July 14:15 – 16:15 Room M2 (40 Seats)

About This Session

This training lab explores the art and engineering of compressing large language models to make them cheaper, faster, and easier to deploy while preserving practical capability. Designed for a broad audience that spans beginners to advanced practitioners, the workshop will introduce foundational concepts for newcomers, share implementation patterns and pitfalls for experienced engineers, and highlight cutting-edge research directions for specialists.

Topics

  • AI Models
  • Large Language Models (LLMs)