Dora Petrella
How We Built a Machine Learning-Based Recommendation System (And Survived to Tell the Tale)
#1about 5 minutes
Defining the business need for product recommendations
A recommendation system for substitute products is needed across multiple touchpoints to prevent lost sales from out-of-stock items.
#2about 2 minutes
Analyzing the limitations of the existing recommender
The previous system, based on the Jaccard coefficient, produced low-quality recommendations, particularly for new or unpopular items.
#3about 5 minutes
Using the Prod2Vec algorithm for recommendations
The Prod2Vec algorithm, adapted from Word2Vec, learns product relationships by analyzing co-occurrence within user session context windows.
#4about 2 minutes
Improving predictions with Meta-Prod2Vec and metadata
Incorporating product metadata like category and brand into the model (Meta-Prod2Vec) significantly improves recommendation quality for long-tail items.
#5about 2 minutes
Implementing the end-to-end MLOps pipeline
The production system uses dbt for data transformation, a Vertex AI pipeline for model training, and Elasticsearch for efficient vector similarity search.
#6about 3 minutes
Evaluating model performance with offline and online metrics
Offline metrics like NDCG confirmed model quality, while mirror traffic analysis showed a 45% increase in product recommendation coverage.
#7about 3 minutes
Visualizing product relationships with embedding projector
Using TensorFlow's Embedding Projector tool reveals how the model groups similar products into distinct clusters in a high-dimensional space.
#8about 3 minutes
Adopting pragmatic baselines and automated data analysis
Key project takeaways include using simple business-logic baselines for benchmarking and automating exploratory data analysis within the ML pipeline itself.
#9about 1 minute
Understanding the project team and final timeline
The project was completed in nine months by a cross-functional team of data engineers, data scientists, and software developers.
Related jobs
Jobs that call for the skills explored in this talk.
Picnic Technologies B.V.
Amsterdam, Netherlands
Intermediate
Senior
Python
Structured Query Language (SQL)
+1
WALTER GROUP
Wiener Neudorf, Austria
Intermediate
Senior
Python
Data Vizualization
+1
Wilken GmbH
Ulm, Germany
Senior
Kubernetes
AI Frameworks
+3
Matching moments
04:57 MIN
Increasing the value of talk recordings post-event
Cat Herding with Lions and Tigers - Christian Heilmann
01:32 MIN
Organizing a developer conference for 15,000 attendees
Cat Herding with Lions and Tigers - Christian Heilmann
04:27 MIN
Moving beyond headcount to solve business problems
What 2025 Taught Us: A Year-End Special with Hung Lee
03:28 MIN
Why corporate AI adoption lags behind the hype
What 2025 Taught Us: A Year-End Special with Hung Lee
04:22 MIN
Why HR struggles with technology implementation and adoption
What 2025 Taught Us: A Year-End Special with Hung Lee
03:38 MIN
Balancing the trade-off between efficiency and resilience
What 2025 Taught Us: A Year-End Special with Hung Lee
05:18 MIN
Incentivizing automation with a 'keep what you kill' policy
What 2025 Taught Us: A Year-End Special with Hung Lee
03:15 MIN
The future of recruiting beyond talent acquisition
What 2025 Taught Us: A Year-End Special with Hung Lee
Featured Partners
Related Videos
Data Science in Retail
Julian Joseph
How AI Models Get Smarter
Ankit Patel
Design as an algorithm, not as a feature
Kaya Sohyun Kim
Empowering Retail Through Applied Machine Learning
Christoph Fassbach & Daniel Rohr
Hybrid AI: Next Generation Natural Language Processing
Jan Schweiger
Deployed ML models need your feedback too
David Mosen
Make it simple, using generative AI to accelerate learning
Duan Lightfoot
Machine learning 101: Where to begin?
Lutske De Leeuw
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Amazon.com, Inc
Intermediate
R
Python
Matlab
Terraform
Machine Learning
+2

Amazon.com, Inc
Intermediate
Python
Terraform
Machine Learning
Amazon Web Services (AWS)
Scripting (Bash/Python/Go/Ruby)

Tecdata
Barcelona, Spain
Intermediate
Machine Learning
Amazon Web Services (AWS)

UL Solutions
Barcelona, Spain
Python
Machine Learning

Amazon.com, Inc
Senior
Spark
Hadoop
PyTorch
Machine Learning
Amazon Web Services (AWS)


Amazon.com, Inc.
Municipality of Zaragoza, Spain
Intermediate
API
Java
DevOps
Python
Machine Learning
+1

AWS EMEA SARL (Spain Branch)
Municipality of Madrid, Spain
Keras
Python
PyTorch
TensorFlow
Machine Learning
+1

Barcelona Supercomputing Center
Barcelona, Spain
Unix
Python
PyTorch
TensorFlow
Machine Learning