Humera Minhas & Parinitha Hirehal
Shoot for the moon - machine learning for automated online ad detection
#1about 4 minutes
The challenge of manual ad filtering and the moonshot project
Manual ad filter lists are slow and resource-intensive, prompting the "Project Moonshot" initiative to automate ad detection using AI and machine learning.
#2about 2 minutes
Choosing the right data source for ad detection
The team pivoted from inefficient computer vision models for perceptual ad detection to analyzing HTML structure, which provided richer data for machine learning.
#3about 3 minutes
Generating labeled training data at scale
A custom crawler combined with a modified Adblock Plus was used to automatically label HTML nodes on 250,000 web pages, creating a large-scale ground truth dataset.
#4about 4 minutes
Pre-processing HTML data and overcoming key challenges
The data pipeline converted raw HTML into adjacency and feature matrices while solving challenges like severely unbalanced data and slow processing speeds.
#5about 6 minutes
Experimenting with different machine learning model approaches
Several models were tested for ad classification, including graph neural networks, traditional classifiers with node embeddings, and tree-based models like XGBoost.
#6about 3 minutes
Comparing model performance and planning future improvements
Tree-based models significantly outperformed graph neural networks in F1 score, and future work will explore self-supervised learning and more diverse data.
#7about 3 minutes
Deploying machine learning models in a JavaScript environment
The team tackled deployment challenges by converting Python models to JavaScript, optimizing for latency by moving the model to a background script, and using TensorFlow.js.
#8about 5 minutes
Answering questions on model circumvention and design choices
The speakers address audience questions regarding how ad companies might circumvent the model and the rationale behind their model experimentation process.
Related jobs
Jobs that call for the skills explored in this talk.
Matching moments
04:02 MIN
How a broken frontend culture impacts the web
WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more
05:03 MIN
Building and iterating on an LLM-powered product
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
08:18 MIN
The impact of AI on tech recruitment and resumes
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
05:24 MIN
Empathizing with users over chasing engagement metrics
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
05:55 MIN
The security risks of AI-generated code and slopsquatting
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
04:59 MIN
Unlocking LLM potential with creative prompting techniques
WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more
03:17 MIN
Why tech companies are laying off engineers for AI
WeAreDevelopers LIVE – Frontend Inspirations, Web Standards and more
03:45 MIN
Preventing exposed API keys in AI-assisted development
Slopquatting, API Keys, Fun with Fonts, Recruiters vs AI and more - The Best of LIVE 2025 - Part 2
Featured Partners
Related Videos
Multimodal Generative AI Demystified
Ekaterina Sirazitdinova
How We Built a Machine Learning-Based Recommendation System (And Survived to Tell the Tale)
Dora Petrella
How to scrape modern websites to feed AI agents
Jan Curn
Hacking AI - how attackers impose their will on AI
Mirko Ross
The AI Elections: How Technology Could Shape Public Sentiment
Martin Förtsch & Thomas Endres
How E.On productionizes its AI model & Implementation of Secure Generative AI.
Kapil Gupta
Machine Learning: Promising, but Perilous
Nura Kawa
Smart, Connected, Unexpected: The Wild Side of IoT and AI
Pawel Skiba
Related Articles
View all articles



From learning to earning
Jobs that call for the skills explored in this talk.

Forschungszentrum Jülich GmbH
Jülich, Germany
Intermediate
Senior
Linux
Docker
AI Frameworks
Machine Learning



ROSEN Technology and Research Center GmbH
Osnabrück, Germany
Senior
React
DevOps
Next.js
TypeScript
Cloud (AWS/Google/Azure)

autonomous-teaming
Berlin, Germany
Remote
ETL
NoSQL
NumPy
Python
+3

autonomous-teaming
München, Germany
Remote
ETL
NoSQL
NumPy
Python
+3

autonomous-teaming
München, Germany
Remote
GIT
Python
Docker
PyTorch
+1

Integral Ad Science
Paris, France
Senior
ETL
Java
DevOps
Python
Docker
+7
