Semi-Supervised Learning. How to overcome the lack of labels

What if you could achieve 95% accuracy with only a handful of labeled examples? Discover how semi-supervised learning makes it possible.

#1about 4 minutes

Understanding the high cost of data labeling

The need for semi-supervised learning arises from the significant expertise, time, and resources required to label data for medical research, malware detection, and industrial inspection.

#2about 5 minutes

Locating semi-supervised learning in the ML landscape

Semi-supervised learning bridges the gap between supervised and unsupervised methods by leveraging a small amount of labeled data alongside a large pool of unlabeled data.

#3about 2 minutes

Using entropy minimization and pseudo-labeling techniques

Initial approaches involve training a classifier to be confident in its predictions by minimizing entropy or by iteratively using the model's own high-confidence predictions as new labels.

#4about 6 minutes

Applying consistency training with data augmentation

A core technique involves applying augmentations like rotation or back-translation and training the model to produce consistent predictions for the original and augmented data.

#5about 1 minute

Improving model robustness with virtual adversarial training

Virtual Adversarial Training (VAT) improves consistency by finding and training against small, adversarial perturbations that are most likely to change the model's prediction.

#6about 2 minutes

Leveraging generative models to understand data structure

Generative models can learn the underlying structure of unlabeled data, which helps create more accurate decision boundaries when combined with a few labeled examples.

#7about 4 minutes

Implementing semi-supervised learning with variational autoencoders

A variational autoencoder (VAE) can be adapted for semi-supervised classification by adding a classification loss term, significantly boosting accuracy with unlabeled data.

#8about 1 minute

Recapping the core principles of semi-supervised learning

The main idea is to combine a standard supervised loss on labeled data with an additional loss function that leverages the structure of unlabeled data.

Alex Timashov

Semi-Supervised Learning. How to overcome the lack of labels

Understanding the high cost of data labeling

Locating semi-supervised learning in the ML landscape

Using entropy minimization and pseudo-labeling techniques

Applying consistency training with data augmentation

Improving model robustness with virtual adversarial training

Leveraging generative models to understand data structure

Implementing semi-supervised learning with variational autoencoders

Recapping the core principles of semi-supervised learning

Matching moments

Featured Partners

Related Videos

Related Articles

From learning to earning