Skip to content

What is Supervised Learning?

Supervised learning is a machine learning method where a model is trained on data that already has correct answers attached, so it can learn to predict those answers for new data.

In supervised learning, the training data consists of input examples paired with known output labels. The algorithm adjusts its internal parameters to minimize the difference between its predictions and the true labels.

This process typically involves splitting data into training and test sets, choosing a loss function to measure errors, and using optimization techniques like gradient descent to improve performance over many iterations.

The two main tasks are classification, which predicts discrete categories, and regression, which predicts continuous values. The goal is to build a model that generalizes well to unseen data rather than just memorizing the training examples.

Example

A classic example is training a model on thousands of emails labeled as 'spam' or 'not spam' so that it can automatically classify new incoming emails correctly.

Why it matters

Supervised learning powers many everyday AI systems such as image recognition, medical diagnosis tools, fraud detection, and recommendation engines, making it one of the most widely used approaches in practical machine learning today.

Frequently asked questions

Supervised learning uses labeled data with known correct answers, while unsupervised learning works with unlabeled data to find hidden patterns.