Skip to content
Sign in

What is Bias-Variance Tradeoff?

The bias-variance tradeoff describes the balance between two sources of error in a machine learning model: bias (error from overly simple assumptions) and variance (error from sensitivity to small fluctuations in the training data).

Every model has a total expected error that can be decomposed into bias squared, variance, and irreducible noise. High bias occurs when the model is too simple and systematically misses patterns, while high variance occurs when the model is too complex and fits noise in the training set.

Increasing model complexity typically lowers bias but raises variance, and vice versa. The goal of training is to find the sweet spot on this curve that minimizes overall prediction error on unseen data.

Techniques such as regularization, cross-validation, and ensemble methods are used to navigate this tradeoff without needing to know the exact bias and variance values.

Example

A linear regression model fit to a curved dataset will have high bias and consistently under-predict in some regions, while a high-degree polynomial will have low bias on the training points but high variance, swinging wildly between them.

Why it matters

Understanding the tradeoff guides choices about model complexity, regularization strength, and data size, directly affecting how well modern AI systems generalize beyond their training data.

Frequently asked questions

Bias is the error from wrong assumptions that cause the model to miss relevant patterns; variance is the error from the model being too sensitive to the training data.