Bias-Variance Tradeoff — Explain Like I'm 5

The Archer Who Always Misses Left

Imagine two archers.

Archer A always shoots to the left. Every single arrow — 5 centimeters left of center. It’s consistent, predictable, wrong in the same direction every time. This is bias — a systematic error, always in the same direction.

Archer B is different. Their arrows land all over the place — sometimes left, sometimes right, sometimes high, sometimes low. No pattern, just randomness. This is variance — inconsistent, scattered.

In machine learning, your model is the archer. Getting predictions right means minimizing both types of error.

Two Ways to Get Things Wrong

High bias (underfitting): The model is too simple. It doesn’t capture the real patterns in the data. Imagine using a single straight line to fit data that clearly curves. The line consistently misses in systematic ways.

High variance (overfitting): The model is too complex. It memorizes the training data, including the random noise. When shown new data, it fails because the noise doesn’t repeat. A very wiggly curve that passes through every single training point would fit training data perfectly but fail on new data.

The Tension

Making a model more complex usually:

  • Reduces bias (it can capture more real patterns)
  • Increases variance (it starts memorizing noise)

Making a model simpler usually:

  • Increases bias (it can’t capture complex patterns)
  • Reduces variance (it doesn’t memorize noise)

This is the tradeoff. Finding the right complexity — complex enough to capture real patterns, simple enough not to memorize noise — is one of machine learning’s most fundamental challenges.

The Good News

Modern machine learning (deep neural networks) has somewhat broken this classical tradeoff for large models. Very large neural networks trained on enormous datasets seem to get both lower bias AND lower variance through a phenomenon called “double descent” — they’re so large that they effectively don’t memorize in the harmful way. But the underlying tension still appears in smaller models and limited data settings.

One thing to remember: Bias is being consistently wrong; variance is being inconsistently wrong. Good models minimize both — by being complex enough to see real patterns but simple enough not to chase noise.

bias-varianceoverfittingunderfittinggeneralizationmachine-learningstatistics

See Also

  • Ai Hallucinations ChatGPT sometimes makes up facts with total confidence. Here's the weird reason why — and why it's not as simple as 'the AI lied.'
  • Artificial Intelligence What is AI really? Think of it as a dog that learned tricks — impressive, but it doesn't know why it's doing them.
  • Deep Learning Why your phone can spot your face in a messy photo album — and why that trick comes from practice, not magic.
  • Embeddings How do computers know that 'dog' and 'puppy' mean almost the same thing? They don't read definitions — they turn words into secret map coordinates, and nearby coordinates mean nearby meanings.
  • Generative Ai Generative AI doesn't look things up — it makes things up. Here's why that's either impressive or terrifying, depending on what you ask it to make.