ROC and AUC Curves in Python — ELI5

Understand how one picture and one number tell you whether a computer's predictions are trustworthy or just lucky guesses.

Imagine you are playing a game where you have to sort marbles into two buckets: red ones and blue ones. But the marbles are dusty and hard to tell apart. You decide on a rule: “if it looks more red than blue, put it in the red bucket.”

Now, you can be strict (“only if I am really, really sure it is red”) or relaxed (“if there is even a tiny chance it might be red, call it red”). Being strict means you miss some red marbles, but the ones you pick are almost always correct. Being relaxed means you catch all the red marbles, but you also grab a bunch of blue ones by mistake.

A ROC curve is a picture that shows what happens at every level of strictness, from super relaxed on one end to ultra strict on the other. It plots “how many real reds did I catch?” against “how many blues did I accidentally grab?” for each setting.

If the curve bows up toward the top-left corner, the sorter is doing great — catching reds without grabbing blues. If the curve is a straight diagonal line from corner to corner, the sorter is no better than flipping a coin.

The AUC is a single number that measures how far the curve bows. A score of 1.0 means perfect sorting. A score of 0.5 means coin-flip random. Anything above 0.5 means the model is at least somewhat useful.

One thing to remember: The ROC-AUC score tells you how good a model is at separating two groups, no matter where you set the strictness dial.

pythonroc-aucmachine-learningclassification

ROC and AUC Curves in Python — ELI5

See Also

Related Topics