Brier Score

The Brier Score is a performance metric used to evaluate the accuracy of probabilistic forecasts. Originally developed for weather forecasting, it has become a standard tool in machine learning to assess how well a model’s predicted probabilities match actual outcomes.

1. Calculation and Formula

The Brier score is essentially the mean squared error of predicted distributions compared to the true one-hot encoded labels.

where qnc​ is the predicted probability for class c and ync​ is the actual indicator (1 if class c is true, 0 otherwise).

2. How to Interpret the Brier Score

3. Key Property: Proper Scoring Rule

The Brier score is a proper scoring rule, which is a vital mathematical property for any metric assessing uncertainty.

4. Comparison to Log Loss (Cross-Entropy)

While both are proper scoring rules, they behave differently:

5. Advanced Variant: Brier Skill Score (BSS)

Because absolute Brier score values can be difficult to interpret across different datasets, practitioners often use the Brier Skill Score to measure relative performance against a baseline.

6. Applications in Generative Modeling

Beyond simple classification, the Brier scoring rule has been used as an objective function to train generative models like Generative Adversarial Networks (specifically LS-GAN). In this context, minimizing the Brier score is mathematically linked to minimizing the Pearson χ2 divergence between the generated and real data distributions.

Powered by Forestry.md