Brier Score

Brier Score

The Brier Score is a performance metric used to evaluate the Accuracy of probabilistic forecasts.

📊 Related metrics: Log Loss (Cross-Entropy), Accuracy, Precision, F1-Score, Area Under the ROC Curve (AUC) Originally developed for weather forecasting, it has become a standard tool in machine learning to assess how well a model’s predicted probabilities match actual outcomes.

1. Calculation and Formula

The Brier score is essentially the mean squared error of predicted distributions compared to the true one-hot encoded labels.

where qnc​ is the predicted probability for class c and ync​ is the actual indicator (1 if class c is true, 0 otherwise).

2. How to Interpret the Brier Score

3. Key Property: Proper Scoring Rule

The Brier score is a proper scoring rule, which is a vital mathematical property for any metric assessing uncertainty.

4. Comparison to Log Loss (Cross-Entropy)

While both are proper scoring rules, they behave differently:

5. Advanced Variant: Brier Skill Score (BSS)

Because absolute Brier score values can be difficult to interpret across different datasets, practitioners often use the Brier Skill Score to measure relative performance against a baseline.

6. Applications in Generative Modeling

Beyond simple Classification, the Brier scoring rule has been used as an objective function to train generative models like Generative Adversarial Networks (specifically LS-GAN). In this context, minimizing the Brier score is mathematically linked to minimizing the Pearson χ2 divergence between the generated and real data distributions.

Powered by Forestry.md