F1-Score

The F1-score (also known as the F-measure) is a single-number performance metric that provides a balanced summary of a Classification model's success by combining Precision and recall.

📊 See also: Accuracy, Confusion Matrix, Area Under the ROC Curve (AUC), Cohen's Kappa statistic It is especially valuable for evaluating models on imbalanced datasets where Accuracy can be highly misleading.

  1. Calculation and Formula

The F1-score is defined as the harmonic mean of Precision and recall.

Pasted image 20260319194123.png

  1. Logic of the Harmonic Mean

Unlike a standard arithmetic average, the harmonic mean gives much more weight to low values.

  1. Why the F1-Score is Important
  1. Interpretability and Limitations
  1. Multiclass Variations

To evaluate multiclass problems, binary F1-scores are computed for each class and then averaged using different strategies:

  1. The Generalized F-Measure (Fβ​)

In many contexts, Precision and recall are not equally important. The generalized Fβ​ score allows practitioners to weight them differently.

Powered by Forestry.md