F1-Score

The F1-score (also known as the F-measure) is a single-number performance metric that provides a balanced summary of a classification model’s success by combining precision and recall. It is especially valuable for evaluating models on imbalanced datasets where accuracy can be highly misleading.

  1. Calculation and Formula

The F1-score is defined as the harmonic mean of precision and recall.

Pasted image 20260319194123.png

  1. Logic of the Harmonic Mean

Unlike a standard arithmetic average, the harmonic mean gives much more weight to low values.

  1. Why the F1-Score is Important
  1. Interpretability and Limitations
  1. Multiclass Variations

To evaluate multiclass problems, binary F1-scores are computed for each class and then averaged using different strategies:

  1. The Generalized F-Measure (Fβ​)

In many contexts, precision and recall are not equally important. The generalized Fβ​ score allows practitioners to weight them differently.

Powered by Forestry.md