Accuracy

Accuracy is the most common quantitative performance measure used to evaluate classification systems. It provides a single-number summary of a model’s success by measuring the fraction of instances it correctly identified out of the total population.

1. Calculation and Building Accuracy

In the context of a Confusion Matrix, accuracy is built by summing the correct predictions (the main diagonal) and dividing by the total number of examples.

2. How to Interpret Accuracy

Accuracy is most effectively interpreted when errors in all classes are judging to be equally important. For example, in an object recognition task for a domestic robot, misidentifying a chair may be no more or less critical than misidentifying a table.

However, interpreting a high accuracy score requires context:

3. The Critical Flaw: Imbalanced Datasets

The most significant limitation of accuracy is its tendency to be highly misleading on skewed or imbalanced datasets. In many real-world scenarios, one class is much more frequent than others.

4. Advanced Accuracy Metrics

To address the limitations of basic accuracy, practitioners use more nuanced versions:

5. Accuracy in the Model Lifecycle

Powered by Forestry.md