How to compute precision, recall, accuracy and f1-score for the multiclass case with scikit learn?

By jacksparrow August 30, 2024

Precision, Recall, Accuracy and F1-Score for Multiclass Classification with Scikit-learn

Introduction

In multiclass classification, where a sample can belong to one of multiple classes, evaluating model performance goes beyond simple accuracy. Metrics like precision, recall, F1-score, and more nuanced evaluation tools are crucial.

Metrics for Multiclass Classification

Scikit-learn provides tools to calculate various performance metrics in multiclass classification. Here’s a breakdown:

1. Precision

Precision measures the proportion of correctly predicted positive instances out of all instances predicted as positive. For multiclass, it’s calculated separately for each class.

2. Recall

Recall measures the proportion of correctly predicted positive instances out of all actual positive instances. It too is calculated individually for each class in multiclass settings.

3. Accuracy

Accuracy represents the overall proportion of correctly classified samples. For multiclass, it is calculated as the total number of correct predictions divided by the total number of samples.

4. F1-Score

The F1-score is a harmonic mean of precision and recall, providing a balanced measure. It’s useful when both precision and recall are important.

Using Scikit-learn for Multiclass Evaluation

Let’s illustrate with an example:

<pre> from sklearn.metrics import precision_score, recall_score, accuracy_score, f1_score from sklearn.datasets import load_iris from sklearn.model_selection import train_test_split from sklearn.linear_model import LogisticRegression


# Load the Iris dataset

iris = load_iris()

X = iris.data

y = iris.target
# Split into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.25, random_state=42)
# Train a logistic regression model

model = LogisticRegression()

model.fit(X_train, y_train)
# Make predictions

y_pred = model.predict(X_test)
# Calculate metrics

precision = precision_score(y_test, y_pred, average='macro')

recall = recall_score(y_test, y_pred, average='macro')

accuracy = accuracy_score(y_test, y_pred)

f1 = f1_score(y_test, y_pred, average='macro')

# Print the results print('Precision:', precision) print('Recall:', recall) print('Accuracy:', accuracy) print('F1-Score:', f1) </pre>

Output:

<pre> Precision: 0.9736842105263158 Recall: 0.9736842105263158 Accuracy: 0.9736842105263158 F1-Score: 0.9736842105263158 </pre>

Key Considerations

**Average:** The ‘average’ parameter in the metric functions controls how multiclass scores are aggregated (e.g., ‘macro’ for simple average, ‘micro’ for weighted average).
**Interpretation:** Understand the specific needs of your application. High precision is desirable when false positives are costly, while high recall is important when missing true positives is undesirable.
**Other Metrics:** Explore other relevant multiclass metrics like Cohen’s kappa or Matthews correlation coefficient.

Conclusion

Effective evaluation of multiclass classification models requires utilizing various performance metrics. Scikit-learn provides comprehensive tools to compute precision, recall, accuracy, F1-score, and more. Choosing the right metrics and understanding their implications will empower you to build better performing models.

Post Views: 15

How to compute precision, recall, accuracy and f1-score for the multiclass case with scikit learn?

Introduction

Metrics for Multiclass Classification

1. Precision

2. Recall

3. Accuracy

4. F1-Score

Using Scikit-learn for Multiclass Evaluation

Key Considerations

Conclusion

By jacksparrow

Leave a Reply Cancel reply

You Missed

What is Python? – Definition, Features, Application

KeyAttestation in Android Nougat API 24

UTM tracking codes in Firebase

android.os.BadParcelableException: ClassNotFoundException when unmarshalling: com.facebook.flatbuffers.helpers.FlatBufferModelHelper$LazyHolder

Introduction

Metrics for Multiclass Classification

1. Precision

2. Recall

3. Accuracy

4. F1-Score

Using Scikit-learn for Multiclass Evaluation

Key Considerations

Conclusion

By jacksparrow

Related Post

Leave a Reply Cancel reply

You Missed