How to compute precision, recall, accuracy and f1-score for the multiclass case with scikit learn?

By jacksparrow August 30, 2024

Computing Precision, Recall, Accuracy, and F1-Score for Multiclass Classification

Introduction

This article delves into the computation of fundamental evaluation metrics – precision, recall, accuracy, and F1-score – in the context of multiclass classification using Scikit-learn, a popular Python machine learning library.

Multiclass Classification

Multiclass classification problems involve predicting one of multiple possible classes, often with no inherent order. Examples include:

Image classification (cat, dog, bird)
Sentiment analysis (positive, negative, neutral)
Spam detection (spam, not spam)

Evaluation Metrics

Precision

Precision measures the proportion of correctly predicted positive instances among all instances predicted as positive. In multiclass settings, it’s calculated for each class separately.

Recall

Recall measures the proportion of correctly predicted positive instances among all actual positive instances. Like precision, it’s computed per class.

Accuracy

Accuracy reflects the overall proportion of correctly classified instances. It’s a simple and often used metric but can be misleading when class distributions are imbalanced.

F1-Score

The F1-score represents the harmonic mean of precision and recall. It provides a balanced metric, particularly useful when considering both false positives and false negatives.

Scikit-learn Implementation

Scikit-learn offers convenient tools to compute these metrics. Here’s a comprehensive example using a multiclass classification problem, followed by code explanations.

from sklearn.metrics import precision_score, recall_score, accuracy_score, f1_score from sklearn.datasets import load_iris from sklearn.model_selection import train_test_split from sklearn.linear_model import LogisticRegression


# Load the Iris dataset

iris = load_iris()

X = iris.data

y = iris.target
# Split into train and test sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
# Train a logistic regression model

model = LogisticRegression()

model.fit(X_train, y_train)
# Make predictions

y_pred = model.predict(X_test)
# Compute evaluation metrics

precision = precision_score(y_test, y_pred, average='macro')

recall = recall_score(y_test, y_pred, average='macro')

accuracy = accuracy_score(y_test, y_pred)

f1 = f1_score(y_test, y_pred, average='macro')

# Print results print("Precision:", precision) print("Recall:", recall) print("Accuracy:", accuracy) print("F1-score:", f1)

Explanation

Import necessary modules (precision_score, recall_score, etc.)
Load the Iris dataset (you can replace this with your own data)
Split data into training and testing sets
Train a logistic regression model
Make predictions on the test data
Compute each metric using Scikit-learn functions. Note: average='macro' calculates the mean metric across all classes.
Print the results.

Output

Precision: 0.9666666666666667
Recall: 0.9333333333333333
Accuracy: 0.9666666666666667
F1-score: 0.9499999999999999

Conclusion

Understanding and effectively calculating precision, recall, accuracy, and F1-score is crucial for assessing the performance of multiclass classification models. Scikit-learn provides convenient tools to streamline this process, empowering data scientists to make informed decisions about model selection and optimization.

Post Views: 8

How to compute precision, recall, accuracy and f1-score for the multiclass case with scikit learn?

Introduction

Multiclass Classification

Evaluation Metrics

Precision

Recall

Accuracy

F1-Score

Scikit-learn Implementation

Explanation

Output

Conclusion

By jacksparrow

Leave a Reply Cancel reply

You Missed

What is Python? – Definition, Features, Application

KeyAttestation in Android Nougat API 24

UTM tracking codes in Firebase

android.os.BadParcelableException: ClassNotFoundException when unmarshalling: com.facebook.flatbuffers.helpers.FlatBufferModelHelper$LazyHolder

Introduction

Multiclass Classification

Evaluation Metrics

Precision

Recall

Accuracy

F1-Score

Scikit-learn Implementation

Explanation

Output

Conclusion

By jacksparrow

Related Post

Leave a Reply Cancel reply

You Missed