Comparison of different methods for multiclass classification problems

A beautiful picture

The project is divided into three sections. Section 1 discusses the One-vs-All and One-vs-One method which are techniques to transform a multiclass classification problem into multiple binary classification problems. Section 2 outlines the multinomial logistic regression and discusses why it is not sufficient in many cases. Section 3 applies different multiclass classification methods to the Letter Image Recognition dataset, and compares the model performance as well as provides some insights on each method. Programming in this project are done using Python’s scikit-learn library.


Transform a multiclass classification problem into multiple binary classification problems

One-vs-All method

One-vs-One method


Extend binary classification techniques to multiclass classification problems

Statistical Method - Multinomial logistic regression

Machine Learning Algorithms - Non-linear classifiers


Letter Image Recognition dataset

The data contains 20000 rectangular pixel images where each image (observation) is classified as one of the 26 capital letters (therefore this is a 26-class classification problem) in the English alphabet. In this data, each observation has 16 attributes or features, where each feature is either a statistical moment or edge counts that has already been scaled into a range of integer values from 0 to 15.

A beautiful picture

Exploratory Data Analysis (EDA)

A beautiful picture A beautiful picture A beautiful picture

Summary of model performance for all multiclass classification techniques

A beautiful picture

Last updated on Jan 1, 2019