One-Hot and Ordinal Encoding for Features and Labels

Features and labels are essential while working with machine learning. Whenever you start working, you need to first convert the categories into numerical format so that the machine learning algorithm can process them. Two common techniques for this transformation are one-hot encoding and ordinal encoding. Understanding when and how to use each method is crucial for preparing data. In this post, we will get an overview of both encoding methods, and explain how they work.

Let’s begin!

One-Hot-Encoding:

It is a common technique used to convert categorical data into a numerical format that can be used by machine learning algorithms.

How Does It Work?

Each category is represented by a binary vector.

Suppose you have a feature with N unique categories. One-hot encoding creates N binary features, one for each category.

Each row in the dataset will have one of these features set to 1 (indicating the presence of that category) and the rest set to 0.

Example 1

Consider a dataset with a qualitative feature “Color” that has three possible values: “Red,” “Blue,” and “Green.”

Car IDColor
1Red
2Blue
3Green
4Blue
5Red
qualitative data example (nominal)

Here’s one-hot encoded data

Car IDColor_RedColor_BlueColor_Green
1100
2010
3001
4010
5100

Example 2

Person IDEducation Level
1High School
2Bachelor’s
3Master’s
4PhD
Person IDHigh SchoolBachelor’sMaster’sPhD
11000
20100
30000
40010
50001

Side Note: When working with classification problems, we often use one-hot encoding to convert categorical labels into a binary format suitable for machine learning algorithms. This technique represents each category as a binary vector, making it easier for models to process and learn from the data.

Ordinal Encoding

Ordinal encoding is a method used to convert ordinal (categorical) features into numerical values while preserving the inherent order of the categories. Unlike one-hot encoding, which creates binary columns for each category, ordinal encoding assigns an integer value to each category based on their order.

How Does It Work?

In ordinal encoding, each category is mapped to an integer that reflects its rank or position in the sequence. This encoding is useful when the categorical data has a meaningful order, but the intervals between the values are not necessarily equal.

Let’s work on example 2 from one-hot-encoding and see how it is different from ordinal encoding.

Person IDEducation Level
1High School
2Bachelor’s
3Master’s
4PhD

Ordinal Encoding Steps:

  1. Identify the Order:
    • High School < Bachelor’s < Master’s < PhD
  2. Assign Integer Values:
    • High School = 1
    • Bachelor’s = 2
    • Master’s = 3
    • PhD = 4
Person IDEducation Level
11
22
33
44

In this encoded data, the numerical values reflect the inherent order of the education levels.

When to Use Ordinal Encoding

Ordinal encoding is suitable for ordinal features where the order matters, but the exact differences between the categories are not known or are not equal. Examples include:

  • Education levels (High School, Bachelor’s, Master’s, PhD)
  • Customer satisfaction ratings (Poor, Fair, Good, Excellent)
  • T-shirt sizes (Small, Medium, Large, Extra Large)

When to Use Which?

  • One-Hot Encoding: Use it when the categorical variable does not have an inherent order. For example, if the variable is “Color” (Red, Green, Blue), there is no natural order.
  • Ordinal Encoding: Use it when the categorical variable has a clear, meaningful order. For example, if the variable is “Size” (Small, Medium, Large), there is a natural progression from Small to Large.

Summing Up

Choosing the appropriate encoding method depends on the nature of the categorical data and the specific requirements of the machine learning algorithm being used.

Scroll to Top