Data Normalization -
What is Data Normalization - Data normalization is a technique used in data mining to transform the values of a dataset into a common scale. This is important because many machine learning algorithms are sensitive to the scale of the input features and can produce better results when the data is normalized.
Data Normalization Techniques -
Min-Max Normalization - This technique performs a linear transformation on the original data. It scales the values of a feature to a range between 0 and 1. This is done by subtracting the minimum value of the feature from each value, and then dividing by the range of the feature. The formula for min-max normalization is:
Z-Score Normalization - Also known as Zero mean normalization or standardization, this technique normalizes values based on the mean and standard deviation of the data. Each value is replaced by a score that indicates how many standard deviations it is from the mean. You can apply Z-score normalization using the following formula:
Decimal Scaling Normalization - This technique normalizes by moving the decimal point of values of the data. Each value of the data is divided by the maximum absolute value of the data, resulting in values typically in the range of -1 to 1. The formula for this simple normalization technique is:
Recommended by LinkedIn
Advantages:
Disadvantages:
Interesting! Sanchit Aggarwal