Feature Scaling and Mean Normalization

Feature Scaling

Idea: Make sure features are on a similar scale

Example: below data, the contour will be very skewed hence gradient descent might take a long time to complete. We should convert them into similar scale, ie. similar range of values (in picture below they're now in the range of [0, 1] (if possible in the range of [-1, 1])
x1 = size (0-2000 feet2)
x2 = number of bedrooms (1-5)
Goal: get every feature into approximately [-1, 1] or [-0.5, 0.5] range
Note: denominator is actually a range (= max - min), ie. 2000 - 0 = 2000

Mean normalization:

Idea: replace xi with xiμi where μi is the average of all the values for feature, si is the range of values (max - min) or si is the standard deviation.
Formula:
xi:=xiμisi

Resources:

https://www.coursera.org/learn/machine-learning/lecture/xx3Da/gradient-descent-in-practice-i-feature-scaling

Comments

Post a Comment