(Choose 1 answer)
A. (1)
B. (II)
C. (III)
D. (IV)
Which of the following are reasons for using feature scaling?
(1)It is necessary to prevent gradient descent from getting stuck in local optima.
(II)It prevents the matrix XTX (used in the normal equation) from being non-invertable (singular/degenerate).
(Ⅲ)It speeds up gradient descent by making it require fewer iterations to get to a good solution
(IV)It speeds up solving for using the normal equation.
El 43