The cost function contains a squared term and is divided by 2*m where m is the number of training examples. What is in the denominator of gradient descent function?

The cost function contains a squared term and is divided by 2*m where m is the number of training examples. What is in the denominator of gradient descent function? Correct Answer m

Gradient descent performs a partial derivative of the cost function. The squared term produces a two after differentiation. This is canceled out with the two in the denominator, leaving only the term “m” there.

Related Questions

Cost function has a squared term, but gradient descent does not. Why?