Stochastic Gradient Descent

Stochastic gradient descent samples the gradient.

Stochastic gradient descent is a specific case of Mini-Batch Gradient Descent.

CS294

SGD minimizes expectations, for a differentiable function of , SGD solves We can use this with Maximum Likelihood Estimation