Stochastic Gradient Descent
Stochastic gradient descent samples the gradient.
Stochastic gradient descent is a specific case of Mini-Batch Gradient Descent.
CS294
SGD minimizes expectations, for a differentiable function of , SGD solves We can use this with Maximum Likelihood Estimation