Label Smoothing

This is a trick to make your model less confident. Article here

Idea introduced by Andrej Karpathy through his lecture.

Basically, you add 1, so that when you take the Negative Log Likelihood, you guarantee that the value of your is never 0, so then the negative log likelihood would not return .