Constrained Optimization
Ohh is this what they are doing with the Sinkhorn Divergence to solve the EMD??
They add a constraint to make it convex?
Learning this in the context of CS287.
Original problem: Find s.t. (subject to)
Original problem formulated as Lagrangian problem: Find , where .
Our objective is: This Lagrangian equation respects the two conditions: We can then solve it.
Why are these two equivalent?
Trying to convince myself that these two are equivalent. Pieter Abbeel was explaining that even in the the Lagrangian formulation, is forced to be intuitively, else you will get punished.
I still don’t get it…