Cross-Entropy Method
https://en.wikipedia.org/wiki/Cross-entropy_method
“Avoid variance collapse in the cross-entropy method..” What are they talking about??
For Policy Gradient Methods
Cross-Entropy Method
- Very simple and can work surprisingly well
- Very scalable
- Does not take advantage of any temporal structure