Cross-Entropy Method
https://en.wikipedia.org/wiki/Cross-entropy_method
For Policy Gradient Methods
Cross-Entropy Method
- Very simple and can work surprisingly well
- Very scalable
- Does not take advantage of any temporal structure
https://en.wikipedia.org/wiki/Cross-entropy_method
Cross-Entropy Method