🛠️ Steven Gong

Search

SearchSearch
  • Cross-Entropy Method
  • For Policy Gradient Methods

Jan 01, 2023, 1 min read

Cross-Entropy

Cross-Entropy Method

https://en.wikipedia.org/wiki/Cross-entropy_method

maxx​f(x)

For Policy Gradient Methods

Cross-Entropy Method

  • Very simple and can work surprisingly well
  • Very scalable
  • Does not take advantage of any temporal structure

Graph View

Backlinks

  • CS287 - Advanced Robotics

Created with Quartz, © 2025

  • Blog
  • LinkedIn
  • Twitter
  • GitHub