Bayesian Optimization

First encountered this in the BayesRace paper.

This video by a researcher at Deepmind: https://www.youtube.com/watch?v=C5nqEHpdyoE&ab_channel=UAI2018

Ahh it’s actually a great talk. There is this close parallel with Multi-Armed Bandit problems.

They have the same Exploration and Exploitation problem.

When you sample points that are bad, but you just want to confirm that they are bad.

Two questions:

What is my model?
What is the exploration strategy?

Bayesian optimization is a sequential design strategy for global optimization of black-box functions that does not assume any functional forms. It is usually employed to optimize expensive-to-evaluate functions.

Goal: maximize $f (x)$ subject to $x \in X$ , where $f$ is some unknown function

Since we don’t observe derivatives, first-order and Second-Order Optimization methods cannot be used

BayesOpt is known for data-efficiency and is widely used in diverse applications such as:

tuning hyperparameters of complex deep neural networks
learning data-efficient reinforcement learning (RL) policies for robotic manipulation tasks
tuning controller parameters in robotics
optimal experiment design for designing functional tests in buildings
recommender systems

Resources:

Practical Bayesian Optimization of Machine Learning Algorithms

There are so many parallels with this being a bandit problem. They talk Thompson Sampling

🛠️ Steven Gong

Table of Contents

Bayesian Optimization

Graph View

Backlinks

🛠️ Steven Gong

Table of Contents

Bayesian Optimization

Related

Graph View

Backlinks