History and State

Definition

The history is the sequence of observations, actions, rewards $H_{t} = O_{1}, A_{1}, R_{1}, \dots, O_{t}, A_{t}, R_{t}$

i.e. all observable variables up to time $t$
i.e. the sensorimotor stream of a robot or embodied agent

State is the information used to determine what happens next Formally, state is a function of the history: $S_{t} = f (H_{t})$

Information State / Markov State

An information state (a.k.a. Markov state) contains all useful information from the history.

“The future is independent of the past given the present”

See Markov Decision Process

Environment State

The environment state $S_{t}^{e}$ is the environment’s private representation i.e. whatever data the environment uses to pick the next observation/reward The environment state is not usually visible to the agent Even if $S_{t}^{e}$ is visible, it may contain irrelevant information

Agent State

The agent state $S_{t}^{a}$ is the agent’s internal representation

i.e. whatever information the agent uses to pick the next action
i.e. it is the information used by reinforcement learning algorithms It can be any function of history: $S_{t}^{a} = f (H_{t})$

What we think happens next really depends on our representation on state. Our job is to build a state that is useful.

Fully Observable Environments

This is the best case.

Full observability: agent directly observes environment state $O_{t} = S_{t}^{a} = S_{t}^{e}$

Agent state = environment state = information state
Formally, this is a Markov Decision Process

Partially Observable Environments

These are more difficult scenarios. Partial observability: agent indirectly observes environment:

A robot with camera vision isn’t told its absolute location
A trading agent only observes current prices
A poker playing agent only observes public cards

In this case, agent state $\neq =$ environment state

Terminal State

🛠️ Steven Gong

Table of Contents

History and State

Definition

Information State / Markov State

Environment State

Agent State

Fully Observable Environments

Partially Observable Environments

Graph View

Backlinks

🛠️ Steven Gong

Table of Contents

History and State

Definition

Information State / Markov State

Environment State

Agent State

Fully Observable Environments

Partially Observable Environments

Related

Graph View

Backlinks