Electrical Engineering Systems Seminar

Thursday, March 11, 2021

12:00pm to 1:00pm

Online Event

Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State

Benjamin Van Roy, Professor, Electrical Engineer and Management Science & Engineering, Stanford University,

I will describe a reinforcement learning agent that, with specification only of agent state dynamics and a reward function, can operate with some degree of competence in any environment. The agent applies an optimistic version of Q-learning to update value predictions that are based on the agent's actions and aleatoric states. We establish a regret bound demonstrating convergence to near-optimal per-period performance, where the time required is polynomial in the number of actions and aleatoric states, as well as the reward mixing time of the best policy among those for which actions depend on history only through aleatoric state. Notably, there is no further dependence on the number of environment states or mixing times associated with other policies or statistics of history.

For more information, please contact Caroline Murphy by email at [email protected].

Event Series

Electrical Engineering Systems Seminar Series

Event Sponsors

Computing and Mathematical Sciences (CMS) More Events from this Sponsor

Electrical Engineering More Events from this Sponsor