IST Lunch Bunch

Tuesday, February 4, 2020

12:00pm to 1:00pm

Annenberg 105

Online Optimization with Preference Feedback, Structured Decision Spaces, and Safety Constraints

Yanan Sui, Assistant Professor, Tsinghua University,

Bandit algorithms tackle the fundamental challenge of balancing exploration (collecting data for learning better models) and exploitation (using the estimates to make decisions). In this talk, I will formalize bandit problems with preference feedback, with structured decision spaces, and with safety constraints (when bad samples are not allowed). These constraints commonly exist in many applications. In particular, we are motivated by online decision-making for clinical treatment and robotic control. This talk will exhibit several algorithms for these constrained optimization problems. Theoretical guarantees and empirical efficiencies of our algorithms will be presented. I will also show our clinical practices of online decision-making for neuromodulation.

For more information, please contact Diane by email at [email protected].

Event Series

IST Lunch Bunch

Event Sponsors

Computing and Mathematical Sciences (CMS) More Events from this Sponsor