Control Meets Learning Seminar
Work in controls and AI tends to focus on how to optimize a specified cost function, but costs that lead to the desired behavior consistently are not so easy to specify. Rather than optimizing specified cost, which is already hard, robots have the much harder job of optimizing intended cost. While the specified cost does not have as much information as we make our robots pretend, the good news is that humans constantly leak information about what the robot should optimize. In this talk, we will explore how to read the right amount of information from different types of human behavior -- and even the lack thereof.