RESEARCH AREA
Learning through interaction with environments to maximize cumulative reward.
Demis Hassabis
Google DeepMind