Oliver Richter: Katalogdaten im Frühjahrssemester 2020

NameHerr Dr. Oliver Richter
DepartementInformationstechnologie und Elektrotechnik

227-0559-00LSeminar in Deep Reinforcement Learning Information Belegung eingeschränkt - Details anzeigen
Number of participants limited to 25.
2 KP2SR. Wattenhofer, O. Richter
KurzbeschreibungIn this seminar participating students present and discuss recent research papers in the area of deep reinforcement learning. The seminar starts with two introductory lessons introducing the basic concepts. Alongside the seminar a programming challenge is posed in which students can take part to improve their grade.
LernzielSince Google Deepmind presented the Deep Q-Network (DQN) algorithm in 2015 that could play Atari-2600 games at a superhuman level, the field of deep reinforcement learning gained a lot of traction. It sparked media attention with AlphaGo and AlphaZero and is one of the most prominent research areas. Yet many research papers in the area come from one of two sources: Google Deepmind or OpenAI. In this seminar we aim at giving the students an in depth view on the current advances in the area by discussing recent papers as well as discussing current issues and difficulties surrounding deep reinforcement learning.
InhaltTwo introductory courses introducing Q-learning and policy gradient methods. Afterwards participating students present recent papers. For details see: www.disco.ethz.ch/courses.html
SkriptSlides of presentations will be made available.
LiteraturOpenAI course (https://spinningup.openai.com/en/latest/) plus selected papers.
The paper selection can be found on www.disco.ethz.ch/courses.html.
Voraussetzungen / BesonderesIt is expected that student have prior knowledge and interest in machine and deep learning, for instance by having attended appropriate courses.