Reinforcement Learning
Prof. Dr. rer. nat. Marius Lindauer
Übungsbetreuung:
Background
In recent years, Reinforcement Learning (RL) has produced some of
the most impressive results in the realm of Machine Learning (ML), especially in game playing
(as with the game of Go) and robotics (e.g. RoboCup or autonomously navigating robots).
Its view of the ML model as an agent acting within an environment, allows for
learning by trial and error and therefore reasoning beyond human expert
knowledge.
RL is a quickly evolving field with new algorithms and applications being
developed constantly.
Therefore this course will begin with teaching the mathematical foundations of RL
and give an overview over the field's development up until today.
At the end of the lecture you will be able to understand the current state of RL
research as well as reason about the theoretical foundations of different RL
approaches.
Via the accompanying exercises you will be introduced to implementing several RL
algorithms as well as the general RL pipeline, including learning environments,
agent evaluation and hyperparameter settings.
At the end of the semester you will apply your new skills to an interesting RL project of your choice.
Requirements
We strongly recommend that you know the foundations of
- AI
- Machine Learning
- Deep Learning
in order to attend the course. You should have attended at least one other course for ML and DL in the past.
Topics
Lecture topics include:
- Markov-Decision Processes
- Value-function Approximation
- Policy Search
- Model-based RL
- Deep RL
Literature
Reinforcement Learning by Richard S. Sutton and Andrew G. Barto
Dynamics
This course as well as the exercises will be in English only.
The course will be held online with live sessions for both lecture and exercises. There will be opportunities for questions and discussion during the lecture, additionally there will be an online forum and the exercise sessions with extra examples for the current topics. Each week there will be a programming task to be completed at home in groups of up to three students. These tasks will also be discussed during the exercises. At the end of the semester, you will be free to choose a task to tackle using RL.