Study-Unit Description

CODE

ARI3212

TITLE

Advanced Reinforcement Learning

UM LEVEL

03 - Years 2, 3, 4 in Modular Undergraduate Course

MQF LEVEL

ECTS CREDITS

DEPARTMENT

Artificial Intelligence

DESCRIPTION

Reinforcement Learning is one of the major AI paradigms which in the last years lead to major milestones in the advancement of AI. From the ability to beat the world champion in the game of Go, or winning professional players in Dota, the immense potential of Reinforcement Learning is also being applied to various industry applications such as manufacturing plant control, self driving cars, algorithmic trading and portfolio management, route management and many more.

The study-unit brings together advanced techniques from Reinforcement Learning in conjunction with Deep Neural Networks and Evolutionary Algorithms, the latter two being core components of Advanced Reinforcement Learning models. Topics covered in this unit include:
- Mathematical framework using Markov Decision Processes
- Model-Free Prediction and Control
- Value Function Approximation
- Neural Networks and Deep Neural Networks
- Deep Reinforcement Learning
- Policy Gradient Methods
- Evolutionary Algorithms
- Evolutionary Reinforcement Learning.

Study-unit Aims:

- Teach the core mathematical and algorithmic components that students would require to understand the latest advancements in Reinforcement Learning;
- Cover deep and evolutionary learning techniques as applied in a Reinforcement Learning framework;
- Demonstrate how complex problems can be framed as a Reinforcement Learning problem.

Learning Outcomes:

1. Knowledge & Understanding
By the end of the study-unit the student will be able to:

- Define a mathematical and algorithmic framework to control and predict sequential decision processes;
- Frame complex problems in the context of a Reinforcement Learning technique;
- Identify the strengths and weaknesses, and applicability, of the different techniques.

2. Skills
By the end of the study-unit the student will be able to:

- Determine how to address complex problems that are sequential in nature by automating sequential decisions with the aim of maximizing long term rewards;
- Implement a solution using a Reinforcement Learning approach.

Main Text/s and any supplementary readings:

Main text:
- Richard S Sutton, Andrew G Barto “Reinforcement Learning: An Introduction”, A Bradford Book, 5 Hayward Street, Cambridge, MA, United States, 2018.

Supplementary readings:
- Miguel Morales “Grokking Deep Reinforcement Learning”, Manning Publications, 2020.
- Maxim Lapan “Deep Reinforcement Learning Hands-On”, Pact Publishing, 2nd Edition, 2020.

ADDITIONAL NOTES

Pre-Requisites: Reinforcement Learning (being proposed at Level 2)

Pre-Requisite Study-Unit: ARI2204

STUDY-UNIT TYPE

Lecture, Independent Study and Project

METHOD OF ASSESSMENT

Assessment Component/s	Assessment Due	Sept. Asst Session	Weighting
Project	SEM1	Yes	100%

LECTURER/S

Vincent Vella

The University makes every effort to ensure that the published Courses Plans, Programmes of Study and Study-Unit information are complete and up-to-date at the time of publication. The University reserves the right to make changes in case errors are detected after publication.
The availability of optional units may be subject to timetabling constraints.
Units not attracting a sufficient number of registrations may be withdrawn without notice.
It should be noted that all the information in the description above applies to study-units available during the academic year 2024/5. It may be subject to change in subsequent years.

Study-Unit Description

Study-Unit Description

Study at UM