Master AIC - RL courses - Bandits (2018 - 2019)

Organized by herilalaina


Dec. 10, 2018, midnight UTC


Jan. 6, 2019, 11 p.m. UTC


Master AIC - RL courses - Bandits

Brought to you by University of Paris-Saclay

In this first practical, you are asked to put what you just learnt about bandits to good use.

Master AIC - RL courses - Bandits: Evaluation

The final performance of your agent will be evaluated on a 2000 random testbed. The average reward is considered as metric.

You can assess your algorithm locally with the following command:

python --niter 1000 --batch 2000

Master AIC - RL courses - Bandits: Rules

Submissions must be submitted before the 2018-12-17 01:42:00+00:00. You may submit 5 submissions every day and 10 in total.


Start: Dec. 10, 2018, midnight

Description: Development phase: create models and submit them or directly submit results on validation and/or test data; feed-back are provided on the validation set only.


Start: Jan. 6, 2019, 11 p.m.

Description: Final phase: submissions from the previous phase are automatically cloned and used to compute the final score. The results on the test set will be revealed when the organizers make them available.

