In this first practical, you are asked to put what you just learnt about bandits to good use.
The final performance of your agent will be evaluated on a 2000 random testbed. The average reward is considered as metric.
You can assess your algorithm locally with the following command:
python main.py --niter 1000 --batch 2000
Submissions must be submitted before the 2018-12-17 01:42:00+00:00. You may submit 5 submissions every day and 10 in total.
Start: Dec. 10, 2018, midnight
Description: Development phase: create models and submit them or directly submit results on validation and/or test data; feed-back are provided on the validation set only.
Start: Jan. 6, 2019, 11 p.m.
Description: Final phase: submissions from the previous phase are automatically cloned and used to compute the final score. The results on the test set will be revealed when the organizers make them available.
You must be logged in to participate in competitions.Sign In