Master AIC - RL courses - Function approximation

Organized by herilalaina - Current server time: Jan. 21, 2021, 5:51 a.m. UTC

First phase

Jan. 7, 2019, midnight UTC


Competition Ends
Jan. 18, 2019, 11 p.m. UTC

Master AIC - RL courses - Function approximation

Brought to you by University of Paris-Saclay

In this third practical, you are asked to put what you just learnt
about function approximation. You are provided with the `` file. Use `python -h` to check how you are supposed to use this file.

In this project, you are asked to solve the classic Pendulum problem (
Unlike previous environment, the state and action space are both continuous so that you need to approximate
the Q values Q(s, a). For more details about action and observation space, please refer to the OpenAI
documentation here: approximation to good use.

Master AIC - RL courses - Bandits: Evaluation

For submission, you need to zip `` and `metadata` files then submit the zipped file to codalab.
`` as an example of submission.

If you want to reproduce your local score on Codalab, please use the docker image ( and do not change the seed.
Then run `python --ngames 1000 --niter 100 --batch 10`

For further questions, please use the codalab forum.


Master AIC - RL courses - Bandits: Rules

Submissions must be submitted before the 2018-12-17 01:42:00+00:00. You may submit 20 submissions every day and 10 in total.


Start: Jan. 7, 2019, midnight

Description: Development phase: create models and submit them or directly submit results on validation and/or test data; feed-back are provided on the validation set only.


Start: Jan. 18, 2019, 11 p.m.

Description: Final phase: submissions from the previous phase are automatically cloned and used to compute the final score. The results on the test set will be revealed when the organizers make them available.

Competition Ends

Jan. 18, 2019, 11 p.m.

You must be logged in to participate in competitions.

Sign In