Have you ever tried "Pendulum-V0"? #3

yanpanlau · 2016-09-04T01:24:54Z

Thanks for the nice code. I am trying to re-produce the result in "Pendulum-V0" using a3c_cont.py but it seems the model fail to converge. I have tried various method like experience reply but still not working. It would be nice if you can test it out and we can discuss together Cheers.

originholic · 2016-09-04T10:26:09Z

Hi @yanpanlau , thanks for trying out the code. Unfortunately, I didn't actually test it with gym's Pendulum-v0, since this repo is very experimental for testing out my "batch" method.

If you are interested in getting the continuous actions to work, it is better to use other frameworks like miyosuda/async_deep_reinforce or coreylynch/async-rl. And just change the loss function for the policy and models as mentioned in the deepmind's async paper. Many thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Have you ever tried "Pendulum-V0"? #3

Have you ever tried "Pendulum-V0"? #3

yanpanlau commented Sep 4, 2016

originholic commented Sep 4, 2016

Have you ever tried "Pendulum-V0"? #3

Have you ever tried "Pendulum-V0"? #3

Comments

yanpanlau commented Sep 4, 2016

originholic commented Sep 4, 2016