The actions shape can be (0,) i.e. empty and there will be a ValueError when
I see this error most of the time, and am not sure what to do
World Perf: Episode 106.000000. Reward 15.333333. action: 0.000000. mean reward 24.393341.
While below will give you your error:
I wold reshape actions to (-1,1) or initialize it as np.empty(0).reshape(0,1) and append using np.vstack
Thank you very much for making these tutorials! They are awesome!
However there seems to be a number of incompatibilities/bugs in this notebook. I had to make the following modifications to get the notebook running on Tensorflow 1.0.0:
And everything executed as expected :)