You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The state of the environment after the action is applied
float
The reward achieved by the previous action
done
True if the environment has entered a terminal state and should be reset
info
Diagnostic information useful for debugging
In the abstract Environment class it says that the step() methods should return the state, the reward, an indicator whether the episode ends and some information.
However, in the GymEnvironment only the reward and the state are returned:
autonomous-learning-library/all/environments/abstract.py
Lines 42 to 51 in 68d355a
In the abstract Environment class it says that the step() methods should return the state, the reward, an indicator whether the episode ends and some information.
However, in the GymEnvironment only the reward and the state are returned:
autonomous-learning-library/all/environments/gym.py
Line 44 in 68d355a
Which usage should be the correct one?
The text was updated successfully, but these errors were encountered: