Should env.step() return done and info? #149

mctigger · 2020-06-05T17:52:06Z

autonomous-learning-library/all/environments/abstract.py

Lines 42 to 51 in 68d355a

    
                   Returns 
        
                   ------- 
        
                   State 
        
                       The state of the environment after the action is applied 
        
                   float 
        
                       The reward achieved by the previous action 
        
                   done 
        
                       True if the environment has entered a terminal state and should be reset 
        
                   info 
        
                       Diagnostic information useful for debugging

In the abstract Environment class it says that the step() methods should return the state, the reward, an indicator whether the episode ends and some information.

However, in the GymEnvironment only the reward and the state are returned:

autonomous-learning-library/all/environments/gym.py

Line 44 in 68d355a

return self._state, self._reward

Which usage should be the correct one?

cpnota · 2020-06-07T18:48:24Z

Ah, looks like the documentation is out of date. The State objective includes done and info now.

cpnota added the documentation needed There is something missing from the documentation label Jun 7, 2020

cpnota mentioned this issue Jun 7, 2020

documentation/policy #151

Merged

cpnota closed this as completed Jun 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should env.step() return done and info? #149

Should env.step() return done and info? #149

mctigger commented Jun 5, 2020 •

edited

Loading

cpnota commented Jun 7, 2020

Should env.step() return done and info? #149

Should env.step() return done and info? #149

Comments

mctigger commented Jun 5, 2020 • edited Loading

cpnota commented Jun 7, 2020

mctigger commented Jun 5, 2020 •

edited

Loading