You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In some Atari games, I observe that well trained models sometimes achieve very high negative scores, such as -969000.
So far, I've seen this issue for BattleZoneNoFrameskip-v4 and UpNDownNoFrameskip-v4.
For example in UpNDownNoFrameskip-v4, if I evaluate my model on five test episodes, I get the following scores: 310760.0, 26270.0, -919890.0, 364960.0, 156270.0.
where -919890.0 looks buggy to me.
Is such a score possible at all? Or is this some bug?
Thank you.
The text was updated successfully, but these errors were encountered:
Hello,
In some Atari games, I observe that well trained models sometimes achieve very high negative scores, such as
-969000
.So far, I've seen this issue for
BattleZoneNoFrameskip-v4
andUpNDownNoFrameskip-v4
.For example in
UpNDownNoFrameskip-v4
, if I evaluate my model on five test episodes, I get the following scores:310760.0
,26270.0
,-919890.0
,364960.0
,156270.0
.where
-919890.0
looks buggy to me.Is such a score possible at all? Or is this some bug?
Thank you.
The text was updated successfully, but these errors were encountered: