Skip to content

Very high negative scores in some of Atari environments #2233

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
kazuki-irie opened this issue May 23, 2021 · 2 comments
Closed

Very high negative scores in some of Atari environments #2233

kazuki-irie opened this issue May 23, 2021 · 2 comments
Labels

Comments

@kazuki-irie
Copy link

Hello,

In some Atari games, I observe that well trained models sometimes achieve very high negative scores, such as -969000.

So far, I've seen this issue for BattleZoneNoFrameskip-v4 and UpNDownNoFrameskip-v4.

For example in UpNDownNoFrameskip-v4, if I evaluate my model on five test episodes, I get the following scores:
310760.0, 26270.0, -919890.0, 364960.0, 156270.0.
where -919890.0 looks buggy to me.

Is such a score possible at all? Or is this some bug?

Thank you.

@jkterry1
Copy link
Collaborator

@JesseFarebro

@JesseFarebro
Copy link
Contributor

It would be really helpful if you could post a list of actions and the random seed that results in the desired behaviour. This is most likely an issue with score wrapping. If you can provide more info to debug this issue feel free to post it in this thread: Farama-Foundation/Arcade-Learning-Environment#262 or create a new issue @ https://github.com/mgbellemare/Arcade-Learning-Environment

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants