-
Notifications
You must be signed in to change notification settings - Fork 945
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RNaD: Possible Error in calculation of Neurd Loss #1156
Comments
@perolat can you take a look? |
Hi @spktrm , I spoke to Julien. He said you're correct about the first one, can you submit a PR? The second one could go either way: it's just a matter of knowing what works. It is not clear whether one works better than the other and it might end up being similar behavior but require different hyper-parameters. Maybe you can try it and let us know? |
I have submitted a PR regarding the first point here: #1157, thank you for the opportunity to contribute :). With regards to the second point, I will experiment further with the fix I am suggesting and let you know how it goes. Meanwhile, is it possible to provide clarity on these other issues? Namely: |
Hi @spktrm, Yeah I will make Julien aware of those (sorry, I thought they were resolved already). I think it may be useful to also try contacting him directly by email, though... because I'm mostly just relaying messages from here to him and back :) |
Thank you. What is his best email? |
Still the same one from the Mastering Stratego paper. |
Fixed by #1157, which has now been merged into master. |
In this line of the RNaD algorithm
open_spiel/open_spiel/python/algorithms/rnad/rnad.py
Line 574 in 7c58b6c
Should the line instead be this? This is so we only subtract the mean calculated from the valid logits.
As a result, should the line below be an average over actions rather than a sum?
open_spiel/open_spiel/python/algorithms/rnad/rnad.py
Line 579 in 7c58b6c
i.e.
This is particularly relevant in games where there is frequently a number of invalid actions.
The text was updated successfully, but these errors were encountered: