Difference between revisions of "Talk:Gaff/Targeting"
Jump to navigation
Jump to search
(A hunch) |
Darkcanuck (talk | contribs) (tolerance) |
||
Line 4: | Line 4: | ||
Ooh.. very interesting things here... The sentance that most intrigues me right now is ''"Waves that already give the correct solution are not retrained."''. I have a hunch that this could be rather important in how well Gaff hits surfers, and may be useful outside of the world of NN guns too --[[User:Rednaxela|Rednaxela]] 00:05, 17 July 2009 (UTC) | Ooh.. very interesting things here... The sentance that most intrigues me right now is ''"Waves that already give the correct solution are not retrained."''. I have a hunch that this could be rather important in how well Gaff hits surfers, and may be useful outside of the world of NN guns too --[[User:Rednaxela|Rednaxela]] 00:05, 17 July 2009 (UTC) | ||
+ | |||
+ | : A common problem with NN is overfitting -- you want to get as good an approximator as possible without losing the ability to generalize. By training waves only to within a certain tolerance, Gaff tries not to overfit. The tolerance margin is pretty small though: +/- 1/4 effective bot width at that distance. A while ago I did some tests and it seemed to help having it set fairly tight. Also note that the in-tolerance waves are rechecked every training interval (if still in the buffer), so if the net starts to forget them they do get re-trained. | ||
+ | : I guess you're thinking of not updating a VCS buffer if it's already producing the right answer? It would be interesting to see how that worked, or if it just flattened the buffer peaks too much... --[[User:Darkcanuck|Darkcanuck]] 01:46, 17 July 2009 (UTC) |
Revision as of 02:46, 17 July 2009
This is really cool, thanks for writing it up. Makes me want to tinker. =) I'm curious, does the "anti-surfer" network alone do even better against surfers? --Voidious 19:24, 16 July 2009 (UTC)
- It was long overdue. Yes, the AS network scored 82.93 on the surfer portion of the TC2K7 (15 seasons) but only got 80.45 against the others. I continue to tweak it every once in a while but have yet to break the 83 barrier... --Darkcanuck 20:09, 16 July 2009 (UTC)
Ooh.. very interesting things here... The sentance that most intrigues me right now is "Waves that already give the correct solution are not retrained.". I have a hunch that this could be rather important in how well Gaff hits surfers, and may be useful outside of the world of NN guns too --Rednaxela 00:05, 17 July 2009 (UTC)
- A common problem with NN is overfitting -- you want to get as good an approximator as possible without losing the ability to generalize. By training waves only to within a certain tolerance, Gaff tries not to overfit. The tolerance margin is pretty small though: +/- 1/4 effective bot width at that distance. A while ago I did some tests and it seemed to help having it set fairly tight. Also note that the in-tolerance waves are rechecked every training interval (if still in the buffer), so if the net starts to forget them they do get re-trained.
- I guess you're thinking of not updating a VCS buffer if it's already producing the right answer? It would be interesting to see how that worked, or if it just flattened the buffer peaks too much... --Darkcanuck 01:46, 17 July 2009 (UTC)