Talk:Gaff/Targeting

From Robowiki
< Talk:Gaff
Revision as of 03:46, 17 July 2009 by Darkcanuck (talk | contribs) (tolerance)
Jump to navigation Jump to search

This is really cool, thanks for writing it up. Makes me want to tinker. =) I'm curious, does the "anti-surfer" network alone do even better against surfers? --Voidious 19:24, 16 July 2009 (UTC)

It was long overdue. Yes, the AS network scored 82.93 on the surfer portion of the TC2K7 (15 seasons) but only got 80.45 against the others. I continue to tweak it every once in a while but have yet to break the 83 barrier... --Darkcanuck 20:09, 16 July 2009 (UTC)

Ooh.. very interesting things here... The sentance that most intrigues me right now is "Waves that already give the correct solution are not retrained.". I have a hunch that this could be rather important in how well Gaff hits surfers, and may be useful outside of the world of NN guns too --Rednaxela 00:05, 17 July 2009 (UTC)

A common problem with NN is overfitting -- you want to get as good an approximator as possible without losing the ability to generalize. By training waves only to within a certain tolerance, Gaff tries not to overfit. The tolerance margin is pretty small though: +/- 1/4 effective bot width at that distance. A while ago I did some tests and it seemed to help having it set fairly tight. Also note that the in-tolerance waves are rechecked every training interval (if still in the buffer), so if the net starts to forget them they do get re-trained.
I guess you're thinking of not updating a VCS buffer if it's already producing the right answer? It would be interesting to see how that worked, or if it just flattened the buffer peaks too much... --Darkcanuck 01:46, 17 July 2009 (UTC)

You cannot post new threads to this discussion page because it has been protected from new threads, or you do not currently have permission to edit.

There are no threads on this page yet.