Thread history

From Talk:Darkcanuck/RRServer
Viewing a history listing
Jump to navigation Jump to search
Time User Activity Comment
15:55, 17 November 2011 Jdev (talk | contribs) New reply created (Reply to Feature Request: average APS diff in bots compare)
15:51, 17 November 2011 Darkcanuck (talk | contribs) New reply created (Reply to Feature Request: average APS diff in bots compare)
12:53, 17 November 2011 Jdev (talk | contribs) New reply created (Reply to Feature Request: average APS diff in bots compare)
12:46, 17 November 2011 GrubbmGait (talk | contribs) New reply created (Reply to Feature Request: average APS diff in bots compare)
12:32, 17 November 2011 Jdev (talk | contribs) New reply created (Reply to Feature Request: average APS diff in bots compare)
12:10, 17 November 2011 GrubbmGait (talk | contribs) New reply created (Reply to Feature Request: average APS diff in bots compare)
10:12, 17 November 2011 Jdev (talk | contribs) New thread created  

Feature Request: average APS diff in bots compare

I find that until all pairings is done it's very useful to know current avarage difference in APS between two versions - after about 100 random battles this number says fairly exactly is newer version better, than older.
Darkcanuck, can you schedule to add row for columns "% Score", "% Survival" in section "+/- Difference" in bots compare page with avarage value of corresponds columns? I think, there're work for 1-2 hours maximum

Jdev10:12, 17 November 2011

I think this is already covered by the 'Common % Score (APS)' and 'Common % Survival', the lowest two lines in the top-table. At least I use it to check if my changes have a positive (or negative) result when the pairings are not complete yet.

GrubbmGait12:10, 17 November 2011
 

No. May be i wrote not clear.
I mean, that i want to know average difference in pairings between 2 versions. According to my tests, this number stabilizes mach faster, than APS. And more, Common % Score does not make sense, because while there only 1 battle in every pairing it's exactly equals to APS and in another case, there may be 10 battles against Walls and 1 battle against Druss.

Jdev12:32, 17 November 2011
 

As far as I know, when your new version has for example 100 pairings, you will see the average APS for that 100 pairings. AND for your older version you will also see the APS for that 100 pairings. And you are right, this indicates much more reliable what your final score will be (relative to your older version) than plain APS. The one who can really answer this question is Darkcanuck.

GrubbmGait12:46, 17 November 2011
 

Wow, if things is like you say, it's really what i want, thanks:)

Jdev12:53, 17 November 2011
 

The common %score is calculated just like APS, but only for pairings that the old and new versions have in common. That makes it easier to compare two versions when the new one is still missing many pairings, or in the case where the old bot may have pairings against a lot of retired bots (and may be missing scores vs newer bots). I think that's what you're looking for...

Darkcanuck15:51, 17 November 2011
 

Yes, thank you:)

Jdev15:55, 17 November 2011