Difference between revisions of "Archived talk:Locke 20040719"

Latest revision as of 00:47, 29 September 2017

Locke Sub-pages:: Locke - Version History - Archived Talk 20040719

This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page.

I may have a bug in the projection from 2D to 1D. Found this after chatting with PEZ and ABC about GuessFactors. Am chasing it right now, hoping it will cause Locke to jump in the ranking.

No, the projection works fine. So no potential points to be gained here. It looks like a bug because of the weighting system. Observations with different ETA's than the current situation could drag the aiming a little off. I tried to solve this in 0.5.1 by increasing the weight of the ETA dimension, meaning the segment borders will depend more on ETA accuracy. --Vic
Well, that does not work. Lost 8 rating points...

Actually I found another bug after that chat: my Waves were not properly aligned. I failed to take into account that bullets may have different bulletpowers. Thanks to jim for providing the answer to that one. Strangely, this fix seems not to improve the rating at all!

In fact I lost 7 rating points with this bug fix! And I used RobocodeGLV014 to verify that the waves are now perfectly aligned! This blows my mind.... --Vic

It seems I was still using an older version of Raiko's movement. As I am comparing the two guns through the RoboRumble, I should have the same movement. Also, I found out that in all my previous versions I had broken Axe's MusashiTrick. That has been repaired now. I expect a big jump in rating points! --Vic

YEAH! Locke jumped 50 rating points to 1870, entering the top 20 in the RoboRumble! That means that there is a difference of 60 points with Raiko, who shares the same movement. So, if I can gain 60 points with improvements on my SOOL gun, this gun would be Raiko's equal. And then beyond :-)

Question to you top guns out there: This high in the rankings, where points are hard to find, can 60 rating points be gained by only tweaking a gun? I should mention that this guntype is brand new, and has some (probably small) known bugs, and hasn't been tweaked very much. --Vic

Maybe the question is old. But I just saw it. The answer is a definite YES. -- PEZ

Hmm, interesting. Watch this space ;-) --Vic

In version 0.5.3 I now fire waves every tick. I'm curious how that will translate to rating points. --Vic

very disappointing. It dropped two points. Have all you people who at one stage implemented this had good results with firing waves every tick? Maybe there are some special tips and tricks on this subject? please advise :-/ --Vic
In theory when you fire waves every tick you lose some accuracy against bots that react to fire. If you run TGrapher against these bots you'll notice that it draws different graphs for "real" waves and for "virtual" ones. However, if your statistics are heavily segmented the learning speed drops dramatically if only real waves are collected. And it seems that the right mix of heavy segmentations can compensate some for the fire-reaction of these bots. Particularly the time-since-velocity-change segmentation helps in this. So, it might be that your stats are not segmented enough for you to need firing waves every tick. It could also be that you lack the right mix of segmentations to compensate for the "virtual" wave inaccuracy against some bots. A boring alternative is that you have a major bug in your gun code somewhere. Then it won't matter much how you collect waves. I have had those situations many times. And, 2 points drop is probably within the margin of error anyway. -- PEZ
If the major advantage of this to segmented GF guns is to increase learning speed that might explain it. Locke's gun has Dynamic Segmentation. Simply put, it starts with one segment, and when it has enough data it kinda splits it into more segments etc. If I quickly fill this first segment with data collected every tick I basically have a bad quality (large spike) segment against flattener movement bots (but not against 'spike only' bots like walls and spinbot... too bad there aren't more of those in the rumble ;-). That segment will eventually split into slightly better segments but that would take probably as much time as only collecting real fire waves. With a GF gun, the fixed segments are an advantage in this case, because they can never have bad quality data (I assume you use fixed segments without stealing from neighbouring segments when you have little data). Then again, you may be right and I may have a major bug in there yet. Thanx! --Vic

v0.5.6 discussion:

Does anyone have experience with increasing firepower for weaker opponents? My theory is that if it is weak enough, you can trade a little accuracy for more bullet damage points. This also shortens the matches which will give the opponent less time to hit you and get damage points. I was convinced this would work. But in practise, it doesn't seem to gain any points for me. After some thinking I have found that maybe the higher firepower bullets expose some of the guns weaknesses which counterbalance the score gain, such as less data in higher power segments and maybe buggy BulletETA interpolation, when using less power segments to fill up the missing data. So, my question: who can tell me if this is a dead end street, or a definite rating point gainer (if I get it bugfree)? --Vic

Very interesting idea... Please let me know if u succeed. -- Axe

The first results are disappointing. Lost 7 rating points. The question above still stands: has anyone experimented with this technique? --Vic
I would think that the problem with this is to accurately find out that you are dealing with a weak opponent. You could be facing a very strong opponent and just have some very good luck in the first couple of rounds. Or am I missing something? -- PEZ
I think that was not so much the problem, but I may be wrong. I compensated for the luck factor you mentioned by gradually boosting power as the match progresses. So say in round 5, even if Locke wins all first four matches, there is hardly any bullet power increase. But in round 35, when Locke has won at least 75% of the rounds, it will fire power 3 every shot. While writing this down, I'm thinking that 35 rounds per match is probably too short to reasonably assume that one bot is better than the other. But maybe I'll try again later with much more conservative parameter values (never increase power in the first 10/15 rounds, only increase power if winning 80/85%, max power 2.5, not 3.0). --Vic

v0.5.8 discussion:

I'm beginning to suspect that I'm on the wrong track. If this version (with two true bugfixes) again performs worse then there must be a major issue with Locke's gun which must be resolved first. --Vic

Well, fortunately it didn't perform any worse this time. But not better either. :-| I've spend the good part of a day aligning waves and getting ETA right (see BulletETA page) and it didn't change a thing. The whole 0.5 branch has basically been about better accuracy. It seems that Locke's gun has bigger problems than that. Still have 60 points to try and gain before catching up with Raiko's gun. --Vic

I think you cant compare your gun with that of Raiko's in the RR. R used a really smart segmentation that works amazing straight after a few rounds. Of cause a SOOG should perform better but remember that it needs lots of ronds to find the "perfect" segmentation for each enemy. To find out the true performance of your gun you should compare your and Raikos gun over 500, 1k ... rounds, imho --deathcon

Actually I designed this gun to be a quick learner as well. And against Walls and Spinbot it does learn very fast :-p. I have observed in a lot of battles that Locke outperforms the opponent in the early rounds, and as the opponents gun comes up to speed the match becomes more equal or even turns around. But I do think you are right that the difference in Segmentation is very important, and may be a weak spot of Locke at this time. Fortunately I have hardly worked on tuning that part of the gun, so I think there is a lot of progress to be made there.

I'm definitely not sure that this gun should in theory perform better then Raiko's gun. The SelfOrganizingObservationLog is just an idea that I came up with, which in theory has some advantages over GF targeting but may also have yet undiscovered weak points. And in practise, GuessFactor targeting has evolved over many years now, with many people working on it. Raiko's gun has hand-tuned, fixed Segmentation that has proven to be very, very, very efficient as you point out. It remains to be seen if Dynamic Segmentation the SOOL way can converge to that level. But I sure as hell will give it a go ;-)

One thing I will address soon about Segmentation: Raiko's gun is heavily segmented. But not all segments are equal. Some segments will get much less data, but to compensate these segments are much more predictable (for example a close to the wall segment, or a segment where time_since_last_decelleration has a high value). Basically these segments result in Situational targeting when used, and play a very important role in the first stages of a match. Other segments are not so predictable and need lots of data to reveal patterns in the opponents behavior. These segments start playing a role later on in the match when there is enough data, and when used they classify as Statistical targeting. In Locke's gun this distinction is not yet made as all Segments are of the some size (same number of observations). This probably results in Situational segments polluted with too much irrelevant data and Statistical segments that have much to little data. The trick will be how to determine how big a segment must be. I'm thinking along the lines of calculating the Entropy of a segment and grow or shrink it accordingly. Any thoughts on this?

@@ Line 30: / Line 30: @@
 * If the major advantage of this to segmented GF guns is to increase learning speed that might explain it. [[Locke]]'s gun has Dynamic Segmentation. Simply put, it starts with one segment, and when it has enough data it kinda splits it into more segments etc. If I quickly fill this first segment with data collected every tick I basically have a bad quality (large spike) segment against flattener movement bots (but not against 'spike only' bots like walls and spinbot... too bad there aren't more of those in the rumble ;-). That segment will eventually split into slightly better segments but that would take probably as much time as only collecting real fire waves. With a GF gun, the fixed segments are an advantage in this case, because they can never have bad quality data (I assume you use fixed segments without stealing from neighbouring segments when you have little data). Then again, you may be right and I may have a major bug in there yet. Thanx! --[[Vic]]
-<p>
 ----
@@ Line 42: / Line 41: @@
 * I think that was not so much the problem, but I may be wrong. I compensated for the luck factor you mentioned by gradually boosting power as the match progresses. So say in round 5, even if [[Locke]] wins all first four matches, there is hardly any bullet power increase. But in round 35, when [[Locke]] has won at least 75% of the rounds, it will fire power 3 every shot. While writing this down, I'm thinking that 35 rounds per match is probably too short to reasonably assume that one bot is better than the other. But maybe I'll try again later with much more conservative parameter values (never increase power in the first 10/15 rounds, only increase power if winning 80/85%, max power 2.5, not 3.0). --[[Vic]]
-<p>
 ----
@@ Line 373: / Line 371: @@
 Haha! What a great reason for using 2.0 :-) 1841 at 512 matches.... still 6 points below the 2.0 version. Something isn't right. I think i'll run the 2.0 version a second time and see what happens. Maybe it was that ScruchiPu fluke or something in the previous 2.0 version... --[[Vic]]
+----
+I guess summer has struck again. Why robocoding when you can do fun stuff outside in the sun, right? Me too, but also I've been a little busy with work lately. Mind you, I have by no means retired.... I have some stuff up my sleeve yet! :-) --[[Vic]]
+Version 1.7.5.2 is going good. Are you nervous? -- [[PEZ]]
+No, not really. I am expecting at most a slight improvement. Like you, I also noticed it started out real good, but I saw that it caught a Scruchi-pu match and many top bot matches early on, which always boost my ratings. 58% against the current king does feel good I must admit :-) --[[Vic]]
+Can you describe some how the Observations are compared? Are some situational parameters more important than others? How "deep" is a situation? And such. =) -- [[PEZ]]
+Ok, I'll give you a sneak preview (I had planned to write a detailed desciption when I am happy with the gun):
+The gun can basically be split in two major parts:
+# filling the Log with data
+# aiming
+Filling the Log goes like this:
+* a wave is fired every tick
+** along with the wave situational parameters are stored
+*** an array of Situational parameters is called a Situation
+*** one situational parameter is called a Dimension
+* waves that hit the enemy are processed and removed from the wave manager
+** the path the enemy travelled since firing the wave is stored in a pair of doubles:
+*** distance travelled
+*** angle travelled (0: straight forward, +-180: fully reversed, 90: 90 degreees clockwise, etc.)
+** the Situation and the distance/angle data are combined into an Observation
+** the Log searches for the index where the corresponding Situations match the Observation most closely
+** the Log inserts the Observation using that index
+Aiming goes like this:
+* the current Situation is assessed
+* the Log searches for the index where the corresponding Situations match current Situation most closely
+* the Log determines the positive and negative offset boundaries (resulting in a small subset of the Log)
+* a GuessFactor array is prepared
+* every Observation in this subset is processed for Multiple Choice using the GF bin:
+** if the distance/angle travelled takes the Observation outside the battlefield it is discarded
+** the distance/angle travelled is converted into a GuessFactor
+** the GF is added to the corresponding GuessFactor bin (if the botwidth spans multiple GF bins, these will be filled as well)
+*** closer (more similar) Observations get a higher weight in the multiple choice
+* the winning GF bin is converted back into a firing angle and returned
+Searching the Log goes like this:
+* an outer loop iterates through the Log in a maximum (currently 50) number of iterations
+** in each iteration the Observation is compared with the given Situation
+*** the compare function returns a value: lower is more similar
+* the closest Observation found by the outer loop is used for a finer search
+** this finer search is done by an inner loop that iterates through +- 300 (currently) adjacent Observations
+Comparing Situations goes like this:
+* weights for all Dimensions are determined
+* a loop iterates over each dimension
+** the difference between the comparer and comparee (squared) is multiplied by the Dimension weight and added to the grand total
+* the grand total is returned
+Currently I am using 10 Dimensions, and I'm planning to add at least three more very soon. One of the advantages of this system is that there is practically no limit to the number of Dimensions you can use.
+--[[Vic]]
+Thanks! I had a hunch about the basic scheme it seems. I think my next gun will be in this style. [[Resin]] failed, but it has some similarities and might work as a start. Now, I guess I'll have some reading up to do on the MultipleChoice issue. I take it the Situations are built on state at the time of fire? -- [[PEZ]]
+* Yes. About your next gun: it's great that you may build something similar. That way we can share experiences. --[[Vic]]
+<p>
+----
+What are blind man's sticks? --[[Ph]]
+It's a BlindMansStick. -- [[PEZ]]
+----
+Tomorrow morning I'm going on a three week holiday :-) To the sunny country of Italy! Hopefully I'll return relaxed and with a few good ideas to help [[Locke]] get to 1880+ points ;-). Secondly, I hope to really get started on my new movement system after this vacation. Thirdly, I'll be thinking about the 'other' targeting algorithm we've been busy with in the last few weeks and hopefully come up with something useful. See you then! --[[Vic]]
+Fijne vakantie! -- [[Jonathan]]

Difference between revisions of "Archived talk:Locke 20040719"

Latest revision as of 00:47, 29 September 2017

Navigation menu

Search