Difference between revisions of "Talk:Segmentation/Autoselected Segmentation"

From Robowiki
Jump to navigation Jump to search
(→‎Thoughts/Suggestions: good suggestion Nat - I've moved the page.)
 
(20 intermediate revisions by 4 users not shown)
Line 1: Line 1:
Automatic Segmentation is a concept proposed on the old wiki by Fractal. You start with a list of segmentation axes:
+
I think we should define new term for this things. The "Automatic Segmentation" word make me feel like "Dynamic Segmentation". Perhaps the "Auto-Choose Segmentation Set"? &raquo; <span style="font-size:0.9em;color:darkgreen;">[[User:Nat|Nat]] | [[User_talk:Nat|Talk]]</span> &raquo; 12:45, 15 June 2009 (UTC)
* Lateral Velocity (LV)
 
* Acceleration (A)
 
* Time Since Reversal (TSR)
 
  
You assemble all combinations of segments into a long list of every segmentation that uses these three axes:
+
Excellent thought - you can see I've moved the page. Thanks! -- [[User:Synapse|<font style="font-size:0.8em;font-variant:small-caps;">Synapse</font>]] 22:36, 15 June 2009 (UTC)
  
{|
+
Shouldn't this be on the actual page, not on the talk page? Or are you writing it here, perfecting it, then posting it? --[[User:J Litewski|Jacob Litewski]] 22:59, 15 June 2009 (UTC)
!Segmentation !! Depth
 
|-
 
|(no segmentation) || 0
 
|-
 
|LV || 1
 
|-
 
|A || 1
 
|-
 
|TSR || 1
 
|-
 
|LV / A || 2
 
|-
 
|LV / TSR || 2
 
|-
 
|A / TSR || 2
 
|-
 
|LV / A / TSR || 3
 
|}
 
  
Once you have your array of segmentations, you need a function to determine how good the data in that segmentation is. One simple way to do that is with what's called the Crest Factor. This is just the ratio of the peak data to the root-mean-squared value of the data (square root of ((the sum of each element's square) divided by the number of elements)).
+
I'm writing it here in small installments while I'm on my breaks at work, then once I'm happy with it I'll move it to the main page. If there's some better way to save "drafts" of a page I'm open to suggestions. -- [[User:Synapse|<font style="font-size:0.8em;font-variant:small-caps;">Synapse</font>]] 02:09, 16 June 2009 (UTC)
  
The segmentation that returns the most useful data (as determined by the Crest Factor) is used either for dodging or aiming, depending on what you need.
+
: <nowiki>{{stub}}</nowiki> will do =) &raquo; <span style="font-size:0.9em;color:darkgreen;">[[User:Nat|Nat]] | [[User_talk:Nat|Talk]]</span> &raquo; 02:42, 16 June 2009 (UTC)
  
(to be completed later)
+
I change something above, especially the math fomula. Hope is isn't too ugly. The first section will be complete later. Some though,  I don't think you should post the code here. If you want to post the code, do it in the Free code sub-page of either your user page or the robot's page. The code here should be pseudo-code. And the first implementation thing, I do think that everyone here do know about Polymorphism so I don't think you need to wrote that. &raquo; <span style="font-size:0.9em;color:darkgreen;">[[User:Nat|Nat]] | [[User_talk:Nat|Talk]]</span> &raquo; 03:20, 16 June 2009 (UTC)
  
----
+
I've updated the math formulas with the images from the main Wikipedia article on crest factor and root-mean-squared calculations. Thanks for the improvements, Nat! You can also see specific code for the assembly of the segments on [[Watermelon/Code#Autoselected_Segmentation | Watermelon/Code]]. -- [[User:Synapse|<font style="font-size:0.8em;font-variant:small-caps;">Synapse</font>]] 06:55, 16 June 2009 (UTC)
===Thoughts/Suggestions===
 
  
I think we should define new term for this things. The "Automatic Segmentation" word make me feel like "Dynamic Segmentation". Perhaps the "Auto-Choose Segmentation Set"? &raquo; <span style="font-size:0.9em;color:darkgreen;">[[User:Nat|Nat]] | [[User_talk:Nat|Talk]]</span> &raquo; 12:45, 15 June 2009 (UTC)
+
Thanks for noticing, Nat. I just noticed the same thing rereading the old article! :) -- [[User:Synapse|<font style="font-size:0.8em;font-variant:small-caps;">Synapse</font>]] 09:13, 12 September 2009 (UTC)
 +
 
 +
: This strategy has caught me attention before (and still on my must-do list, only at nearly bottom of it) so I know it before. Old wiki is really messy, isn't it? =) &raquo; <span style="font-size:0.9em;color:darkgreen;">[[User:Nat|Nat]] | [[User_talk:Nat|Talk]]</span> &raquo; 09:22, 12 September 2009 (UTC)
 +
 
 +
:: In my implementation I've gotten decent results without the [[Multiple Choice|multiple choice]] analysis [[User:Vuen|Vuen]] mentions in the original article. I've been restructuring things so I can try that out, at least on the gun, but so far just picking a good segment seems to work well. I don't think it's a golden bullet, but it was easy to understand once I grokked [[Wave Surfing|wave surfing]] and has gotten good results so far. -- [[User:Synapse|<font style="font-size:0.8em;font-variant:small-caps;">Synapse</font>]] 11:01, 12 September 2009 (UTC)
 +
 
 +
== Problems using this with binsmoothing ==
 +
 
 +
When I use this with binsmoothing, if there's a hit near &plusmn;1, it produces an artificially higher crest factor, since the smoothing is only applied in one direction. This causes segments with a peak near one end or the other to get a much higher priority. Suggestions? --&nbsp;<font style="font-size:0.8em;font-variant:small-caps;">[[User:Synapse|Synapse]]&nbsp;|&nbsp;[[User_talk:Synapse|talk]]</font> 07:12, 10 October 2009 (UTC)
 +
 
 +
The only idea that comes to mind here that would would neither distort crest factors, nor distort probabilities within the -1 to +1 range, would be extending your number of bins, to include bins slightly outside the -1 to +1 range, which will never actually be aimed at due to only being written to via smoothing. --[[User:Rednaxela|Rednaxela]] 17:37, 10 October 2009 (UTC)
 +
 
 +
== Standard Deviation ==
 +
 
 +
Wonder why can't we just use SD of each segment to determine which one is the best? --[[User:Nat|<span style="color:#099;">Nat</span>]] [[User talk:Nat|<span style="color:#0a5;">Pavasant</span>]] 12:06, 23 January 2010 (UTC)
  
Escellent thought - you can see I've moved the page. Thanks! -- [[User:Synapse|<font style="font-size:0.8em;font-variant:small-caps;">Synapse</font>]] 22:36, 15 June 2009 (UTC)
+
Standard deviation would probably work reasonably in some cases, but I'd consider it less optimal. The case of two very strong peaks far apart is very clearly useful data since it gives a reasonably high rate of hits, but standard deviation would rate it very low, and crest factor would rate it reasonably high. I think crest factor, (with some sort of multiplier to penalize when the number of samples is very low) definitely makes the most sense. --[[User:Rednaxela|Rednaxela]] 16:06, 23 January 2010 (UTC)

Latest revision as of 17:06, 23 January 2010

I think we should define new term for this things. The "Automatic Segmentation" word make me feel like "Dynamic Segmentation". Perhaps the "Auto-Choose Segmentation Set"? » Nat | Talk » 12:45, 15 June 2009 (UTC)

Excellent thought - you can see I've moved the page. Thanks! -- Synapse 22:36, 15 June 2009 (UTC)

Shouldn't this be on the actual page, not on the talk page? Or are you writing it here, perfecting it, then posting it? --Jacob Litewski 22:59, 15 June 2009 (UTC)

I'm writing it here in small installments while I'm on my breaks at work, then once I'm happy with it I'll move it to the main page. If there's some better way to save "drafts" of a page I'm open to suggestions. -- Synapse 02:09, 16 June 2009 (UTC)

{{stub}} will do =) » Nat | Talk » 02:42, 16 June 2009 (UTC)

I change something above, especially the math fomula. Hope is isn't too ugly. The first section will be complete later. Some though, I don't think you should post the code here. If you want to post the code, do it in the Free code sub-page of either your user page or the robot's page. The code here should be pseudo-code. And the first implementation thing, I do think that everyone here do know about Polymorphism so I don't think you need to wrote that. » Nat | Talk » 03:20, 16 June 2009 (UTC)

I've updated the math formulas with the images from the main Wikipedia article on crest factor and root-mean-squared calculations. Thanks for the improvements, Nat! You can also see specific code for the assembly of the segments on Watermelon/Code. -- Synapse 06:55, 16 June 2009 (UTC)

Thanks for noticing, Nat. I just noticed the same thing rereading the old article! :) -- Synapse 09:13, 12 September 2009 (UTC)

This strategy has caught me attention before (and still on my must-do list, only at nearly bottom of it) so I know it before. Old wiki is really messy, isn't it? =) » Nat | Talk » 09:22, 12 September 2009 (UTC)
In my implementation I've gotten decent results without the multiple choice analysis Vuen mentions in the original article. I've been restructuring things so I can try that out, at least on the gun, but so far just picking a good segment seems to work well. I don't think it's a golden bullet, but it was easy to understand once I grokked wave surfing and has gotten good results so far. -- Synapse 11:01, 12 September 2009 (UTC)

Problems using this with binsmoothing

When I use this with binsmoothing, if there's a hit near ±1, it produces an artificially higher crest factor, since the smoothing is only applied in one direction. This causes segments with a peak near one end or the other to get a much higher priority. Suggestions? -- Synapse | talk 07:12, 10 October 2009 (UTC)

The only idea that comes to mind here that would would neither distort crest factors, nor distort probabilities within the -1 to +1 range, would be extending your number of bins, to include bins slightly outside the -1 to +1 range, which will never actually be aimed at due to only being written to via smoothing. --Rednaxela 17:37, 10 October 2009 (UTC)

Standard Deviation

Wonder why can't we just use SD of each segment to determine which one is the best? --Nat Pavasant 12:06, 23 January 2010 (UTC)

Standard deviation would probably work reasonably in some cases, but I'd consider it less optimal. The case of two very strong peaks far apart is very clearly useful data since it gives a reasonably high rate of hits, but standard deviation would rate it very low, and crest factor would rate it reasonably high. I think crest factor, (with some sort of multiplier to penalize when the number of samples is very low) definitely makes the most sense. --Rednaxela 16:06, 23 January 2010 (UTC)