Difference between revisions of "User:Skilgannon/Free Code"

From Robowiki
Jump to navigation Jump to search
(variation)
(did some more testing, and clarify a bit)
Line 1: Line 1:
This is a neat method that I made up. It takes array of 'indexes' between 0 and max and returns a double between 0 and 1 of how 'clustered' your array is. 1 if all the values are the same, and 0 if there are infinite values spread perfectly evenly. Note, this is very different from a standard deviation calculation. In this code there can be as many 'dense' points on the graph as you want, and it won't try to accommodate them all from one mean. Instead, it relies on the fact that (d + 1)*(d + 1) is always greater than (d + 1) for any d > 0.
+
This is a neat method that I made up. It takes array of 'indexes' (be it guess factors, or indexes where you are logging hits) between 0 and max and returns a double between 0 and 1 of how 'clustered' your array is. 1 if all the values are the same, and 0 if there are infinite values spread perfectly evenly. Note, this is very different from a standard deviation calculation. In this code there can be as many 'dense' points on the graph as you want, and it won't try to accommodate them all from one mean. Instead, it relies on the fact that (d+k)^2  > d^2 > (d-k)^2 for any d > k > 0.
  
<pre>
 
      public static double clustering(float[] indexes, float max){
 
     
 
        float[] sorted = new float[indexes.length];
 
        System.arraycopy(indexes,0,sorted,0,indexes.length);
 
        java.util.Arrays.sort(sorted);
 
       
 
        double clustering = sorted[0] + max
 
            - sorted[sorted.length - 1] + 1;
 
        clustering *= clustering;
 
       
 
        for(int i = 1; i < sorted.length; i++){
 
            double diff = sorted[i] - sorted[i-1] + 1;
 
            clustering += diff*diff;
 
        }     
 
        return (clustering - sorted.length + 1)/((max + 1)*(max + 1));
 
     
 
      }
 
</pre>
 
 
Alternatively, if you know your step size is always greater than 1 (eg if you are using the indexes for logging to a set of bins) you can use the following code, which should be slightly quicker:
 
 
<pre>  
 
<pre>  
 
       public static double clustering(float[] indexes, float max){
 
       public static double clustering(float[] indexes, float max){

Revision as of 18:53, 13 April 2010

This is a neat method that I made up. It takes array of 'indexes' (be it guess factors, or indexes where you are logging hits) between 0 and max and returns a double between 0 and 1 of how 'clustered' your array is. 1 if all the values are the same, and 0 if there are infinite values spread perfectly evenly. Note, this is very different from a standard deviation calculation. In this code there can be as many 'dense' points on the graph as you want, and it won't try to accommodate them all from one mean. Instead, it relies on the fact that (d+k)^2 > d^2 > (d-k)^2 for any d > k > 0.

 
       public static double clustering(float[] indexes, float max){
         float[] sorted = new float[indexes.length];
         System.arraycopy(indexes,0,sorted,0,indexes.length);
         java.util.Arrays.sort(sorted);
         
         double clustering = sorted[0] + max 
            - sorted[sorted.length - 1];
         clustering *= clustering;
         
         for(int i = 1; i < sorted.length; i++){
            double diff = sorted[i] - sorted[i-1];
            clustering += diff*diff;
         }      
         return clustering/(max*max);
      	
      }