| 
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectgr.demokritos.iit.jinsect.algorithms.estimators.DistanceEstimator
public class DistanceEstimator
| Field Summary | |
|---|---|
protected  NGramSizeEstimator | 
Estimator
 | 
protected  int | 
MaxRank
 | 
protected  int | 
MinRank
 | 
protected  Distribution | 
NonSymbolsPerRank
 | 
protected  Distribution | 
SymbolsPerRank
 | 
| Constructor Summary | |
|---|---|
DistanceEstimator(Distribution tmSymbolsPerRank,
                  Distribution tmNonSymbolsPerRank)
Creates a new instance of DistanceEstimator, given two distribution of symbols and non-symbols, by getting a copy of the distributions.  | 
|
DistanceEstimator(Distribution tmSymbolsPerRank,
                  Distribution tmNonSymbolsPerRank,
                  NGramSizeEstimator nseEstimator)
Creates a new instance of DistanceEstimator, given two distribution of symbols and non-symbols, by getting a copy of the distributions.  | 
|
| Method Summary | |
|---|---|
 double | 
getAllSymbolProbability(int iMinRank,
                        int iMaxRank,
                        int iDistance)
Returns the probability that, for a given distance, all n-grams in it will be symbols, given a rank range.  | 
 double | 
getNonSymbolProbability(int iMinRank,
                        int iMaxRank,
                        int iDistance)
Returns the probability of occurence of a non-symbol given a range of n-gram ranks.  | 
 int | 
getOptimalDistance(int iMinDist,
                   int iMaxDist)
Returns the distance corresponding to the highest signal to noise ratio for a given distance range to examine, with respects to ranks identified by the rank estimator.  | 
 int | 
getOptimalDistance(int iMinDist,
                   int iMaxDist,
                   int iMinRank,
                   int iMaxRank)
Returns the distance corresponding to the highest signal to noise ratio for a given n-gram rank range, and a given distance range to examine.  | 
 double | 
getSignalToNoise(int iMinRank,
                 int iMaxRank,
                 int iDistance)
Returns the signal to noise ratio for a given n-gram rank range.  | 
 double | 
getSignalToNoise(int iMinRank,
                 int iMaxRank,
                 int iDistance,
                 int iCurNGramSize)
Returns the signal to noise ratio for a given n-gram rank range.  | 
 double | 
getSymbolToNonSymbolPercentage(int iMinRank,
                               int iMaxRank)
Returns the symbol to non-symbol percentage given a range of n-gram ranks.  | 
| Methods inherited from class java.lang.Object | 
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait | 
| Field Detail | 
|---|
protected Distribution SymbolsPerRank
protected Distribution NonSymbolsPerRank
protected int MinRank
protected int MaxRank
protected NGramSizeEstimator Estimator
| Constructor Detail | 
|---|
public DistanceEstimator(Distribution tmSymbolsPerRank,
                         Distribution tmNonSymbolsPerRank)
tmSymbolsPerRank - The distribution of symbols per n-gram rank.tmNonSymbolsPerRank - The distribution of non-symbols per n-gram rank.
public DistanceEstimator(Distribution tmSymbolsPerRank,
                         Distribution tmNonSymbolsPerRank,
                         NGramSizeEstimator nseEstimator)
tmSymbolsPerRank - The distribution of symbols per n-gram rank.tmNonSymbolsPerRank - The distribution of non-symbols per n-gram rank.nseEstimator - An estimator for various n-gram rank cardinalities.| Method Detail | 
|---|
public double getSymbolToNonSymbolPercentage(int iMinRank,
                                             int iMaxRank)
iMinRank - The minimum rank to take into account.iMaxRank - The maximum rank to take into account.
public double getNonSymbolProbability(int iMinRank,
                                      int iMaxRank,
                                      int iDistance)
iMinRank - The minimum rank to take into account.iMaxRank - The maximum rank to take into account.
public double getAllSymbolProbability(int iMinRank,
                                      int iMaxRank,
                                      int iDistance)
iMinRank - The minimum rank to take into account.iMaxRank - The maximum rank to take into account.iDistance - The distance (character range) within which we expect n-grams to be found.
public final double getSignalToNoise(int iMinRank,
                                     int iMaxRank,
                                     int iDistance)
iMinRank - The minimum rank to take into account.iMaxRank - The maximum rank to take into account.iDistance - The distance (character range) within which we expect n-grams to be found.
public final double getSignalToNoise(int iMinRank,
                                     int iMaxRank,
                                     int iDistance,
                                     int iCurNGramSize)
iMinRank - The minimum rank to take into account.iMaxRank - The maximum rank to take into account.iDistance - The distance (character range) within which we expect n-grams to be found.
public int getOptimalDistance(int iMinDist,
                              int iMaxDist,
                              int iMinRank,
                              int iMaxRank)
iMinDist - The minimum distance to examine.iMaxDist - The maximum distance to examine.iMinRank - The minimum rank to take into account.iMaxRank - The maximum rank to take into account.
public int getOptimalDistance(int iMinDist,
                              int iMaxDist)
iMinDist - The minimum distance to examine.iMaxDist - The maximum distance to examine.
  | 
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||