|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectgr.demokritos.iit.jinsect.algorithms.estimators.DistanceEstimator
public class DistanceEstimator
Field Summary | |
---|---|
protected NGramSizeEstimator |
Estimator
|
protected int |
MaxRank
|
protected int |
MinRank
|
protected Distribution |
NonSymbolsPerRank
|
protected Distribution |
SymbolsPerRank
|
Constructor Summary | |
---|---|
DistanceEstimator(Distribution tmSymbolsPerRank,
Distribution tmNonSymbolsPerRank)
Creates a new instance of DistanceEstimator, given two distribution of symbols and non-symbols, by getting a copy of the distributions. |
|
DistanceEstimator(Distribution tmSymbolsPerRank,
Distribution tmNonSymbolsPerRank,
NGramSizeEstimator nseEstimator)
Creates a new instance of DistanceEstimator, given two distribution of symbols and non-symbols, by getting a copy of the distributions. |
Method Summary | |
---|---|
double |
getAllSymbolProbability(int iMinRank,
int iMaxRank,
int iDistance)
Returns the probability that, for a given distance, all n-grams in it will be symbols, given a rank range. |
double |
getNonSymbolProbability(int iMinRank,
int iMaxRank,
int iDistance)
Returns the probability of occurence of a non-symbol given a range of n-gram ranks. |
int |
getOptimalDistance(int iMinDist,
int iMaxDist)
Returns the distance corresponding to the highest signal to noise ratio for a given distance range to examine, with respects to ranks identified by the rank estimator. |
int |
getOptimalDistance(int iMinDist,
int iMaxDist,
int iMinRank,
int iMaxRank)
Returns the distance corresponding to the highest signal to noise ratio for a given n-gram rank range, and a given distance range to examine. |
double |
getSignalToNoise(int iMinRank,
int iMaxRank,
int iDistance)
Returns the signal to noise ratio for a given n-gram rank range. |
double |
getSignalToNoise(int iMinRank,
int iMaxRank,
int iDistance,
int iCurNGramSize)
Returns the signal to noise ratio for a given n-gram rank range. |
double |
getSymbolToNonSymbolPercentage(int iMinRank,
int iMaxRank)
Returns the symbol to non-symbol percentage given a range of n-gram ranks. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected Distribution SymbolsPerRank
protected Distribution NonSymbolsPerRank
protected int MinRank
protected int MaxRank
protected NGramSizeEstimator Estimator
Constructor Detail |
---|
public DistanceEstimator(Distribution tmSymbolsPerRank, Distribution tmNonSymbolsPerRank)
tmSymbolsPerRank
- The distribution of symbols per n-gram rank.tmNonSymbolsPerRank
- The distribution of non-symbols per n-gram rank.public DistanceEstimator(Distribution tmSymbolsPerRank, Distribution tmNonSymbolsPerRank, NGramSizeEstimator nseEstimator)
tmSymbolsPerRank
- The distribution of symbols per n-gram rank.tmNonSymbolsPerRank
- The distribution of non-symbols per n-gram rank.nseEstimator
- An estimator for various n-gram rank cardinalities.Method Detail |
---|
public double getSymbolToNonSymbolPercentage(int iMinRank, int iMaxRank)
iMinRank
- The minimum rank to take into account.iMaxRank
- The maximum rank to take into account.
public double getNonSymbolProbability(int iMinRank, int iMaxRank, int iDistance)
iMinRank
- The minimum rank to take into account.iMaxRank
- The maximum rank to take into account.
public double getAllSymbolProbability(int iMinRank, int iMaxRank, int iDistance)
iMinRank
- The minimum rank to take into account.iMaxRank
- The maximum rank to take into account.iDistance
- The distance (character range) within which we expect n-grams to be found.
public final double getSignalToNoise(int iMinRank, int iMaxRank, int iDistance)
iMinRank
- The minimum rank to take into account.iMaxRank
- The maximum rank to take into account.iDistance
- The distance (character range) within which we expect n-grams to be found.
public final double getSignalToNoise(int iMinRank, int iMaxRank, int iDistance, int iCurNGramSize)
iMinRank
- The minimum rank to take into account.iMaxRank
- The maximum rank to take into account.iDistance
- The distance (character range) within which we expect n-grams to be found.
public int getOptimalDistance(int iMinDist, int iMaxDist, int iMinRank, int iMaxRank)
iMinDist
- The minimum distance to examine.iMaxDist
- The maximum distance to examine.iMinRank
- The minimum rank to take into account.iMaxRank
- The maximum rank to take into account.
public int getOptimalDistance(int iMinDist, int iMaxDist)
iMinDist
- The minimum distance to examine.iMaxDist
- The maximum distance to examine.
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |