public class MilneWittenMetric extends BaseSRMetric
Modifier and Type | Class and Description |
---|---|
static class |
MilneWittenMetric.Provider |
BaseSRMetric.SRConfig
Constructor and Description |
---|
MilneWittenMetric(String name,
Language language,
LocalPageDao dao,
SRMetric inlink,
SRMetric outlink,
Disambiguator dab) |
Modifier and Type | Method and Description |
---|---|
double[][] |
cosimilarity(int[] rowIds,
int[] columnIds)
Construct a cosimilarity matrix of Wikipedia ids in a given language.
|
BaseSRMetric.SRConfig |
getConfig() |
SRResultList |
mostSimilar(int pageId,
int maxResults,
gnu.trove.set.TIntSet validIds)
Find the most similar local pages to a local page.
|
void |
read()
Reads the metric from the current data directory.
|
void |
setTrainSubmetrics(boolean train) |
SRResult |
similarity(int pageId1,
int pageId2,
boolean explanations)
Determine the similarity between two local pages.
|
void |
trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds)
Train the mostSimilar() function
The KnownSims may already be associated with Wikipedia ids (check wpId1 and wpId2).
|
void |
trainSimilarity(Dataset dataset)
Train the similarity() function.
|
void |
write()
Writes the metric to the current data directory.
|
clearMostSimilarCache, configureBase, cosimilarity, cosimilarity, cosimilarity, ensureMostSimilarTrained, ensureSimilarityTrained, getCachedMostSimilar, getDataDir, getDisambiguator, getLanguage, getLocalPageDao, getMostSimilarCache, getMostSimilarMatrixPath, getMostSimilarNormalizer, getName, getSimilarityNormalizer, mostSimilar, mostSimilar, mostSimilar, mostSimilarIsTrained, normalize, normalize, normalize, setBuildMostSimilarCache, setDataDir, setMostSimilarCacheRowIds, setMostSimilarNormalizer, setReadNormalizers, setSimilarityNormalizer, similarity, similarityIsTrained, writeMostSimilarCache, writeMostSimilarCache
public MilneWittenMetric(String name, Language language, LocalPageDao dao, SRMetric inlink, SRMetric outlink, Disambiguator dab)
public BaseSRMetric.SRConfig getConfig()
getConfig
in class BaseSRMetric
public SRResult similarity(int pageId1, int pageId2, boolean explanations) throws DaoException
SRMetric
similarity
in interface SRMetric
similarity
in class BaseSRMetric
pageId1
- Id of the first page.pageId2
- Id of the second page.explanations
- Whether explanations should be created.DaoException
public double[][] cosimilarity(int[] rowIds, int[] columnIds) throws DaoException
SRMetric
cosimilarity
in interface SRMetric
cosimilarity
in class BaseSRMetric
DaoException
public void setTrainSubmetrics(boolean train)
public void trainSimilarity(Dataset dataset) throws DaoException
SRMetric
trainSimilarity
in interface SRMetric
trainSimilarity
in class BaseSRMetric
dataset
- A gold standard datasetDaoException
public void trainMostSimilar(Dataset dataset, int numResults, gnu.trove.set.TIntSet validIds)
SRMetric
trainMostSimilar
in interface SRMetric
trainMostSimilar
in class BaseSRMetric
dataset
- A gold standard dataset.numResults
- The maximum number of similar articles computed per phrase.validIds
- The Wikipedia ids that should be considered in result sets. Null means all ids.public void write() throws IOException
SRMetric
write
in interface SRMetric
write
in class BaseSRMetric
IOException
public void read() throws IOException
SRMetric
read
in interface SRMetric
read
in class BaseSRMetric
IOException
public SRResultList mostSimilar(int pageId, int maxResults, gnu.trove.set.TIntSet validIds) throws DaoException
SRMetric
mostSimilar
in interface SRMetric
mostSimilar
in class BaseSRMetric
pageId
- The id of the local page whose similarity we are examining.maxResults
- The maximum number of results to return.validIds
- The local page ids to be considered. Null means all ids in the language.DaoException
Copyright © 2014. All rights reserved.