Package | Description |
---|---|
org.wikibrain.sr | |
org.wikibrain.sr.dataset | |
org.wikibrain.sr.ensemble | |
org.wikibrain.sr.evaluation | |
org.wikibrain.sr.milnewitten | |
org.wikibrain.sr.utils | |
org.wikibrain.sr.vector |
Modifier and Type | Method and Description |
---|---|
Dataset |
SRBuilder.getDataset() |
Modifier and Type | Method and Description |
---|---|
void |
SRMetric.trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds)
Train the mostSimilar() function
The KnownSims may already be associated with Wikipedia ids (check wpId1 and wpId2).
|
void |
BaseSRMetric.trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds) |
void |
SRMetric.trainSimilarity(Dataset dataset)
Train the similarity() function.
|
void |
BaseSRMetric.trainSimilarity(Dataset dataset) |
Modifier and Type | Method and Description |
---|---|
Dataset |
DatasetDao.get(Language language,
String name)
Reads a dataset from the classpath with a particular name.
|
Dataset |
Dataset.prune(double minSim,
double maxSim) |
Dataset |
DatasetDao.read(Language language,
File path)
Reads a dataset from the classpath with a particular name.
|
protected Dataset |
DatasetDao.read(String name,
Language language,
BufferedReader reader)
Reads a dataset from a buffered reader.
|
Modifier and Type | Method and Description |
---|---|
List<Dataset> |
DatasetDao.getAllInLanguage(Language lang) |
List<Dataset> |
DatasetDao.getDatasetOrGroup(Language language,
String name) |
List<Dataset> |
DatasetDao.getGroup(Language language,
String name)
Return all the member datasets in the specified group.
|
List<Dataset> |
Dataset.split(int k)
Shuffles a dataset and splits it into k equally sized subsets, and returns them all
|
Modifier and Type | Method and Description |
---|---|
void |
DatasetDao.write(Dataset dataset,
File path)
Writes a dataset out to a particular path
|
Constructor and Description |
---|
Dataset(List<Dataset> datasets)
Concatenates a list of datasets into a new merged dataset.
|
Dataset(String name,
List<Dataset> datasets)
Concatenates a list of datasets into a new merged dataset.
|
Modifier and Type | Method and Description |
---|---|
void |
EnsembleMetric.trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds)
Training cascades to base metrics.
|
void |
EnsembleMetric.trainSimilarity(Dataset dataset)
Training cascades to base metrics.
|
Modifier and Type | Method and Description |
---|---|
Dataset |
Split.getTest() |
Dataset |
Split.getTrain() |
Dataset |
MostSimilarDataset.toDataset()
Converts the most similar dataset back to a "normal" dataset.
|
Modifier and Type | Method and Description |
---|---|
List<Dataset> |
MostSimilarDataset.splitIntoDatasets(int n) |
Modifier and Type | Method and Description |
---|---|
void |
SimilarityEvaluator.addCrossfolds(Dataset ds,
int numFolds)
Adds a crossfold validation of a particular dataset.
|
void |
MostSimilarEvaluator.addCrossfolds(Dataset ds,
int numFolds)
Adds a crossfold validation of a particular dataset.
|
abstract void |
Evaluator.addCrossfolds(Dataset ds,
int numFolds) |
void |
PretrainedSRFactory.PretrainedMetric.trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds) |
void |
PretrainedSRFactory.PretrainedMetric.trainSimilarity(Dataset dataset) |
Constructor and Description |
---|
MostSimilarDataset(Dataset dataset) |
Split(String name,
String group,
Dataset train,
Dataset test) |
Constructor and Description |
---|
MostSimilarDataset(List<Dataset> datasets)
Creates a new most similar dataset based on some input datasets.
|
MostSimilarDataset(List<Dataset> datasets,
double threshold)
Creates a new most similar dataset based on some input datasets.
|
Modifier and Type | Method and Description |
---|---|
void |
SimpleMilneWitten.trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds) |
void |
MilneWittenMetric.trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds) |
void |
SimpleMilneWitten.trainSimilarity(Dataset dataset) |
void |
MilneWittenMetric.trainSimilarity(Dataset dataset) |
Modifier and Type | Method and Description |
---|---|
void |
SrNormalizers.trainMostSimilar(SRMetric metric,
Disambiguator disambiguator,
Dataset dataset,
gnu.trove.set.TIntSet validIds,
int maxResults) |
void |
SrNormalizers.trainSimilarity(SRMetric metric,
Dataset dataset) |
Modifier and Type | Method and Description |
---|---|
void |
VectorBasedSRMetric.trainMostSimilar(Dataset dataset,
int numResults,
gnu.trove.set.TIntSet validIds) |
void |
VectorBasedSRMetric.trainSimilarity(Dataset dataset)
Train the similarity() function.
|
Copyright © 2014. All rights reserved.