public class PreprocessingContext.AllPhrases extends Object
PreprocessingContext.documents. Each entry in each array corresponds to one
sequence.
All arrays in this class have the same length and values across different arrays correspond to each other for the same index.
| Modifier and Type | Field and Description |
|---|---|
int[] |
tf
Term frequency of the phrase.
|
int[][] |
tfByDocument
Term frequency of the phrase for each document.
|
int[][] |
wordIndices
Pointers to
PreprocessingContext.AllWords for each word in the phrase sequence. |
| Constructor and Description |
|---|
PreprocessingContext.AllPhrases() |
| Modifier and Type | Method and Description |
|---|---|
CharSequence |
getPhrase(int index)
Returns space-separated words that constitute this phrase.
|
int |
size()
Returns length of all arrays in this
PreprocessingContext.AllPhrases. |
String |
toString()
For debugging purposes.
|
public int[][] wordIndices
PreprocessingContext.AllWords for each word in the phrase sequence.
This array is produced by PhraseExtractor.
public int[] tf
This array is produced by PhraseExtractor.
public int[][] tfByDocument
PreprocessingContext.AllWords.tfByDocument: consecutive pairs of:
document index, frequency.
This array is produced by PhraseExtractor. The order of documents in this
array is not defined.
public CharSequence getPhrase(int index)
public int size()
PreprocessingContext.AllPhrases.