@Bindable(prefix="PreprocessingPipeline") public class BasicPreprocessingPipeline extends Object implements IPreprocessingPipeline
| Modifier and Type | Field and Description |
|---|---|
CaseNormalizer |
caseNormalizer
Case normalizer used by the algorithm, contains bindable attributes.
|
LanguageModelStemmer |
languageModelStemmer
Stemmer used by the algorithm, contains bindable attributes.
|
ILexicalDataFactory |
lexicalDataFactory
Lexical data factory.
|
IStemmerFactory |
stemmerFactory
Stemmer factory.
|
StopListMarker |
stopListMarker
Stop list marker used by the algorithm, contains bindable attributes.
|
Tokenizer |
tokenizer
Tokenizer used by the algorithm, contains bindable attributes.
|
ITokenizerFactory |
tokenizerFactory
Tokenizer factory.
|
| Constructor and Description |
|---|
BasicPreprocessingPipeline() |
| Modifier and Type | Method and Description |
|---|---|
PreprocessingContext |
preprocess(List<Document> documents,
String query,
LanguageCode language)
Performs preprocessing on the provided list of documents.
|
public final Tokenizer tokenizer
public final CaseNormalizer caseNormalizer
public final LanguageModelStemmer languageModelStemmer
public final StopListMarker stopListMarker
@Input @Processing @Internal @Attribute @Level(value=ADVANCED) @Group(value="Preprocessing") public ITokenizerFactory tokenizerFactory
@Input @Processing @Internal @Attribute @Level(value=ADVANCED) @Group(value="Preprocessing") public IStemmerFactory stemmerFactory
@Input @Processing @Internal @Attribute @Level(value=ADVANCED) @Group(value="Preprocessing") public ILexicalDataFactory lexicalDataFactory
public PreprocessingContext preprocess(List<Document> documents, String query, LanguageCode language)
PreprocessingContext.preprocess in interface IPreprocessingPipeline