@Bindable(prefix="PreprocessingPipeline") public class CompletePreprocessingPipeline extends BasicPreprocessingPipeline
Tokenizer.tokenize(PreprocessingContext)CaseNormalizer.normalize(PreprocessingContext)LanguageModelStemmer.stem(PreprocessingContext)StopListMarker.mark(PreprocessingContext)PhraseExtractor.extractPhrases(PreprocessingContext)LabelFilterProcessor.process(PreprocessingContext)DocumentAssigner.assign(PreprocessingContext)| Modifier and Type | Field and Description |
|---|---|
DocumentAssigner |
documentAssigner
Document assigner used by the algorithm, contains bindable attributes.
|
LabelFilterProcessor |
labelFilterProcessor
Label filter processor used by the algorithm, contains bindable attributes.
|
PhraseExtractor |
phraseExtractor
Phrase extractor used by the algorithm, contains bindable attributes.
|
caseNormalizer, languageModelStemmer, lexicalDataFactory, stemmerFactory, stopListMarker, tokenizer, tokenizerFactory| Constructor and Description |
|---|
CompletePreprocessingPipeline() |
| Modifier and Type | Method and Description |
|---|---|
PreprocessingContext |
preprocess(List<Document> documents,
String query,
LanguageCode language)
Performs preprocessing on the provided list of documents.
|
public final PhraseExtractor phraseExtractor
public final LabelFilterProcessor labelFilterProcessor
public final DocumentAssigner documentAssigner
public PreprocessingContext preprocess(List<Document> documents, String query, LanguageCode language)
BasicPreprocessingPipelinePreprocessingContext.preprocess in interface IPreprocessingPipelinepreprocess in class BasicPreprocessingPipeline