@Bindable(prefix="CaseNormalizer") public final class CaseNormalizer extends Object
This class saves the following results to the PreprocessingContext:
PreprocessingContext.AllTokens.wordIndexPreprocessingContext.AllWords.imagePreprocessingContext.AllWords.tfPreprocessingContext.AllWords.tfByDocument
This class requires that Tokenizer be invoked first.
| Modifier and Type | Field and Description |
|---|---|
int |
dfThreshold
Word Document Frequency threshold.
|
| Constructor and Description |
|---|
CaseNormalizer() |
| Modifier and Type | Method and Description |
|---|---|
void |
normalize(PreprocessingContext context)
Performs normalization and saves the results to the
context. |
@Processing @Input @Attribute @IntRange(min=1, max=100) @Label(value="Word document frequency threshold") @Level(value=ADVANCED) @Group(value="Preprocessing") public int dfThreshold
dfThreshold documents will be ignored.public void normalize(PreprocessingContext context)
context.