org.apache.lucene.analysis.nl
Class DutchStemFilter
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.nl.DutchStemFilter
public final class DutchStemFilter
- extends org.apache.lucene.analysis.TokenFilter
A TokenFilter that stems Dutch words.
It supports a table of words that should
not be stemmed at all. The stemmer used can be changed at runtime after the
filter object is created (as long as it is a DutchStemmer).
| Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource |
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State |
| Fields inherited from class org.apache.lucene.analysis.TokenFilter |
input |
|
Constructor Summary |
DutchStemFilter(org.apache.lucene.analysis.TokenStream _in)
|
DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable)
Builds a DutchStemFilter that uses an exclusion table. |
DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable,
Map stemdictionary)
|
| Methods inherited from class org.apache.lucene.analysis.TokenFilter |
close, end, reset |
| Methods inherited from class org.apache.lucene.analysis.TokenStream |
getOnlyUseNewAPI, next, next, setOnlyUseNewAPI |
| Methods inherited from class org.apache.lucene.util.AttributeSource |
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString |
DutchStemFilter
public DutchStemFilter(org.apache.lucene.analysis.TokenStream _in)
DutchStemFilter
public DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable)
- Builds a DutchStemFilter that uses an exclusion table.
DutchStemFilter
public DutchStemFilter(org.apache.lucene.analysis.TokenStream _in,
Set exclusiontable,
Map stemdictionary)
- Parameters:
stemdictionary - Dictionary of word stem pairs, that overrule the algorithm
incrementToken
public boolean incrementToken()
throws IOException
- Returns the next token in the stream, or null at EOS
- Overrides:
incrementToken in class org.apache.lucene.analysis.TokenStream
- Throws:
IOException
setStemmer
public void setStemmer(DutchStemmer stemmer)
- Set a alternative/custom
DutchStemmer for this filter.
setExclusionTable
public void setExclusionTable(HashSet exclusiontable)
- Set an alternative exclusion list for this filter.
setStemDictionary
public void setStemDictionary(HashMap dict)
- Set dictionary for stemming, this dictionary overrules the algorithm,
so you can correct for a particular unwanted word-stem pair.
Copyright © 2000-2010 Apache Software Foundation. All Rights Reserved.