|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.TokenFilter
org.apache.lucene.analysis.fr.ElisionFilter
public class ElisionFilter
Removes elisions from a TokenStream. For example, "l'avion" (the plane) will be
tokenized as "avion" (plane).
Note that StandardTokenizer sees " ' " as a space, and cuts it out.
| Nested Class Summary |
|---|
| Nested classes/interfaces inherited from class org.apache.lucene.util.AttributeSource |
|---|
org.apache.lucene.util.AttributeSource.AttributeFactory, org.apache.lucene.util.AttributeSource.State |
| Field Summary |
|---|
| Fields inherited from class org.apache.lucene.analysis.TokenFilter |
|---|
input |
| Constructor Summary | |
|---|---|
protected |
ElisionFilter(org.apache.lucene.analysis.TokenStream input)
Constructs an elision filter with standard stop words |
|
ElisionFilter(org.apache.lucene.analysis.TokenStream input,
Set articles)
Constructs an elision filter with a Set of stop words |
|
ElisionFilter(org.apache.lucene.analysis.TokenStream input,
String[] articles)
Constructs an elision filter with an array of stop words |
| Method Summary | |
|---|---|
boolean |
incrementToken()
Increments the TokenStream with a TermAttribute without elisioned start |
org.apache.lucene.analysis.Token |
next()
Deprecated. Will be removed in Lucene 3.0. This method is final, as it should not be overridden. Delegates to the backwards compatibility layer. |
org.apache.lucene.analysis.Token |
next(org.apache.lucene.analysis.Token reusableToken)
Deprecated. Will be removed in Lucene 3.0. This method is final, as it should not be overridden. Delegates to the backwards compatibility layer. |
void |
setArticles(Set articles)
|
| Methods inherited from class org.apache.lucene.analysis.TokenFilter |
|---|
close, end, reset |
| Methods inherited from class org.apache.lucene.analysis.TokenStream |
|---|
getOnlyUseNewAPI, setOnlyUseNewAPI |
| Methods inherited from class org.apache.lucene.util.AttributeSource |
|---|
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString |
| Methods inherited from class java.lang.Object |
|---|
clone, finalize, getClass, notify, notifyAll, wait, wait, wait |
| Constructor Detail |
|---|
protected ElisionFilter(org.apache.lucene.analysis.TokenStream input)
public ElisionFilter(org.apache.lucene.analysis.TokenStream input,
Set articles)
public ElisionFilter(org.apache.lucene.analysis.TokenStream input,
String[] articles)
| Method Detail |
|---|
public void setArticles(Set articles)
public final boolean incrementToken()
throws IOException
TokenStream with a TermAttribute without elisioned start
incrementToken in class org.apache.lucene.analysis.TokenStreamIOException
public final org.apache.lucene.analysis.Token next(org.apache.lucene.analysis.Token reusableToken)
throws IOException
next in class org.apache.lucene.analysis.TokenStreamIOException
public final org.apache.lucene.analysis.Token next()
throws IOException
next in class org.apache.lucene.analysis.TokenStreamIOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||