public class PreprocessedDocumentScanner extends Object
PreprocessingContext.| Modifier and Type | Field and Description |
|---|---|
static com.carrotsearch.hppc.predicates.ShortPredicate |
ON_DOCUMENT_SEPARATOR
Predicate for splitting on document separator.
|
static com.carrotsearch.hppc.predicates.ShortPredicate |
ON_FIELD_SEPARATOR
Predicate for splitting on field separator.
|
static com.carrotsearch.hppc.predicates.ShortPredicate |
ON_SENTENCE_SEPARATOR
Predicate for splitting on sentence separator.
|
| Constructor and Description |
|---|
PreprocessedDocumentScanner() |
| Modifier and Type | Method and Description |
|---|---|
protected void |
document(PreprocessingContext context,
int start,
int length)
Invoked for each document.
|
static com.carrotsearch.hppc.predicates.ShortPredicate |
equalTo(short t)
Return a new
ShortPredicate returning true
if the argument equals a given value. |
protected void |
field(PreprocessingContext context,
int start,
int length)
Invoked for each document's field.
|
void |
iterate(PreprocessingContext context)
Iterate over all documents, fields and sentences in
PreprocessingContext.allTokens. |
protected void |
sentence(PreprocessingContext context,
int start,
int length)
Invoked for each document's sentence.
|
public static final com.carrotsearch.hppc.predicates.ShortPredicate ON_DOCUMENT_SEPARATOR
public static final com.carrotsearch.hppc.predicates.ShortPredicate ON_FIELD_SEPARATOR
public static final com.carrotsearch.hppc.predicates.ShortPredicate ON_SENTENCE_SEPARATOR
public static final com.carrotsearch.hppc.predicates.ShortPredicate equalTo(short t)
ShortPredicate returning true
if the argument equals a given value.public final void iterate(PreprocessingContext context)
PreprocessingContext.allTokens.protected void document(PreprocessingContext context, int start, int length)
protected void field(PreprocessingContext context, int start, int length)
protected void sentence(PreprocessingContext context, int start, int length)