public static class WhitespaceTokenizer.WhitespaceTokenizerFactory<T extends HasWord> extends Object implements TokenizerFactory<T>
Constructor and Description |
---|
WhitespaceTokenizerFactory(LexedTokenFactory<T> factory) |
WhitespaceTokenizerFactory(LexedTokenFactory<T> factory,
boolean tokenizeNLs) |
WhitespaceTokenizerFactory(LexedTokenFactory<T> factory,
String options) |
Modifier and Type | Method and Description |
---|---|
Iterator<T> |
getIterator(Reader r)
Return an iterator over the contents read from r.
|
Tokenizer<T> |
getTokenizer(Reader r)
Get a tokenizer for this reader.
|
Tokenizer<T> |
getTokenizer(Reader r,
String extraOptions)
Get a tokenizer for this reader.
|
static TokenizerFactory<Word> |
newTokenizerFactory()
Constructs a new TokenizerFactory that returns Word objects and
treats carriage returns as normal whitespace.
|
void |
setOptions(String options)
Sets default options for how tokenizers built from this factory should behave.
|
public WhitespaceTokenizerFactory(LexedTokenFactory<T> factory)
public WhitespaceTokenizerFactory(LexedTokenFactory<T> factory, String options)
public WhitespaceTokenizerFactory(LexedTokenFactory<T> factory, boolean tokenizeNLs)
public static TokenizerFactory<Word> newTokenizerFactory()
public Iterator<T> getIterator(Reader r)
IteratorFromReaderFactory
getIterator
in interface IteratorFromReaderFactory<T extends HasWord>
r
- Where to read objects frompublic Tokenizer<T> getTokenizer(Reader r)
TokenizerFactory
getTokenizer
in interface TokenizerFactory<T extends HasWord>
r
- A Reader (which is assumed to already by buffered, if appropriate)public Tokenizer<T> getTokenizer(Reader r, String extraOptions)
TokenizerFactory
getTokenizer
in interface TokenizerFactory<T extends HasWord>
r
- A Reader (which is assumed to already by buffered, if appropriate)extraOptions
- Options for how this tokenizer should behavepublic void setOptions(String options)
TokenizerFactory
setOptions
in interface TokenizerFactory<T extends HasWord>
options
- Options for how this tokenizer should behave