org.apache.lucene.analysis
public class: WhitespaceTokenizer [javadoc |
source]
java.lang.Object
org.apache.lucene.util.AttributeSource
org.apache.lucene.analysis.TokenStream
org.apache.lucene.analysis.Tokenizer
org.apache.lucene.analysis.CharTokenizer
org.apache.lucene.analysis.WhitespaceTokenizer
All Implemented Interfaces:
Closeable
A WhitespaceTokenizer is a tokenizer that divides text at whitespace.
Adjacent sequences of non-Whitespace characters form tokens.
| Method from org.apache.lucene.analysis.WhitespaceTokenizer Summary: |
|---|
|
isTokenChar |
| Methods from org.apache.lucene.util.AttributeSource: |
|---|
|
addAttribute, addAttributeImpl, captureState, clearAttributes, cloneAttributes, equals, getAttribute, getAttributeClassesIterator, getAttributeFactory, getAttributeImplsIterator, hasAttribute, hasAttributes, hashCode, restoreState, toString |
| Methods from java.lang.Object: |
|---|
|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method from org.apache.lucene.analysis.WhitespaceTokenizer Detail: |
protected boolean isTokenChar(char c) {
return !Character.isWhitespace(c);
}
|