AnalyzerUtil | Various fulltext analysis utilities avoiding redundant code in several classes. | code | html |
AnalyzerUtil.MutableInteger | code | html | |
MemoryIndex | High-performance single-document main memory Apache Lucene fulltext search index. | code | html |
MemoryIndex.ArrayIntList | Efficient resizable auto-expanding list holding int elements;
implemented with arrays. |
code | html |
MemoryIndex.Info | Index data structure for a field; Contains the tokenized term texts and their positions. | code | html |
MemoryIndex.MemoryIndexReader | Search support for Lucene framework integration; implements all methods required by the Lucene IndexReader contracts. | code | html |
MemoryIndex.VM | code | html | |
PatternAnalyzer | Efficient Lucene analyzer/tokenizer that preferably operates on a String rather than a java.io.Reader , that can flexibly separate text into terms via a regular expression Pattern (with behaviour identical to String#split(String) ), and that combines the functionality of org.apache.lucene.analysis.LetterTokenizer , org.apache.lucene.analysis.LowerCaseTokenizer , org.apache.lucene.analysis.WhitespaceTokenizer , org.apache.lucene.analysis.StopFilter into a single efficient multi-purpose class. | code | html |
PatternAnalyzer.FastStringReader | A StringReader that exposes it's contained string for fast direct access. | code | html |
PatternAnalyzer.FastStringTokenizer | Special-case class for best performance in common cases; this class is otherwise unnecessary. | code | html |
PatternAnalyzer.PatternTokenizer | The work horse; performance isn't fantastic, but it's not nearly as bad as one might think - kudos to the Sun regex developers. | code | html |
SynonymMap | Loads the WordNet prolog file wn_s.pl into a thread-safe main-memory hash map that can be used for fast high-frequency lookups of synonyms for any given (lowercase) word string. | code | html |
SynonymTokenFilter | Injects additional tokens for synonyms of token terms fetched from the underlying child stream; the child stream must deliver lowercase tokens for synonyms to be found. | code | html |