org.crosswire.jsword.index.lucene.analysis
Class ChineseLuceneAnalyzer
java.lang.Object
org.apache.lucene.analysis.Analyzer
org.crosswire.jsword.index.lucene.analysis.AbstractBookAnalyzer
org.crosswire.jsword.index.lucene.analysis.ChineseLuceneAnalyzer
public class ChineseLuceneAnalyzer
- extends AbstractBookAnalyzer
Uses org.apache.lucene.analysis.cn.ChineseAnalyzer
Analysis: ChineseTokenizer, ChineseFilter
StopFilter, Stemming not implemented yet
Note: org.apache.lucene.analysis.cn.CJKAnalyzer takes overlapping two character tokenization approach
which leads to larger index size.
- Author:
- Sijo Cherian [sijocherian at yahoo dot com]
- See Also:
for license details.
The copyright to this program is held by it's authors.
Field Summary |
private org.apache.lucene.analysis.cn.ChineseAnalyzer |
myAnalyzer
|
Methods inherited from class org.apache.lucene.analysis.Analyzer |
getPositionIncrementGap, getPreviousTokenStream, reusableTokenStream, setPreviousTokenStream |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
myAnalyzer
private org.apache.lucene.analysis.cn.ChineseAnalyzer myAnalyzer
ChineseLuceneAnalyzer
public ChineseLuceneAnalyzer()
tokenStream
public final org.apache.lucene.analysis.TokenStream tokenStream(String fieldName,
Reader reader)
- Specified by:
tokenStream
in class org.apache.lucene.analysis.Analyzer