You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by bu...@apache.org on 2003/10/13 17:25:50 UTC
DO NOT REPLY [Bug 23782] New: -
[PATCH] KStem for Lucene
DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23782>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND
INSERTED IN THE BUG DATABASE.
http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23782
[PATCH] KStem for Lucene
Summary: [PATCH] KStem for Lucene
Product: Lucene
Version: unspecified
Platform: Other
OS/Version: Other
Status: NEW
Severity: Enhancement
Priority: Other
Component: Analysis
AssignedTo: lucene-dev@jakarta.apache.org
ReportedBy: otis@apache.org
September 10th 2003 contributionn from "Sergio Guzman-Lara" <gu...@cs.umass.edu>
Original email:
Hi all,
I have ported the kstem stemmer to Java and incorporated it to
Lucene. You can get the source code (Kstem.jar) from the following website:
http://ciir.cs.umass.edu/downloads/
Just click on "KStem Java Implementation" (you will need to register
your e-mail, for free of course, with the CIIR --Center for Intelligent
Information Retrieval, UMass -- and get an access code).
Content of Kstem.jar:
java/org/apache/lucene/analysis/KStemData1.java
java/org/apache/lucene/analysis/KStemData2.java
java/org/apache/lucene/analysis/KStemData3.java
java/org/apache/lucene/analysis/KStemData4.java
java/org/apache/lucene/analysis/KStemData5.java
java/org/apache/lucene/analysis/KStemData6.java
java/org/apache/lucene/analysis/KStemData7.java
java/org/apache/lucene/analysis/KStemData8.java
java/org/apache/lucene/analysis/KStemFilter.java
java/org/apache/lucene/analysis/KStemmer.java
KStemData1.java, ..., KStemData8.java Contain several lists of words
used by Kstem
KStemmer.java Implements the Kstem algorithm
KStemFilter.java Extends TokenFilter applying Kstem
To compile
unjar the file Kstem.jar to Lucene's "src" directory, and compile it
there.
What is Kstem?
A stemmer designed by Bob Krovetz (for more information see
http://ciir.cs.umass.edu/pubfiles/ir-35.pdf).
Copyright issues
This is open source. The actual license agreement is included at the
top of every source file.
Any comments/questions/suggestions are welcome,
Sergio Guzman-Lara
Senior Research Fellow
CIIR UMass
---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org