You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by bu...@apache.org on 2003/10/13 17:25:50 UTC

DO NOT REPLY [Bug 23782] New: - [PATCH] KStem for Lucene

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23782>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=23782

[PATCH] KStem for Lucene

           Summary: [PATCH] KStem for Lucene
           Product: Lucene
           Version: unspecified
          Platform: Other
        OS/Version: Other
            Status: NEW
          Severity: Enhancement
          Priority: Other
         Component: Analysis
        AssignedTo: lucene-dev@jakarta.apache.org
        ReportedBy: otis@apache.org


September 10th 2003 contributionn from "Sergio Guzman-Lara" <gu...@cs.umass.edu>

Original email:

Hi all,

  I have ported the kstem stemmer to Java and incorporated it to 
Lucene. You can get the source code (Kstem.jar) from the following website:

http://ciir.cs.umass.edu/downloads/

  Just click on "KStem Java Implementation" (you will need to register 
your e-mail, for free of course, with the CIIR --Center for Intelligent 
Information Retrieval, UMass -- and get an access code).


Content of Kstem.jar:

java/org/apache/lucene/analysis/KStemData1.java
java/org/apache/lucene/analysis/KStemData2.java
java/org/apache/lucene/analysis/KStemData3.java
java/org/apache/lucene/analysis/KStemData4.java
java/org/apache/lucene/analysis/KStemData5.java
java/org/apache/lucene/analysis/KStemData6.java
java/org/apache/lucene/analysis/KStemData7.java
java/org/apache/lucene/analysis/KStemData8.java
java/org/apache/lucene/analysis/KStemFilter.java
java/org/apache/lucene/analysis/KStemmer.java

KStemData1.java, ..., KStemData8.java   Contain several lists of words 
used by Kstem
KStemmer.java      Implements the Kstem algorithm 
KStemFilter.java     Extends TokenFilter applying Kstem


To compile

unjar the file Kstem.jar to Lucene's "src" directory, and compile it 
there. 


What is Kstem?

  A stemmer designed by Bob Krovetz (for more information see 
http://ciir.cs.umass.edu/pubfiles/ir-35.pdf). 


Copyright issues

  This is open source. The actual license agreement is included at the 
top of every source file.


 Any comments/questions/suggestions are welcome,


  Sergio Guzman-Lara
  Senior Research Fellow
  CIIR UMass

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org