You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2007/06/07 07:13:27 UTC

[jira] Commented: (LUCENE-915) PorterStemmer is incorrectly truncating words ending in e

    [ https://issues.apache.org/jira/browse/LUCENE-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502221 ] 

Hoss Man commented on LUCENE-915:
---------------------------------

can you elaborate on why you think this is a bug?  

This is a fairly basic function of the Porter Stemming Algorithm, and exists in the official java version of the algorith published by Martin Porter...

http://www.tartarus.org/~martin/PorterStemmer/java.txt

(you may not arge with Porter's decision to strip trailing Es, but it's in the algorithm, and the class implements the algorithm)

> PorterStemmer is incorrectly truncating words ending in e
> ---------------------------------------------------------
>
>                 Key: LUCENE-915
>                 URL: https://issues.apache.org/jira/browse/LUCENE-915
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Index, QueryParser, Search
>    Affects Versions: 1.9
>         Environment: Java 1.5 on Mac OS X 10.4.
>            Reporter: Paul Curren
>
> Searching for the word 'orange' will result incorrectly in matches for 'orang'.
> Likewise, searching for 'apple' will incorrectly match 'appl'
> The problem is in step6() of the PorterStemmer class.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org