You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2007/06/07 07:13:27 UTC
[jira] Commented: (LUCENE-915) PorterStemmer is incorrectly
truncating words ending in e
[ https://issues.apache.org/jira/browse/LUCENE-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502221 ]
Hoss Man commented on LUCENE-915:
---------------------------------
can you elaborate on why you think this is a bug?
This is a fairly basic function of the Porter Stemming Algorithm, and exists in the official java version of the algorith published by Martin Porter...
http://www.tartarus.org/~martin/PorterStemmer/java.txt
(you may not arge with Porter's decision to strip trailing Es, but it's in the algorithm, and the class implements the algorithm)
> PorterStemmer is incorrectly truncating words ending in e
> ---------------------------------------------------------
>
> Key: LUCENE-915
> URL: https://issues.apache.org/jira/browse/LUCENE-915
> Project: Lucene - Java
> Issue Type: Bug
> Components: Index, QueryParser, Search
> Affects Versions: 1.9
> Environment: Java 1.5 on Mac OS X 10.4.
> Reporter: Paul Curren
>
> Searching for the word 'orange' will result incorrectly in matches for 'orang'.
> Likewise, searching for 'apple' will incorrectly match 'appl'
> The problem is in step6() of the PorterStemmer class.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org