You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-dev@lucene.apache.org by "Hoss Man (JIRA)" <ji...@apache.org> on 2008/12/18 09:52:44 UTC

[jira] Resolved: (SOLR-876) Add ability to optionally splitOnNumerics WordDelimiterFilter/Factory

     [ https://issues.apache.org/jira/browse/SOLR-876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hoss Man resolved SOLR-876.
---------------------------

       Resolution: Fixed
    Fix Version/s: 1.4
         Assignee: Hoss Man

Thanks for the great patch Dan.

Committed revision 727677.

(i also added a few more tests to prove to myself that protwords was really working)

> Add ability to optionally splitOnNumerics WordDelimiterFilter/Factory
> ---------------------------------------------------------------------
>
>                 Key: SOLR-876
>                 URL: https://issues.apache.org/jira/browse/SOLR-876
>             Project: Solr
>          Issue Type: Improvement
>          Components: search
>            Reporter: Dan Rosher
>            Assignee: Hoss Man
>            Priority: Minor
>             Fix For: 1.4
>
>         Attachments: SOLR-876.patch, SOLR-876.patch, SOLR-876.patch
>
>
> Add ability to optionally splitOnNumerics WordDelimiterFilter/Factory
> Default behaviour is to splitOnNumerics as WordDelimiterFilter/Factory does now
> I was having issues with e.g. Java/J2SE becoming split into tokens 'Java','J' '2' and 'SE'  which isn't desired behavior in my instance, I wanted this to be tokens 'Java', 'J2SE'. Another option I thought about but not implemented was to have a protected list of words like solr.EnglishPorterFilterFactory

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.