You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-dev@lucene.apache.org by "Geoffrey Young (JIRA)" <ji...@apache.org> on 2008/07/03 15:36:45 UTC

[jira] Commented: (SOLR-14) Add the ability to preserve the original term when using WordDelimiterFilter

    [ https://issues.apache.org/jira/browse/SOLR-14?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12610229#action_12610229 ] 

Geoffrey Young commented on SOLR-14:
------------------------------------

looks good from a functional pov.  the ordering of the tokens looks funny (to me) in analysis.jsp, but all the right ones are there.

it still takes my full load roughly twice as long to index (3.5 minutes versus 1.75 minutes for a 120MB file) but the functionality is important enough to incur the cost.

thanks muchly.

> Add the ability to preserve the original term when using WordDelimiterFilter
> ----------------------------------------------------------------------------
>
>                 Key: SOLR-14
>                 URL: https://issues.apache.org/jira/browse/SOLR-14
>             Project: Solr
>          Issue Type: Improvement
>          Components: search
>            Reporter: Richard "Trey" Hyde
>            Assignee: Yonik Seeley
>         Attachments: SOLR-14.patch, SOLR-14.patch, SOLR-14.patch, SOLR-14.patch, TokenizerFactory.java, WordDelimiterFilter.patch, WordDelimiterFilter.patch
>
>
> When doing prefix searching, you need to hang on to the original term othewise you'll miss many matches you should be making.
> Data: ABC-12345
> WordDelimiterFitler may change this into
> ABC 12345 ABC12345
> A user may enter a search such as 
>  ABC\-123*
> Which will fail to find a match given the above scenario.
> The attached patch will allow the use of the "preserveOriginal" option to WordDelimiterFilter and will analyse as
> ABC 12345 ABC12345  ABC-12345 
> in which case we will get a postive match.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.