You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Robert Muir (Commented) (JIRA)" <ji...@apache.org> on 2012/04/11 15:21:17 UTC

[jira] [Commented] (SOLR-3099) Add query operator, index structure, and analyzer for "exact match" searching

    [ https://issues.apache.org/jira/browse/SOLR-3099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13251533#comment-13251533 ] 

Robert Muir commented on SOLR-3099:
-----------------------------------

{quote}
Currently you can tag a token with a TOKENTYPE, so the stemmer could add the stemmed token on same position with tokentype=stem.
{quote}

This is not the way to go, for many reasons, its been brought up many times before.

This feature already works. Just use a separate field. Stacking tokens on top of each other
will be about the same size in the index anyway, since its an inverted index.

stemmedBody = stemmed field
exactBody = unstemmed field.

Now i have an exact operator, "exactBody:stuff" that works.

                
> Add query operator, index structure, and analyzer for "exact match" searching
> -----------------------------------------------------------------------------
>
>                 Key: SOLR-3099
>                 URL: https://issues.apache.org/jira/browse/SOLR-3099
>             Project: Solr
>          Issue Type: Sub-task
>          Components: Schema and Analysis
>            Reporter: Mike
>             Fix For: 4.0
>
>
> A project I'm working on requires *exact match* searching with stemming turned off. The users are accostomed to Sphinx search, and thus expect a query like [ =runs ] to return only documents that contain the exact term, "runs", and not the stemmed word "run".
> In SOLR-2866, there is similar work, but I believe it is different because it uses a huge-synonym file rather than storing the original terms directly in the index. 
> What I'd like instead is two things:
> 1. An analyzer that says, "store the original form of all words in the index along with the stemmed variations." If necessary, it's fine if this is simply an unstemmed field, but that seems cumbersome schema-wise and performance-wise.
> 2. An operator in edismax that allows users to query the exact form of the word. Sphinx uses the equals sign (=), and that makes sense logically to me.
> This issue is part of a meta issue, SOLR-3028, that is requesting two other operators in edismax (quorum search and word order).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org