You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Rohit <ro...@in-rev.com> on 2012/01/11 11:20:03 UTC

Search Issue

Hi,

 

We are storing a large number of tweets and blogs feeds into solr. 

 

Now if the user searches for twitter mentions like, @rohit , records which
just contain the word rohit are also being returned. Even if we do an exact
match "@rohit", I understand this happens because of use of
WordDelimiterFilterFactory which splits on special charaters,  

 

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimit
erFilterFactory

 

How can I force Solr to not return without "@"? Hope I am being clear.

 

 

 

Regards,

Rohit

 


Re: Search Issue

Posted by findbestopensource <fi...@gmail.com>.
While indexing @ is removed. You need to use your own Tokenizer which will
consider "@rohit" as one word.

Another option is to break the tweet in to two fields, @<username> and the
tweet. Index both the fields but don't use any tokenizer for the field "
@<username>". Just index as it is. While querying you need to search for
both the fields. This method will help to fetch tweets of the particular
user.

Regards
Aditya
www.findbestopensource.com

On Wed, Jan 11, 2012 at 3:50 PM, Rohit <ro...@in-rev.com> wrote:

> Hi,
>
>
>
> We are storing a large number of tweets and blogs feeds into solr.
>
>
>
> Now if the user searches for twitter mentions like, @rohit , records which
> just contain the word rohit are also being returned. Even if we do an exact
> match "@rohit", I understand this happens because of use of
> WordDelimiterFilterFactory which splits on special charaters,
>
>
>
>
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimit
> erFilterFactory
>
>
>
> How can I force Solr to not return without "@"? Hope I am being clear.
>
>
>
>
>
>
>
> Regards,
>
> Rohit
>
>
>
>