You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Rohit <ro...@in-rev.com> on 2012/01/11 11:20:03 UTC
Search Issue
Hi,
We are storing a large number of tweets and blogs feeds into solr.
Now if the user searches for twitter mentions like, @rohit , records which
just contain the word rohit are also being returned. Even if we do an exact
match "@rohit", I understand this happens because of use of
WordDelimiterFilterFactory which splits on special charaters,
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimit
erFilterFactory
How can I force Solr to not return without "@"? Hope I am being clear.
Regards,
Rohit
Re: Search Issue
Posted by findbestopensource <fi...@gmail.com>.
While indexing @ is removed. You need to use your own Tokenizer which will
consider "@rohit" as one word.
Another option is to break the tweet in to two fields, @<username> and the
tweet. Index both the fields but don't use any tokenizer for the field "
@<username>". Just index as it is. While querying you need to search for
both the fields. This method will help to fetch tweets of the particular
user.
Regards
Aditya
www.findbestopensource.com
On Wed, Jan 11, 2012 at 3:50 PM, Rohit <ro...@in-rev.com> wrote:
> Hi,
>
>
>
> We are storing a large number of tweets and blogs feeds into solr.
>
>
>
> Now if the user searches for twitter mentions like, @rohit , records which
> just contain the word rohit are also being returned. Even if we do an exact
> match "@rohit", I understand this happens because of use of
> WordDelimiterFilterFactory which splits on special charaters,
>
>
>
>
> http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.WordDelimit
> erFilterFactory
>
>
>
> How can I force Solr to not return without "@"? Hope I am being clear.
>
>
>
>
>
>
>
> Regards,
>
> Rohit
>
>
>
>