You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Joe Zhang <sm...@gmail.com> on 2012/11/28 07:00:55 UTC

stopwords in solr

I noticed that the stopwords.txt file that comes with the example solr
configuration is empty. Does it mean that no stopwords are considered in
this case, or are the basic stopwords hardcoded somewhere?

Re: stopwords in solr

Posted by Andy Lester <an...@petdance.com>.
On Nov 28, 2012, at 12:33 AM, Joe Zhang <sm...@gmail.com> wrote:

> that is really strange. so basic stopwords such as "a" "the' are not
> eliminated from the index?

There is no list of "basic stopwords" anywhere.  If you want stop words, you have to put them in the file yourself.  There are not really any sensible defaults for stopwords, so Solr doesn't provide them.

Just add them to the stopwords.txt and reindex your core.

xoa

--
Andy Lester => andy@petdance.com => www.petdance.com => AIM:petdance


Re: stopwords in solr

Posted by 曹霖 <ca...@babytree-inc.com>.
yep,lt is a bad idea to eliminate stopword during indexing,may be u
can eliminate stopword during querying,That is flexible

2012/11/28 Walter Underwood <wu...@wunderwood.org>

> Eliminating stopwords is generally a bad idea. It means you cannot search
> for "vitamin a".

Re: stopwords in solr

Posted by Walter Underwood <wu...@wunderwood.org>.
Eliminating stopwords is generally a bad idea. It means you cannot search for "vitamin a".

Back in the 1970's, search engines eliminated stopwords so they could work on 16-bit machines. That isn't a problem any more.

wunder

On Nov 27, 2012, at 10:33 PM, Joe Zhang wrote:

> that is really strange. so basic stopwords such as "a" "the' are not
> eliminated from the index?
> 
> On Tue, Nov 27, 2012 at 11:16 PM, 曹霖 <ca...@babytree-inc.com> wrote:
> 
>> justt no stopwords are considered in that case
>> 
>> 2012/11/28 Joe Zhang <sm...@gmail.com>
>> 
>>> t no stopwords are considered in
>>> this case
>>> 
>> 






Re: stopwords in solr

Posted by Joe Zhang <sm...@gmail.com>.
that is really strange. so basic stopwords such as "a" "the' are not
eliminated from the index?

On Tue, Nov 27, 2012 at 11:16 PM, 曹霖 <ca...@babytree-inc.com> wrote:

> justt no stopwords are considered in that case
>
> 2012/11/28 Joe Zhang <sm...@gmail.com>
>
> > t no stopwords are considered in
> > this case
> >
>

Re: stopwords in solr

Posted by 曹霖 <ca...@babytree-inc.com>.
justt no stopwords are considered in that case

2012/11/28 Joe Zhang <sm...@gmail.com>

> t no stopwords are considered in
> this case
>