You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by John Wang <jo...@gmail.com> on 2010/04/10 03:59:50 UTC

chinese stopwords

Hi:

   I am using the SmartChineseAnalyzer class and it is great!

   Was wondering if we should have a set of chinese stopwords. The default
set containts only punctuations.

Thanks

-John

Re: chinese stopwords

Posted by John Wang <jo...@gmail.com>.
Awesome, thanks!

Great job of the work!

-John

2010/4/10 Gao Pinker <xi...@gmail.com>

> That's  a good idea, I'll think about adding another stopword-list to let
> users have a chance to choose.
>
>
> On Sat, Apr 10, 2010 at 9:25 PM, John Wang <jo...@gmail.com> wrote:
>
>> Yeah, I found some as well.
>> Was wondering if we should have a standard list to be bundled with the
>> default.
>>
>> -John
>>
>>
>> On Sat, Apr 10, 2010 at 6:17 AM, Gao Pinker <xi...@gmail.com>wrote:
>>
>>> I remember there were some stopwords list on the internet.
>>> I found these:
>>> http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
>>> http://download.csdn.net/source/740407
>>>
>>>
>>> On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:
>>>
>>>> Hi:
>>>>
>>>>    I am using the SmartChineseAnalyzer class and it is great!
>>>>
>>>>    Was wondering if we should have a set of chinese stopwords. The
>>>> default set containts only punctuations.
>>>>
>>>> Thanks
>>>>
>>>> -John
>>>>
>>>
>>>
>>>
>>> --
>>> 高小平
>>>
>>
>>
>
>
> --
> 高小平
>

Re: chinese stopwords

Posted by Gao Pinker <xi...@gmail.com>.
That's  a good idea, I'll think about adding another stopword-list to let
users have a chance to choose.

On Sat, Apr 10, 2010 at 9:25 PM, John Wang <jo...@gmail.com> wrote:

> Yeah, I found some as well.
> Was wondering if we should have a standard list to be bundled with the
> default.
>
> -John
>
>
> On Sat, Apr 10, 2010 at 6:17 AM, Gao Pinker <xi...@gmail.com> wrote:
>
>> I remember there were some stopwords list on the internet.
>> I found these:
>> http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
>> http://download.csdn.net/source/740407
>>
>>
>> On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:
>>
>>> Hi:
>>>
>>>    I am using the SmartChineseAnalyzer class and it is great!
>>>
>>>    Was wondering if we should have a set of chinese stopwords. The
>>> default set containts only punctuations.
>>>
>>> Thanks
>>>
>>> -John
>>>
>>
>>
>>
>> --
>> 高小平
>>
>
>


-- 
高小平

Re: chinese stopwords

Posted by John Wang <jo...@gmail.com>.
Yeah, I found some as well.
Was wondering if we should have a standard list to be bundled with the
default.

-John

On Sat, Apr 10, 2010 at 6:17 AM, Gao Pinker <xi...@gmail.com> wrote:

> I remember there were some stopwords list on the internet.
> I found these:
> http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
> http://download.csdn.net/source/740407
>
>
> On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:
>
>> Hi:
>>
>>    I am using the SmartChineseAnalyzer class and it is great!
>>
>>    Was wondering if we should have a set of chinese stopwords. The default
>> set containts only punctuations.
>>
>> Thanks
>>
>> -John
>>
>
>
>
> --
> 高小平
>

Re: chinese stopwords

Posted by Gao Pinker <xi...@gmail.com>.
I remember there were some stopwords list on the internet.
I found these:
http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
http://download.csdn.net/source/740407

On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:

> Hi:
>
>    I am using the SmartChineseAnalyzer class and it is great!
>
>    Was wondering if we should have a set of chinese stopwords. The default
> set containts only punctuations.
>
> Thanks
>
> -John
>



-- 
高小平

Re: chinese stopwords

Posted by Grant Ingersoll <gs...@apache.org>.
+1

On Apr 9, 2010, at 9:59 PM, John Wang wrote:

> Hi:
> 
>    I am using the SmartChineseAnalyzer class and it is great! 
> 
>    Was wondering if we should have a set of chinese stopwords. The default set containts only punctuations.
> 
> Thanks
> 
> -John



---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org