You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by John Wang <jo...@gmail.com> on 2010/04/10 03:59:50 UTC
chinese stopwords
Hi:
I am using the SmartChineseAnalyzer class and it is great!
Was wondering if we should have a set of chinese stopwords. The default
set containts only punctuations.
Thanks
-John
Re: chinese stopwords
Posted by John Wang <jo...@gmail.com>.
Awesome, thanks!
Great job of the work!
-John
2010/4/10 Gao Pinker <xi...@gmail.com>
> That's a good idea, I'll think about adding another stopword-list to let
> users have a chance to choose.
>
>
> On Sat, Apr 10, 2010 at 9:25 PM, John Wang <jo...@gmail.com> wrote:
>
>> Yeah, I found some as well.
>> Was wondering if we should have a standard list to be bundled with the
>> default.
>>
>> -John
>>
>>
>> On Sat, Apr 10, 2010 at 6:17 AM, Gao Pinker <xi...@gmail.com>wrote:
>>
>>> I remember there were some stopwords list on the internet.
>>> I found these:
>>> http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
>>> http://download.csdn.net/source/740407
>>>
>>>
>>> On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:
>>>
>>>> Hi:
>>>>
>>>> I am using the SmartChineseAnalyzer class and it is great!
>>>>
>>>> Was wondering if we should have a set of chinese stopwords. The
>>>> default set containts only punctuations.
>>>>
>>>> Thanks
>>>>
>>>> -John
>>>>
>>>
>>>
>>>
>>> --
>>> 高小平
>>>
>>
>>
>
>
> --
> 高小平
>
Re: chinese stopwords
Posted by Gao Pinker <xi...@gmail.com>.
That's a good idea, I'll think about adding another stopword-list to let
users have a chance to choose.
On Sat, Apr 10, 2010 at 9:25 PM, John Wang <jo...@gmail.com> wrote:
> Yeah, I found some as well.
> Was wondering if we should have a standard list to be bundled with the
> default.
>
> -John
>
>
> On Sat, Apr 10, 2010 at 6:17 AM, Gao Pinker <xi...@gmail.com> wrote:
>
>> I remember there were some stopwords list on the internet.
>> I found these:
>> http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
>> http://download.csdn.net/source/740407
>>
>>
>> On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:
>>
>>> Hi:
>>>
>>> I am using the SmartChineseAnalyzer class and it is great!
>>>
>>> Was wondering if we should have a set of chinese stopwords. The
>>> default set containts only punctuations.
>>>
>>> Thanks
>>>
>>> -John
>>>
>>
>>
>>
>> --
>> 高小平
>>
>
>
--
高小平
Re: chinese stopwords
Posted by John Wang <jo...@gmail.com>.
Yeah, I found some as well.
Was wondering if we should have a standard list to be bundled with the
default.
-John
On Sat, Apr 10, 2010 at 6:17 AM, Gao Pinker <xi...@gmail.com> wrote:
> I remember there were some stopwords list on the internet.
> I found these:
> http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
> http://download.csdn.net/source/740407
>
>
> On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:
>
>> Hi:
>>
>> I am using the SmartChineseAnalyzer class and it is great!
>>
>> Was wondering if we should have a set of chinese stopwords. The default
>> set containts only punctuations.
>>
>> Thanks
>>
>> -John
>>
>
>
>
> --
> 高小平
>
Re: chinese stopwords
Posted by Gao Pinker <xi...@gmail.com>.
I remember there were some stopwords list on the internet.
I found these:
http://hi.baidu.com/zhaocy0113/blog/item/146b5c346a738c4d251f1496.html
http://download.csdn.net/source/740407
On Sat, Apr 10, 2010 at 9:59 AM, John Wang <jo...@gmail.com> wrote:
> Hi:
>
> I am using the SmartChineseAnalyzer class and it is great!
>
> Was wondering if we should have a set of chinese stopwords. The default
> set containts only punctuations.
>
> Thanks
>
> -John
>
--
高小平
Re: chinese stopwords
Posted by Grant Ingersoll <gs...@apache.org>.
+1
On Apr 9, 2010, at 9:59 PM, John Wang wrote:
> Hi:
>
> I am using the SmartChineseAnalyzer class and it is great!
>
> Was wondering if we should have a set of chinese stopwords. The default set containts only punctuations.
>
> Thanks
>
> -John
---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: java-dev-help@lucene.apache.org