You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by waynelam <wa...@ln.edu.hk> on 2011/12/09 09:16:15 UTC

SmartChineseAnalyzer

Hi all,

I checked the documentation of SmartChineseAnalyzer, It looks like it is 
for Simplified Chinese Only.
Does anyone tried to include Traditional Chinese characters also. As the 
analyzer is based on a
dictionary from ICTCLAS1.0. My first thought is maybe i can get it work 
by simply convert the
whole dictionary to Traditional Chinese?

Btw, I checked ICTCLAS official website and it seems the newest version 
java library supports GB2312、GBK、UTF-8、BIG5.
So I can expect a roadmap for SmartChineseAnalyzer to support BIG5 later?



Anyone can show me some hint is much appreciated.



Regards,

Wayne

Re: SmartChineseAnalyzer

Posted by Chris Hostetter <ho...@fucit.org>.
: Subject: SmartChineseAnalyzer
: References:
:     <CA...@mail.gmail.com>
:  <CA...@mail.gmail.com>
:  <CA...@mail.gmail.com>
: In-Reply-To:
:     <CA...@mail.gmail.com>

http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists

When starting a new discussion on a mailing list, please do not reply to 
an existing message, instead start a fresh email.  Even if you change the 
subject line of your email, other mail headers still track which thread 
you replied to and your question is "hidden" in that thread and gets less 
attention.   It makes following discussions in the mailing list archives 
particularly difficult.



-Hoss