You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by waynelam <wa...@ln.edu.hk> on 2011/12/09 09:16:15 UTC
SmartChineseAnalyzer
Hi all,
I checked the documentation of SmartChineseAnalyzer, It looks like it is
for Simplified Chinese Only.
Does anyone tried to include Traditional Chinese characters also. As the
analyzer is based on a
dictionary from ICTCLAS1.0. My first thought is maybe i can get it work
by simply convert the
whole dictionary to Traditional Chinese?
Btw, I checked ICTCLAS official website and it seems the newest version
java library supports GB2312、GBK、UTF-8、BIG5.
So I can expect a roadmap for SmartChineseAnalyzer to support BIG5 later?
Anyone can show me some hint is much appreciated.
Regards,
Wayne
Re: SmartChineseAnalyzer
Posted by Chris Hostetter <ho...@fucit.org>.
: Subject: SmartChineseAnalyzer
: References:
: <CA...@mail.gmail.com>
: <CA...@mail.gmail.com>
: <CA...@mail.gmail.com>
: In-Reply-To:
: <CA...@mail.gmail.com>
http://people.apache.org/~hossman/#threadhijack
Thread Hijacking on Mailing Lists
When starting a new discussion on a mailing list, please do not reply to
an existing message, instead start a fresh email. Even if you change the
subject line of your email, other mail headers still track which thread
you replied to and your question is "hidden" in that thread and gets less
attention. It makes following discussions in the mailing list archives
particularly difficult.
-Hoss