You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by bu...@apache.org on 2003/04/11 09:44:45 UTC

DO NOT REPLY [Bug 18933] - Add support for Chinese, Japanese, and Korean to the core build.

DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG 
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://nagoya.apache.org/bugzilla/show_bug.cgi?id=18933>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND 
INSERTED IN THE BUG DATABASE.

http://nagoya.apache.org/bugzilla/show_bug.cgi?id=18933

Add support for Chinese, Japanese, and Korean to the core build.





------- Additional Comments From chedong@yeah.net  2003-04-11 07:44 -------
for most without natural space based word segment languages(like Chinese 
Japanese Korean) I use bigram to and make query parser default boolean query 
with 'and' relationship(It's an option in QueryParser of 1.3 release).

for example: 
C1C2C3C4 will segment to  C1C2 C2C3 C3C4 while indexing
and query parsed to C1C2 +C2C3 +C3C4 while searching.

Regards

Che, Dong
http://www.chedong.com/tech/

---------------------------------------------------------------------
To unsubscribe, e-mail: lucene-dev-unsubscribe@jakarta.apache.org
For additional commands, e-mail: lucene-dev-help@jakarta.apache.org