You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Solr List <so...@gmail.com> on 2016/06/14 17:27:52 UTC

Recommendations for analyzing Korean?

Hi -

What's the current recommendation for searching/analyzing Korean?

The reference guide only lists CJK:
https://cwiki.apache.org/confluence/display/solr/Language+Analysis

I see a bunch of work was done on
https://issues.apache.org/jira/browse/LUCENE-4956, but it doesn't look like
that was ever committed - and the last comment was years ago.

There seem to be a few version of this in the wild, both more recent:
https://github.com/juncon/arirang.lucene-analyzer-5.0.0, and the original:
https://sourceforge.net/projects/lucenekorean/ but I'm not sure what's the
canonical source at this point.

I also see this: https://bitbucket.org/eunjeon/mecab-ko-lucene-analyzer

Suggestions?

Thanks,

Tom