You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Namgyu Kim (JIRA)" <ji...@apache.org> on 2019/07/07 14:50:00 UTC

[jira] [Created] (LUCENE-8904) Enhance Nori DictionaryBuilder tool

Namgyu Kim created LUCENE-8904:
----------------------------------

             Summary: Enhance Nori DictionaryBuilder tool
                 Key: LUCENE-8904
                 URL: https://issues.apache.org/jira/browse/LUCENE-8904
             Project: Lucene - Core
          Issue Type: Improvement
            Reporter: Namgyu Kim


It is the Nori version of [~sokolov]'s LUCENE-8863.
 This patch has two changes.
 1) Improve exception handling
 2) Enable external dictionary for testing

Overall, it is the same as LUCENE-8863.

But there are some differences between Nori and Kuromoji.
These can be slightly different on the code.
1) CSV field size
Nori : 12
Kuromoji : 13
2) left context ID == right context ID
Nori : can be different
Kuromoji : always same
3) Dictionary Type
Nori : just one type
Kuromoji : IPADIC, UNIDIC

After this job, I'll apply LUCENE-8866 and LUCENE-8871 to Nori.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org