You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@lucene.apache.org by "YOO JEONGIN (JIRA)" <ji...@apache.org> on 2019/04/19 02:34:00 UTC

[jira] [Created] (LUCENE-8772) [nori] A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN'

YOO JEONGIN created LUCENE-8772:
-----------------------------------

             Summary: [nori]  A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN'
                 Key: LUCENE-8772
                 URL: https://issues.apache.org/jira/browse/LUCENE-8772
             Project: Lucene - Core
          Issue Type: Bug
          Components: modules/analysis
    Affects Versions: 8.0, 7.7.1, 7.7, 7.6, 7.5
            Reporter: YOO JEONGIN
         Attachments: image-2019-04-19-11-32-56-310.png

In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is analyzed even if there is a word already registered in the middle.
So here is the question.
Does nori analyze only on the left side and do not analyze from the right side?
Could this be solved?

 

ex)

input => 갊수학

Condition

dictionary registered : 수학
dictionary Unregistered : 갊

result => 갊수학

!image-2019-04-19-11-32-56-310.png!



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org