You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "YOO JEONGIN (JIRA)" <ji...@apache.org> on 2019/04/19 02:34:00 UTC
[jira] [Created] (LUCENE-8772) [nori] A word that is registered in
advance, but the words are not separated and recognized as 'UNKNOWN'
YOO JEONGIN created LUCENE-8772:
-----------------------------------
Summary: [nori] A word that is registered in advance, but the words are not separated and recognized as 'UNKNOWN'
Key: LUCENE-8772
URL: https://issues.apache.org/jira/browse/LUCENE-8772
Project: Lucene - Core
Issue Type: Bug
Components: modules/analysis
Affects Versions: 8.0, 7.7.1, 7.7, 7.6, 7.5
Reporter: YOO JEONGIN
Attachments: image-2019-04-19-11-32-56-310.png
In case of 'nori', if there is no word starting from the left, 'UNKNOWN' is analyzed even if there is a word already registered in the middle.
So here is the question.
Does nori analyze only on the left side and do not analyze from the right side?
Could this be solved?
ex)
input => 갊수학
Condition
dictionary registered : 수학
dictionary Unregistered : 갊
result => 갊수학
!image-2019-04-19-11-32-56-310.png!
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org