You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by GitBox <gi...@apache.org> on 2022/09/27 04:44:24 UTC

[GitHub] [lucene] hakanai commented on issue #6967: Different behaviour of JapaneseAnalyzer at indexing time vs. at search time results in no matches for some words. [LUCENE-5905]

hakanai commented on issue #6967:
URL: https://github.com/apache/lucene/issues/6967#issuecomment-1258972455

   This issue is still alive and well in v8.5.1.
   
   The example discovered this time (in this case mode is set to SEARCH):
   
   1. input: 非公開
       tokens:
       - 非      (接頭詞)
       - 公開   (名詞)
   
   2. input: を非公開
       tokens:
       - を           (助詞)
       - 非公開    (名詞)
   
   Somehow を preceding the word makes it happen, while having は preceding the word instead does not.
   
   This is also reproducible by typing the same examples into the text field on [Atilika's site](https://www.atilika.org/).
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org