You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hivemall.apache.org by "Makoto Yui (Jira)" <ji...@apache.org> on 2021/07/02 04:54:00 UTC

[jira] [Created] (HIVEMALL-316) Improve error message for duplicate entries error in Kuromoji user dictionary.

Makoto Yui created HIVEMALL-316:
-----------------------------------

             Summary: Improve error message for duplicate entries error in Kuromoji user dictionary.
                 Key: HIVEMALL-316
                 URL: https://issues.apache.org/jira/browse/HIVEMALL-316
             Project: Hivemall
          Issue Type: Improvement
            Reporter: Makoto Yui
            Assignee: Makoto Yui
             Fix For: 0.6.2


Improve error message for duplicate entries error in user dictionary.

 
{code:java}
Caused by: org.apache.hadoop.hive.ql.exec.UDFArgumentException: Failed to parse the file in CSV format (UTF-8 encoding is expected): https://s3-cnj-vogue-to-td.s3-ap-northeast-1.amazonaws.com/vogue_master.csv
2java.lang.UnsupportedOperationException
3
4---- Debugging information ----
5* trace-exception   : java.lang.UnsupportedOperationException 
6	at org.apache.lucene.util.fst.Outputs.merge(Outputs.java:97)
7	at org.apache.lucene.util.fst.Builder.add(Builder.java:462)
8	at org.apache.lucene.analysis.ja.dict.UserDictionary.<init>(UserDictionary.java:132)
9	at org.apache.lucene.analysis.ja.dict.UserDictionary.open(UserDictionary.java:82)
{code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)