You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hivemall.apache.org by "Makoto Yui (Jira)" <ji...@apache.org> on 2021/07/02 06:16:00 UTC

[jira] [Closed] (HIVEMALL-316) Improve error message for duplicate entries error in Kuromoji user dictionary.

     [ https://issues.apache.org/jira/browse/HIVEMALL-316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Makoto Yui closed HIVEMALL-316.
-------------------------------
    Resolution: Fixed

> Improve error message for duplicate entries error in Kuromoji user dictionary.
> ------------------------------------------------------------------------------
>
>                 Key: HIVEMALL-316
>                 URL: https://issues.apache.org/jira/browse/HIVEMALL-316
>             Project: Hivemall
>          Issue Type: Improvement
>            Reporter: Makoto Yui
>            Assignee: Makoto Yui
>            Priority: Trivial
>             Fix For: 0.6.2
>
>
> Improve error message for duplicate entries error in user dictionary.
>  
> {code:java}
> Caused by: org.apache.hadoop.hive.ql.exec.UDFArgumentException: Failed to parse the file in CSV format (UTF-8 encoding is expected): ...
> java.lang.UnsupportedOperationException
> ---- Debugging information ----
> * trace-exception   : java.lang.UnsupportedOperationException 
> 	at org.apache.lucene.util.fst.Outputs.merge(Outputs.java:97)
> 	at org.apache.lucene.util.fst.Builder.add(Builder.java:462)
> 	at org.apache.lucene.analysis.ja.dict.UserDictionary.<init>(UserDictionary.java:132)
> 	at org.apache.lucene.analysis.ja.dict.UserDictionary.open(UserDictionary.java:82)
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)