You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tyler Palsulich (JIRA)" <ji...@apache.org> on 2015/03/20 21:45:39 UTC
[jira] [Commented] (TIKA-1314) An inappropriate comment of
CharsetDetector.detect()
[ https://issues.apache.org/jira/browse/TIKA-1314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14372066#comment-14372066 ]
Tyler Palsulich commented on TIKA-1314:
---------------------------------------
This is still an issue in Tika 1.8-SNAPSHOT. See [here|https://github.com/apache/tika/blob/4096059da7f6d50e3d6e018681b8c02a96d3933a/tika-parsers/src/main/java/org/apache/tika/parser/txt/CharsetDetector.java#L141-L172]. Any input on whether we should update the comment or throw an Exception?
> An inappropriate comment of CharsetDetector.detect()
> ----------------------------------------------------
>
> Key: TIKA-1314
> URL: https://issues.apache.org/jira/browse/TIKA-1314
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.5
> Reporter: Yi EungJun
> Priority: Minor
>
> According to the javadoc of CharsetDetector.detect(), it raises an
> exception if no charset appears to match the data:
> * Raise an exception if
> * <ul>
> * <li>no charsets appear to match the input data.</li>
> * <li>no input text has been provided</li>
> * </ul>
> But it seems to me that in such cases the method returns null but does not raise any exception.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)