You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2021/08/16 18:22:00 UTC

[jira] [Created] (TIKA-3525) Allow users to configure skipping of unsupported charsets in charset detection

Tim Allison created TIKA-3525:
---------------------------------

             Summary: Allow users to configure skipping of unsupported charsets in charset detection
                 Key: TIKA-3525
                 URL: https://issues.apache.org/jira/browse/TIKA-3525
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison


We've had two issues now where a charset detector is detecting a charset that is not supported by the jvm.  In both cases, the charset detector was wrong.  It is beyond our scope to fix the underlying charset detectors, but we can allow users to have the charset detectors skip unsupported charsets.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)