You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2021/08/16 18:22:00 UTC
[jira] [Created] (TIKA-3525) Allow users to configure skipping of
unsupported charsets in charset detection
Tim Allison created TIKA-3525:
---------------------------------
Summary: Allow users to configure skipping of unsupported charsets in charset detection
Key: TIKA-3525
URL: https://issues.apache.org/jira/browse/TIKA-3525
Project: Tika
Issue Type: Task
Reporter: Tim Allison
We've had two issues now where a charset detector is detecting a charset that is not supported by the jvm. In both cases, the charset detector was wrong. It is beyond our scope to fix the underlying charset detectors, but we can allow users to have the charset detectors skip unsupported charsets.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)