You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Sébastien Nicouleau (JIRA)" <ji...@apache.org> on 2018/08/21 09:20:00 UTC

[jira] [Created] (TIKA-2715) Detection Encoding Problem

Sébastien Nicouleau created TIKA-2715:
-----------------------------------------

             Summary: Detection Encoding Problem
                 Key: TIKA-2715
                 URL: https://issues.apache.org/jira/browse/TIKA-2715
             Project: Tika
          Issue Type: Bug
    Affects Versions: 1.18, 1.17
            Reporter: Sébastien Nicouleau


Hi,

I have a problem to detect encoding , with page headers:
Content-Encoding: gzip
Content-Length: 668
Content-Type: text/html;charset=ISO-8859-1

Debuging , in class AutoDetectReader 

metadata.get("Content-Type") return null



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)