You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Trevor Yann (JIRA)" <ji...@apache.org> on 2017/12/20 01:16:00 UTC

[jira] [Created] (TIKA-2532) Output for PDF file contains X-TIKA:content that is postscript

Trevor Yann created TIKA-2532:
---------------------------------

             Summary: Output for PDF file contains X-TIKA:content that is postscript
                 Key: TIKA-2532
                 URL: https://issues.apache.org/jira/browse/TIKA-2532
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.17, 1.16, 1.15
         Environment: Ubuntu 64 bit
JDK 1.8
            Reporter: Trevor Yann
            Priority: Minor


I have a PDF file that returns two elements in the recursive json output. The first element is text, as expected. The second element seems to be a fragment of postscript, rather than extracted text.




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)