You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/06/02 00:42:59 UTC

[jira] [Created] (TIKA-1992) Check for duplicate inline images via COSStream not name in PDFParser

Tim Allison created TIKA-1992:
---------------------------------

             Summary: Check for duplicate inline images via COSStream not name in PDFParser
                 Key: TIKA-1992
                 URL: https://issues.apache.org/jira/browse/TIKA-1992
             Project: Tika
          Issue Type: Bug
            Reporter: Tim Allison
            Assignee: Tim Allison
            Priority: Trivial


We're currently checking for duplicate inline images by image name, not by COSStream as is done in PDFBox's ExtractImages.  Our method doesn't work; we should follow their lead.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)