You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ta...@apache.org on 2019/04/03 14:26:28 UTC

[tika] branch branch_1x updated (b437e21 -> 35e2222)

This is an automated email from the ASF dual-hosted git repository.

tallison pushed a change to branch branch_1x
in repository https://gitbox.apache.org/repos/asf/tika.git.


    from b437e21  TIKA-2841 - focusing on epub, but also fixing TIKA-2310, and handling embedded images/attachments
     new 844626a  TIKA-2845 -- need to override processPages
     new 35e2222  add import statement for TIKA-2845 test

The 4373 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .../apache/tika/parser/pdf/AbstractPDF2XHTML.java  |   65 +-
 .../org/apache/tika/parser/pdf/PDFParserTest.java  |    9 +
 .../testPDFFileEmbInAnnotation_noContents.pdf      | 1894 ++++++++++++++++++++
 3 files changed, 1965 insertions(+), 3 deletions(-)
 create mode 100644 tika-parsers/src/test/resources/test-documents/testPDFFileEmbInAnnotation_noContents.pdf