You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by ta...@apache.org on 2019/07/31 18:43:55 UTC
[tika] branch branch_1x updated (88ea116 -> fd0eeb9)
This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch branch_1x
in repository https://gitbox.apache.org/repos/asf/tika.git.
from 88ea116 [maven-release-plugin] prepare for next development iteration
add fd0eeb9 TIKA-2917 -- extract metadata that accompanies inline images
No new revisions were added by this update.
Summary of changes:
.../tika/parser/image/xmp/JempboxExtractor.java | 4 +-
.../apache/tika/parser/pdf/AbstractPDF2XHTML.java | 2 +-
.../java/org/apache/tika/parser/pdf/PDF2XHTML.java | 137 +++++-----
.../java/org/apache/tika/parser/pdf/PDFParser.java | 278 +++------------------
.../tika/parser/pdf/PDMetadataExtractor.java | 270 ++++++++++++++++++++
5 files changed, 375 insertions(+), 316 deletions(-)
create mode 100644 tika-parsers/src/main/java/org/apache/tika/parser/pdf/PDMetadataExtractor.java