You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2016/03/07 16:28:40 UTC
[jira] [Comment Edited] (TIKA-1894) Add XMPMM metadata extraction
to JempboxExtractor
[ https://issues.apache.org/jira/browse/TIKA-1894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15183137#comment-15183137 ]
Tim Allison edited comment on TIKA-1894 at 3/7/16 3:27 PM:
-----------------------------------------------------------
Update made to trunk with commit c5d4ec6c50824a9a40fdd2b492bf7557d8d693f3.
In 2.0, I'm not sure how to share JempboxExtractor with the multi-media-module and the pdf-module. As expected, we get a cyclic dependency error if I add the multi-media-module as a dependency to the pdf-module, and, even if it did work, that wasn't a good option.
Some options:
1. Create a tika-parser-xmp-module that would include helper functionality for extracting xmp packets & metadata. Is this enough to warrant a separate module?
2. Duplicate code (no!!!).
3. Other options?
was (Author: tallison@mitre.org):
Update made to trunk with commit c5d4ec6c50824a9a40fdd2b492bf7557d8d693f3.
In 2.0, I'm not sure how to share JempboxExtractor with the multi-media-module and the pdf-module. As expected, we get a cyclic dependency error if I add the multi-media-module as a dependency to the pdf-module, and, even if it did work, that wasn't a good option.
Some options:
#. Create a tika-parser-xmp-module that would include helper functionality for extracting xmp packets & metadata. Is this enough to warrant a separate module?
#. Duplicate code (no!!!).
#. Other options?
> Add XMPMM metadata extraction to JempboxExtractor
> -------------------------------------------------
>
> Key: TIKA-1894
> URL: https://issues.apache.org/jira/browse/TIKA-1894
> Project: Tika
> Issue Type: New Feature
> Reporter: Tim Allison
> Priority: Minor
>
> The XMP Media Management (XMPMM) section of xmp carries some useful information. We currently have keys for many of the important attributes in tika-core's o.a.t.metadata.XMPMM, and JempBox extracts the XMPMM schema, but the wiring between the two has not yet been installed.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)