You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2013/10/16 14:31:42 UTC

[jira] [Commented] (STANBOL-762) XMP Extractor Engine

    [ https://issues.apache.org/jira/browse/STANBOL-762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13796721#comment-13796721 ] 

Rupert Westenthaler commented on STANBOL-762:
---------------------------------------------

Reto would this engine a candidate for inclusion in the 0.12.0 release?

> XMP Extractor Engine
> --------------------
>
>                 Key: STANBOL-762
>                 URL: https://issues.apache.org/jira/browse/STANBOL-762
>             Project: Stanbol
>          Issue Type: Improvement
>          Components: Enhancement Engines
>            Reporter: Reto Bachmann-Gmür
>            Assignee: Reto Bachmann-Gmür
>
> Many file formats (images, pdfs, videos) may contain XMP metadata. The XMP syntax is a subset of RDF/XML and can thus be parsed as RDF. While some of this data is aleready extracted by the tika engine the XMP block often contains more information. As the relevance of this information depends on usage scenarios a dedicated xmpextractor engine should be created so that clients can decide if they want that engine in the chain or not.



--
This message was sent by Atlassian JIRA
(v6.1#6144)