You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@stanbol.apache.org by "Fabian Christ (JIRA)" <ji...@apache.org> on 2012/12/12 16:29:21 UTC

[jira] [Updated] (STANBOL-762) XMP Extractor Engine

     [ https://issues.apache.org/jira/browse/STANBOL-762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Fabian Christ updated STANBOL-762:
----------------------------------

    Component/s: Engine - XMP Extractor
    
> XMP Extractor Engine
> --------------------
>
>                 Key: STANBOL-762
>                 URL: https://issues.apache.org/jira/browse/STANBOL-762
>             Project: Stanbol
>          Issue Type: Improvement
>          Components: Engine - XMP Extractor
>            Reporter: Reto Bachmann-Gmür
>            Assignee: Reto Bachmann-Gmür
>
> Many file formats (images, pdfs, videos) may contain XMP metadata. The XMP syntax is a subset of RDF/XML and can thus be parsed as RDF. While some of this data is aleready extracted by the tika engine the XMP block often contains more information. As the relevance of this information depends on usage scenarios a dedicated xmpextractor engine should be created so that clients can decide if they want that engine in the chain or not.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira