You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Walter Kasper (JIRA)" <ji...@apache.org> on 2012/09/19 10:49:08 UTC

[jira] [Resolved] (STANBOL-689) Refactor RDFa/Microformat extractor to be independent of external repositories

     [ https://issues.apache.org/jira/browse/STANBOL-689?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Walter Kasper resolved STANBOL-689.
-----------------------------------

    Resolution: Fixed

Provided the htmlextractor engine for Microformat and RDFa extractors (same as the Metax engine but without Aperture dependencies).

The underlying HTML parser was replaced by JTidy that provides sufficent robustness and namespace handling without patches.
                
> Refactor RDFa/Microformat extractor to be independent of external repositories
> ------------------------------------------------------------------------------
>
>                 Key: STANBOL-689
>                 URL: https://issues.apache.org/jira/browse/STANBOL-689
>             Project: Stanbol
>          Issue Type: New Feature
>          Components: Engine - Metaxa, Enhancer
>            Reporter: Walter Kasper
>            Assignee: Walter Kasper
>
> The RDFa and Microformat extractor as part of Metaxa depends on libraries not available in the Apache maven repositories, such preventing them being released.
> The HTML extractors for RDFa and Microformats will be refactored to a new engine that will not depend on libs from external repositories.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira