You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Enrico Daga (JIRA)" <ji...@apache.org> on 2011/03/01 18:56:36 UTC

[jira] Commented: (STANBOL-107) semantic description of the engines

    [ https://issues.apache.org/jira/browse/STANBOL-107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13000987#comment-13000987 ] 

Enrico Daga commented on STANBOL-107:
-------------------------------------

Some examples should be setup from existing engines, as ogrisel suggester. Don't do it now since there are lot of ways to do it and I - personally - want to think more about it.

> semantic description of the engines
> -----------------------------------
>
>                 Key: STANBOL-107
>                 URL: https://issues.apache.org/jira/browse/STANBOL-107
>             Project: Stanbol
>          Issue Type: New Feature
>          Components: Enhancer
>            Reporter: Enrico Daga
>            Priority: Minor
>
> It would be nice to find a way to let engines declare which is the contribution they are going to provide.
> I see at  least the following kinds of enhancements:
> 1) tagging: detect keywords, entities, concepts "within" the content
> 2) categorization/classification: locate the content in a conceptual place within a given framework. For example an engine could state that the document has "Secon World War" as primary topic, or "Theatre" in the framework of DBPedia categories, or state that is an E-mail, or a News, in the framework of the CMS document types;
> 3) metadata: the engine extracts metadata from within the content. For instance it returns the PDF metadata in RDF using the dublin core vocabulary
> 4) embedded knowledge: the source document is a rich HTML (with RDFa, Microformats) or it is a structured file (why not an RDF file, say a FOAF profile?)
> then, the enhancement engine should also say HOW it contributes in terms of vocabulary elements
> 1) Does the engine add annotation roles?
> 2) Does it add entity types?
> 3) Which metadata fields it will return?
> This could be done with an RDF description stating which are the terms the engine will introduce in relation to the ones of the Stanbol Enhancement base ontology (STANBOL-52). This is also related to STANBOL-3.

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira