You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2012/10/10 08:19:03 UTC

[jira] [Created] (STANBOL-764) Add support for importing Quads to the JenaTDB indexing source

Rupert Westenthaler created STANBOL-764:
-------------------------------------------

             Summary: Add support for importing Quads to the JenaTDB indexing source
                 Key: STANBOL-764
                 URL: https://issues.apache.org/jira/browse/STANBOL-764
             Project: Stanbol
          Issue Type: New Feature
          Components: Entity Hub
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler
            Priority: Minor


The Jena TDB indexing source should support quads (Nquads, Trig serialized RDF data) by ignoring the context.

This means that SPO (the triples) are stored to the default dataset of the Jena TDB store by ignoring the Context information of the quads.

see also http://markmail.org/message/jzy6w7abettauxqv

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (STANBOL-764) Add support for importing Quads to the JenaTDB indexing source

Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/STANBOL-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rupert Westenthaler resolved STANBOL-764.
-----------------------------------------

    Resolution: Implemented

implemented with http://svn.apache.org/viewvc?rev=1396507&view=rev
                
> Add support for importing Quads to the JenaTDB indexing source
> --------------------------------------------------------------
>
>                 Key: STANBOL-764
>                 URL: https://issues.apache.org/jira/browse/STANBOL-764
>             Project: Stanbol
>          Issue Type: New Feature
>          Components: Entity Hub
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>            Priority: Minor
>
> The Jena TDB indexing source should support quads (Nquads, Trig serialized RDF data) by ignoring the context.
> This means that SPO (the triples) are stored to the default dataset of the Jena TDB store by ignoring the Context information of the quads.
> see also http://markmail.org/message/jzy6w7abettauxqv

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Comment Edited] (STANBOL-764) Add support for importing Quads to the JenaTDB indexing source

Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/STANBOL-764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13473056#comment-13473056 ] 

Rupert Westenthaler edited comment on STANBOL-764 at 10/10/12 7:57 AM:
-----------------------------------------------------------------------

To avoid clashes with bank nodes (Bnodes) in different Graphs the importer processes those for Subjects and Objects of imported quads.

If the Context is a URI, than BNodes are converted by using the Context URI as base and the Bnode ID as local name. If the Context is a Bnode, that the Bnode ID of the Subject/Object is appended (by using '_' as separator) to the Bnode ID of the context.

e.g. the quad

    _:node16nh2jpdcx57591272 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en <http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.html>   .

would be imported as Triple 

    http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.htm#node16nh2jpdcx5759127 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en  .

NOTES:

* This does change the RDF data as anonymous BNodes are converted to dereferenceable URIs. Because of that this behavior might need some further discussion

* This does work for BNodes linking to other BNodes as BNodes are only valid within the same Graph. Meaning that Contexts of Bnodes linking to each other are ensured to be the same. Therefore also the UIRs, NodeIds of the processed nodes will match.
                
      was (Author: rwesten):
    To avoid clashes with bank nodes (Bnodes) the importer replaces Bnodes with URIs that use the URI of the context and do add the Bnode ID as local name. 

e.g. the quad

    _:node16nh2jpdcx57591272 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en <http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.html>   .

would be imported as Triple 

    http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.htm#node16nh2jpdcx5759127 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en  .

One needs to note that this does change the RDF data as anonymous BNodes are converted to dereferenceable URIs. Because of that this behavior might need some further discussion
                  
> Add support for importing Quads to the JenaTDB indexing source
> --------------------------------------------------------------
>
>                 Key: STANBOL-764
>                 URL: https://issues.apache.org/jira/browse/STANBOL-764
>             Project: Stanbol
>          Issue Type: New Feature
>          Components: Entity Hub
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>            Priority: Minor
>
> The Jena TDB indexing source should support quads (Nquads, Trig serialized RDF data) by ignoring the context.
> This means that SPO (the triples) are stored to the default dataset of the Jena TDB store by ignoring the Context information of the quads.
> see also http://markmail.org/message/jzy6w7abettauxqv

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (STANBOL-764) Add support for importing Quads to the JenaTDB indexing source

Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/STANBOL-764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rupert Westenthaler updated STANBOL-764:
----------------------------------------

    Comment: was deleted

(was: To avoid clashes with bank nodes (Bnodes) in different Graphs the importer processes those for Subjects and Objects of imported quads.

If the Context is a URI, than BNodes are converted by using the Context URI as base and the Bnode ID as local name. If the Context is a Bnode, that the Bnode ID of the Subject/Object is appended (by using '_' as separator) to the Bnode ID of the context.

e.g. the quad

    _:node16nh2jpdcx57591272 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en <http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.html>   .

would be imported as Triple 

    http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.htm#node16nh2jpdcx5759127 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en  .

NOTES:

* This does change the RDF data as anonymous BNodes are converted to dereferenceable URIs. Because of that this behavior might need some further discussion

* This does work for BNodes linking to other BNodes as BNodes are only valid within the same Graph. Meaning that Contexts of Bnodes linking to each other are ensured to be the same. Therefore also the UIRs, NodeIds of the processed nodes will match.)
    
> Add support for importing Quads to the JenaTDB indexing source
> --------------------------------------------------------------
>
>                 Key: STANBOL-764
>                 URL: https://issues.apache.org/jira/browse/STANBOL-764
>             Project: Stanbol
>          Issue Type: New Feature
>          Components: Entity Hub
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>            Priority: Minor
>
> The Jena TDB indexing source should support quads (Nquads, Trig serialized RDF data) by ignoring the context.
> This means that SPO (the triples) are stored to the default dataset of the Jena TDB store by ignoring the Context information of the quads.
> see also http://markmail.org/message/jzy6w7abettauxqv

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (STANBOL-764) Add support for importing Quads to the JenaTDB indexing source

Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/STANBOL-764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13473056#comment-13473056 ] 

Rupert Westenthaler commented on STANBOL-764:
---------------------------------------------

To avoid clashes with bank nodes (Bnodes) the importer replaces Bnodes with URIs that use the URI of the context and do add the Bnode ID as local name. 

e.g. the quad

    _:node16nh2jpdcx57591272 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en <http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.html>   .

would be imported as Triple 

    http://www.huffingtonpost.com/2009/01/13/obama-issues-first-veto-t_n_157585.htm#node16nh2jpdcx5759127 <http://xmlns.com/foaf/0.1/name> "Barack Obama"@en  .

One needs to note that this does change the RDF data as anonymous BNodes are converted to dereferenceable URIs. Because of that this behavior might need some further discussion
                
> Add support for importing Quads to the JenaTDB indexing source
> --------------------------------------------------------------
>
>                 Key: STANBOL-764
>                 URL: https://issues.apache.org/jira/browse/STANBOL-764
>             Project: Stanbol
>          Issue Type: New Feature
>          Components: Entity Hub
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>            Priority: Minor
>
> The Jena TDB indexing source should support quads (Nquads, Trig serialized RDF data) by ignoring the context.
> This means that SPO (the triples) are stored to the default dataset of the Jena TDB store by ignoring the Context information of the quads.
> see also http://markmail.org/message/jzy6w7abettauxqv

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira