You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@stanbol.apache.org by "Dileepa Jayakody (JIRA)" <ji...@apache.org> on 2013/09/19 22:59:51 UTC

[jira] [Updated] (STANBOL-1161) Entity Disambiguation using FOAF Co-reference

     [ https://issues.apache.org/jira/browse/STANBOL-1161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Dileepa Jayakody updated STANBOL-1161:
--------------------------------------

    Description: 
Entity Disambiguation using FOAF co-reference is implemented as a new Stanbol Enhancement Engine. The project includes 2 sub-tasks.

1. Entityhub site with substantial FOAF entities : 
FOAF datasources are dispersed all over the internet, and it's essential to aggregate a sufficient FOAF dataset in-order to perform co-reference disambiguation techniques over data. foaf-site is the entityhub site created by indexing btc2012 dataset including FOAF data. For more information on the foaf-site integration in Stanbol please refer the github project at [1]. The README.md contains the steps to integrate btc2012 dataset as an entityhub-site in Stanbol.

2. Disambiguation Engine using FOAF co-reference techniques :
The main algorithm calculates disambiguated confidence of an entity by analyzing the the connected-ness of the entity. This is calculated by processing the number of correlated URI References with other suggested entities in the content. Correlated URI links between entities derive their connected-ness and the highest connected entity will have high disambiguated confidence. The other disambiguation algorithm uses literal-matching of selected-text with foaf:name value of the entity. If matched, the confidence is increased. The foaf disambiguation engine project can be found at the github project [2]. To install the engine and configure Stanbol enhancement-chain please refer the README.md of the project.


[1] https://github.com/dileepajayakody/FOAFSite
[2] https://github.com/dileepajayakody/foaf-disambiguation

  was:
Entity Disambiguation using FOAF co-reference is implemented as a new Stanbol Enhancement Engine. The project includes 2 sub-tasks.

1. Entityhub site with substantial FOAF entities : 
FOAF datasources are dispersed all over the internet, and it's essential to aggregate a sufficient FOAF dataset in-order to perform co-reference disambiguation techniques over data. foaf-site is the entityhub site created by indexing btc2012 dataset including FOAF data. For more information on the foaf-site integration in Stanbol please refer the github project at [1]. The README.md contains the steps to integrate btc2012 dataset as an entityhub-site in Stanbol.

2. Disambiguation Engine using FOAF co-reference techniques :
The main algorithm calculates disambiguated confidence of an entity by analyzing the the connected-ness of the entity. This is calculated by processing the number of URI reference matches with others from suggested entities in the content. Co-references of URI links between entities derive their connected-ness and the highest connected entity will have more disambiguated confidence. The other disambiguation algorithm uses literal-matching of selected-text with foaf:name value of the entity. If matched, the confidence is increased. The foaf disambiguation engine project can be found at the github project [2]. To install the engine and configure Stanbol enhancement-chain please refer the README.md of the project.


[1] https://github.com/dileepajayakody/FOAFSite
[2] https://github.com/dileepajayakody/foaf-disambiguation

    
> Entity Disambiguation using FOAF Co-reference
> ---------------------------------------------
>
>                 Key: STANBOL-1161
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1161
>             Project: Stanbol
>          Issue Type: Task
>          Components: Enhancement Engines, Entityhub
>            Reporter: Dileepa Jayakody
>   Original Estimate: 1,344h
>  Remaining Estimate: 1,344h
>
> Entity Disambiguation using FOAF co-reference is implemented as a new Stanbol Enhancement Engine. The project includes 2 sub-tasks.
> 1. Entityhub site with substantial FOAF entities : 
> FOAF datasources are dispersed all over the internet, and it's essential to aggregate a sufficient FOAF dataset in-order to perform co-reference disambiguation techniques over data. foaf-site is the entityhub site created by indexing btc2012 dataset including FOAF data. For more information on the foaf-site integration in Stanbol please refer the github project at [1]. The README.md contains the steps to integrate btc2012 dataset as an entityhub-site in Stanbol.
> 2. Disambiguation Engine using FOAF co-reference techniques :
> The main algorithm calculates disambiguated confidence of an entity by analyzing the the connected-ness of the entity. This is calculated by processing the number of correlated URI References with other suggested entities in the content. Correlated URI links between entities derive their connected-ness and the highest connected entity will have high disambiguated confidence. The other disambiguation algorithm uses literal-matching of selected-text with foaf:name value of the entity. If matched, the confidence is increased. The foaf disambiguation engine project can be found at the github project [2]. To install the engine and configure Stanbol enhancement-chain please refer the README.md of the project.
> [1] https://github.com/dileepajayakody/FOAFSite
> [2] https://github.com/dileepajayakody/foaf-disambiguation

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira