You are viewing a plain text version of this content. The canonical link for it is here.

Posted to commits@stanbol.apache.org by "Rupert Westenthaler (JIRA)" <ji...@apache.org> on 2012/11/06 15:54:12 UTC

[jira] [Created] (STANBOL-797) Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart

Rupert Westenthaler created STANBOL-797:
-------------------------------------------

             Summary: Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart
                 Key: STANBOL-797
                 URL: https://issues.apache.org/jira/browse/STANBOL-797
             Project: Stanbol
          Issue Type: Sub-task
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler


Currently the OpenNLP NER engine consumes the plain text, detects sentences and tokenizes the text. However if an EnhancementChain does use the NLP processing module those data are likely already present in the AnalyzedText ContentPart.

This will add functionality that allows the OpenNLP NER engine to consume those data. If no AnalyzedText ContentPart is present or the AnalyzedText does not contain Token information the engine should still perform tokenization and sentence detection

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (STANBOL-797) Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart

Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/STANBOL-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rupert Westenthaler resolved STANBOL-797.
-----------------------------------------

    Resolution: Fixed

implementation provided with http://svn.apache.org/viewvc?rev=1406168&view=rev
                
> Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart
> --------------------------------------------------------------------
>
>                 Key: STANBOL-797
>                 URL: https://issues.apache.org/jira/browse/STANBOL-797
>             Project: Stanbol
>          Issue Type: Sub-task
>          Components: Enhancer
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> Currently the OpenNLP NER engine consumes the plain text, detects sentences and tokenizes the text. However if an EnhancementChain does use the NLP processing module those data are likely already present in the AnalyzedText ContentPart.
> This will add functionality that allows the OpenNLP NER engine to consume those data. If no AnalyzedText ContentPart is present or the AnalyzedText does not contain Token information the engine should still perform tokenization and sentence detection

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (STANBOL-797) Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart

Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/STANBOL-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rupert Westenthaler reopened STANBOL-797:
-----------------------------------------


Add functionality to add NLP annotations also to the AnalyzedContent part so that other engines that prefer to use the NLP API can consume them without accessing the RDF enhancement structure
                
> Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart
> --------------------------------------------------------------------
>
>                 Key: STANBOL-797
>                 URL: https://issues.apache.org/jira/browse/STANBOL-797
>             Project: Stanbol
>          Issue Type: Sub-task
>          Components: Enhancer
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> Currently the OpenNLP NER engine consumes the plain text, detects sentences and tokenizes the text. However if an EnhancementChain does use the NLP processing module those data are likely already present in the AnalyzedText ContentPart.
> This will add functionality that allows the OpenNLP NER engine to consume those data. If no AnalyzedText ContentPart is present or the AnalyzedText does not contain Token information the engine should still perform tokenization and sentence detection

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (STANBOL-797) Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart

Posted by "Rupert Westenthaler (JIRA)" <ji...@apache.org>.

     [ https://issues.apache.org/jira/browse/STANBOL-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rupert Westenthaler resolved STANBOL-797.
-----------------------------------------

    Resolution: Fixed

NerTags are now added to the AnalyzedText content part
                
> Adapt the OpenNLP NER engine to support the AnalyzedText ContentPart
> --------------------------------------------------------------------
>
>                 Key: STANBOL-797
>                 URL: https://issues.apache.org/jira/browse/STANBOL-797
>             Project: Stanbol
>          Issue Type: Sub-task
>          Components: Enhancer
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>
> Currently the OpenNLP NER engine consumes the plain text, detects sentences and tokenizes the text. However if an EnhancementChain does use the NLP processing module those data are likely already present in the AnalyzedText ContentPart.
> This will add functionality that allows the OpenNLP NER engine to consume those data. If no AnalyzedText ContentPart is present or the AnalyzedText does not contain Token information the engine should still perform tokenization and sentence detection

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira