You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@opennlp.apache.org by "James Kosin (Created) (JIRA)" <ji...@apache.org> on 2012/01/05 04:34:39 UTC

[jira] [Created] (OPENNLP-417) Back-to-Back tags get improperly set when tagging

Back-to-Back <START><END> tags get improperly set when tagging
--------------------------------------------------------------

                 Key: OPENNLP-417
                 URL: https://issues.apache.org/jira/browse/OPENNLP-417
             Project: OpenNLP
          Issue Type: Bug
          Components: Name Finder
    Affects Versions: tools-1.5.2-incubating, tools-1.5.1-incubating
            Reporter: James Kosin
            Assignee: James Kosin


1) Special thanks go to Angel Luis Jimenez Martinez for both finding the problem, and finding the one line of code that caused this problem.

This only affects the TokenNameFinder when parsing user input and tagging the output.
The problem was when the name finder model returns the spans, it would properly find the Spans and return them; however, it would improperly label the Spans when constructing them, if they where back-to-back.
This can cause the NameFinder to improperly report the wrong tag in this situation.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Closed] (OPENNLP-417) Back-to-Back tags get improperly set when tagging

Posted by "James Kosin (Closed) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/OPENNLP-417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

James Kosin closed OPENNLP-417.
-------------------------------

    
> Back-to-Back <START><END> tags get improperly set when tagging
> --------------------------------------------------------------
>
>                 Key: OPENNLP-417
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-417
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Name Finder
>    Affects Versions: tools-1.5.1-incubating, tools-1.5.2-incubating
>            Reporter: James Kosin
>            Assignee: James Kosin
>              Labels: namefinder, tags, wrong
>             Fix For: tools-1.5.3-incubating
>
>
> 1) Special thanks go to Angel Luis Jimenez Martinez for both finding the problem, and finding the one line of code that caused this problem.
> This only affects the TokenNameFinder when parsing user input and tagging the output.
> The problem was when the name finder model returns the spans, it would properly find the Spans and return them; however, it would improperly label the Spans when constructing them, if they where back-to-back.
> This can cause the NameFinder to improperly report the wrong tag in this situation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (OPENNLP-417) Back-to-Back tags get improperly set when tagging

Posted by "James Kosin (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/OPENNLP-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13180184#comment-13180184 ] 

James Kosin commented on OPENNLP-417:
-------------------------------------

I'm going to assume then that the back-to-back issue isn't that prevalent if you are just looking for one type of name.

                
> Back-to-Back <START><END> tags get improperly set when tagging
> --------------------------------------------------------------
>
>                 Key: OPENNLP-417
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-417
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Name Finder
>    Affects Versions: tools-1.5.1-incubating, tools-1.5.2-incubating
>            Reporter: James Kosin
>            Assignee: James Kosin
>              Labels: namefinder, tags, wrong
>
> 1) Special thanks go to Angel Luis Jimenez Martinez for both finding the problem, and finding the one line of code that caused this problem.
> This only affects the TokenNameFinder when parsing user input and tagging the output.
> The problem was when the name finder model returns the spans, it would properly find the Spans and return them; however, it would improperly label the Spans when constructing them, if they where back-to-back.
> This can cause the NameFinder to improperly report the wrong tag in this situation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (OPENNLP-417) Back-to-Back tags get improperly set when tagging

Posted by "James Kosin (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/OPENNLP-417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

James Kosin resolved OPENNLP-417.
---------------------------------

       Resolution: Fixed
    Fix Version/s: tools-1.5.3-incubating

I'm closing the issue as fixed.  I don't see any degragation in performance with the changes.
                
> Back-to-Back <START><END> tags get improperly set when tagging
> --------------------------------------------------------------
>
>                 Key: OPENNLP-417
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-417
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Name Finder
>    Affects Versions: tools-1.5.1-incubating, tools-1.5.2-incubating
>            Reporter: James Kosin
>            Assignee: James Kosin
>              Labels: namefinder, tags, wrong
>             Fix For: tools-1.5.3-incubating
>
>
> 1) Special thanks go to Angel Luis Jimenez Martinez for both finding the problem, and finding the one line of code that caused this problem.
> This only affects the TokenNameFinder when parsing user input and tagging the output.
> The problem was when the name finder model returns the spans, it would properly find the Spans and return them; however, it would improperly label the Spans when constructing them, if they where back-to-back.
> This can cause the NameFinder to improperly report the wrong tag in this situation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (OPENNLP-417) Back-to-Back tags get improperly set when tagging

Posted by "James Kosin (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/OPENNLP-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13180183#comment-13180183 ] 

James Kosin commented on OPENNLP-417:
-------------------------------------

So far, it has improved the performance of the name finder that contains all types...

Dutch [ned] test data:

Precision: 0.664424218440839
Recall: 0.6418195718654435
F-Measure: 0.6529263076025666


up over 1.5.2 release:

Precision: 0.6509695290858726
Recall: 0.628822629969419
F-Measure: 0.6397044526540929


                
> Back-to-Back <START><END> tags get improperly set when tagging
> --------------------------------------------------------------
>
>                 Key: OPENNLP-417
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-417
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Name Finder
>    Affects Versions: tools-1.5.1-incubating, tools-1.5.2-incubating
>            Reporter: James Kosin
>            Assignee: James Kosin
>              Labels: namefinder, tags, wrong
>
> 1) Special thanks go to Angel Luis Jimenez Martinez for both finding the problem, and finding the one line of code that caused this problem.
> This only affects the TokenNameFinder when parsing user input and tagging the output.
> The problem was when the name finder model returns the spans, it would properly find the Spans and return them; however, it would improperly label the Spans when constructing them, if they where back-to-back.
> This can cause the NameFinder to improperly report the wrong tag in this situation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (OPENNLP-417) Back-to-Back tags get improperly set when tagging

Posted by "James Kosin (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/OPENNLP-417?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13180169#comment-13180169 ] 

James Kosin commented on OPENNLP-417:
-------------------------------------

Changes are done; I'm now testing the changes.
                
> Back-to-Back <START><END> tags get improperly set when tagging
> --------------------------------------------------------------
>
>                 Key: OPENNLP-417
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-417
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Name Finder
>    Affects Versions: tools-1.5.1-incubating, tools-1.5.2-incubating
>            Reporter: James Kosin
>            Assignee: James Kosin
>              Labels: namefinder, tags, wrong
>
> 1) Special thanks go to Angel Luis Jimenez Martinez for both finding the problem, and finding the one line of code that caused this problem.
> This only affects the TokenNameFinder when parsing user input and tagging the output.
> The problem was when the name finder model returns the spans, it would properly find the Spans and return them; however, it would improperly label the Spans when constructing them, if they where back-to-back.
> This can cause the NameFinder to improperly report the wrong tag in this situation.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira