You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ctakes.apache.org by "Tim Miller (JIRA)" <ji...@apache.org> on 2013/01/22 17:18:13 UTC

[jira] [Created] (CTAKES-133) constituency-parser: escaped types confused for function tags

Tim Miller created CTAKES-133:
---------------------------------

             Summary: constituency-parser: escaped types confused for function tags
                 Key: CTAKES-133
                 URL: https://issues.apache.org/jira/browse/CTAKES-133
             Project: cTAKES
          Issue Type: Bug
          Components: ctakes-constituency-parser
            Reporter: Tim Miller
            Assignee: Tim Miller
            Priority: Minor


There is code to separate constituent labels from function tags - e.g., NP-SBJ for a noun phrase (NP) that is a subject (SBJ).  There are other tags that are special cases to escape things that screw up tree formatting -- a paren in the text "(" is rewritten as -LRB- to not be mistaken for a tree delimiter. But the code looking for the dash in a label-function tag will find the dash in the -LRB-.  The fix is to check for a dash at the start of the label -- escaped tags will not have function elements.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira