You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@ctakes.apache.org by "James Joseph Masanz (JIRA)" <ji...@apache.org> on 2013/08/29 17:35:52 UTC
[jira] [Created] (CTAKES-231) missing NEs because of inconsistent
chunking for parallel sentence constructions
James Joseph Masanz created CTAKES-231:
------------------------------------------
Summary: missing NEs because of inconsistent chunking for parallel sentence constructions
Key: CTAKES-231
URL: https://issues.apache.org/jira/browse/CTAKES-231
Project: cTAKES
Issue Type: Bug
Components: ctakes-chunker
Affects Versions: 3.0-incubating
Reporter: James Joseph Masanz
Attachments: liver.cancer.chunking.issue.xmi.xml
cancer of colon, lung and liver
results in an annotation for liver cancer
cancer of colon, liver and lung.
does not result in an annotation for liver cancer or for lung cancer.
Thanks Dennis Lee Hon Kit for reporting this.
Details:
Reproduced by running 3.0.0-incubating with the separately downloadable UMLS resources, using the AggregatePlaintextUMLSProcessor.xml, results in these chunk annotations:
[0] org.apache.ctakes.typesystem.type.syntax.NP
[1] org.apache.ctakes.typesystem.type.syntax.PP
[2] org.apache.ctakes.typesystem.type.syntax.NP
[3] org.apache.ctakes.typesystem.type.syntax.NP
[4] org.apache.ctakes.typesystem.type.syntax.PP
[5] org.apache.ctakes.typesystem.type.syntax.NP
[6] org.apache.ctakes.typesystem.type.syntax.O
[7] org.apache.ctakes.typesystem.type.syntax.O
[8] org.apache.ctakes.typesystem.type.syntax.NP
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira