You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@uima.apache.org by "Marshall Schor (JIRA)" <ui...@incubator.apache.org> on 2008/04/04 23:01:25 UTC

[jira] Created: (UIMA-958) Document Analyzer not showing PersonTitle when running with xml tagged source

Document Analyzer not showing PersonTitle when running with xml tagged source
-----------------------------------------------------------------------------

Key: UIMA-958
URL: https://issues.apache.org/jira/browse/UIMA-958
Project: UIMA
Issue Type: Bug
Components: Tools
Reporter: Marshall Schor
Assignee: Marshall Schor
Priority: Minor
Fix For: 2.2.2

Running the document analyzer with an input source of xml docs, and specifying the xml tag (of TEXT) causes the personTitle annotator to not show results. This is traced to it being given a language of x-unspecified, even though the collection reader used set the document language to "en". This is traced to the insertion of the xmldetagger into the aggregate, which passed a cas view of "plain text" to the personTitle annotator. That component did not copy the language specifier from the input cas view. In fact, the input cas view was "xmlDocument" - and that view also didn't have the language. The collection reader put the language only into the "_InitialView". Fix by having the XmlDetagger copy the initial view's language into the resulting plain text view it creates for downstream annotators to work on.

Note that there are two copies of this class - fix both of them (One in tools, other in examples).

--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (UIMA-958) Document Analyzer not showing PersonTitle when running with xml tagged source

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.

     [ https://issues.apache.org/jira/browse/UIMA-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor updated UIMA-958:
--------------------------------

    Fix Version/s:     (was: 2.2.2)

In my opinion we don't have to include this in 2.2.2 - only affects examples.  Changing the fix release to "unknown".

> Document Analyzer not showing PersonTitle when running with xml tagged source
> -----------------------------------------------------------------------------
>
>                 Key: UIMA-958
>                 URL: https://issues.apache.org/jira/browse/UIMA-958
>             Project: UIMA
>          Issue Type: Bug
>          Components: Tools
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>
> Running the document analyzer with an input source of xml docs, and specifying the xml tag (of TEXT) causes the personTitle annotator to not show results.  This is traced to it being given a language of x-unspecified, even though the collection reader used set the document language to "en".  This is traced to the insertion of the xmldetagger into the aggregate, which passed a cas view of "plain text" to the personTitle annotator.  That component did not copy the language specifier from the input cas view.  In fact, the input cas view was "xmlDocument" - and that view also didn't have the language.  The collection reader put the language only into the "_InitialView".  Fix by having the XmlDetagger copy the initial view's language into the resulting plain text view it creates for downstream annotators to work on.
> Note that there are two copies of this class - fix both of them (One in tools, other in examples).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (UIMA-958) Document Analyzer not showing PersonTitle when running with xml tagged source

Posted by "Michael Baessler (JIRA)" <ui...@incubator.apache.org>.

     [ https://issues.apache.org/jira/browse/UIMA-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Michael Baessler updated UIMA-958:
----------------------------------

    Fix Version/s: 2.3

> Document Analyzer not showing PersonTitle when running with xml tagged source
> -----------------------------------------------------------------------------
>
>                 Key: UIMA-958
>                 URL: https://issues.apache.org/jira/browse/UIMA-958
>             Project: UIMA
>          Issue Type: Bug
>          Components: Tools
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3
>
>
> Running the document analyzer with an input source of xml docs, and specifying the xml tag (of TEXT) causes the personTitle annotator to not show results.  This is traced to it being given a language of x-unspecified, even though the collection reader used set the document language to "en".  This is traced to the insertion of the xmldetagger into the aggregate, which passed a cas view of "plain text" to the personTitle annotator.  That component did not copy the language specifier from the input cas view.  In fact, the input cas view was "xmlDocument" - and that view also didn't have the language.  The collection reader put the language only into the "_InitialView".  Fix by having the XmlDetagger copy the initial view's language into the resulting plain text view it creates for downstream annotators to work on.
> Note that there are two copies of this class - fix both of them (One in tools, other in examples).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (UIMA-958) Document Analyzer not showing PersonTitle when running with xml tagged source

Posted by "Michael Baessler (JIRA)" <ui...@incubator.apache.org>.

    [ https://issues.apache.org/jira/browse/UIMA-958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12585940#action_12585940 ] 

Michael Baessler commented on UIMA-958:
---------------------------------------

Fine with me to release this with the next release. I will change the fix version to 2.3

> Document Analyzer not showing PersonTitle when running with xml tagged source
> -----------------------------------------------------------------------------
>
>                 Key: UIMA-958
>                 URL: https://issues.apache.org/jira/browse/UIMA-958
>             Project: UIMA
>          Issue Type: Bug
>          Components: Tools
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.3
>
>
> Running the document analyzer with an input source of xml docs, and specifying the xml tag (of TEXT) causes the personTitle annotator to not show results.  This is traced to it being given a language of x-unspecified, even though the collection reader used set the document language to "en".  This is traced to the insertion of the xmldetagger into the aggregate, which passed a cas view of "plain text" to the personTitle annotator.  That component did not copy the language specifier from the input cas view.  In fact, the input cas view was "xmlDocument" - and that view also didn't have the language.  The collection reader put the language only into the "_InitialView".  Fix by having the XmlDetagger copy the initial view's language into the resulting plain text view it creates for downstream annotators to work on.
> Note that there are two copies of this class - fix both of them (One in tools, other in examples).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Closed: (UIMA-958) Document Analyzer not showing PersonTitle when running with xml tagged source

Posted by "Marshall Schor (JIRA)" <ui...@incubator.apache.org>.

     [ https://issues.apache.org/jira/browse/UIMA-958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Marshall Schor closed UIMA-958.
-------------------------------

    Resolution: Fixed

> Document Analyzer not showing PersonTitle when running with xml tagged source
> -----------------------------------------------------------------------------
>
>                 Key: UIMA-958
>                 URL: https://issues.apache.org/jira/browse/UIMA-958
>             Project: UIMA
>          Issue Type: Bug
>          Components: Tools
>            Reporter: Marshall Schor
>            Assignee: Marshall Schor
>            Priority: Minor
>             Fix For: 2.2.2
>
>
> Running the document analyzer with an input source of xml docs, and specifying the xml tag (of TEXT) causes the personTitle annotator to not show results.  This is traced to it being given a language of x-unspecified, even though the collection reader used set the document language to "en".  This is traced to the insertion of the xmldetagger into the aggregate, which passed a cas view of "plain text" to the personTitle annotator.  That component did not copy the language specifier from the input cas view.  In fact, the input cas view was "xmlDocument" - and that view also didn't have the language.  The collection reader put the language only into the "_InitialView".  Fix by having the XmlDetagger copy the initial view's language into the resulting plain text view it creates for downstream annotators to work on.
> Note that there are two copies of this class - fix both of them (One in tools, other in examples).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.