You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@uima.apache.org by "Eddie Epstein (JIRA)" <ui...@incubator.apache.org> on 2009/12/15 19:57:18 UTC

[jira] Created: (UIMA-1707) String features in out-of-typesystem data not normalized on service replies

String features in out-of-typesystem data not normalized on service replies
---------------------------------------------------------------------------

                 Key: UIMA-1707
                 URL: https://issues.apache.org/jira/browse/UIMA-1707
             Project: UIMA
          Issue Type: Bug
          Components: C++ Framework
    Affects Versions: 2.2.2
            Reporter: Eddie Epstein
            Assignee: Eddie Epstein
             Fix For: 2.3C


Christoph Büscher  to uima-user

I currently encountered a problem with the XMI deserialization of a feature structure after calling a remote C++ AS annotator from a CPE. The szenario is the following:

1. I add a custom feature structure "DocumentData" containing an String Feature (the document URL) to the CAS in my CPE. The exact URL causing the problem is:

documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&_psmand=1"

2. The CAS get's serialized to XMI before sending it to a remote C++ TAE. I added a breakpoint to UimaSerializer.serializeCasToXmi() and get the following part in the XMI string:

documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&amp;_psmand=1"

So here the "&" character seems to be excaped correctly.

3. When the document comes back, the same feature in the XMI string received in UimaSerializer.deserializeCasFromXmi() reads:

documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&_psmand=1"

an now the SAXParser throws the following exception:

org.xml.sax.SAXParseException: The reference to entity "_psmand" must end with the ';' delimiter.
       at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
       at org.apache.uima.aae.UimaSerializer.deserializeCasFromXmi(UimaSerializer.java:170)
       at ...

because the "&" comes back unescaped.
-----------------------------------

Christoph Büscher to uima-user

I did some further testing and the problem seems to happen when the FS is not
declared in the remote C++ TAE but declared and set in an AE in the (lokal) main
 application (in out case a CPE running various Java AEs).
------------------------------------------

Nice work Christoph!


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Closed: (UIMA-1707) String features in out-of-typesystem data not normalized on service replies

Posted by "Eddie Epstein (JIRA)" <ui...@incubator.apache.org>.
     [ https://issues.apache.org/jira/browse/UIMA-1707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Eddie Epstein closed UIMA-1707.
-------------------------------

    Resolution: Fixed

> String features in out-of-typesystem data not normalized on service replies
> ---------------------------------------------------------------------------
>
>                 Key: UIMA-1707
>                 URL: https://issues.apache.org/jira/browse/UIMA-1707
>             Project: UIMA
>          Issue Type: Bug
>          Components: C++ Framework
>    Affects Versions: 2.2.2
>            Reporter: Eddie Epstein
>            Assignee: Eddie Epstein
>             Fix For: 2.3C
>
>
> Christoph Büscher  to uima-user
> I currently encountered a problem with the XMI deserialization of a feature structure after calling a remote C++ AS annotator from a CPE. The szenario is the following:
> 1. I add a custom feature structure "DocumentData" containing an String Feature (the document URL) to the CAS in my CPE. The exact URL causing the problem is:
> documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&_psmand=1"
> 2. The CAS get's serialized to XMI before sending it to a remote C++ TAE. I added a breakpoint to UimaSerializer.serializeCasToXmi() and get the following part in the XMI string:
> documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&amp;_psmand=1"
> So here the "&" character seems to be excaped correctly.
> 3. When the document comes back, the same feature in the XMI string received in UimaSerializer.deserializeCasFromXmi() reads:
> documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&_psmand=1"
> an now the SAXParser throws the following exception:
> org.xml.sax.SAXParseException: The reference to entity "_psmand" must end with the ';' delimiter.
>        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
>        at org.apache.uima.aae.UimaSerializer.deserializeCasFromXmi(UimaSerializer.java:170)
>        at ...
> because the "&" comes back unescaped.
> -----------------------------------
> Christoph Büscher to uima-user
> I did some further testing and the problem seems to happen when the FS is not
> declared in the remote C++ TAE but declared and set in an AE in the (lokal) main
>  application (in out case a CPE running various Java AEs).
> ------------------------------------------
> Nice work Christoph!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (UIMA-1707) String features in out-of-typesystem data not normalized on service replies

Posted by "Christoph Buescher (JIRA)" <ui...@incubator.apache.org>.
    [ https://issues.apache.org/jira/browse/UIMA-1707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12796706#action_12796706 ] 

Christoph Buescher commented on UIMA-1707:
------------------------------------------

Great. We were able to verify that your fix is working with for our test cases. Many thanks. Looking forward to the offical UIMA 2.3. Release.

> String features in out-of-typesystem data not normalized on service replies
> ---------------------------------------------------------------------------
>
>                 Key: UIMA-1707
>                 URL: https://issues.apache.org/jira/browse/UIMA-1707
>             Project: UIMA
>          Issue Type: Bug
>          Components: C++ Framework
>    Affects Versions: 2.2.2
>            Reporter: Eddie Epstein
>            Assignee: Eddie Epstein
>             Fix For: 2.3C
>
>
> Christoph Büscher  to uima-user
> I currently encountered a problem with the XMI deserialization of a feature structure after calling a remote C++ AS annotator from a CPE. The szenario is the following:
> 1. I add a custom feature structure "DocumentData" containing an String Feature (the document URL) to the CAS in my CPE. The exact URL causing the problem is:
> documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&_psmand=1"
> 2. The CAS get's serialized to XMI before sending it to a remote C++ TAE. I added a breakpoint to UimaSerializer.serializeCasToXmi() and get the following part in the XMI string:
> documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&amp;_psmand=1"
> So here the "&" character seems to be excaped correctly.
> 3. When the document comes back, the same feature in the XMI string received in UimaSerializer.deserializeCasFromXmi() reads:
> documentURL="http://www.gesundheitsnachrichten.net/live/navigation/live.php?navigation_id=11&_psmand=1"
> an now the SAXParser throws the following exception:
> org.xml.sax.SAXParseException: The reference to entity "_psmand" must end with the ';' delimiter.
>        at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
>        at org.apache.uima.aae.UimaSerializer.deserializeCasFromXmi(UimaSerializer.java:170)
>        at ...
> because the "&" comes back unescaped.
> -----------------------------------
> Christoph Büscher to uima-user
> I did some further testing and the problem seems to happen when the FS is not
> declared in the remote C++ TAE but declared and set in an AE in the (lokal) main
>  application (in out case a CPE running various Java AEs).
> ------------------------------------------
> Nice work Christoph!

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.