You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@camel.apache.org by "Aki Yoshida (JIRA)" <ji...@apache.org> on 2014/07/08 10:57:34 UTC

[jira] [Resolved] (CAMEL-7584) XML-Aware Tokenizer failing with utf-8 multibyte characters

     [ https://issues.apache.org/jira/browse/CAMEL-7584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Aki Yoshida resolved CAMEL-7584.
--------------------------------

    Resolution: Fixed

> XML-Aware Tokenizer failing with utf-8 multibyte characters
> -----------------------------------------------------------
>
>                 Key: CAMEL-7584
>                 URL: https://issues.apache.org/jira/browse/CAMEL-7584
>             Project: Camel
>          Issue Type: Bug
>          Components: camel-core
>            Reporter: Aki Yoshida
>            Assignee: Aki Yoshida
>             Fix For: 2.14.0
>
>
> There is some issue in the underlining Stax reader's  getLocation().getCharOffset() when the input data is an InputStream to the stax reader.
> This issue was brought up in the woodstox community. But I believe fixing it seems to be non trivial as woodstox internally uses char/Reader and keeps the offset value to the character sequence and not to the original input stream.
> We change the tokenzer to pass java.io.Reader to the woodstox parser instead of passing java.io.InputStream directly.



--
This message was sent by Atlassian JIRA
(v6.2#6252)