You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@camel.apache.org by "Aki Yoshida (JIRA)" <ji...@apache.org> on 2014/07/08 10:57:34 UTC
[jira] [Resolved] (CAMEL-7584) XML-Aware Tokenizer failing with
utf-8 multibyte characters
[ https://issues.apache.org/jira/browse/CAMEL-7584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Aki Yoshida resolved CAMEL-7584.
--------------------------------
Resolution: Fixed
> XML-Aware Tokenizer failing with utf-8 multibyte characters
> -----------------------------------------------------------
>
> Key: CAMEL-7584
> URL: https://issues.apache.org/jira/browse/CAMEL-7584
> Project: Camel
> Issue Type: Bug
> Components: camel-core
> Reporter: Aki Yoshida
> Assignee: Aki Yoshida
> Fix For: 2.14.0
>
>
> There is some issue in the underlining Stax reader's getLocation().getCharOffset() when the input data is an InputStream to the stax reader.
> This issue was brought up in the woodstox community. But I believe fixing it seems to be non trivial as woodstox internally uses char/Reader and keeps the offset value to the character sequence and not to the original input stream.
> We change the tokenzer to pass java.io.Reader to the woodstox parser instead of passing java.io.InputStream directly.
--
This message was sent by Atlassian JIRA
(v6.2#6252)