You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@camel.apache.org by "Aki Yoshida (JIRA)" <ji...@apache.org> on 2014/07/07 18:18:33 UTC
[jira] [Created] (CAMEL-7584) XML-Aware Tokenizer failing with
utf-8 multibyte characters
Aki Yoshida created CAMEL-7584:
----------------------------------
Summary: XML-Aware Tokenizer failing with utf-8 multibyte characters
Key: CAMEL-7584
URL: https://issues.apache.org/jira/browse/CAMEL-7584
Project: Camel
Issue Type: Bug
Components: camel-core
Reporter: Aki Yoshida
Assignee: Aki Yoshida
Fix For: 2.14.0
There is some issue in the underlining Stax reader's getLocation().getCharOffset() when the input data is an InputStream to the stax reader.
This issue was brought up in the woodstox community. But I believe fixing it seems to be non trivial as woodstox internally uses char/Reader and keeps the offset value to the character sequence and not to the original input stream.
We change the tokenzer to pass java.io.Reader to the woodstox parser instead of passing java.io.InputStream directly.
--
This message was sent by Atlassian JIRA
(v6.2#6252)