You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by Anthony Shawver <as...@actaview.com> on 2002/02/07 21:54:42 UTC
SAX Parser input size limit
Hi All,
I am using the xerces (1.4.1) SAXParser to read an XML Document. During the
parsing of the document, the parser seems to hit a limit at 16k (in between
the 16384th & 16385th characters) and "splits" the current piece of data
that it's reading. Is there a setting I can change to prevent this?
Thanks,
Tony
RE: SAX Parser input size limit
Posted by Anthony Shawver <as...@actaview.com>.
No, but for example, when I get to the following:
<COMPANY>The Company Name</COMPANY>
and the 'C' is the 16384th char, I get 2 Company records;
Company1:"The C"
Company2:"ompany Name"
The xml doc that I am parsing is about 85k and I get 5 bad records (actually
5 records that end up as 10 bad records) at each of the splits.
Tony
-----Original Message-----
From: Andy Clark [mailto:andyc@apache.org]
Sent: Thursday, February 07, 2002 7:07 PM
To: xerces-j-dev@xml.apache.org
Subject: Re: SAX Parser input size limit
Anthony Shawver wrote:
> During the parsing of the document, the parser seems to hit a limit at
> 16k (in between the 16384th & 16385th characters) and "splits" the
> current piece of data that it's reading. Is there a setting I can
> change to prevent this?
Define "split". Are you losing data?
--
Andy Clark * andyc@apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org
Re: SAX Parser input size limit
Posted by Andy Clark <an...@apache.org>.
Anthony Shawver wrote:
> During the parsing of the document, the parser seems to hit a limit at
> 16k (in between the 16384th & 16385th characters) and "splits" the
> current piece of data that it's reading. Is there a setting I can
> change to prevent this?
Define "split". Are you losing data?
--
Andy Clark * andyc@apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org