You are viewing a plain text version of this content. The canonical link for it is here.
Posted to j-dev@xerces.apache.org by Anthony Shawver <as...@actaview.com> on 2002/02/07 21:54:42 UTC

SAX Parser input size limit

Hi All,
I am using the xerces (1.4.1) SAXParser to read an XML Document.  During the
parsing of the document, the parser seems to hit a limit at 16k (in between
the 16384th & 16385th characters) and "splits" the current piece of data
that it's reading.   Is there a setting I can change to prevent this?

Thanks,
Tony

RE: SAX Parser input size limit

Posted by Anthony Shawver <as...@actaview.com>.
No, but for example, when I get to the following:
<COMPANY>The Company Name</COMPANY>

and the 'C' is the 16384th char, I get 2 Company records;
Company1:"The C"
Company2:"ompany Name"

The xml doc that I am parsing is about 85k and I get 5 bad records (actually
5 records that end up as 10 bad records) at each of the splits.

Tony

-----Original Message-----
From: Andy Clark [mailto:andyc@apache.org]
Sent: Thursday, February 07, 2002 7:07 PM
To: xerces-j-dev@xml.apache.org
Subject: Re: SAX Parser input size limit


Anthony Shawver wrote:
> During the parsing of the document, the parser seems to hit a limit at
> 16k (in between the 16384th & 16385th characters) and "splits" the
> current piece of data that it's reading.   Is there a setting I can
> change to prevent this?

Define "split". Are you losing data?

--
Andy Clark * andyc@apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org


Re: SAX Parser input size limit

Posted by Andy Clark <an...@apache.org>.
Anthony Shawver wrote:
> During the parsing of the document, the parser seems to hit a limit at
> 16k (in between the 16384th & 16385th characters) and "splits" the
> current piece of data that it's reading.   Is there a setting I can
> change to prevent this?

Define "split". Are you losing data?

-- 
Andy Clark * andyc@apache.org

---------------------------------------------------------------------
To unsubscribe, e-mail: xerces-j-dev-unsubscribe@xml.apache.org
For additional commands, e-mail: xerces-j-dev-help@xml.apache.org