You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commons-dev@ws.apache.org by ve...@apache.org on 2009/12/21 16:07:59 UTC

svn commit: r892845 - /webservices/commons/trunk/modules/axiom/src/docbkx/userguide.xml

Author: veithen
Date: Mon Dec 21 15:07:58 2009
New Revision: 892845

URL: http://svn.apache.org/viewvc?rev=892845&view=rev
Log:
User guide: added a section describing how to switch off coalescing.

Modified:
    webservices/commons/trunk/modules/axiom/src/docbkx/userguide.xml

Modified: webservices/commons/trunk/modules/axiom/src/docbkx/userguide.xml
URL: http://svn.apache.org/viewvc/webservices/commons/trunk/modules/axiom/src/docbkx/userguide.xml?rev=892845&r1=892844&r2=892845&view=diff
==============================================================================
--- webservices/commons/trunk/modules/axiom/src/docbkx/userguide.xml (original)
+++ webservices/commons/trunk/modules/axiom/src/docbkx/userguide.xml Mon Dec 21 15:07:58 2009
@@ -1007,6 +1007,30 @@
 <screen><![CDATA[<root>Test
 with CRLF</root>]]></screen>
                 </section>
+                <section>
+                    <title>Preserving CDATA sections during parsing</title>
+                    <para>
+                        By default, <classname>StAXUtils</classname> creates StAX parsers in coaelescing mode.
+                        In this mode, the parser will never return two character data events in sequence, while
+                        in non coaelescing mode, the parser is allowed to break up character data into smaller
+                        chunks and to return multiple consecutive character events, which may improve throughput
+                        for documents containing large text nodes.
+                        It should be noted that <classname>StAXUtils</classname> overrides the default settings
+                        mandated by the StAX specification, which specifies that by default, a StAX parser must
+                        be in non coalescing mode. The primary reason is compatibility: older versions of
+                        Woodstox had coalescing switched on by default.
+                    </para>
+                    <para>
+                        A side effect of the default settings chosen by Axiom is that by default, CDATA sections
+                        are not reported by parser created by
+                        <classname>StAXUtils</classname>. The reason is that in coalescing mode, the parser will
+                        not only coaelsce adjacent text nodes, but also CDATA sections. Applications that require
+                        correct reporting of CDATA sections should therefore disable coalescing. This can be
+                        achieved by creating a <filename>XMLInputFactory.properties</filename> file with the
+                        following content:
+                    </para>
+<programlisting>javax.xml.stream.isCoalescing=false</programlisting>
+                </section>
             </section>
         </section>
         <section>