You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@jena.apache.org by GitBox <gi...@apache.org> on 2022/07/07 11:25:20 UTC

[GitHub] [jena] afs commented on issue #1324: Poor performance when parsing huge literal in query (e.g. 100MB)

afs commented on issue #1324:
URL: https://github.com/apache/jena/issues/1324#issuecomment-1177453788

   The issue we experience is buffer management. Linear growth of a few Kbytes for 100Mb is a lot of recopying. If it grew at say x1.5 (like a Java ArrayList) the effect would be much less. (The same issues can arise with Arraylist but much less pronounced).
   
   The grammar is the grammar in the SPARQL specification - it really "is the grammar" because the HTML in th spec was produced from this JavaCC grammar!
   
   The negation is only 3 chars ahead maximum. 
   
   @SimonBin - is this triple-quoted literals or single-quoted?
   
   In both cases, if the XML uses " for attributes, then a `'` quoting may make a difference but I suspect the buffer expansion is going to dominate.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@jena.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@jena.apache.org
For additional commands, e-mail: issues-help@jena.apache.org