You are viewing a plain text version of this content. The canonical link for it is here.
Posted to pr@jena.apache.org by GitBox <gi...@apache.org> on 2022/02/07 10:57:35 UTC

[GitHub] [jena] neumarcx commented on pull request #1164: JENA-2208: Improve TDB2 xloader

neumarcx commented on pull request #1164:
URL: https://github.com/apache/jena/pull/1164#issuecomment-1031335639


   @anlam <https://github.com/anlam> try to use the gz files directly for the
   load as a workaround. no need to uncompress. data will be streamed for
   processing. it will just add a little more time in terms of cpu cycles used
   to uncompress the data dynamically.
   
   On Mon, Feb 7, 2022 at 10:38 AM An Ngoc Lam ***@***.***>
   wrote:
   
   > @afs <https://github.com/afs>, I am loading wikidata which is about 800GB
   > with 16 billion triples. tdbloader2 failed due to Out Of Memory error on
   > a machine with 512GB RAM. So I tried to use xloader. It seemed to be
   > promising, but like I said, it failed at "Build index SPO" due to "Can't
   > find gzip program" error. Do you have any workaround to this?.
   >
   > —
   > Reply to this email directly, view it on GitHub
   > <https://github.com/apache/jena/pull/1164#issuecomment-1031319546>, or
   > unsubscribe
   > <https://github.com/notifications/unsubscribe-auth/ACPL5GWRNH3JYHE2PWH5VYDUZ6OJPANCNFSM5L5DAK6A>
   > .
   > Triage notifications on the go with GitHub Mobile for iOS
   > <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
   > or Android
   > <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
   >
   > You are receiving this because you are subscribed to this thread.Message
   > ID: ***@***.***>
   >
   
   
   -- 
   
   
   ---
   Marco Neumann
   KONA
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: pr-unsubscribe@jena.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: pr-unsubscribe@jena.apache.org
For additional commands, e-mail: pr-help@jena.apache.org