You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@jena.apache.org by an...@apache.org on 2022/01/14 16:17:52 UTC

[jena-site] branch xloader-threads created (now 2520624)

This is an automated email from the ASF dual-hosted git repository.

andy pushed a change to branch xloader-threads
in repository https://gitbox.apache.org/repos/asf/jena-site.git.


      at 2520624  xloader --thread argument

This branch includes the following new commits:

     new 2520624  xloader --thread argument

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


[jena-site] 01/01: xloader --thread argument

Posted by an...@apache.org.
This is an automated email from the ASF dual-hosted git repository.

andy pushed a commit to branch xloader-threads
in repository https://gitbox.apache.org/repos/asf/jena-site.git

commit 25206246fd27f2a0b443e43c528113a82f641a42
Author: Andy Seaborne <an...@apache.org>
AuthorDate: Fri Jan 14 16:17:46 2022 +0000

    xloader --thread argument
---
 source/documentation/tdb/tdb-xloader.md | 10 ++++++++--
 1 file changed, 8 insertions(+), 2 deletions(-)

diff --git a/source/documentation/tdb/tdb-xloader.md b/source/documentation/tdb/tdb-xloader.md
index 82c8878..f23056d 100644
--- a/source/documentation/tdb/tdb-xloader.md
+++ b/source/documentation/tdb/tdb-xloader.md
@@ -6,11 +6,12 @@ TDB xloader ("x" for external) is a bulkloader for very large datasets. The goal
 is stability and reliability for long running loading, running on modest
 hardware and can be use to load a database on rotating disk or SSD.
 
-xloader is not a replacement for regular TDB1 and TDB2 loaders.
+`xloader` is not a replacement for regular TDB1 and TDB2 loaders. It is for very
+large datasets.
 
 There are two scripts to load data using the xloader subsystem.
 
-"tdb1.xloader", which was called "tdbloader2" and has some improvements.
+"tdb1.xloader", which was called "tdbloader2", has some improvements.
 
 It is not as fast as other TDB loaders on dataset where the general loaders work
 without encountering progressive slowdown.
@@ -40,6 +41,11 @@ temporary files.
 `FILE` is any RDF syntax supported by Jena. Syntax is determined by the file
 extension and can include an addtional ".gz" or ".bz2" for compressed files.
 
+`tdb2.xloader also supports `--threads` to set the number of threads to use with
+`sort(1)`. The default is 2. The recommendation for an inital setting is to set
+it to the number of cores (not hardware threads) minus 1. This is sensitive to
+the hardware environment. Experimentation may show a different best setting.
+
 ### Advice
 
 To avoid a load failing due to a syntax or other data error, it is advisable to