You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@jena.apache.org by an...@apache.org on 2022/01/14 16:20:26 UTC

[jena-site] branch xloader-threads updated (d4e777f -> 20c5d3c)

This is an automated email from the ASF dual-hosted git repository.

andy pushed a change to branch xloader-threads
in repository https://gitbox.apache.org/repos/asf/jena-site.git.


 discard d4e777f  xloader --thread argument
     new 20c5d3c  xloader --thread argument

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (d4e777f)
            \
             N -- N -- N   refs/heads/xloader-threads (20c5d3c)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omit" are not gone; other references still
refer to them.  Any revisions marked "discard" are gone forever.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 source/documentation/tdb/tdb-xloader.md | 9 +++++----
 1 file changed, 5 insertions(+), 4 deletions(-)

[jena-site] 01/01: xloader --thread argument

Posted by an...@apache.org.
This is an automated email from the ASF dual-hosted git repository.

andy pushed a commit to branch xloader-threads
in repository https://gitbox.apache.org/repos/asf/jena-site.git

commit 20c5d3cbb6b18d1ce64b59ed75144a5ecba325f4
Author: Andy Seaborne <an...@apache.org>
AuthorDate: Fri Jan 14 16:17:46 2022 +0000

    xloader --thread argument
---
 source/documentation/tdb/tdb-xloader.md | 13 ++++++++++---
 1 file changed, 10 insertions(+), 3 deletions(-)

diff --git a/source/documentation/tdb/tdb-xloader.md b/source/documentation/tdb/tdb-xloader.md
index 82c8878..fd3d79f 100644
--- a/source/documentation/tdb/tdb-xloader.md
+++ b/source/documentation/tdb/tdb-xloader.md
@@ -6,13 +6,14 @@ TDB xloader ("x" for external) is a bulkloader for very large datasets. The goal
 is stability and reliability for long running loading, running on modest
 hardware and can be use to load a database on rotating disk or SSD.
 
-xloader is not a replacement for regular TDB1 and TDB2 loaders.
+`xloader` is not a replacement for regular TDB1 and TDB2 loaders. It is for very
+large datasets.
 
 There are two scripts to load data using the xloader subsystem.
 
-"tdb1.xloader", which was called "tdbloader2" and has some improvements.
+"tdb1.xloader", which was called "tdbloader2", has some improvements.
 
-It is not as fast as other TDB loaders on dataset where the general loaders work
+It is not as fast as other TDB loaders on datasets where the general loaders work
 without encountering progressive slowdown.
 
 The xloaders for TDB1 and TDB2 are not identical. The TDB2 xloader is more
@@ -40,6 +41,12 @@ temporary files.
 `FILE` is any RDF syntax supported by Jena. Syntax is determined by the file
 extension and can include an addtional ".gz" or ".bz2" for compressed files.
 
+`tdb2.xloader` also supports argument `--threads` to set the number of threads
+to use with `sort(1)`. The default is 2. The recommendation for an inital
+setting is to set it to the number of cores (not hardware threads) minus 1. This
+is sensitive to the hardware environment. Experimentation may show a different
+best setting.
+
 ### Advice
 
 To avoid a load failing due to a syntax or other data error, it is advisable to