You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@jena.apache.org by an...@apache.org on 2022/01/14 16:19:23 UTC

[jena-site] branch xloader-threads updated (2520624 -> d4e777f)

This is an automated email from the ASF dual-hosted git repository.

andy pushed a change to branch xloader-threads
in repository https://gitbox.apache.org/repos/asf/jena-site.git.


 discard 2520624  xloader --thread argument
     new d4e777f  xloader --thread argument

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (2520624)
            \
             N -- N -- N   refs/heads/xloader-threads (d4e777f)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omit" are not gone; other references still
refer to them.  Any revisions marked "discard" are gone forever.

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 source/documentation/tdb/tdb-xloader.md | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

[jena-site] 01/01: xloader --thread argument

Posted by an...@apache.org.
This is an automated email from the ASF dual-hosted git repository.

andy pushed a commit to branch xloader-threads
in repository https://gitbox.apache.org/repos/asf/jena-site.git

commit d4e777fedaa81778834474b2e5a29fd898fef9b4
Author: Andy Seaborne <an...@apache.org>
AuthorDate: Fri Jan 14 16:17:46 2022 +0000

    xloader --thread argument
---
 source/documentation/tdb/tdb-xloader.md | 12 +++++++++---
 1 file changed, 9 insertions(+), 3 deletions(-)

diff --git a/source/documentation/tdb/tdb-xloader.md b/source/documentation/tdb/tdb-xloader.md
index 82c8878..7f07855 100644
--- a/source/documentation/tdb/tdb-xloader.md
+++ b/source/documentation/tdb/tdb-xloader.md
@@ -6,13 +6,14 @@ TDB xloader ("x" for external) is a bulkloader for very large datasets. The goal
 is stability and reliability for long running loading, running on modest
 hardware and can be use to load a database on rotating disk or SSD.
 
-xloader is not a replacement for regular TDB1 and TDB2 loaders.
+`xloader` is not a replacement for regular TDB1 and TDB2 loaders. It is for very
+large datasets.
 
 There are two scripts to load data using the xloader subsystem.
 
-"tdb1.xloader", which was called "tdbloader2" and has some improvements.
+"tdb1.xloader", which was called "tdbloader2", has some improvements.
 
-It is not as fast as other TDB loaders on dataset where the general loaders work
+It is not as fast as other TDB loaders on datasets where the general loaders work
 without encountering progressive slowdown.
 
 The xloaders for TDB1 and TDB2 are not identical. The TDB2 xloader is more
@@ -40,6 +41,11 @@ temporary files.
 `FILE` is any RDF syntax supported by Jena. Syntax is determined by the file
 extension and can include an addtional ".gz" or ".bz2" for compressed files.
 
+`tdb2.xloader also supports `--threads` to set the number of threads to use with
+`sort(1)`. The default is 2. The recommendation for an inital setting is to set
+it to the number of cores (not hardware threads) minus 1. This is sensitive to
+the hardware environment. Experimentation may show a different best setting.
+
 ### Advice
 
 To avoid a load failing due to a syntax or other data error, it is advisable to