You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@giraph.apache.org by Francesco Sclano <sc...@gmail.com> on 2018/12/12 15:31:08 UTC

In a real cluster (not pseudo-distributed mode) composed by N datanodes eachone having X vCPU, is more appropriate to set giraph.numComputeThreads with N or with X*N ?

Hi,
I'm using a cluster composed of N datanodes and I'd like to properly set
following parameters:

-ca giraph.numComputeThreads=
-ca giraph.numInputThreads=
-ca giraph.numOutputThreads=

The cluster is dedicated to my giraph program (no others job run in this
cluster).
I'd like to know if is more appropriate to set above parameters with N *
with X*N ? (where X is the number of vCPU in every datanode).

p.s. As Claudio Martella says in a previous post, maybe is better to use
X-1 instead of X because giraph needs 1 thread for internal work. Anyway I
didn't understand if this previous post is related to a pseudo-distributed
mode or a distributed mode. Hence i post this question :-P

Thanks!

-- 
Francesco Sclano