You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@jena.apache.org by Andy Seaborne <an...@apache.org> on 2012/03/06 09:35:35 UTC

Re: svn commit: r1297010 - /incubator/jena/Jena2/TDB/trunk/bin/tdbloader2

Paolo,

This has been in for a day now.  It will have been put into the nightly 
build.

Why not have a differently-named file that only your testing uses so 
that everyone else is not picking up your debug changes?  Or a branch? 
Or a custom build?

	Andy

On 05/03/12 11:35, castagna@apache.org wrote:
> Author: castagna
> Date: Mon Mar  5 11:35:39 2012
> New Revision: 1297010
>
> URL: http://svn.apache.org/viewvc?rev=1297010&view=rev
> Log:
> Print the classpath, temporarily.
>
> Modified:
>      incubator/jena/Jena2/TDB/trunk/bin/tdbloader2
>
> Modified: incubator/jena/Jena2/TDB/trunk/bin/tdbloader2
> URL: http://svn.apache.org/viewvc/incubator/jena/Jena2/TDB/trunk/bin/tdbloader2?rev=1297010&r1=1297009&r2=1297010&view=diff
> ==============================================================================
> --- incubator/jena/Jena2/TDB/trunk/bin/tdbloader2 (original)
> +++ incubator/jena/Jena2/TDB/trunk/bin/tdbloader2 Mon Mar  5 11:35:39 2012
> @@ -62,6 +62,8 @@ log "Data phase"
>   DATA_TRIPLES="$LOC/data-triples.$TMP"
>   DATA_QUADS="$LOC/data-quads.$TMP"
>
> +# This is temporary, to help me debugging a classpath issue when running on EC2 directly from trunk. I'll remove it, when I find the problem. -- PC
> +echo $CP
>   JVM_ARGS=${JVM_ARGS:--Xmx1200M -server}
>   java $JVM_ARGS -cp "$CP" "$PKG".CmdNodeTableBuilder \
>       "--loc=$LOC" "--triples=$DATA_TRIPLES" "--quads=$DATA_QUADS" $FILES
>
>


Re: svn commit: r1297010 - /incubator/jena/Jena2/TDB/trunk/bin/tdbloader2

Posted by Paolo Castagna <ca...@googlemail.com>.
Hi Andy

Removed.

The ability to control JVM arguments is there (as for the other TDB scripts):

JVM_ARGS=${JVM_ARGS:--Xmx1200M -server}
java $JVM_ARGS -cp "$CP" "$PKG".CmdNodeTableBuilder \
    "--loc=$LOC" "--triples=$DATA_TRIPLES" "--quads=$DATA_QUADS" $FILES

I had OutOfMemoryError(s) with a 600 million dataset using the -Xmx1200M
default setting. I tried with -Xmx2048m, same issue. I am not re-running
using -Xmx4096m.

Paolo

Andy Seaborne wrote:
> Paolo,
> 
> This has been in for a day now.  It will have been put into the nightly
> build.
> 
> Why not have a differently-named file that only your testing uses so
> that everyone else is not picking up your debug changes?  Or a branch?
> Or a custom build?
> 
>     Andy
> 
> On 05/03/12 11:35, castagna@apache.org wrote:
>> Author: castagna
>> Date: Mon Mar  5 11:35:39 2012
>> New Revision: 1297010
>>
>> URL: http://svn.apache.org/viewvc?rev=1297010&view=rev
>> Log:
>> Print the classpath, temporarily.
>>
>> Modified:
>>      incubator/jena/Jena2/TDB/trunk/bin/tdbloader2
>>
>> Modified: incubator/jena/Jena2/TDB/trunk/bin/tdbloader2
>> URL:
>> http://svn.apache.org/viewvc/incubator/jena/Jena2/TDB/trunk/bin/tdbloader2?rev=1297010&r1=1297009&r2=1297010&view=diff
>>
>> ==============================================================================
>>
>> --- incubator/jena/Jena2/TDB/trunk/bin/tdbloader2 (original)
>> +++ incubator/jena/Jena2/TDB/trunk/bin/tdbloader2 Mon Mar  5 11:35:39
>> 2012
>> @@ -62,6 +62,8 @@ log "Data phase"
>>   DATA_TRIPLES="$LOC/data-triples.$TMP"
>>   DATA_QUADS="$LOC/data-quads.$TMP"
>>
>> +# This is temporary, to help me debugging a classpath issue when
>> running on EC2 directly from trunk. I'll remove it, when I find the
>> problem. -- PC
>> +echo $CP
>>   JVM_ARGS=${JVM_ARGS:--Xmx1200M -server}
>>   java $JVM_ARGS -cp "$CP" "$PKG".CmdNodeTableBuilder \
>>       "--loc=$LOC" "--triples=$DATA_TRIPLES" "--quads=$DATA_QUADS" $FILES
>>
>>
>