You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@predictionio.apache.org by George Yarish <gy...@griddynamics.com> on 2018/07/26 12:25:14 UTC

Increase heap size for pio deploy

Hi!

Can someone please advise me how to setup java heap size properties for pio
deploy process?

My current issue is "[ERROR] [LocalFSModels]  Java heap space" during pio
deploy.
My model takes ~350mb on localfs in model store.

I was trying something like "JAVA_OPTS=-Xmx4g pio deploy" doesn't work for
me.

Thanks,
George

Re: Increase heap size for pio deploy

Posted by George Yarish <gy...@griddynamics.com>.

ok solved by --driver-memory 10g

Sorry for bothering,
George


On Thu, Jul 26, 2018 at 3:25 PM, George Yarish <gy...@griddynamics.com>
wrote:

> Hi!
>
> Can someone please advise me how to setup java heap size properties for
> pio deploy process?
>
> My current issue is "[ERROR] [LocalFSModels]  Java heap space" during pio
> deploy.
> My model takes ~350mb on localfs in model store.
>
> I was trying something like "JAVA_OPTS=-Xmx4g pio deploy" doesn't work for
> me.
>
> Thanks,
> George
>

Re: Increase heap size for pio deploy

Posted by Pat Ferrel <pa...@occamsmachete.com>.

Depending on the template you are using the driver and executor memory will increase as your data increases. Spark keeps data in memory to get the speed increase over something like Hadoop MapReduce by using memory instead of temp files. This yields orders of magnitude speed increases but does mean with big data PIO and Spark (more specifically) is a memory hog—by design. The memory requirements will be far larger than you are used to with DBs or other services. The good thing about Spark is that the data can be spread over members of a cluster so if you need a 100g data structure in-memory you can put 10g on each executor—or something like this and the data structures may only be loosely linked to the sixe of your input.

TLDR; Experiment to find the driver and executor memory required to run train and deploy of your template. For instance the Universal Recommender will need a lot of train memory but almost no deploy memory because it does not use Spark for deploy. Other templates may need more memory for deploy. Unfortunately the template and algorithm greatly affect these numbers and there is generally no way but experiment to determine them.

From: George Yarish <gy...@griddynamics.com>
Reply: user@predictionio.apache.org <us...@predictionio.apache.org>
Date: July 26, 2018 at 5:51:44 AM
To: user@predictionio.apache.org <us...@predictionio.apache.org>
Subject: Re: Increase heap size for pio deploy

ok solved by --driver-memory 10g

Sorry for bothering,
George

On Thu, Jul 26, 2018 at 3:25 PM, George Yarish <gy...@griddynamics.com> wrote:
Hi!

Can someone please advise me how to setup java heap size properties for pio deploy process?

My current issue is "[ERROR] [LocalFSModels] Java heap space" during pio deploy.
My model takes ~350mb on localfs in model store.

I was trying something like "JAVA_OPTS=-Xmx4g pio deploy" doesn't work for me.

Thanks,
George