You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by muhammet pakyürek <mp...@hotmail.com> on 2016/09/26 08:08:39 UTC

how to decide which part of process use spark dataframe and pandas dataframe?


is there a clear guide to decide the above?

Re: how to decide which part of process use spark dataframe and pandas dataframe?

Posted by Peyman Mohajerian <mo...@gmail.com>.
A simple way to do that is to collect data in the driver when you need to
use Python panda.

On Monday, September 26, 2016, muhammet pakyürek <mp...@hotmail.com> wrote:

>
>
> is there a clear guide to decide the above?
>