You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Dennis Suhari <d....@icloud.com.INVALID> on 2019/10/08 15:48:34 UTC

Best practise local vs distributed python

Hi,

is there any „best practise“ rule of thumb when to use local python instead of distributed python on spark (data size, massive computation etc.) ? I mean spark can also generate overhead and sometimes local processing is faster.

Br,

Dennis

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscribe@spark.apache.org