You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by 牛兆捷 <nz...@gmail.com> on 2015/07/09 10:19:26 UTC

Questions about Fault tolerance of Spark

Hi All:

We already know that Spark utilizes the lineage to recompute the RDDs when
failure occurs.
I want to study the performance of this fault-tolerant approach and have
some questions about it.

1) Is there any benchmark (or standard failure model) to test the fault
tolerance of these kinds of in-memory data processing systems?

2) How do you emulate the failures in testing spark?  (e.g., kill a
computation task? or kill the computation nodes?)

Thanks!!!

-- 
*Regards,*
*Zhaojie*