You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Matt Massie (JIRA)" <ji...@apache.org> on 2015/07/14 23:10:05 UTC

[jira] [Created] (SPARK-9043) Serialize key, value and combiner classes in ShuffleDependency

Matt Massie created SPARK-9043:
----------------------------------

             Summary: Serialize key, value and combiner classes in ShuffleDependency
                 Key: SPARK-9043
                 URL: https://issues.apache.org/jira/browse/SPARK-9043
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core
            Reporter: Matt Massie


ShuffleManager implementations are currently not given type information regarding the key, value and combiner classes. Serialization of shuffle objects relies on them being JavaSerializable, with methods defined for reading/writing the object or, alternatively, serialization via Kryo which uses reflection.

Serialization systems like Avro, Thrift and Protobuf generate classes with zero argument constructors and explicit schema information (e.g. IndexedRecords in Avro have get, put and getSchema methods).

By serializing the key, value and combiner class names in {ShuffleDependency}, shuffle implementations will have access to schema information when {registerShuffle} is called.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org