You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Suzen Fylke (Jira)" <ji...@apache.org> on 2020/06/24 19:20:00 UTC
[jira] [Created] (SPARK-32094) Patch cloudpickle.py with typing
module side-effect fix
Suzen Fylke created SPARK-32094:
-----------------------------------
Summary: Patch cloudpickle.py with typing module side-effect fix
Key: SPARK-32094
URL: https://issues.apache.org/jira/browse/SPARK-32094
Project: Spark
Issue Type: Bug
Components: PySpark
Affects Versions: 3.0.0, 2.4.6
Reporter: Suzen Fylke
Pyspark's cloudpickle.py and versions of cloudpickle below 1.3.0 interfere with dill unpickling because they define types.ClassType, which is undefined in dill. This results in the following error:
{{}}
{code:java}
Traceback (most recent call last):
File "/usr/local/lib/python3.6/site-packages/apache_beam/internal/pickler.py", line 279, in loads
return dill.loads(s)
File "/usr/local/lib/python3.6/site-packages/dill/_dill.py", line 317, in loads
return load(file, ignore)
File "/usr/local/lib/python3.6/site-packages/dill/_dill.py", line 305, in load
obj = pik.load()
File "/usr/local/lib/python3.6/site-packages/dill/_dill.py", line 577, in _load_type
return _reverse_typemap[name]
KeyError: 'ClassType'{code}
{{}}
(See [https://github.com/cloudpipe/cloudpickle/issues/82])
This was fixed for cloudpickle 1.3.0+ ([https://github.com/cloudpipe/cloudpickle/pull/337]), but PySpark's cloudpickle.py doesn't have this change yet.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org