You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/10/21 12:57:29 UTC

[GitHub] [spark] HyukjinKwon opened a new pull request #26194: [SPARK-29536][PYTHON] Upgrade cloudpickle to 1.2.2 to support Python 3.8

HyukjinKwon opened a new pull request #26194: [SPARK-29536][PYTHON] Upgrade cloudpickle to 1.2.2 to support Python 3.8
URL: https://github.com/apache/spark/pull/26194
 
 
   ### What changes were proposed in this pull request?
   
   Inline cloudpickle in PySpark to cloudpickle 1.2.2. See https://github.com/cloudpipe/cloudpickle/blob/v1.2.2/cloudpickle/cloudpickle.py
   
   ### Why are the changes needed?
   
   positional-only arguments was newly introduced from Python 3.8 (see https://docs.python.org/3/whatsnew/3.8.html#positional-only-parameters)
   
   Particularly the newly added argument to `types.CodeType` was the problem (https://docs.python.org/3/whatsnew/3.8.html#changes-in-the-python-api):
   
   > types.CodeType has a new parameter in the second position of the constructor (posonlyargcount) to support positional-only arguments defined in PEP 570. The first argument (argcount) now represents the total number of positional arguments (including positional-only arguments). The new replace() method of types.CodeType can be used to make the code future-proof.
   
   ### Does this PR introduce any user-facing change?
   
   No.
   
   ### How was this patch tested?
   
   Manually tested. Note that the optional dependency PyArrow looks not yet supporting Python 3.8; therefore, it was not tested.
   
   ```bash
   cd python
   ./run-tests --python-executables=python3.8
   ```
   
   ```
   Running PySpark tests. Output is in /Users/hyukjin.kwon/workspace/forked/spark/python/unit-tests.log
   Will test against the following Python executables: ['python3.8']
   Will test the following Python modules: ['pyspark-core', 'pyspark-ml', 'pyspark-mllib', 'pyspark-sql', 'pyspark-streaming']
   Starting test(python3.8): pyspark.ml.tests.test_algorithms
   Starting test(python3.8): pyspark.ml.tests.test_feature
   Starting test(python3.8): pyspark.ml.tests.test_base
   Starting test(python3.8): pyspark.ml.tests.test_evaluation
   Finished test(python3.8): pyspark.ml.tests.test_base (12s)
   Starting test(python3.8): pyspark.ml.tests.test_image
   Finished test(python3.8): pyspark.ml.tests.test_evaluation (14s)
   Starting test(python3.8): pyspark.ml.tests.test_linalg
   Finished test(python3.8): pyspark.ml.tests.test_feature (23s)
   Starting test(python3.8): pyspark.ml.tests.test_param
   Finished test(python3.8): pyspark.ml.tests.test_image (22s)
   Starting test(python3.8): pyspark.ml.tests.test_persistence
   Finished test(python3.8): pyspark.ml.tests.test_param (25s)
   Starting test(python3.8): pyspark.ml.tests.test_pipeline
   Finished test(python3.8): pyspark.ml.tests.test_linalg (37s)
   Starting test(python3.8): pyspark.ml.tests.test_stat
   Finished test(python3.8): pyspark.ml.tests.test_pipeline (7s)
   Starting test(python3.8): pyspark.ml.tests.test_training_summary
   Finished test(python3.8): pyspark.ml.tests.test_stat (21s)
   Starting test(python3.8): pyspark.ml.tests.test_tuning
   Finished test(python3.8): pyspark.ml.tests.test_persistence (45s)
   Starting test(python3.8): pyspark.ml.tests.test_wrapper
   Finished test(python3.8): pyspark.ml.tests.test_algorithms (83s)
   Starting test(python3.8): pyspark.mllib.tests.test_algorithms
   Finished test(python3.8): pyspark.ml.tests.test_training_summary (32s)
   Starting test(python3.8): pyspark.mllib.tests.test_feature
   Finished test(python3.8): pyspark.ml.tests.test_wrapper (20s)
   Starting test(python3.8): pyspark.mllib.tests.test_linalg
   Finished test(python3.8): pyspark.mllib.tests.test_feature (32s)
   Starting test(python3.8): pyspark.mllib.tests.test_stat
   Finished test(python3.8): pyspark.mllib.tests.test_algorithms (70s)
   Starting test(python3.8): pyspark.mllib.tests.test_streaming_algorithms
   Finished test(python3.8): pyspark.mllib.tests.test_stat (37s)
   Starting test(python3.8): pyspark.mllib.tests.test_util
   Finished test(python3.8): pyspark.mllib.tests.test_linalg (70s)
   Starting test(python3.8): pyspark.sql.tests.test_arrow
   Finished test(python3.8): pyspark.sql.tests.test_arrow (1s) ... 53 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_catalog
   Finished test(python3.8): pyspark.mllib.tests.test_util (15s)
   Starting test(python3.8): pyspark.sql.tests.test_column
   Finished test(python3.8): pyspark.sql.tests.test_catalog (24s)
   Starting test(python3.8): pyspark.sql.tests.test_conf
   Finished test(python3.8): pyspark.sql.tests.test_column (21s)
   Starting test(python3.8): pyspark.sql.tests.test_context
   Finished test(python3.8): pyspark.ml.tests.test_tuning (125s)
   Starting test(python3.8): pyspark.sql.tests.test_dataframe
   Finished test(python3.8): pyspark.sql.tests.test_conf (9s)
   Starting test(python3.8): pyspark.sql.tests.test_datasources
   Finished test(python3.8): pyspark.sql.tests.test_context (29s)
   Starting test(python3.8): pyspark.sql.tests.test_functions
   Finished test(python3.8): pyspark.sql.tests.test_datasources (32s)
   Starting test(python3.8): pyspark.sql.tests.test_group
   Finished test(python3.8): pyspark.sql.tests.test_dataframe (39s) ... 3 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_pandas_udf
   Finished test(python3.8): pyspark.sql.tests.test_pandas_udf (1s) ... 6 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_pandas_udf_cogrouped_map
   Finished test(python3.8): pyspark.sql.tests.test_pandas_udf_cogrouped_map (0s) ... 14 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_pandas_udf_grouped_agg
   Finished test(python3.8): pyspark.sql.tests.test_pandas_udf_grouped_agg (1s) ... 15 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_pandas_udf_grouped_map
   Finished test(python3.8): pyspark.sql.tests.test_pandas_udf_grouped_map (1s) ... 20 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_pandas_udf_scalar
   Finished test(python3.8): pyspark.sql.tests.test_pandas_udf_scalar (1s) ... 49 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_pandas_udf_window
   Finished test(python3.8): pyspark.sql.tests.test_pandas_udf_window (1s) ... 14 tests were skipped
   Starting test(python3.8): pyspark.sql.tests.test_readwriter
   Finished test(python3.8): pyspark.sql.tests.test_functions (29s)
   Starting test(python3.8): pyspark.sql.tests.test_serde
   Finished test(python3.8): pyspark.sql.tests.test_group (20s)
   Starting test(python3.8): pyspark.sql.tests.test_session
   Finished test(python3.8): pyspark.mllib.tests.test_streaming_algorithms (126s)
   Starting test(python3.8): pyspark.sql.tests.test_streaming
   Finished test(python3.8): pyspark.sql.tests.test_serde (25s)
   Starting test(python3.8): pyspark.sql.tests.test_types
   Finished test(python3.8): pyspark.sql.tests.test_readwriter (38s)
   Starting test(python3.8): pyspark.sql.tests.test_udf
   Finished test(python3.8): pyspark.sql.tests.test_session (32s)
   Starting test(python3.8): pyspark.sql.tests.test_utils
   Finished test(python3.8): pyspark.sql.tests.test_utils (17s)
   Starting test(python3.8): pyspark.streaming.tests.test_context
   Finished test(python3.8): pyspark.sql.tests.test_types (45s)
   Starting test(python3.8): pyspark.streaming.tests.test_dstream
   Finished test(python3.8): pyspark.sql.tests.test_udf (44s)
   Starting test(python3.8): pyspark.streaming.tests.test_kinesis
   Finished test(python3.8): pyspark.streaming.tests.test_kinesis (0s) ... 2 tests were skipped
   Starting test(python3.8): pyspark.streaming.tests.test_listener
   Finished test(python3.8): pyspark.streaming.tests.test_context (28s)
   Starting test(python3.8): pyspark.tests.test_appsubmit
   Finished test(python3.8): pyspark.sql.tests.test_streaming (60s)
   Starting test(python3.8): pyspark.tests.test_broadcast
   Finished test(python3.8): pyspark.streaming.tests.test_listener (11s)
   Starting test(python3.8): pyspark.tests.test_conf
   Finished test(python3.8): pyspark.tests.test_conf (17s)
   Starting test(python3.8): pyspark.tests.test_context
   Finished test(python3.8): pyspark.tests.test_broadcast (39s)
   Starting test(python3.8): pyspark.tests.test_daemon
   Finished test(python3.8): pyspark.tests.test_daemon (5s)
   Starting test(python3.8): pyspark.tests.test_join
   Finished test(python3.8): pyspark.tests.test_context (31s)
   Starting test(python3.8): pyspark.tests.test_profiler
   Finished test(python3.8): pyspark.tests.test_join (9s)
   Starting test(python3.8): pyspark.tests.test_rdd
   Finished test(python3.8): pyspark.tests.test_profiler (12s)
   Starting test(python3.8): pyspark.tests.test_readwrite
   Finished test(python3.8): pyspark.tests.test_readwrite (23s) ... 3 tests were skipped
   Starting test(python3.8): pyspark.tests.test_serializers
   Finished test(python3.8): pyspark.tests.test_appsubmit (94s)
   Starting test(python3.8): pyspark.tests.test_shuffle
   Finished test(python3.8): pyspark.streaming.tests.test_dstream (110s)
   Starting test(python3.8): pyspark.tests.test_taskcontext
   Finished test(python3.8): pyspark.tests.test_rdd (42s)
   Starting test(python3.8): pyspark.tests.test_util
   Finished test(python3.8): pyspark.tests.test_serializers (11s)
   Starting test(python3.8): pyspark.tests.test_worker
   Finished test(python3.8): pyspark.tests.test_shuffle (12s)
   Starting test(python3.8): pyspark.accumulators
   Finished test(python3.8): pyspark.tests.test_util (7s)
   Starting test(python3.8): pyspark.broadcast
   Finished test(python3.8): pyspark.accumulators (8s)
   Starting test(python3.8): pyspark.conf
   Finished test(python3.8): pyspark.broadcast (8s)
   Starting test(python3.8): pyspark.context
   Finished test(python3.8): pyspark.tests.test_worker (19s)
   Starting test(python3.8): pyspark.ml.classification
   Finished test(python3.8): pyspark.conf (4s)
   Starting test(python3.8): pyspark.ml.clustering
   Finished test(python3.8): pyspark.context (22s)
   Starting test(python3.8): pyspark.ml.evaluation
   Finished test(python3.8): pyspark.tests.test_taskcontext (49s)
   Starting test(python3.8): pyspark.ml.feature
   Finished test(python3.8): pyspark.ml.clustering (43s)
   Starting test(python3.8): pyspark.ml.fpm
   Finished test(python3.8): pyspark.ml.evaluation (27s)
   Starting test(python3.8): pyspark.ml.image
   Finished test(python3.8): pyspark.ml.image (8s)
   Starting test(python3.8): pyspark.ml.linalg.__init__
   Finished test(python3.8): pyspark.ml.linalg.__init__ (0s)
   Starting test(python3.8): pyspark.ml.recommendation
   Finished test(python3.8): pyspark.ml.classification (63s)
   Starting test(python3.8): pyspark.ml.regression
   Finished test(python3.8): pyspark.ml.fpm (23s)
   Starting test(python3.8): pyspark.ml.stat
   Finished test(python3.8): pyspark.ml.stat (30s)
   Starting test(python3.8): pyspark.ml.tuning
   Finished test(python3.8): pyspark.ml.regression (51s)
   Starting test(python3.8): pyspark.mllib.classification
   Finished test(python3.8): pyspark.ml.feature (93s)
   Starting test(python3.8): pyspark.mllib.clustering
   Finished test(python3.8): pyspark.ml.tuning (39s)
   Starting test(python3.8): pyspark.mllib.evaluation
   Finished test(python3.8): pyspark.mllib.classification (38s)
   Starting test(python3.8): pyspark.mllib.feature
   Finished test(python3.8): pyspark.mllib.evaluation (25s)
   Starting test(python3.8): pyspark.mllib.fpm
   Finished test(python3.8): pyspark.mllib.clustering (64s)
   Starting test(python3.8): pyspark.mllib.linalg.__init__
   Finished test(python3.8): pyspark.ml.recommendation (131s)
   Starting test(python3.8): pyspark.mllib.linalg.distributed
   Finished test(python3.8): pyspark.mllib.linalg.__init__ (0s)
   Starting test(python3.8): pyspark.mllib.random
   Finished test(python3.8): pyspark.mllib.feature (36s)
   Starting test(python3.8): pyspark.mllib.recommendation
   Finished test(python3.8): pyspark.mllib.fpm (31s)
   Starting test(python3.8): pyspark.mllib.regression
   Finished test(python3.8): pyspark.mllib.random (16s)
   Starting test(python3.8): pyspark.mllib.stat.KernelDensity
   Finished test(python3.8): pyspark.mllib.stat.KernelDensity (1s)
   Starting test(python3.8): pyspark.mllib.stat._statistics
   Finished test(python3.8): pyspark.mllib.stat._statistics (25s)
   Starting test(python3.8): pyspark.mllib.tree
   Finished test(python3.8): pyspark.mllib.regression (44s)
   Starting test(python3.8): pyspark.mllib.util
   Finished test(python3.8): pyspark.mllib.recommendation (49s)
   Starting test(python3.8): pyspark.profiler
   Finished test(python3.8): pyspark.mllib.linalg.distributed (53s)
   Starting test(python3.8): pyspark.rdd
   Finished test(python3.8): pyspark.profiler (14s)
   Starting test(python3.8): pyspark.serializers
   Finished test(python3.8): pyspark.mllib.tree (30s)
   Starting test(python3.8): pyspark.shuffle
   Finished test(python3.8): pyspark.shuffle (2s)
   Starting test(python3.8): pyspark.sql.avro.functions
   Finished test(python3.8): pyspark.mllib.util (30s)
   Starting test(python3.8): pyspark.sql.catalog
   Finished test(python3.8): pyspark.serializers (17s)
   Starting test(python3.8): pyspark.sql.column
   Finished test(python3.8): pyspark.rdd (31s)
   Starting test(python3.8): pyspark.sql.conf
   Finished test(python3.8): pyspark.sql.conf (7s)
   Starting test(python3.8): pyspark.sql.context
   Finished test(python3.8): pyspark.sql.avro.functions (19s)
   Starting test(python3.8): pyspark.sql.dataframe
   Finished test(python3.8): pyspark.sql.catalog (16s)
   Starting test(python3.8): pyspark.sql.functions
   Finished test(python3.8): pyspark.sql.column (27s)
   Starting test(python3.8): pyspark.sql.group
   Finished test(python3.8): pyspark.sql.context (26s)
   Starting test(python3.8): pyspark.sql.readwriter
   Finished test(python3.8): pyspark.sql.group (52s)
   Starting test(python3.8): pyspark.sql.session
   Finished test(python3.8): pyspark.sql.dataframe (73s)
   Starting test(python3.8): pyspark.sql.streaming
   Finished test(python3.8): pyspark.sql.functions (75s)
   Starting test(python3.8): pyspark.sql.types
   Finished test(python3.8): pyspark.sql.readwriter (57s)
   Starting test(python3.8): pyspark.sql.udf
   Finished test(python3.8): pyspark.sql.types (13s)
   Starting test(python3.8): pyspark.sql.window
   Finished test(python3.8): pyspark.sql.session (32s)
   Starting test(python3.8): pyspark.streaming.util
   Finished test(python3.8): pyspark.streaming.util (1s)
   Starting test(python3.8): pyspark.util
   Finished test(python3.8): pyspark.util (0s)
   Finished test(python3.8): pyspark.sql.streaming (30s)
   Finished test(python3.8): pyspark.sql.udf (27s)
   Finished test(python3.8): pyspark.sql.window (22s)
   Tests passed in 855 seconds
   ```

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org