You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2019/07/27 03:23:42 UTC

[GitHub] [spark] HyukjinKwon opened a new pull request #25271: [SPARK-28536][SQL][PYTHON][TESTS] Reduce shuffle partitions in Python UDF tests in SQLQueryTestSuite

HyukjinKwon opened a new pull request #25271: [SPARK-28536][SQL][PYTHON][TESTS] Reduce shuffle partitions in Python UDF tests in SQLQueryTestSuite
URL: https://github.com/apache/spark/pull/25271
 
 
   ## What changes were proposed in this pull request?
   
   In Python UDF tests, the number of shuffle partitions matters considerably in the testing time because it requires to fork and communicate between external processes.
   
   See:
   
   ![image](https://user-images.githubusercontent.com/6477701/61989374-465c0080-b069-11e9-9936-b386d0cccf7a.png)
   
   We should check Jenkins results before merging it as well.
   
   ## How was this patch tested?
   
   Manually tested in my local. We should check Jenkins results too.
   
   **Before:**
   
   ```
   [info] SQLQueryTestSuite:
   [info] - udf/udf-window.sql - Scala UDF (58 seconds, 558 milliseconds)
   [info] - udf/udf-window.sql - Regular Python UDF (58 seconds, 371 milliseconds)
   [info] - udf/udf-window.sql - Scalar Pandas UDF (1 minute, 8 seconds)
   ```
   
   **After:**
   
   ```
   [info] SQLQueryTestSuite:
   [info] - udf/udf-window.sql - Scala UDF (14 seconds, 690 milliseconds)
   [info] - udf/udf-window.sql - Regular Python UDF (10 seconds, 467 milliseconds)
   [info] - udf/udf-window.sql - Scalar Pandas UDF (10 seconds, 895 milliseconds)
   ```

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org