You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [spark] vitaliili-db commented on pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/01 00:16:04 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36606: [SPARK-39232][CORE] History Server Main Page App List Filtering - posted by GitBox <gi...@apache.org> on 2022/09/01 00:21:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37741: [SPARK-40283][INFRA] Bump MiMa's previousSparkVersion to 3.3.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 00:31:06 UTC, 1 replies.
- [GitHub] [spark] JoshRosen closed pull request #37713: [SPARK-40261][CORE]Exclude DirectTaskResult metadata when calculating result size - posted by GitBox <gi...@apache.org> on 2022/09/01 00:39:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37741: [SPARK-40283][INFRA] Bump MiMa's previousSparkVersion to 3.3.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 01:08:19 UTC, 23 replies.
- [GitHub] [spark] itholic commented on pull request #37210: add ignore for the recently added and failing mypy error 'type-var' SPARK-39811 - posted by GitBox <gi...@apache.org> on 2022/09/01 01:14:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37728: [SPARK-40276][CORE] Reduce the result size of RDD.takeOrdered - posted by GitBox <gi...@apache.org> on 2022/09/01 01:19:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37728: [SPARK-40276][CORE] Reduce the result size of RDD.takeOrdered - posted by GitBox <gi...@apache.org> on 2022/09/01 01:19:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37728: [SPARK-40276][CORE] Reduce the result size of RDD.takeOrdered - posted by GitBox <gi...@apache.org> on 2022/09/01 01:32:01 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #37463: [SPARK-40033][SQL] Nested schema pruning support through element_at - posted by GitBox <gi...@apache.org> on 2022/09/01 01:34:32 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/01 01:40:36 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/01 01:40:45 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/01 01:42:22 UTC, 4 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #37612: [SPARK-39915][SQL] Ensure the output partitioning is user-specified in AQE - posted by GitBox <gi...@apache.org> on 2022/09/01 01:59:23 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37612: [SPARK-39915][SQL] Ensure the output partitioning is user-specified in AQE - posted by GitBox <gi...@apache.org> on 2022/09/01 02:07:14 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37483: [SPARK-40112][SQL] Improve the TO_BINARY() function - posted by GitBox <gi...@apache.org> on 2022/09/01 02:18:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37483: [SPARK-40112][SQL] Improve the TO_BINARY() function - posted by GitBox <gi...@apache.org> on 2022/09/01 02:19:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/01 02:23:46 UTC, 5 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37407: [SPARK-39876][SQL] Add UNPIVOT to SQL syntax - posted by GitBox <gi...@apache.org> on 2022/09/01 02:26:36 UTC, 50 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37520: [SPARK-40098][SQL] Format error messages in the Thrift Server - posted by GitBox <gi...@apache.org> on 2022/09/01 02:39:12 UTC, 0 replies.
- [GitHub] [spark] maryannxue opened a new pull request, #37751: [SPARK-40297][SQL] CTE outer reference nested in CTE main body cannot be resolved - posted by GitBox <gi...@apache.org> on 2022/09/01 02:56:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37737: [SPARK-40055][SQL][FOLLOWUP] CatalogManager.listCatalogs should include spark_catalog - posted by GitBox <gi...@apache.org> on 2022/09/01 02:57:16 UTC, 0 replies.
- [GitHub] [spark] maryannxue commented on pull request #37751: [SPARK-40297][SQL] CTE outer reference nested in CTE main body cannot be resolved - posted by GitBox <gi...@apache.org> on 2022/09/01 02:57:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37737: [SPARK-40055][SQL][FOLLOWUP] CatalogManager.listCatalogs should include spark_catalog - posted by GitBox <gi...@apache.org> on 2022/09/01 02:57:49 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #37746: [SPARK-40293][SQL] Make the V2 table error message more meaningful - posted by GitBox <gi...@apache.org> on 2022/09/01 03:05:19 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #37736: [SPARK-40285][SQL] Simplify the `roundTo[Numeric]` for Spark `Decimal` - posted by GitBox <gi...@apache.org> on 2022/09/01 03:20:21 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37748: [SPARK-40210][PYTHON][CORE] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/01 03:21:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37751: [SPARK-40297][SQL] CTE outer reference nested in CTE main body cannot be resolved - posted by GitBox <gi...@apache.org> on 2022/09/01 03:22:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37741: [SPARK-40283][INFRA] Bump MiMa's previousSparkVersion to 3.3.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 03:26:53 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37723: Queries can see group by result - posted by GitBox <gi...@apache.org> on 2022/09/01 03:28:55 UTC, 0 replies.
- [GitHub] [spark] 1zg12 commented on pull request #37738: add Support Java Class with circular references - posted by GitBox <gi...@apache.org> on 2022/09/01 03:35:51 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #37738: add Support Java Class with circular references - posted by GitBox <gi...@apache.org> on 2022/09/01 03:39:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37741: [SPARK-40283][INFRA] Bump MiMa's previousSparkVersion to 3.3.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 04:00:21 UTC, 2 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/01 04:28:19 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37736: [SPARK-40285][SQL] Simplify the `roundTo[Numeric]` for Spark `Decimal` - posted by GitBox <gi...@apache.org> on 2022/09/01 04:41:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37736: [SPARK-40285][SQL] Simplify the `roundTo[Numeric]` for Spark `Decimal` - posted by GitBox <gi...@apache.org> on 2022/09/01 04:41:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37741: [SPARK-40283][INFRA] Bump MiMa's previousSparkVersion to 3.3.0 and clean up expired rules - posted by GitBox <gi...@apache.org> on 2022/09/01 04:43:40 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37746: [SPARK-40293][SQL] Make the V2 table error message more meaningful - posted by GitBox <gi...@apache.org> on 2022/09/01 04:44:50 UTC, 10 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37741: [SPARK-40283][INFRA] Bump MiMa's previousSparkVersion to 3.3.0 and clean up expired rules - posted by GitBox <gi...@apache.org> on 2022/09/01 04:47:40 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37622: [SPARK-40187][DOCS] Add `Apache YuniKorn` scheduler docs - posted by GitBox <gi...@apache.org> on 2022/09/01 05:25:59 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37622: [SPARK-40187][DOCS] Add `Apache YuniKorn` scheduler docs - posted by GitBox <gi...@apache.org> on 2022/09/01 05:27:00 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on pull request #37742: [SPARK-40291][SQL] Improve the message for column not in group by clause error - posted by GitBox <gi...@apache.org> on 2022/09/01 05:47:45 UTC, 2 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/01 06:49:05 UTC, 3 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37533: [SPARK-40096]Fix finalize shuffle stage slow due to connection creation slow - posted by GitBox <gi...@apache.org> on 2022/09/01 06:59:31 UTC, 19 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37752: [SPARK-40301][PYTHON] Add parameter validations in pyspark.rdd - posted by GitBox <gi...@apache.org> on 2022/09/01 07:36:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37744: [SPARK-40300][SQL] Migrate onto the `DATATYPE_MISMATCH` error class - posted by GitBox <gi...@apache.org> on 2022/09/01 07:55:03 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37748: [SPARK-40210][PYTHON][CORE] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/01 08:01:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37746: [SPARK-40293][SQL] Make the V2 table error message more meaningful - posted by GitBox <gi...@apache.org> on 2022/09/01 08:58:49 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #37753: [SPARK-40302][K8S][TESTS] Add YuniKornSuite - posted by GitBox <gi...@apache.org> on 2022/09/01 09:01:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/01 09:29:35 UTC, 10 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37754: [SPARK-39906][INFRA][FOLLOWGUP] Eliminate build warnings - sbt 0.13 hell syntax is deprecated; use slash syntax instead - posted by GitBox <gi...@apache.org> on 2022/09/01 09:34:14 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/01 09:54:06 UTC, 5 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37742: [SPARK-40291][SQL] Improve the message for column not in group by clause error - posted by GitBox <gi...@apache.org> on 2022/09/01 10:08:32 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37753: [SPARK-40302][K8S][TESTS] Add `YuniKornSuite` - posted by GitBox <gi...@apache.org> on 2022/09/01 10:12:11 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #37755: [SPARK-40304][K8S][TESTS] Add decomTestTag to K8s Integration Test - posted by GitBox <gi...@apache.org> on 2022/09/01 10:14:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37755: [SPARK-40304][K8S][TESTS] Add `decomTestTag` to K8s Integration Test - posted by GitBox <gi...@apache.org> on 2022/09/01 10:18:10 UTC, 2 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/09/01 11:15:04 UTC, 1 replies.
- [GitHub] [spark] zero323 commented on a diff in pull request #37748: [SPARK-40210][PYTHON][CORE] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/01 11:15:14 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/09/01 11:21:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37756: [SPARK-40305][PS] Implement Groupby.sem - posted by GitBox <gi...@apache.org> on 2022/09/01 11:41:58 UTC, 0 replies.
- [GitHub] [spark] gitlabsam opened a new pull request, #37757: Branch 3.3 sam - posted by GitBox <gi...@apache.org> on 2022/09/01 11:52:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #37758: [SPARK-40149][SQL] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/01 12:25:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37758: [SPARK-40149][SQL] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/01 12:26:15 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37739: [SPARK-40265][PS] Fix the inconsistent behavior for Index.intersection. - posted by GitBox <gi...@apache.org> on 2022/09/01 12:45:42 UTC, 2 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37411: [SPARK-39984][CORE] Check workerLastHeartbeat with master before HeartbeatReceiver expires an executor - posted by GitBox <gi...@apache.org> on 2022/09/01 13:20:15 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37700: [SPARK-40251][BUILD][MLLIB] Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 & breeze from 2.0 to 2.1.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 13:29:46 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #37731: [SPARK-40279][DOC] Document spark.yarn.report.interval - posted by GitBox <gi...@apache.org> on 2022/09/01 13:30:19 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37731: [SPARK-40279][DOC] Document spark.yarn.report.interval - posted by GitBox <gi...@apache.org> on 2022/09/01 13:30:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/01 13:39:47 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #37700: [SPARK-40251][BUILD][MLLIB] Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 & breeze from 2.0 to 2.1.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 13:43:13 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #37751: [SPARK-40297][SQL] CTE outer reference nested in CTE main body cannot be resolved - posted by GitBox <gi...@apache.org> on 2022/09/01 14:04:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37751: [SPARK-40297][SQL] CTE outer reference nested in CTE main body cannot be resolved - posted by GitBox <gi...@apache.org> on 2022/09/01 14:07:26 UTC, 2 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #37744: [SPARK-40300][SQL] Migrate onto the `DATATYPE_MISMATCH` error class - posted by GitBox <gi...@apache.org> on 2022/09/01 14:24:16 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on pull request #37751: [SPARK-40297][SQL] CTE outer reference nested in CTE main body cannot be resolved - posted by GitBox <gi...@apache.org> on 2022/09/01 14:25:21 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37697: [SPARK-40248][SQL] Use larger number of bits to build Bloom filter - posted by GitBox <gi...@apache.org> on 2022/09/01 14:37:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37700: [SPARK-40251][BUILD][MLLIB] Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 & breeze from 2.0 to 2.1.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 14:37:18 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2022/09/01 15:08:05 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #37533: [SPARK-40096]Fix finalize shuffle stage slow due to connection creation slow - posted by GitBox <gi...@apache.org> on 2022/09/01 15:11:57 UTC, 9 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #37760: [SPARK-38404][SQL][3.3] Improve CTE resolution when a nested CTE references an outer CTE - posted by GitBox <gi...@apache.org> on 2022/09/01 15:14:13 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37760: [SPARK-38404][SQL][3.3] Improve CTE resolution when a nested CTE references an outer CTE - posted by GitBox <gi...@apache.org> on 2022/09/01 15:15:27 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2022/09/01 15:21:23 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/01 15:48:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/01 16:08:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37753: [SPARK-40302][K8S][TESTS] Add `YuniKornSuite` - posted by GitBox <gi...@apache.org> on 2022/09/01 16:27:16 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37755: [SPARK-40304][K8S][TESTS] Add `decomTestTag` to K8s Integration Test - posted by GitBox <gi...@apache.org> on 2022/09/01 16:32:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37755: [SPARK-40304][K8S][TESTS] Add `decomTestTag` to K8s Integration Test - posted by GitBox <gi...@apache.org> on 2022/09/01 16:35:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37744: [SPARK-40300][SQL] Migrate onto the `DATATYPE_MISMATCH` error class - posted by GitBox <gi...@apache.org> on 2022/09/01 16:52:37 UTC, 10 replies.
- [GitHub] [spark] santosh-d3vpl3x opened a new pull request, #37761: Add withColumnsRenamed to scala API of spark - posted by GitBox <gi...@apache.org> on 2022/09/01 17:27:07 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on a diff in pull request #37697: [SPARK-40248][SQL] Use larger number of bits to build Bloom filter - posted by GitBox <gi...@apache.org> on 2022/09/01 17:40:28 UTC, 0 replies.
- [GitHub] [spark] leewyang commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/09/01 18:18:51 UTC, 8 replies.
- [GitHub] [spark] steveloughran commented on a diff in pull request #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/01 18:20:11 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #37762: SPARK-39996[BUILD] Upgrade 'postgresql' to 42.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/01 18:20:16 UTC, 0 replies.
- [GitHub] [spark] yangwwei commented on pull request #37622: [SPARK-40187][DOCS] Add `Apache YuniKorn` scheduler docs - posted by GitBox <gi...@apache.org> on 2022/09/01 18:28:35 UTC, 0 replies.
- [GitHub] [spark] yangwwei commented on pull request #37753: [SPARK-40302][K8S][TESTS] Add `YuniKornSuite` - posted by GitBox <gi...@apache.org> on 2022/09/01 18:30:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/01 21:30:44 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/01 21:35:25 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #37763: [SPARK-40308][SQL] Allow non-foldable delimiter arguments to `str_to_map` function - posted by GitBox <gi...@apache.org> on 2022/09/01 22:34:48 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #37763: [SPARK-40308][SQL] Allow non-foldable delimiter arguments to `str_to_map` function - posted by GitBox <gi...@apache.org> on 2022/09/01 22:37:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37764: [SPARK-40310][SQL] try_sum() should throw the exceptions from its child - posted by GitBox <gi...@apache.org> on 2022/09/01 23:24:23 UTC, 0 replies.
- [GitHub] [spark] lyssg commented on pull request #35667: [SPARK-38425][K8S] Avoid possible errors due to incorrect file size or type supplied in hadoop conf - posted by GitBox <gi...@apache.org> on 2022/09/02 01:07:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37756: [SPARK-40305][PS] Implement Groupby.sem - posted by GitBox <gi...@apache.org> on 2022/09/02 02:49:27 UTC, 2 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #37765: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression. - posted by GitBox <gi...@apache.org> on 2022/09/02 03:38:11 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #37756: [SPARK-40305][PS] Implement Groupby.sem - posted by GitBox <gi...@apache.org> on 2022/09/02 04:10:19 UTC, 2 replies.
- [GitHub] [spark] hgs19921112 commented on pull request #37765: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression. - posted by GitBox <gi...@apache.org> on 2022/09/02 04:27:26 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 closed pull request #37765: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression. - posted by GitBox <gi...@apache.org> on 2022/09/02 04:29:14 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #37766: [SPARK-40288][SQL]After RemoveRedundantAggregates, PullOutGroupingExpressions should applied to avoid attribute missing when use complex expression. - posted by GitBox <gi...@apache.org> on 2022/09/02 04:34:13 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 commented on pull request #37766: [SPARK-40288][SQL]After RemoveRedundantAggregates, PullOutGroupingExpressions should applied to avoid attribute missing when use complex expression. - posted by GitBox <gi...@apache.org> on 2022/09/02 04:35:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37756: [SPARK-40305][PS] Implement Groupby.sem - posted by GitBox <gi...@apache.org> on 2022/09/02 04:39:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37767: [SPARK-39284][FOLLOW] Add Groupby.mad to API references - posted by GitBox <gi...@apache.org> on 2022/09/02 04:55:18 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #37746: [SPARK-40293][SQL] Make the V2 table error message more meaningful - posted by GitBox <gi...@apache.org> on 2022/09/02 05:57:59 UTC, 3 replies.
- [GitHub] [spark] kevin85421 commented on a diff in pull request #37411: [SPARK-39984][CORE] Check workerLastHeartbeat with master before HeartbeatReceiver expires an executor - posted by GitBox <gi...@apache.org> on 2022/09/02 06:27:03 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37744: [SPARK-40300][SQL] Migrate onto the `DATATYPE_MISMATCH` error class - posted by GitBox <gi...@apache.org> on 2022/09/02 07:29:43 UTC, 6 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37752: [SPARK-40301][PYTHON] Add parameter validations in pyspark.rdd - posted by GitBox <gi...@apache.org> on 2022/09/02 08:20:38 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37766: [SPARK-40288][SQL]After RemoveRedundantAggregates, PullOutGroupingExpressions should applied to avoid attribute missing when use complex expression. - posted by GitBox <gi...@apache.org> on 2022/09/02 08:44:16 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on a diff in pull request #37724: [SPARK-40273][PYTHON][DOCS] Fix the documents "Contributing and Maintaining Type Hints". - posted by GitBox <gi...@apache.org> on 2022/09/02 09:04:01 UTC, 1 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37610: [SPARK-38888][BUILD][CORE][YARN][DOCS] Add `RocksDB` support for shuffle state store - posted by GitBox <gi...@apache.org> on 2022/09/02 09:13:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37768: [SPARK-40313][PS] Make `ps.DataFrame(data, index)` support the same anchor - posted by GitBox <gi...@apache.org> on 2022/09/02 09:22:28 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #37769: [SPARK-40312][CORE][DOCS] Add missing configuration documentation in Spark History Server - posted by GitBox <gi...@apache.org> on 2022/09/02 09:22:52 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on a diff in pull request #37748: [SPARK-40210][PYTHON] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/02 09:23:58 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37748: [SPARK-40210][PYTHON] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/02 09:25:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37748: [SPARK-40210][PYTHON] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/02 09:25:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37748: [SPARK-40210][PYTHON] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/02 09:26:58 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37767: [SPARK-39284][FOLLOW] Add Groupby.mad to API references - posted by GitBox <gi...@apache.org> on 2022/09/02 09:31:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37767: [SPARK-39284][FOLLOW] Add Groupby.mad to API references - posted by GitBox <gi...@apache.org> on 2022/09/02 09:32:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37752: [SPARK-40301][PYTHON] Add parameter validations in pyspark.rdd - posted by GitBox <gi...@apache.org> on 2022/09/02 09:39:09 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on a diff in pull request #37748: [SPARK-40210][PYTHON] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/02 10:03:32 UTC, 0 replies.
- [GitHub] [spark] Resol1992 commented on a diff in pull request #37404: [SPARK-39866][SQL] Memory leak when closing a session of Spark ThriftServer - posted by GitBox <gi...@apache.org> on 2022/09/02 10:04:47 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on pull request #37745: [SPARK-33605][BUILD] Add `gcs-connector` to `hadoop-cloud` module - posted by GitBox <gi...@apache.org> on 2022/09/02 11:13:06 UTC, 1 replies.
- [GitHub] [spark] Kimahriman opened a new pull request, #37770: [SPARK-40314][SQL] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/02 11:22:21 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37762: SPARK-39996[BUILD] Upgrade `postgresql` to 42.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/02 11:25:10 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37761: [SPARK-40311][SQL] Add withColumnsRenamed to scala API of spark - posted by GitBox <gi...@apache.org> on 2022/09/02 11:25:13 UTC, 0 replies.
- [GitHub] [spark] sos3k commented on pull request #37413: [SPARK-39983][CORE][SQL] Do not cache unserialized broadcast relations on the driver - posted by GitBox <gi...@apache.org> on 2022/09/02 12:07:19 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #37610: [SPARK-38888][BUILD][CORE][YARN][DOCS] Add `RocksDB` support for shuffle state store - posted by GitBox <gi...@apache.org> on 2022/09/02 12:12:42 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #36200: [SPARK-38909][CORE][YARN] Encapsulate `LevelDB` used to store remote/external shuffle state as `DB` - posted by GitBox <gi...@apache.org> on 2022/09/02 12:20:03 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/02 12:22:28 UTC, 0 replies.
- [GitHub] [spark] c27kwan opened a new pull request, #37771: [SPARK-40315] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/02 12:38:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/02 12:44:50 UTC, 11 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37769: [SPARK-40312][CORE][DOCS] Add missing configuration documentation in Spark History Server - posted by GitBox <gi...@apache.org> on 2022/09/02 12:54:24 UTC, 0 replies.
- [GitHub] [spark] c27kwan commented on a diff in pull request #37771: [SPARK-40315][SQL] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/02 12:58:56 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2022/09/02 13:10:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37772: [SPARK-40163][CORE][TESTS][FOLLOWUP] Use Junit `Assert` api instead of Java `assert` - posted by GitBox <gi...@apache.org> on 2022/09/02 13:46:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37772: [SPARK-40163][CORE][TESTS][FOLLOWUP] Use Junit `Assert` api instead of Java `assert` in `JavaSparkSessionSuite.java` - posted by GitBox <gi...@apache.org> on 2022/09/02 13:48:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37773: [SPARK-40098][SQL][FOLLOWUP] Revert the pretty format of error messages in the Thrift Server - posted by GitBox <gi...@apache.org> on 2022/09/02 13:48:43 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2022/09/02 14:12:23 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37672: [SPARK-40228][SQL] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/02 14:45:06 UTC, 2 replies.
- [GitHub] [spark] wangyum closed pull request #37672: [SPARK-40228][SQL] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/02 14:45:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37757: Branch 3.3 sam - posted by GitBox <gi...@apache.org> on 2022/09/02 14:56:19 UTC, 0 replies.
- [GitHub] [spark] steven-aerts commented on a diff in pull request #36506: [SPARK-25050][SQL] Avro: writing complex unions - posted by GitBox <gi...@apache.org> on 2022/09/02 15:02:32 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37771: [SPARK-40315][SQL] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/02 15:35:55 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37764: [SPARK-40310][SQL] try_sum() should throw the exceptions from its child - posted by GitBox <gi...@apache.org> on 2022/09/02 16:13:28 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #37764: [SPARK-40310][SQL] try_sum() should throw the exceptions from its child - posted by GitBox <gi...@apache.org> on 2022/09/02 16:14:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/02 16:14:44 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37773: [SPARK-40098][SQL][FOLLOWUP] Revert the pretty format of error messages in the Thrift Server - posted by GitBox <gi...@apache.org> on 2022/09/02 16:23:11 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37773: [SPARK-40098][SQL][FOLLOWUP] Revert the pretty format of error messages in the Thrift Server - posted by GitBox <gi...@apache.org> on 2022/09/02 16:29:56 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37754: [SPARK-39906][INFRA][FOLLOWGUP] Eliminate build warnings - sbt 0.13 hell syntax is deprecated; use slash syntax instead - posted by GitBox <gi...@apache.org> on 2022/09/02 16:41:50 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37700: [SPARK-40251][BUILD][MLLIB] Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 & breeze from 2.0 to 2.1.0 - posted by GitBox <gi...@apache.org> on 2022/09/02 16:42:28 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37700: [SPARK-40251][BUILD][MLLIB] Upgrade dev.ludovic.netlib from 2.2.1 to 3.0.2 & breeze from 2.0 to 2.1.0 - posted by GitBox <gi...@apache.org> on 2022/09/02 16:42:55 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #37752: [SPARK-40301][PYTHON] Add parameter validations in pyspark.rdd - posted by GitBox <gi...@apache.org> on 2022/09/02 16:50:51 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37763: [SPARK-40308][SQL] Allow non-foldable delimiter arguments to `str_to_map` function - posted by GitBox <gi...@apache.org> on 2022/09/02 17:29:35 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/02 17:34:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37754: [SPARK-39906][INFRA][FOLLOWGUP] Eliminate build warnings - sbt 0.13 hell syntax is deprecated; use slash syntax instead - posted by GitBox <gi...@apache.org> on 2022/09/02 18:12:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37754: [SPARK-39906][INFRA][FOLLOWGUP] Eliminate build warnings - sbt 0.13 hell syntax is deprecated; use slash syntax instead - posted by GitBox <gi...@apache.org> on 2022/09/02 18:13:55 UTC, 0 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #37533: [SPARK-40096]Fix finalize shuffle stage slow due to connection creation slow - posted by GitBox <gi...@apache.org> on 2022/09/02 18:14:28 UTC, 4 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37758: [SPARK-40149][SQL] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/02 18:26:39 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/02 18:27:15 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #37774: [SPARK-40210][PYTHON][TEST]Follow-up to speed up new tests using one action instead of many - posted by GitBox <gi...@apache.org> on 2022/09/02 18:40:17 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #37774: [SPARK-40210][PYTHON][TEST]Follow-up to speed up new tests using one action instead of many - posted by GitBox <gi...@apache.org> on 2022/09/02 19:33:39 UTC, 1 replies.
- [GitHub] [spark] seunggabi commented on pull request #37772: [SPARK-40163][SQL][TESTS][FOLLOWUP] Use Junit `Assert` api instead of Java `assert` in `JavaSparkSessionSuite.java` - posted by GitBox <gi...@apache.org> on 2022/09/02 19:48:32 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37772: [SPARK-40163][SQL][TESTS][FOLLOWUP] Use Junit `Assert` api instead of Java `assert` in `JavaSparkSessionSuite.java` - posted by GitBox <gi...@apache.org> on 2022/09/02 19:49:30 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37772: [SPARK-40163][SQL][TESTS][FOLLOWUP] Use Junit `Assert` api instead of Java `assert` in `JavaSparkSessionSuite.java` - posted by GitBox <gi...@apache.org> on 2022/09/02 19:49:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37748: [SPARK-40210][PYTHON] Fix math atan2, hypot, pow and pmod float argument call - posted by GitBox <gi...@apache.org> on 2022/09/02 20:12:54 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37775: [SPARK-40318][SQL] try_avg() should throw the exceptions from its child - posted by GitBox <gi...@apache.org> on 2022/09/02 20:33:45 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37776: [SPARK-40319][SQL] Remove duplicated query execution error method for PARSE_DATETIME_BY_NEW_PARSER - posted by GitBox <gi...@apache.org> on 2022/09/02 21:16:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37741: [SPARK-40283][INFRA] Make MiMa check default exclude `private object` and bump `previousSparkVersion` to 3.3.0 - posted by GitBox <gi...@apache.org> on 2022/09/02 21:18:07 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #37777: [SPARK-40309][PYTHON] Introduce `sql_conf` context manager for `pyspark.sql` - posted by GitBox <gi...@apache.org> on 2022/09/02 21:23:19 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #37777: [SPARK-40309][PYTHON] Introduce `sql_conf` context manager for `pyspark.sql` - posted by GitBox <gi...@apache.org> on 2022/09/02 21:32:16 UTC, 1 replies.
- [GitHub] [spark] linhongliu-db commented on pull request #37532: [SPARK-39989][SQL][FollowUp] Improve foldable expression stats estimate for string and binary - posted by GitBox <gi...@apache.org> on 2022/09/02 21:34:11 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #37777: [SPARK-40309][PYTHON] Introduce `sql_conf` context manager for `pyspark.sql` - posted by GitBox <gi...@apache.org> on 2022/09/02 21:46:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37741: [SPARK-40283][INFRA] Make MiMa check default exclude `private object` and bump `previousSparkVersion` to 3.3.0 - posted by GitBox <gi...@apache.org> on 2022/09/02 21:46:34 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #37747: [SPARK-40280][SQL] Add support for parquet push down for annotated int and long - posted by GitBox <gi...@apache.org> on 2022/09/02 22:06:36 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on pull request #37747: [SPARK-40280][SQL] Add support for parquet push down for annotated int and long - posted by GitBox <gi...@apache.org> on 2022/09/02 22:12:15 UTC, 1 replies.
- [GitHub] [spark] edmondo1984 opened a new pull request, #37778: Unsafe loop - posted by GitBox <gi...@apache.org> on 2022/09/02 23:34:52 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 closed pull request #37766: [SPARK-40288][SQL]After RemoveRedundantAggregates, PullOutGroupingExpressions should applied to avoid attribute missing when use complex expression. - posted by GitBox <gi...@apache.org> on 2022/09/03 01:22:59 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37776: [SPARK-40319][SQL] Remove duplicated query execution error method for PARSE_DATETIME_BY_NEW_PARSER - posted by GitBox <gi...@apache.org> on 2022/09/03 05:01:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37776: [SPARK-40319][SQL] Remove duplicated query execution error method for PARSE_DATETIME_BY_NEW_PARSER - posted by GitBox <gi...@apache.org> on 2022/09/03 05:02:41 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37780: [SPARK-39414][BUILD][FOLLOWUP] Update Scala to 2.12.16 in doc - posted by GitBox <gi...@apache.org> on 2022/09/03 05:05:50 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37532: [SPARK-39989][SQL][FollowUp] Improve foldable expression stats estimate for string and binary - posted by GitBox <gi...@apache.org> on 2022/09/03 05:18:08 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #37769: [SPARK-40312][CORE][DOCS] Add missing configuration documentation in Spark History Server - posted by GitBox <gi...@apache.org> on 2022/09/03 05:22:16 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37769: [SPARK-40312][CORE][DOCS] Add missing configuration documentation in Spark History Server - posted by GitBox <gi...@apache.org> on 2022/09/03 05:22:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37763: [SPARK-40308][SQL] Allow non-foldable delimiter arguments to `str_to_map` function - posted by GitBox <gi...@apache.org> on 2022/09/03 05:32:25 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37746: [SPARK-40293][SQL] Make the V2 table error message more meaningful - posted by GitBox <gi...@apache.org> on 2022/09/03 06:09:20 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37752: [SPARK-40301][PYTHON] Add parameter validations in pyspark.rdd - posted by GitBox <gi...@apache.org> on 2022/09/03 06:27:33 UTC, 1 replies.
- [GitHub] [spark] viirya closed pull request #37463: [SPARK-40033][SQL] Nested schema pruning support through element_at - posted by GitBox <gi...@apache.org> on 2022/09/03 06:33:40 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37463: [SPARK-40033][SQL] Nested schema pruning support through element_at - posted by GitBox <gi...@apache.org> on 2022/09/03 06:35:08 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #37780: [SPARK-39414][BUILD][FOLLOWUP] Update Scala to 2.12.16 in doc - posted by GitBox <gi...@apache.org> on 2022/09/03 06:43:31 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37780: [SPARK-39414][BUILD][FOLLOWUP] Update Scala to 2.12.16 in doc - posted by GitBox <gi...@apache.org> on 2022/09/03 06:43:41 UTC, 0 replies.
- [GitHub] [spark] Yikun closed pull request #37781: static image - posted by GitBox <gi...@apache.org> on 2022/09/03 07:23:27 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #37729: Revert "[SPARK-33861][SQL] Simplify conditional in predicate" - posted by GitBox <gi...@apache.org> on 2022/09/03 07:31:30 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37729: Revert "[SPARK-33861][SQL] Simplify conditional in predicate" - posted by GitBox <gi...@apache.org> on 2022/09/03 08:48:33 UTC, 1 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #37774: [SPARK-40210][PYTHON][FOLLOW-UP][TEST] Speed up new tests using one action instead of many - posted by GitBox <gi...@apache.org> on 2022/09/03 10:28:24 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #37782: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions`" + "should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/03 12:06:28 UTC, 0 replies.
- [GitHub] [spark] zero323 closed pull request #37774: [SPARK-40210][PYTHON][FOLLOW-UP][TEST] Speed up new tests using one action instead of many - posted by GitBox <gi...@apache.org> on 2022/09/03 12:19:16 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #37774: [SPARK-40210][PYTHON][FOLLOW-UP][TEST] Speed up new tests using one action instead of many - posted by GitBox <gi...@apache.org> on 2022/09/03 12:19:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37783: [SPARK-40321][BUILD] Upgrade rocksdbjni to 7.5.3 - posted by GitBox <gi...@apache.org> on 2022/09/03 13:03:06 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 closed pull request #37782: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/03 13:39:10 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37724: [SPARK-40273][PYTHON][DOCS] Fix the documents "Contributing and Maintaining Type Hints". - posted by GitBox <gi...@apache.org> on 2022/09/03 15:00:32 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37778: Unsafe loop - posted by GitBox <gi...@apache.org> on 2022/09/03 15:01:16 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37778: Unsafe loop - posted by GitBox <gi...@apache.org> on 2022/09/03 15:01:28 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37757: Branch 3.3 sam - posted by GitBox <gi...@apache.org> on 2022/09/03 15:02:04 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37762: [SPARK-39996][BUILD] Upgrade `postgresql` to 42.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/03 15:02:47 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37771: [SPARK-40315][SQL] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/03 15:05:43 UTC, 1 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #37784: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/03 17:37:01 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 closed pull request #37784: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/03 17:42:30 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #37785: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/03 17:45:09 UTC, 1 replies.
- [GitHub] [spark] hgs19921112 commented on pull request #37785: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/03 17:55:03 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #37413: [SPARK-39983][CORE][SQL] Do not cache unserialized broadcast relations on the driver - posted by GitBox <gi...@apache.org> on 2022/09/03 18:04:11 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #37786: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 5, ~28 functions) - posted by GitBox <gi...@apache.org> on 2022/09/03 18:52:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37763: [SPARK-40308][SQL] Allow non-foldable delimiter arguments to `str_to_map` function - posted by GitBox <gi...@apache.org> on 2022/09/03 20:05:57 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on pull request #37761: [SPARK-40311][SQL][CORE][Python] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/03 20:07:51 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37786: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 5, ~28 functions) - posted by GitBox <gi...@apache.org> on 2022/09/04 00:01:18 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37785: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/04 00:01:21 UTC, 0 replies.
- [GitHub] [spark] williamhyun opened a new pull request, #37787: [SPARK-40323][BUILD] Update ORC to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/09/04 00:19:30 UTC, 0 replies.
- [GitHub] [spark] IrishBird commented on pull request #34684: [SPARK-37442][SQL] InMemoryRelation statistics bug causing broadcast join failures with AQE enabled - posted by GitBox <gi...@apache.org> on 2022/09/04 01:12:19 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37779: [SPARK-40320][Core] Executor should exit when it failed to initialize for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/04 01:45:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37777: [WIP][SPARK-40309][PYTHON][PS] Introduce `sql_conf` context manager for `pyspark.sql` - posted by GitBox <gi...@apache.org> on 2022/09/04 01:45:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37787: [SPARK-40323][BUILD] Update ORC to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/09/04 02:26:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37729: Revert "[SPARK-33861][SQL] Simplify conditional in predicate" - posted by GitBox <gi...@apache.org> on 2022/09/04 02:28:44 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/04 03:14:58 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37624: [SPARK-40186][CORE][YARN] Ensure `mergedShuffleCleaner` have been shutdown before `db` close - posted by GitBox <gi...@apache.org> on 2022/09/04 03:15:48 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37771: [SPARK-40315][SQL] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/04 03:34:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37770: [SPARK-40314][SQL] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/04 03:34:08 UTC, 0 replies.
- [GitHub] [spark] SelfImpr001 commented on a diff in pull request #37732: [SPARK-40253] [SQL] Fixed loss of precision for writing 0.00 specific… - posted by GitBox <gi...@apache.org> on 2022/09/04 05:24:40 UTC, 2 replies.
- [GitHub] [spark] hgs19921112 closed pull request #37785: [SPARK-40288][SQL]After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/04 06:12:18 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/04 06:18:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37756: [SPARK-40305][PS] Implement Groupby.sem - posted by GitBox <gi...@apache.org> on 2022/09/04 06:26:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37752: [SPARK-40301][PYTHON] Add parameter validations in pyspark.rdd - posted by GitBox <gi...@apache.org> on 2022/09/04 06:29:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37774: [SPARK-40210][PYTHON][FOLLOW-UP][TEST] Speed up new tests using one action instead of many - posted by GitBox <gi...@apache.org> on 2022/09/04 06:49:49 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #37786: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 5, ~28 functions) - posted by GitBox <gi...@apache.org> on 2022/09/04 07:44:29 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #37788: Fix SPARK-40288 - posted by GitBox <gi...@apache.org> on 2022/09/04 08:33:58 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 closed pull request #37788: [SPARK-40288][SQL] After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/04 08:54:12 UTC, 0 replies.
- [GitHub] [spark] jyong-somnambulist opened a new pull request, #37789: Branch 3.3 - posted by GitBox <gi...@apache.org> on 2022/09/04 08:55:35 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #37790: [SPARK-40288][SQL] After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/04 08:56:25 UTC, 2 replies.
- [GitHub] [spark] hgs19921112 closed pull request #37790: [SPARK-40288][SQL] After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/04 08:58:41 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37791: [SPARK-40251][MLLIB][DOCS][FOLLOWUP] Update doc for mllib linear algebra acceleration - posted by GitBox <gi...@apache.org> on 2022/09/04 09:36:08 UTC, 0 replies.
- [GitHub] [spark] jyong-somnambulist opened a new pull request, #37792: add sparksql wirte mysql support update ,the design from replace into… - posted by GitBox <gi...@apache.org> on 2022/09/04 10:14:33 UTC, 0 replies.
- [GitHub] [spark] jyong-somnambulist commented on pull request #37792: add sparksql wirte mysql support update ,the design from replace into… - posted by GitBox <gi...@apache.org> on 2022/09/04 10:14:49 UTC, 0 replies.
- [GitHub] [spark] jyong-somnambulist closed pull request #37792: add sparksql wirte mysql support update ,the design from replace into… - posted by GitBox <gi...@apache.org> on 2022/09/04 10:15:22 UTC, 0 replies.
- [GitHub] [spark] jyong-somnambulist opened a new pull request, #37793: add sparksql wirte mysql support update ,the design from replace into… - posted by GitBox <gi...@apache.org> on 2022/09/04 10:19:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37794: [WIP][SPARK-40324][SQL] Provide a query context of `ParseException` - posted by GitBox <gi...@apache.org> on 2022/09/04 10:58:29 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37762: [SPARK-39996][BUILD] Upgrade `postgresql` to 42.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/04 13:25:30 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37783: [SPARK-40321][BUILD] Upgrade rocksdbjni to 7.5.3 - posted by GitBox <gi...@apache.org> on 2022/09/04 13:27:35 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37783: [SPARK-40321][BUILD] Upgrade rocksdbjni to 7.5.3 - posted by GitBox <gi...@apache.org> on 2022/09/04 13:27:35 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37732: [SPARK-40253] [SQL] Fixed loss of precision for writing 0.00 specific… - posted by GitBox <gi...@apache.org> on 2022/09/04 13:28:35 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #37789: Branch 3.3 - posted by GitBox <gi...@apache.org> on 2022/09/04 13:29:43 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37791: [SPARK-40251][MLLIB][DOCS][FOLLOWUP] Update doc for mllib linear algebra acceleration - posted by GitBox <gi...@apache.org> on 2022/09/04 13:30:27 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37791: [SPARK-40251][MLLIB][DOCS][FOLLOWUP] Update doc for mllib linear algebra acceleration - posted by GitBox <gi...@apache.org> on 2022/09/04 13:30:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37783: [SPARK-40321][BUILD] Upgrade rocksdbjni to 7.5.3 - posted by GitBox <gi...@apache.org> on 2022/09/04 13:41:02 UTC, 0 replies.
- [GitHub] [spark] StephenQQ opened a new pull request, #37795: fix the question of SparkSQL call iceberg's expire_snapshots procedur… - posted by GitBox <gi...@apache.org> on 2022/09/04 13:47:33 UTC, 0 replies.
- [GitHub] [spark] StephenQQ commented on pull request #37795: fix the question of SparkSQL call iceberg's expire_snapshots procedur… - posted by GitBox <gi...@apache.org> on 2022/09/04 13:49:46 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/04 17:14:54 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/04 17:17:06 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #37795: fix the question of SparkSQL call iceberg's expire_snapshots procedur… - posted by GitBox <gi...@apache.org> on 2022/09/04 17:42:45 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37743: [SPARK-40294][SQL] Fix repeat calls to `PartitionReader.hasNext` timing out - posted by GitBox <gi...@apache.org> on 2022/09/04 19:32:07 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/04 20:06:38 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/04 20:08:15 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi closed pull request #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/04 20:28:03 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/04 20:28:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37761: [SPARK-40311][SQL][CORE][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/05 00:03:46 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/05 00:26:45 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #37635: [SPARK-40131][PYTHON] Support NumPy ndarray in built-in functions - posted by GitBox <gi...@apache.org> on 2022/09/05 01:08:35 UTC, 4 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #37635: [SPARK-40131][PYTHON] Support NumPy ndarray in built-in functions - posted by GitBox <gi...@apache.org> on 2022/09/05 01:08:59 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #37758: [SPARK-40149][SQL] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/05 01:16:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37635: [SPARK-40131][PYTHON] Support NumPy ndarray in built-in functions - posted by GitBox <gi...@apache.org> on 2022/09/05 01:19:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37777: [WIP][SPARK-40309][PYTHON][PS] Introduce `sql_conf` context manager for `pyspark.sql` - posted by GitBox <gi...@apache.org> on 2022/09/05 02:30:00 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37724: [SPARK-40273][PYTHON][DOCS] Fix the documents "Contributing and Maintaining Type Hints". - posted by GitBox <gi...@apache.org> on 2022/09/05 02:46:52 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/05 03:38:23 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37732: [SPARK-40253] [SQL] Fixed loss of precision for writing 0.00 specific… - posted by GitBox <gi...@apache.org> on 2022/09/05 03:41:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37786: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 5, ~28 functions) - posted by GitBox <gi...@apache.org> on 2022/09/05 03:54:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37786: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 5, ~28 functions) - posted by GitBox <gi...@apache.org> on 2022/09/05 03:54:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/05 03:55:44 UTC, 7 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/05 04:00:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37798: [SPARK-40271][PYTHON][TESTS][FOLLOW-UP] Make test_lit_list test pass with ANSI mode on - posted by GitBox <gi...@apache.org> on 2022/09/05 04:16:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37798: [SPARK-40271][PYTHON][TESTS][FOLLOW-UP] Make test_lit_list test pass with ANSI mode on - posted by GitBox <gi...@apache.org> on 2022/09/05 04:16:13 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37793: add sparksql wirte mysql support update ,the design from replace into… - posted by GitBox <gi...@apache.org> on 2022/09/05 04:17:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37770: [SPARK-40314][SQL] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/05 04:22:57 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37770: [SPARK-40314][SQL][PYTHON] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/05 04:24:32 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37747: [SPARK-40280][SQL] Add support for parquet push down for annotated int and long - posted by GitBox <gi...@apache.org> on 2022/09/05 04:26:35 UTC, 1 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #37747: [SPARK-40280][SQL] Add support for parquet push down for annotated int and long - posted by GitBox <gi...@apache.org> on 2022/09/05 04:47:59 UTC, 1 replies.
- [GitHub] [spark] Kimahriman commented on a diff in pull request #37770: [SPARK-40314][SQL][PYTHON] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/05 05:04:40 UTC, 5 replies.
- [GitHub] [spark] pralabhkumar commented on pull request #37417: [SPARK-33782][K8S][CORE]Place spark.files, spark.jars and spark.files under the current working directory on the driver in K8S cluster mode - posted by GitBox <gi...@apache.org> on 2022/09/05 05:27:23 UTC, 7 replies.
- [GitHub] [spark] itholic commented on pull request #37739: [SPARK-40265][PS] Fix the inconsistent behavior for Index.intersection. - posted by GitBox <gi...@apache.org> on 2022/09/05 06:04:06 UTC, 1 replies.
- [GitHub] [spark] chong0929 commented on pull request #37721: [SPARK-40272][CORE]Support service port custom with range - posted by GitBox <gi...@apache.org> on 2022/09/05 06:12:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37799: [SPARK-40331][DOCS] Recommend use Java 11+ as the runtime of Spark 3.4.0 - posted by GitBox <gi...@apache.org> on 2022/09/05 06:12:50 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/05 06:15:53 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11+ as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/05 06:19:27 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37786: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 5, ~28 functions) - posted by GitBox <gi...@apache.org> on 2022/09/05 06:25:38 UTC, 2 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #37800: [SPARK-39830][SQL][TESTS] Reading ORC table that requires type promotion may throw AIOOBE - posted by GitBox <gi...@apache.org> on 2022/09/05 06:27:55 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #37800: [SPARK-39830][SQL][TESTS] Reading ORC table that requires type promotion may throw AIOOBE - posted by GitBox <gi...@apache.org> on 2022/09/05 06:29:28 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37756: [SPARK-40305][PS] Implement Groupby.sem - posted by GitBox <gi...@apache.org> on 2022/09/05 06:31:22 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11+ as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/05 06:40:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11+ as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/05 06:45:31 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37770: [SPARK-40314][SQL][PYTHON] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/05 06:50:36 UTC, 2 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #37407: [SPARK-39876][SQL] Add UNPIVOT to SQL syntax - posted by GitBox <gi...@apache.org> on 2022/09/05 06:52:56 UTC, 47 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37739: [SPARK-40265][PS] Fix the inconsistent behavior for Index.intersection. - posted by GitBox <gi...@apache.org> on 2022/09/05 06:57:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37798: [SPARK-40271][PYTHON][TESTS][FOLLOW-UP] Make test_lit_list test pass with ANSI mode on - posted by GitBox <gi...@apache.org> on 2022/09/05 06:57:46 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37768: [SPARK-40313][PS] Make `ps.DataFrame(data, index)` support the same anchor - posted by GitBox <gi...@apache.org> on 2022/09/05 07:02:27 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/05 07:06:18 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37786: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 5, ~28 functions) - posted by GitBox <gi...@apache.org> on 2022/09/05 07:37:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37768: [SPARK-40313][PS] Make `ps.DataFrame(data, index)` support the same anchor - posted by GitBox <gi...@apache.org> on 2022/09/05 07:50:24 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37794: [WIP][SPARK-40324][SQL] Provide a query context of `ParseException` - posted by GitBox <gi...@apache.org> on 2022/09/05 08:18:17 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37794: [SPARK-40324][SQL] Provide a query context of `ParseException` - posted by GitBox <gi...@apache.org> on 2022/09/05 08:23:56 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/05 08:34:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/05 08:46:48 UTC, 2 replies.
- [GitHub] [spark] wangyum commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/05 09:09:19 UTC, 2 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/05 09:45:25 UTC, 3 replies.
- [GitHub] [spark] zheniantoushipashi opened a new pull request, #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/05 09:56:48 UTC, 0 replies.
- [GitHub] [spark] fanyilun opened a new pull request, #37803: [SPARK-39546][Kubernetes] Executor pod template should support port definitions - posted by GitBox <gi...@apache.org> on 2022/09/05 10:59:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37804: [WIP][SQL] Add function aliases: len, datepart, dateadd, date_diff and curdate - posted by GitBox <gi...@apache.org> on 2022/09/05 11:15:13 UTC, 0 replies.
- [GitHub] [spark] meimiao0730 commented on pull request #31302: [SPARK-34210][SQL] After upgrading 3.0.1, Spark SQL access hive on HBase table access exception - posted by GitBox <gi...@apache.org> on 2022/09/05 11:28:47 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #37805: [WIP][Do not merge] Spark Decimal support Int128 as the underlying implementation. - posted by GitBox <gi...@apache.org> on 2022/09/05 12:27:39 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/05 12:33:42 UTC, 6 replies.
- [GitHub] [spark] ivoson commented on pull request #37268: [SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled - posted by GitBox <gi...@apache.org> on 2022/09/05 14:03:32 UTC, 3 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/05 14:53:12 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/05 15:19:39 UTC, 0 replies.
- [GitHub] [spark] ashutoshcipher commented on pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/05 17:33:51 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37410: [WIP][SPARK-38493][PS][FOLLOWUP] Improve more coverage for Pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/05 22:57:05 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37671: [SPARK-40229][PS][TEST] Re-enable excel I/O test for pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/05 23:06:51 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #37798: [SPARK-40271][PYTHON][TESTS][FOLLOW-UP] Make test_lit_list test pass with ANSI mode on - posted by GitBox <gi...@apache.org> on 2022/09/05 23:12:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37803: [SPARK-39546][Kubernetes] Executor pod template should support port definitions - posted by GitBox <gi...@apache.org> on 2022/09/05 23:16:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/05 23:16:35 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37635: [SPARK-40131][PYTHON] Support NumPy ndarray in built-in functions - posted by GitBox <gi...@apache.org> on 2022/09/05 23:42:55 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/06 00:05:37 UTC, 8 replies.
- [GitHub] [spark] srowen commented on pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/06 00:51:51 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/06 00:51:54 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37800: [SPARK-39830][SQL][TESTS] Reading ORC table that requires type promotion may throw AIOOBE - posted by GitBox <gi...@apache.org> on 2022/09/06 01:02:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37768: [SPARK-40313][PS] Make `ps.DataFrame(data, index)` support the same anchor - posted by GitBox <gi...@apache.org> on 2022/09/06 01:24:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37635: [SPARK-40131][PYTHON] Support NumPy ndarray in built-in functions - posted by GitBox <gi...@apache.org> on 2022/09/06 01:24:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37768: [SPARK-40313][PS] Make `ps.DataFrame(data, index)` support the same anchor - posted by GitBox <gi...@apache.org> on 2022/09/06 01:25:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37635: [SPARK-40131][PYTHON] Support NumPy ndarray in built-in functions - posted by GitBox <gi...@apache.org> on 2022/09/06 01:26:40 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37671: [SPARK-40229][PS][TEST] Re-enable excel I/O test for pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:28:35 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37410: [SPARK-38493][PS][FOLLOWUP] Improve more coverage for Pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:29:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37410: [SPARK-38493][PS][FOLLOWUP] Improve more coverage for Pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:30:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:30:15 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:32:22 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37671: [SPARK-40229][PS][TEST] Re-enable excel I/O test for pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:40:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:44:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37768: [SPARK-40313][PS] Make `ps.DataFrame(data, index)` support the same anchor - posted by GitBox <gi...@apache.org> on 2022/09/06 01:49:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 01:50:53 UTC, 3 replies.
- [GitHub] [spark] kevin85421 commented on pull request #37411: [SPARK-39984][CORE] Check workerLastHeartbeat with master before HeartbeatReceiver expires an executor - posted by GitBox <gi...@apache.org> on 2022/09/06 02:10:07 UTC, 1 replies.
- [GitHub] [spark] yabola commented on pull request #37779: [wip][SPARK-40320][Core] Executor should exit when it failed to initialize for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/06 02:33:03 UTC, 11 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/06 02:48:40 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37796: [SPARK-40326][BUILD] Upgrade `fasterxml.jackson.version` to 2.13.4 - posted by GitBox <gi...@apache.org> on 2022/09/06 02:48:43 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37795: fix the question of SparkSQL call iceberg's expire_snapshots procedur… - posted by GitBox <gi...@apache.org> on 2022/09/06 02:48:46 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 03:21:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 03:37:33 UTC, 5 replies.
- [GitHub] [spark] Ted-Jiang opened a new pull request, #37806: [MINOR]Print stacktrace when NoClassDefFoundError in HiveDelegationToken - posted by GitBox <gi...@apache.org> on 2022/09/06 03:50:58 UTC, 1 replies.
- [GitHub] [spark] Ted-Jiang commented on pull request #37806: [MINOR]Print stacktrace when NoClassDefFoundError in HiveDelegationToken - posted by GitBox <gi...@apache.org> on 2022/09/06 03:51:13 UTC, 0 replies.
- [GitHub] [spark] Ted-Jiang closed pull request #37806: [MINOR]Print stacktrace when NoClassDefFoundError in HiveDelegationToken - posted by GitBox <gi...@apache.org> on 2022/09/06 03:59:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37760: [SPARK-38404][SQL][3.3] Improve CTE resolution when a nested CTE references an outer CTE - posted by GitBox <gi...@apache.org> on 2022/09/06 04:33:47 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37793: add sparksql wirte mysql support update ,the design from replace into… - posted by GitBox <gi...@apache.org> on 2022/09/06 04:35:23 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37790: [SPARK-40288][SQL] After `RemoveRedundantAggregates`, `PullOutGroupingExpressions` should applied to avoid attribute missing when use complex expression - posted by GitBox <gi...@apache.org> on 2022/09/06 04:35:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 04:40:04 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #37610: [SPARK-38888][BUILD][CORE][YARN][DOCS] Add `RocksDB` support for shuffle service state store - posted by GitBox <gi...@apache.org> on 2022/09/06 04:53:45 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37758: [SPARK-40149][SQL] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/06 04:55:06 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37532: [SPARK-39989][SQL][FollowUp] Improve foldable expression stats estimate for string and binary - posted by GitBox <gi...@apache.org> on 2022/09/06 05:13:17 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37800: [SPARK-39830][SQL][TESTS] Add a test case to read ORC table that requires type promotion - posted by GitBox <gi...@apache.org> on 2022/09/06 05:25:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37672: [SPARK-40228][SQL] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/06 05:26:46 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/06 05:33:29 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37800: [SPARK-39830][SQL][TESTS] Add a test case to read ORC table that requires type promotion - posted by GitBox <gi...@apache.org> on 2022/09/06 05:37:31 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/06 05:40:25 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 05:43:00 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/06 05:47:24 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #37800: [SPARK-39830][SQL][TESTS] Add a test case to read ORC table that requires type promotion - posted by GitBox <gi...@apache.org> on 2022/09/06 05:55:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37761: [SPARK-40311][SQL][CORE][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/06 06:14:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37761: [SPARK-40311][SQL][CORE][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/06 06:14:34 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37761: [SPARK-40311][SQL][CORE][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/06 06:42:15 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37761: [SPARK-40311][SQL][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/06 06:45:10 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37804: [SPARK-40352][SQL] Add function aliases: len, datepart, dateadd, date_diff and curdate - posted by GitBox <gi...@apache.org> on 2022/09/06 07:45:01 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37268: [SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled - posted by GitBox <gi...@apache.org> on 2022/09/06 07:50:16 UTC, 5 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on a diff in pull request #37761: [SPARK-40311][SQL][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/06 08:11:43 UTC, 6 replies.
- [GitHub] [spark] fanyilun commented on pull request #37803: [SPARK-39546][Kubernetes] Executor pod template should support port definitions - posted by GitBox <gi...@apache.org> on 2022/09/06 08:21:43 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37804: [SPARK-40352][SQL] Add function aliases: len, datepart, dateadd, date_diff and curdate - posted by GitBox <gi...@apache.org> on 2022/09/06 08:50:01 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #37744: [SPARK-40300][SQL] Migrate onto the `DATATYPE_MISMATCH` error class - posted by GitBox <gi...@apache.org> on 2022/09/06 08:53:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #36850: [SPARK-39069][SQL] Pushing EqualTo with Literal to other conditions - posted by GitBox <gi...@apache.org> on 2022/09/06 08:55:12 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #36850: [SPARK-39069][SQL] Enhance ConstantPropagation to replace constants in inequality predicates - posted by GitBox <gi...@apache.org> on 2022/09/06 09:02:23 UTC, 12 replies.
- [GitHub] [spark] MaxGekk closed pull request #37794: [SPARK-40324][SQL] Provide a query context of `ParseException` - posted by GitBox <gi...@apache.org> on 2022/09/06 09:18:23 UTC, 0 replies.
- [GitHub] [spark] c27kwan commented on pull request #37771: [SPARK-40315][SQL] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/06 09:33:59 UTC, 2 replies.
- [GitHub] [spark] c27kwan opened a new pull request, #37807: [SPARK-40315][SQL] Add hashCode() for Literal of ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/06 10:08:44 UTC, 0 replies.
- [GitHub] [spark] c27kwan commented on a diff in pull request #37807: [SPARK-40315][SQL] Add hashCode() for Literal of ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/06 10:11:14 UTC, 0 replies.
- [GitHub] [spark] c27kwan closed pull request #37771: [SPARK-40315][SQL] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/06 10:21:05 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #37808: [SPARK-39830][SQL][TESTS][3.3] Add a test case to read ORC table that requires type promotion - posted by GitBox <gi...@apache.org> on 2022/09/06 11:24:30 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #37808: [SPARK-39830][SQL][TESTS][3.3] Add a test case to read ORC table that requires type promotion - posted by GitBox <gi...@apache.org> on 2022/09/06 11:24:45 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2022/09/06 11:32:03 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37810: [SPARK-40356][INFRA][PS] Upgrade pandas to 1.4.4 (infra and docs) - posted by GitBox <gi...@apache.org> on 2022/09/06 12:42:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37771: [SPARK-40315][SQL] Add equals() and hashCode() to ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/06 12:50:42 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37804: [SPARK-40352][SQL] Add function aliases: len, datepart, dateadd, date_diff and curdate - posted by GitBox <gi...@apache.org> on 2022/09/06 13:03:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37804: [SPARK-40352][SQL] Add function aliases: len, datepart, dateadd, date_diff and curdate - posted by GitBox <gi...@apache.org> on 2022/09/06 13:04:07 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37417: [SPARK-33782][K8S][CORE]Place spark.files, spark.jars and spark.files under the current working directory on the driver in K8S cluster mode - posted by GitBox <gi...@apache.org> on 2022/09/06 13:11:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37807: [SPARK-40315][SQL] Add hashCode() for Literal of ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/06 13:11:41 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #37417: [SPARK-33782][K8S][CORE]Place spark.files, spark.jars and spark.files under the current working directory on the driver in K8S cluster mode - posted by GitBox <gi...@apache.org> on 2022/09/06 13:12:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37807: [SPARK-40315][SQL] Add hashCode() for Literal of ArrayBasedMapData - posted by GitBox <gi...@apache.org> on 2022/09/06 13:15:47 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/06 13:29:37 UTC, 4 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37624: [SPARK-40186][CORE][YARN] Ensure `mergedShuffleCleaner` have been shutdown before `db` close - posted by GitBox <gi...@apache.org> on 2022/09/06 13:34:08 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37624: [SPARK-40186][CORE][YARN] Ensure `mergedShuffleCleaner` have been shutdown before `db` close - posted by GitBox <gi...@apache.org> on 2022/09/06 13:46:09 UTC, 6 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37716: [SPARK-40269][CORE] Randomize the orders of peer in BlockManagerDecommissioner - posted by GitBox <gi...@apache.org> on 2022/09/06 14:03:21 UTC, 4 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37603: [SPARK-40168][CORE] Handle FileNotFoundException when shuffle file deleted in decommissioner - posted by GitBox <gi...@apache.org> on 2022/09/06 14:18:22 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2022/09/06 14:49:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37743: [SPARK-40294][SQL] Fix repeat calls to `PartitionReader.hasNext` timing out - posted by GitBox <gi...@apache.org> on 2022/09/06 14:53:51 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #37268: [SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled - posted by GitBox <gi...@apache.org> on 2022/09/06 15:31:20 UTC, 3 replies.
- [GitHub] [spark] roczei commented on pull request #37679: [SPARK-35242][SQL] Support changing session catalog's default database - posted by GitBox <gi...@apache.org> on 2022/09/06 15:45:19 UTC, 9 replies.
- [GitHub] [spark] sigmod commented on pull request #37697: [SPARK-40248][SQL] Use larger number of bits to build Bloom filter - posted by GitBox <gi...@apache.org> on 2022/09/06 16:06:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 16:11:51 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on pull request #36850: [SPARK-39069][SQL] Enhance ConstantPropagation to replace constants in inequality predicates - posted by GitBox <gi...@apache.org> on 2022/09/06 16:13:06 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37624: [SPARK-40186][CORE][YARN] Ensure `mergedShuffleCleaner` have been shutdown before `db` close - posted by GitBox <gi...@apache.org> on 2022/09/06 17:20:10 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/06 17:26:40 UTC, 0 replies.
- [GitHub] [spark] revans2 commented on pull request #37747: [SPARK-40280][SQL] Add support for parquet push down for annotated int and long - posted by GitBox <gi...@apache.org> on 2022/09/06 17:43:32 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37775: [SPARK-40318][SQL] try_avg() should throw the exceptions from its child - posted by GitBox <gi...@apache.org> on 2022/09/06 17:58:55 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #37775: [SPARK-40318][SQL] try_avg() should throw the exceptions from its child - posted by GitBox <gi...@apache.org> on 2022/09/06 17:59:47 UTC, 0 replies.
- [GitHub] [spark] thejdeep commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/09/06 18:07:34 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #36506: [SPARK-25050][SQL] Avro: writing complex unions - posted by GitBox <gi...@apache.org> on 2022/09/06 18:48:07 UTC, 3 replies.
- [GitHub] [spark] vitaliili-db commented on a diff in pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/06 19:08:39 UTC, 1 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #37468: [SPARK-40034][SQL] PathOutputCommitters to support dynamic partitions - posted by GitBox <gi...@apache.org> on 2022/09/06 20:09:16 UTC, 4 replies.
- [GitHub] [spark] warrenzhu25 commented on a diff in pull request #37716: [SPARK-40269][CORE] Randomize the orders of peer in BlockManagerDecommissioner - posted by GitBox <gi...@apache.org> on 2022/09/06 20:52:05 UTC, 0 replies.
- [GitHub] [spark] srielau opened a new pull request, #37811: [SPARK-40360] TABLE_OR_VIEW_ALREADY_EXISTS_ERROR - posted by GitBox <gi...@apache.org> on 2022/09/06 21:13:42 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #37716: [SPARK-40269][CORE] Randomize the orders of peer in BlockManagerDecommissioner - posted by GitBox <gi...@apache.org> on 2022/09/06 22:01:10 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37812: fix - posted by GitBox <gi...@apache.org> on 2022/09/06 23:25:56 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #37812: fix - posted by GitBox <gi...@apache.org> on 2022/09/06 23:26:57 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37812: fix - posted by GitBox <gi...@apache.org> on 2022/09/06 23:29:30 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on a diff in pull request #37532: [SPARK-39989][SQL][FollowUp] Improve foldable expression stats estimate for string and binary - posted by GitBox <gi...@apache.org> on 2022/09/07 01:06:03 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37813: [SPARK-40228][SQL][3.3] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/07 01:14:13 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37810: [SPARK-40356][INFRA][PS] Upgrade pandas to 1.4.4 (infra and docs) - posted by GitBox <gi...@apache.org> on 2022/09/07 01:19:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37810: [SPARK-40356][INFRA][PS] Upgrade pandas to 1.4.4 (infra and docs) - posted by GitBox <gi...@apache.org> on 2022/09/07 01:26:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37810: [SPARK-40356][INFRA][PS] Upgrade pandas to 1.4.4 (infra and docs) - posted by GitBox <gi...@apache.org> on 2022/09/07 01:27:04 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on a diff in pull request #37603: [SPARK-40168][CORE] Handle FileNotFoundException when shuffle file deleted in decommissioner - posted by GitBox <gi...@apache.org> on 2022/09/07 01:35:34 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2022/09/07 03:02:05 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2022/09/07 03:37:19 UTC, 2 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2022/09/07 05:25:08 UTC, 5 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37814: [SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.11.1 - posted by GitBox <gi...@apache.org> on 2022/09/07 05:43:18 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi commented on pull request #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/07 06:11:09 UTC, 4 replies.
- [GitHub] [spark] zheniantoushipashi commented on a diff in pull request #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/07 06:12:42 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37815: [SPARK-40366][INFRA] Add `spark` namespace to spark ci image - posted by GitBox <gi...@apache.org> on 2022/09/07 06:37:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37814: [SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.11.1 - posted by GitBox <gi...@apache.org> on 2022/09/07 06:45:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37814: [WIP][SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.11.1 - posted by GitBox <gi...@apache.org> on 2022/09/07 06:49:16 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/07 06:49:23 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #37814: [WIP][SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.11.1 - posted by GitBox <gi...@apache.org> on 2022/09/07 06:53:56 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37815: [SPARK-40366][INFRA] Add `spark` namespace to spark ci image - posted by GitBox <gi...@apache.org> on 2022/09/07 06:55:03 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37814: [WIP][SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.9.3 - posted by GitBox <gi...@apache.org> on 2022/09/07 06:55:42 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #37813: [SPARK-40228][SQL][3.3] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/07 08:13:29 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37525: [SPARK-40086][SQL] Improve AliasAwareOutputPartitioning to take all aliases into account - posted by GitBox <gi...@apache.org> on 2022/09/07 08:50:11 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #36027: [SPARK-38717][SQL] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/07 08:51:29 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #37760: [SPARK-38404][SQL][3.3] Improve CTE resolution when a nested CTE references an outer CTE - posted by GitBox <gi...@apache.org> on 2022/09/07 08:59:38 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37814: [WIP][SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.9.3 - posted by GitBox <gi...@apache.org> on 2022/09/07 09:11:01 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37761: [SPARK-40311][SQL][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/07 09:21:49 UTC, 3 replies.
- [GitHub] [spark] Yikun commented on pull request #37710: [DRAFT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/07 09:24:19 UTC, 2 replies.
- [GitHub] [spark] zero323 closed pull request #37724: [SPARK-40273][PYTHON][DOCS] Fix the documents "Contributing and Maintaining Type Hints". - posted by GitBox <gi...@apache.org> on 2022/09/07 09:30:24 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #37724: [SPARK-40273][PYTHON][DOCS] Fix the documents "Contributing and Maintaining Type Hints". - posted by GitBox <gi...@apache.org> on 2022/09/07 09:30:41 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/07 09:36:13 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/07 09:39:28 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on a diff in pull request #37468: [SPARK-40034][SQL] PathOutputCommitters to support dynamic partitions - posted by GitBox <gi...@apache.org> on 2022/09/07 09:55:11 UTC, 5 replies.
- [GitHub] [spark] grundprinzip commented on pull request #37710: [DRAFT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/07 09:57:50 UTC, 3 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37816: [SPARK-40332][PS] Implement `GroupBy.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/07 10:08:42 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37816: [SPARK-40332][PS] Implement `GroupBy.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/07 10:10:52 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37816: [SPARK-40332][PS] Implement `GroupBy.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/07 10:21:14 UTC, 8 replies.
- [GitHub] [spark] cloud-fan closed pull request #37758: [SPARK-40149][SQL] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/07 10:46:12 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37811: [SPARK-40360] TABLE_OR_VIEW_ALREADY_EXISTS error - posted by GitBox <gi...@apache.org> on 2022/09/07 10:57:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2022/09/07 10:57:45 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37808: [SPARK-39830][SQL][TESTS][3.3] Add a test case to read ORC table that requires type promotion - posted by GitBox <gi...@apache.org> on 2022/09/07 10:57:48 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37806: [MINOR][SQL] Print stacktrace when NoClassDefFoundError in HiveDelegationToken - posted by GitBox <gi...@apache.org> on 2022/09/07 10:57:51 UTC, 0 replies.
- [GitHub] [spark] ELHoussineT opened a new pull request, #37817: Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/07 11:14:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #37818: [SPARK-40149][SQL][3.2] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/07 11:25:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37818: [SPARK-40149][SQL][3.2] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/07 11:26:50 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/07 11:50:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37621: [SPARK-40185][SQL] Remove column suggestion when the candidate list is empty - posted by GitBox <gi...@apache.org> on 2022/09/07 11:50:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #36850: [SPARK-39069][SQL] Enhance ConstantPropagation to replace constants in inequality predicates - posted by GitBox <gi...@apache.org> on 2022/09/07 12:08:30 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2022/09/07 12:11:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #36027: [SPARK-38717][SQL] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/07 12:15:46 UTC, 10 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37679: [SPARK-35242][SQL] Support changing session catalog's default database - posted by GitBox <gi...@apache.org> on 2022/09/07 12:23:42 UTC, 16 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37679: [SPARK-35242][SQL] Support changing session catalog's default database - posted by GitBox <gi...@apache.org> on 2022/09/07 12:24:41 UTC, 3 replies.
- [GitHub] [spark] LeeeeLiu opened a new pull request, #37819: SPARK-40377 Allow customize maxBroadcastTableBytes and maxBroadcastRows - posted by GitBox <gi...@apache.org> on 2022/09/07 12:58:57 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2022/09/07 13:19:39 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on a diff in pull request #37268: [SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled - posted by GitBox <gi...@apache.org> on 2022/09/07 13:22:41 UTC, 11 replies.
- [GitHub] [spark] srowen commented on pull request #37817: [SPARK-40376] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/07 13:34:03 UTC, 0 replies.
- [GitHub] [spark] ELHoussineT commented on pull request #37817: [SPARK-40376] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/07 14:01:44 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/07 14:06:44 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37816: [SPARK-40332][PS] Implement `GroupBy.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/07 14:22:06 UTC, 8 replies.
- [GitHub] [spark] plokhotnyuk commented on a diff in pull request #37604: [DON'T MERGE] Try to replace all `json4s` with `Jackson` - posted by GitBox <gi...@apache.org> on 2022/09/07 14:48:33 UTC, 0 replies.
- [GitHub] [spark] plokhotnyuk commented on pull request #37604: [DON'T MERGE] Try to replace all `json4s` with `Jackson` - posted by GitBox <gi...@apache.org> on 2022/09/07 14:48:48 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on pull request #37468: [SPARK-40034][SQL] PathOutputCommitters to support dynamic partitions - posted by GitBox <gi...@apache.org> on 2022/09/07 15:13:17 UTC, 1 replies.
- [GitHub] [spark] Yikun closed pull request #37815: [SPARK-40366][INFRA] Add `spark` namespace to spark ci image - posted by GitBox <gi...@apache.org> on 2022/09/07 15:18:50 UTC, 1 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37820: [SPARK-38961][PS][FOLLOWUP] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/07 15:27:34 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37820: [SPARK-38961][PS][FOLLOWUP] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/07 15:28:52 UTC, 0 replies.
- [GitHub] [spark] beobest2 commented on pull request #37820: [SPARK-38961][PS][FOLLOWUP] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/07 15:35:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37818: [SPARK-40149][SQL][3.2] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/07 15:45:28 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/07 16:03:32 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37815: [SPARK-40366][INFRA] Add `spark` namespace to spark ci image - posted by GitBox <gi...@apache.org> on 2022/09/07 16:09:18 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/07 16:16:18 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/07 16:16:44 UTC, 0 replies.
- [GitHub] [spark] holdenk opened a new pull request, #37821: [SPARK-40379][K8S]: Propagate decommission executor loss reason in K8s - posted by GitBox <gi...@apache.org> on 2022/09/07 16:18:56 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37818: [SPARK-40149][SQL][3.2] Propagate metadata columns through Project - posted by GitBox <gi...@apache.org> on 2022/09/07 16:40:02 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #37733: [SPARK-40267][DOC] Add description for ExecutorAllocationManager metrics - posted by GitBox <gi...@apache.org> on 2022/09/07 17:33:32 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by GitBox <gi...@apache.org> on 2022/09/07 17:43:15 UTC, 1 replies.
- [GitHub] [spark] warrenzhu25 opened a new pull request, #37822: [SPARK-40381][DEPLOY] Support standalone worker recommission - posted by GitBox <gi...@apache.org> on 2022/09/07 18:30:20 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx opened a new pull request, #37823: [SPARK-40380][SQL] Fix constant-folding of InvokeLike to avoid non-serializable literal embedded in the plan - posted by GitBox <gi...@apache.org> on 2022/09/07 18:39:06 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx commented on pull request #37823: [SPARK-40380][SQL] Fix constant-folding of InvokeLike to avoid non-serializable literal embedded in the plan - posted by GitBox <gi...@apache.org> on 2022/09/07 18:43:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37746: [SPARK-40293][SQL] Make the V2 table error message more meaningful - posted by GitBox <gi...@apache.org> on 2022/09/07 19:15:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37742: [SPARK-40291][SQL] Improve the message for column not in group by clause error - posted by GitBox <gi...@apache.org> on 2022/09/07 19:31:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37742: [SPARK-40291][SQL] Improve the message for column not in group by clause error - posted by GitBox <gi...@apache.org> on 2022/09/07 19:32:48 UTC, 0 replies.
- [GitHub] [spark] ahshahid opened a new pull request, #37824: Spark 40362 Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/07 20:03:30 UTC, 0 replies.
- [GitHub] [spark] abhishekboga commented on pull request #28953: [SPARK-32013][SQL] Support query execution before reading DataFrame and before/after writing DataFrame over JDBC - posted by GitBox <gi...@apache.org> on 2022/09/07 23:29:48 UTC, 2 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #37825: [SPARK-40382][SQL] Group distinct aggregate expressions by semantically equivalent children in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/09/07 23:42:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37820: [SPARK-38961][PS][FOLLOWUP] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/07 23:46:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37823: [SPARK-40380][SQL] Fix constant-folding of InvokeLike to avoid non-serializable literal embedded in the plan - posted by GitBox <gi...@apache.org> on 2022/09/07 23:58:08 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37820: [SPARK-38961][PS][FOLLOWUP] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/08 00:29:22 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37816: [SPARK-40332][PS] Implement `GroupBy.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/08 00:47:28 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #28953: [SPARK-32013][SQL] Support query execution before reading DataFrame and before/after writing DataFrame over JDBC - posted by GitBox <gi...@apache.org> on 2022/09/08 00:52:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37671: [SPARK-40229][PS][TEST] Re-enable excel I/O test for pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/08 01:00:33 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37814: [SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.9.3 - posted by GitBox <gi...@apache.org> on 2022/09/08 01:08:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37814: [SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.9.3 - posted by GitBox <gi...@apache.org> on 2022/09/08 01:09:13 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37820: [MINOR][PS][DOCS] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/08 01:38:51 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37820: [MINOR][PS][DOCS] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/08 01:39:04 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/08 01:47:03 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37648: [SPARK-38909][BUILD][CORE][YARN][FOLLOWUP] Make some code cleanup related to shuffle state db - posted by GitBox <gi...@apache.org> on 2022/09/08 01:47:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37816: [SPARK-40332][PS] Implement `GroupBy.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/08 01:54:38 UTC, 8 replies.
- [GitHub] [spark] mridulm closed pull request #37624: [SPARK-40186][CORE][YARN] Ensure `mergedShuffleCleaner` have been shutdown before `db` close - posted by GitBox <gi...@apache.org> on 2022/09/08 01:55:53 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37624: [SPARK-40186][CORE][YARN] Ensure `mergedShuffleCleaner` have been shutdown before `db` close - posted by GitBox <gi...@apache.org> on 2022/09/08 01:57:17 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37817: [SPARK-40376] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/08 02:21:20 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37468: [SPARK-40034][SQL] PathOutputCommitters to support dynamic partitions - posted by GitBox <gi...@apache.org> on 2022/09/08 02:50:08 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37610: [SPARK-38888][BUILD][CORE][YARN][DOCS] Add `RocksDB` support for shuffle service state store - posted by GitBox <gi...@apache.org> on 2022/09/08 02:59:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37610: [SPARK-38888][BUILD][CORE][YARN][DOCS] Add `RocksDB` support for shuffle service state store - posted by GitBox <gi...@apache.org> on 2022/09/08 03:02:48 UTC, 1 replies.
- [GitHub] [spark] AngersZhuuuu commented on a diff in pull request #37823: [SPARK-40380][SQL] Fix constant-folding of InvokeLike to avoid non-serializable literal embedded in the plan - posted by GitBox <gi...@apache.org> on 2022/09/08 03:03:09 UTC, 0 replies.
- [GitHub] [spark] caican00 commented on a diff in pull request #37479: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by GitBox <gi...@apache.org> on 2022/09/08 03:55:49 UTC, 1 replies.
- [GitHub] [spark] attilapiros commented on pull request #37824: Spark 40362 Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/08 04:05:18 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #37824: Spark 40362 Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/08 04:13:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/08 04:14:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37827: [SPARK-40383][INFRA] Pin mypy ==0.920 in dev/requirements.txt - posted by GitBox <gi...@apache.org> on 2022/09/08 04:15:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/08 04:16:04 UTC, 8 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37828: [SPARK-40384][INFRA] Do base image real in time build only when infra dockerfile is changed - posted by GitBox <gi...@apache.org> on 2022/09/08 05:12:59 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37828: [SPARK-40384][INFRA] Do base image real in time build only when infra dockerfile is changed - posted by GitBox <gi...@apache.org> on 2022/09/08 05:13:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37829: [SPARK-40386][PS][SQL] Implement `ddof` in `DataFrame.cov` - posted by GitBox <gi...@apache.org> on 2022/09/08 06:11:35 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #37830: [SPARK-40387][SQL] Improve the implementation of Spark Decimal - posted by GitBox <gi...@apache.org> on 2022/09/08 06:43:26 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #37831: [SPARK-40354][SQL] Support eliminate dynamic partition for datasource v1 writes - posted by GitBox <gi...@apache.org> on 2022/09/08 06:44:49 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/08 07:03:52 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx commented on a diff in pull request #37823: [SPARK-40380][SQL] Fix constant-folding of InvokeLike to avoid non-serializable literal embedded in the plan - posted by GitBox <gi...@apache.org> on 2022/09/08 07:06:52 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37832: [SPARK-40389][SQL] Decimals can't upcast as integral types if the cast can overflow - posted by GitBox <gi...@apache.org> on 2022/09/08 07:19:12 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37832: [SPARK-40389][SQL] Decimals can't upcast as integral types if the cast can overflow - posted by GitBox <gi...@apache.org> on 2022/09/08 07:19:35 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37803: [SPARK-39546][Kubernetes] Executor pod template should support port definitions - posted by GitBox <gi...@apache.org> on 2022/09/08 07:36:25 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #37833: [SPARK-40292] Fix column names in "arrays_zip" function when arrays are referenced from nested structs - posted by GitBox <gi...@apache.org> on 2022/09/08 07:37:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37803: [SPARK-39546][Kubernetes] Executor pod template should support port definitions - posted by GitBox <gi...@apache.org> on 2022/09/08 07:37:51 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #37833: [SPARK-40292][SQL] Fix column names in "arrays_zip" function when arrays are referenced from nested structs - posted by GitBox <gi...@apache.org> on 2022/09/08 07:38:21 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37834: [WIP][SQL] Pass error message parameters to exceptions as a map - posted by GitBox <gi...@apache.org> on 2022/09/08 07:50:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37801: [SPARK-40333][PS] Implement `GroupBy.nth` - posted by GitBox <gi...@apache.org> on 2022/09/08 08:04:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37803: [SPARK-39546][K8S] Support `ports` definition in executor pod template - posted by GitBox <gi...@apache.org> on 2022/09/08 08:05:43 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #37832: [SPARK-40389][SQL] Decimals can't upcast as integral types if the cast can overflow - posted by GitBox <gi...@apache.org> on 2022/09/08 08:06:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37832: [SPARK-40389][SQL] Decimals can't upcast as integral types if the cast can overflow - posted by GitBox <gi...@apache.org> on 2022/09/08 08:22:59 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #37832: [SPARK-40389][SQL] Decimals can't upcast as integral types if the cast can overflow - posted by GitBox <gi...@apache.org> on 2022/09/08 08:31:00 UTC, 1 replies.
- [GitHub] [spark] boneanxs commented on pull request #30894: [SPARK-33152][SQL] Improve the performance of constraint propagation for Project and Aggregate - posted by GitBox <gi...@apache.org> on 2022/09/08 08:56:27 UTC, 0 replies.
- [GitHub] [spark] mskapilks commented on pull request #37674: [SPARK-40231][SQL][TEST] Add 1TB TPCDS Plan stability tests - posted by GitBox <gi...@apache.org> on 2022/09/08 08:58:04 UTC, 0 replies.
- [GitHub] [spark] ELHoussineT commented on pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/08 09:13:01 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #37674: [SPARK-40231][SQL][TEST] Add 1TB TPCDS Plan stability tests - posted by GitBox <gi...@apache.org> on 2022/09/08 09:30:25 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #37749: [SPARK-40295][SQL] Allow v2 functions with literal args in write distribution/ordering - posted by GitBox <gi...@apache.org> on 2022/09/08 09:31:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37814: [SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.9.3 - posted by GitBox <gi...@apache.org> on 2022/09/08 10:12:41 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #28953: [SPARK-32013][SQL] Support query execution before reading DataFrame and before/after writing DataFrame over JDBC - posted by GitBox <gi...@apache.org> on 2022/09/08 10:24:18 UTC, 1 replies.
- [GitHub] [spark] fanyilun commented on pull request #37803: [SPARK-39546][K8S] Support `ports` definition in executor pod template - posted by GitBox <gi...@apache.org> on 2022/09/08 11:08:49 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #37830: [SPARK-40387][SQL] Improve the implementation of Spark Decimal - posted by GitBox <gi...@apache.org> on 2022/09/08 11:13:52 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37829: [SPARK-40386][PS][SQL] Implement `ddof` in `DataFrame.cov` - posted by GitBox <gi...@apache.org> on 2022/09/08 11:51:37 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37827: [SPARK-40383][INFRA] Pin `mypy==0.920` in dev/requirements.txt - posted by GitBox <gi...@apache.org> on 2022/09/08 11:51:53 UTC, 1 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/08 12:15:35 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37836: [WIP][SPARK-40339][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/08 12:21:25 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37836: [WIP][SPARK-40339][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/08 12:22:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37823: [SPARK-40380][SQL] Fix constant-folding of InvokeLike to avoid non-serializable literal embedded in the plan - posted by GitBox <gi...@apache.org> on 2022/09/08 13:20:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37823: [SPARK-40380][SQL] Fix constant-folding of InvokeLike to avoid non-serializable literal embedded in the plan - posted by GitBox <gi...@apache.org> on 2022/09/08 13:21:21 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #37747: [SPARK-40280][SQL] Add support for parquet push down for annotated int and long - posted by GitBox <gi...@apache.org> on 2022/09/08 13:54:58 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt opened a new pull request, #37837: [SPARK-40385][SQL] Fix interpreted path for companion object constructor - posted by GitBox <gi...@apache.org> on 2022/09/08 13:56:34 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #37747: [SPARK-40280][SQL] Add support for parquet push down for annotated int and long - posted by GitBox <gi...@apache.org> on 2022/09/08 14:00:54 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37824: Spark 40362 Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/08 14:09:40 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37837: [SPARK-40385][SQL] Fix interpreted path for companion object constructor - posted by GitBox <gi...@apache.org> on 2022/09/08 14:44:11 UTC, 0 replies.
- [GitHub] [spark] ahshahid commented on a diff in pull request #37824: Spark 40362 Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/08 16:15:39 UTC, 1 replies.
- [GitHub] [spark] ahshahid commented on pull request #37824: Spark 40362 Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/08 16:17:13 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #37603: [SPARK-40168][CORE] Handle FileNotFoundException when shuffle file deleted in decommissioner - posted by GitBox <gi...@apache.org> on 2022/09/08 16:28:59 UTC, 0 replies.
- [GitHub] [spark] kazuyukitanimura commented on pull request #37674: [SPARK-40231][SQL][TEST] Add 1TB TPCDS Plan stability tests - posted by GitBox <gi...@apache.org> on 2022/09/08 16:46:49 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #36027: [SPARK-38717][SQL] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/08 16:55:18 UTC, 13 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37803: [SPARK-39546][K8S] Support `ports` definition in executor pod template - posted by GitBox <gi...@apache.org> on 2022/09/08 17:54:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37803: [SPARK-39546][K8S] Support `ports` definition in executor pod template - posted by GitBox <gi...@apache.org> on 2022/09/08 17:56:32 UTC, 0 replies.
- [GitHub] [spark] abubakrsiddq opened a new pull request, #37838: correct typo in rdd-programming-guide.md - posted by GitBox <gi...@apache.org> on 2022/09/08 18:08:52 UTC, 1 replies.
- [GitHub] [spark] abubakrsiddq closed pull request #37838: correct typo in rdd-programming-guide.md - posted by GitBox <gi...@apache.org> on 2022/09/08 18:13:56 UTC, 0 replies.
- [GitHub] [spark] ahshahid commented on pull request #37824: [SPARK-40362][SQL] Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/08 18:15:06 UTC, 8 replies.
- [GitHub] [spark] ahshahid commented on a diff in pull request #37824: [SPARK-40362][SQL] Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/08 18:16:07 UTC, 0 replies.
- [GitHub] [spark] abubakrsiddq opened a new pull request, #37839: correct typo in rdd-programming-guide.md - posted by GitBox <gi...@apache.org> on 2022/09/08 18:17:16 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37833: [SPARK-40292][SQL] Fix column names in "arrays_zip" function when arrays are referenced from nested structs - posted by GitBox <gi...@apache.org> on 2022/09/08 20:06:43 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #37832: [SPARK-40389][SQL] Decimals can't upcast as integral types if the cast can overflow - posted by GitBox <gi...@apache.org> on 2022/09/08 20:23:34 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/08 23:10:47 UTC, 3 replies.
- [GitHub] [spark] srowen closed pull request #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/08 23:12:00 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37797: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 6, ~50 functions) - posted by GitBox <gi...@apache.org> on 2022/09/08 23:12:01 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/08 23:39:53 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/08 23:44:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37827: [SPARK-40383][INFRA] Pin `mypy==0.920` in dev/requirements.txt - posted by GitBox <gi...@apache.org> on 2022/09/09 00:18:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37816: [SPARK-40332][PS] Implement `GroupBy.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/09 00:24:49 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37841: [SPARK-40324][SQL] Provide query context in AnalysisException - posted by GitBox <gi...@apache.org> on 2022/09/09 00:27:26 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37841: [SPARK-40324][SQL] Provide query context in AnalysisException - posted by GitBox <gi...@apache.org> on 2022/09/09 00:28:11 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/09 00:28:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37830: [SPARK-40387][SQL] Improve the implementation of Spark Decimal - posted by GitBox <gi...@apache.org> on 2022/09/09 02:05:16 UTC, 1 replies.
- [GitHub] [spark] mridulm closed pull request #37610: [SPARK-38888][BUILD][CORE][YARN][DOCS] Add `RocksDB` support for shuffle service state store - posted by GitBox <gi...@apache.org> on 2022/09/09 02:06:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37842: [SPARK-40396][BUILD] Update scalatest and scalatestplus related dependencies to use 3.2.13 - posted by GitBox <gi...@apache.org> on 2022/09/09 03:26:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/09 03:28:12 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37671: [SPARK-40229][PS][TEST] Re-enable excel I/O test for pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/09 03:37:58 UTC, 2 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/09 04:01:46 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #37831: [SPARK-40354][SQL] Support eliminate dynamic partition for datasource v1 writes - posted by GitBox <gi...@apache.org> on 2022/09/09 04:20:13 UTC, 3 replies.
- [GitHub] [spark] ulysses-you commented on pull request #37831: [SPARK-40354][SQL] Support eliminate dynamic partition for datasource v1 writes - posted by GitBox <gi...@apache.org> on 2022/09/09 04:27:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37610: [SPARK-38888][BUILD][CORE][YARN][DOCS] Add `RocksDB` support for shuffle service state store - posted by GitBox <gi...@apache.org> on 2022/09/09 05:33:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37843: [SPARK-40398][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/09 05:52:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37843: [WIP][SPARK-40398][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/09 06:10:40 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37604: [DON'T MERGE] Try to replace all `json4s` with `Jackson` - posted by GitBox <gi...@apache.org> on 2022/09/09 06:14:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37468: [SPARK-40034][SQL] PathOutputCommitters to support dynamic partitions - posted by GitBox <gi...@apache.org> on 2022/09/09 06:42:11 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37844: [DON'T MERGE] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/09 06:53:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37844: [DON'T MERGE] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/09 06:54:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37831: [SPARK-40354][SQL] Support eliminate dynamic partition for datasource v1 writes - posted by GitBox <gi...@apache.org> on 2022/09/09 07:23:21 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37841: [SPARK-40324][SQL] Provide query context in AnalysisException - posted by GitBox <gi...@apache.org> on 2022/09/09 07:48:59 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37830: [SPARK-40387][SQL] Improve the implementation of Spark Decimal - posted by GitBox <gi...@apache.org> on 2022/09/09 07:59:28 UTC, 4 replies.
- [GitHub] [spark] peter-toth commented on pull request #37824: [SPARK-40362][SQL] Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/09 08:00:42 UTC, 4 replies.
- [GitHub] [spark] SparksFyz commented on pull request #37825: [SPARK-40382][SQL] Group distinct aggregate expressions by semantically equivalent children in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/09/09 08:22:57 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #37830: [SPARK-40387][SQL] Improve the implementation of Spark Decimal - posted by GitBox <gi...@apache.org> on 2022/09/09 08:26:08 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37824: [SPARK-40362][SQL] Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/09 08:36:05 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37822: [SPARK-40381][DEPLOY] Support standalone worker recommission - posted by GitBox <gi...@apache.org> on 2022/09/09 08:36:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37845: [SPARK-40399][PS] DataFrame.corr `Pearson` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/09 08:57:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37824: [SPARK-40362][SQL] Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/09 09:10:15 UTC, 0 replies.
- [GitHub] [spark] martin-g commented on a diff in pull request #37844: [DON'T MERGE][BUILD] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/09 09:10:36 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37845: [SPARK-40399][PS] Make `pearson` correlation in `DataFrame.corr` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/09 09:11:36 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37844: [DON'T MERGE][BUILD] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/09 09:12:06 UTC, 3 replies.
- [GitHub] [spark] LuciferYang closed pull request #37844: [DON'T MERGE][BUILD] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/09 09:12:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37844: [DON'T MERGE][BUILD] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/09 09:12:24 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37819: [SPARK-40377][SQL] Allow customize maxBroadcastTableBytes and maxBroadcastRows - posted by GitBox <gi...@apache.org> on 2022/09/09 11:16:18 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/09 11:16:22 UTC, 0 replies.
- [GitHub] [spark] zzcclp opened a new pull request, #37846: [SPARK-40280][SQL][FOLLOWUP][3.2] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 12:43:42 UTC, 0 replies.
- [GitHub] [spark] zzcclp opened a new pull request, #37847: [SPARK-40280][SQL][FOLLOWUP][3.3] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 12:58:18 UTC, 0 replies.
- [GitHub] [spark] zzcclp commented on pull request #37846: [SPARK-40280][SQL][FOLLOWUP][3.2] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 13:00:37 UTC, 1 replies.
- [GitHub] [spark] zzcclp commented on pull request #37847: [SPARK-40280][SQL][FOLLOWUP][3.3] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 13:00:45 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #37845: [SPARK-40399][PS] Make `pearson` correlation in `DataFrame.corr` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/09 13:22:11 UTC, 2 replies.
- [GitHub] [spark] wangyum closed pull request #37732: [SPARK-40253] [SQL] Fixed loss of precision for writing 0.00 specific… - posted by GitBox <gi...@apache.org> on 2022/09/09 13:23:39 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37842: [SPARK-40396][BUILD] Update scalatest and scalatestplus related dependencies to use stable version - posted by GitBox <gi...@apache.org> on 2022/09/09 13:24:23 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #37839: correct typo in rdd-programming-guide.md - posted by GitBox <gi...@apache.org> on 2022/09/09 13:24:36 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37839: correct typo in rdd-programming-guide.md - posted by GitBox <gi...@apache.org> on 2022/09/09 13:24:37 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #37847: [SPARK-40280][SQL][FOLLOWUP][3.3] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 13:28:22 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #37846: [SPARK-40280][SQL][FOLLOWUP][3.2] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 13:28:26 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/09 14:28:44 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/09 14:51:21 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37842: [SPARK-40396][BUILD] Update scalatest and scalatestplus related dependencies to use stable version - posted by GitBox <gi...@apache.org> on 2022/09/09 16:09:36 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37843: [WIP][SPARK-40398][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/09 16:30:57 UTC, 8 replies.
- [GitHub] [spark] Bahigac commented on pull request #35220: [SPARK-37922][SQL] Combine to one cast if we can safely up-cast two casts - posted by GitBox <gi...@apache.org> on 2022/09/09 16:34:09 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #37533: [SPARK-40096]Fix finalize shuffle stage slow due to connection creation slow - posted by GitBox <gi...@apache.org> on 2022/09/09 17:07:26 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #37841: [SPARK-40324][SQL] Provide query context in AnalysisException - posted by GitBox <gi...@apache.org> on 2022/09/09 17:40:18 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/09 17:49:09 UTC, 12 replies.
- [GitHub] [spark] huaxingao commented on pull request #37847: [SPARK-40280][SQL][FOLLOWUP][3.3] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 18:06:03 UTC, 4 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37847: [SPARK-40280][SQL][FOLLOWUP][3.3] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 18:09:54 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #37846: [SPARK-40280][SQL][FOLLOWUP][3.2] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 18:12:56 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37468: [SPARK-40034][SQL] PathOutputCommitters to support dynamic partitions - posted by GitBox <gi...@apache.org> on 2022/09/09 18:29:49 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #37847: [SPARK-40280][SQL][FOLLOWUP][3.3] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 18:33:35 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #37846: [SPARK-40280][SQL][FOLLOWUP][3.2] Fix 'ParquetFilterSuite' issue - posted by GitBox <gi...@apache.org> on 2022/09/09 18:35:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37733: [SPARK-40267][DOC] Add description for ExecutorAllocationManager metrics - posted by GitBox <gi...@apache.org> on 2022/09/09 18:37:48 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37848: [SPARK-40389][SQL][FollowUp][3.3] Fix a test failure in SQLQuerySuite - posted by GitBox <gi...@apache.org> on 2022/09/09 18:47:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37848: [SPARK-40389][SQL][FollowUp][3.3] Fix a test failure in SQLQuerySuite - posted by GitBox <gi...@apache.org> on 2022/09/09 18:49:52 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37848: [SPARK-40389][SQL][FollowUp][3.3] Fix a test failure in SQLQuerySuite - posted by GitBox <gi...@apache.org> on 2022/09/09 18:57:41 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #37841: [SPARK-40324][SQL] Provide query context in AnalysisException - posted by GitBox <gi...@apache.org> on 2022/09/09 18:58:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37716: [SPARK-40269][CORE] Randomize the orders of peer in BlockManagerDecommissioner - posted by GitBox <gi...@apache.org> on 2022/09/09 19:02:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/09 19:06:51 UTC, 7 replies.
- [GitHub] [spark] roczei commented on a diff in pull request #37679: [SPARK-35242][SQL] Support changing session catalog's default database - posted by GitBox <gi...@apache.org> on 2022/09/09 19:59:36 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37603: [SPARK-40168][CORE] Handle `SparkException` during shuffle block migration - posted by GitBox <gi...@apache.org> on 2022/09/09 21:22:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37813: [SPARK-40228][SQL][3.3] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/09 21:33:06 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37808: [SPARK-39830][SQL][TESTS][3.3] Add a test case to read ORC table that requires type promotion - posted by GitBox <gi...@apache.org> on 2022/09/09 21:37:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37730: [SPARK-39915][SQL][3.3] Dataset.repartition(N) may not create N partitions Non-AQE part - posted by GitBox <gi...@apache.org> on 2022/09/09 21:43:48 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37806: [MINOR][SQL] Print stacktrace when NoClassDefFoundError in HiveDelegationToken - posted by GitBox <gi...@apache.org> on 2022/09/09 23:17:01 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37806: [MINOR][SQL] Print stacktrace when NoClassDefFoundError in HiveDelegationToken - posted by GitBox <gi...@apache.org> on 2022/09/09 23:17:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37820: [MINOR][PS][DOCS] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/09 23:17:43 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37820: [MINOR][PS][DOCS] Fix note in missing pandas - posted by GitBox <gi...@apache.org> on 2022/09/09 23:17:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/09 23:37:26 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37802: [SPARK-40350][Kubernetes] Use spark config to configure the parameters of volcano podgroup - posted by GitBox <gi...@apache.org> on 2022/09/09 23:37:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37813: [SPARK-40228][SQL][3.3] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/09 23:50:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37813: [SPARK-40228][SQL][3.3] Do not simplify multiLike if child is not a cheap expression - posted by GitBox <gi...@apache.org> on 2022/09/09 23:50:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #37849: [SPARK-40401][CORE] Remove the support of deprecated `spark.akka.*` configs - posted by GitBox <gi...@apache.org> on 2022/09/10 00:28:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37849: [SPARK-40401][CORE] Remove the support of deprecated `spark.akka.*` configs - posted by GitBox <gi...@apache.org> on 2022/09/10 00:32:18 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #37849: [SPARK-40401][CORE] Remove the support of deprecated `spark.akka.*` configs - posted by GitBox <gi...@apache.org> on 2022/09/10 04:17:45 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37849: [SPARK-40401][CORE] Remove the support of deprecated `spark.akka.*` configs - posted by GitBox <gi...@apache.org> on 2022/09/10 04:18:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37834: [SPARK-40400][SQL] Pass error message parameters to exceptions as maps - posted by GitBox <gi...@apache.org> on 2022/09/10 06:11:50 UTC, 2 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #37850: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 7, ~30 functions) - posted by GitBox <gi...@apache.org> on 2022/09/10 13:08:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37843: [SPARK-40398][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/10 13:35:26 UTC, 8 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37841: [SPARK-40324][SQL] Provide query context in AnalysisException - posted by GitBox <gi...@apache.org> on 2022/09/10 15:02:45 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/10 15:02:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37841: [SPARK-40324][SQL] Provide query context in AnalysisException - posted by GitBox <gi...@apache.org> on 2022/09/10 15:04:00 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/10 16:38:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/10 17:49:15 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #37851: [SPARK-40362][SQL] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/10 18:04:06 UTC, 0 replies.
- [GitHub] [spark] ahshahid commented on pull request #37851: [SPARK-40362][SQL] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/10 18:17:04 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37833: [SPARK-40292][SQL] Fix column names in "arrays_zip" function when arrays are referenced from nested structs - posted by GitBox <gi...@apache.org> on 2022/09/10 18:18:56 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #37843: [SPARK-40398][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/10 18:36:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37843: [SPARK-40398][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/10 18:47:57 UTC, 2 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37843: [SPARK-40398][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/10 20:40:56 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x closed pull request #37555: [SPARK-40119][CORE] Add cancel job group reason - posted by GitBox <gi...@apache.org> on 2022/09/10 22:12:13 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37851: [SPARK-40362][SQL] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/11 07:31:54 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37842: [SPARK-40396][BUILD] Update scalatest and scalatestplus related dependencies to use stable version - posted by GitBox <gi...@apache.org> on 2022/09/11 10:39:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37842: [SPARK-40396][BUILD] Update scalatest and scalatestplus related dependencies to use stable version - posted by GitBox <gi...@apache.org> on 2022/09/11 10:40:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37842: [SPARK-40396][BUILD] Update scalatest and scalatestplus related dependencies to use stable version - posted by GitBox <gi...@apache.org> on 2022/09/11 11:06:10 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #37799: [SPARK-40331][DOCS] Recommend use Java 11/17 as the runtime environment of Spark - posted by GitBox <gi...@apache.org> on 2022/09/11 13:41:47 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on a diff in pull request #34895: [SPARK-6305][BUILD] Migrate from log4j1 to log4j2 - posted by GitBox <gi...@apache.org> on 2022/09/11 14:25:13 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/11 15:17:24 UTC, 31 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #34895: [SPARK-6305][BUILD] Migrate from log4j1 to log4j2 - posted by GitBox <gi...@apache.org> on 2022/09/11 15:47:23 UTC, 2 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/11 16:27:14 UTC, 3 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #37825: [SPARK-40382][SQL] Group distinct aggregate expressions by semantically equivalent children in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/09/11 17:19:13 UTC, 4 replies.
- [GitHub] [spark] bersprockets commented on pull request #37825: [SPARK-40382][SQL] Group distinct aggregate expressions by semantically equivalent children in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/09/11 17:40:29 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #34895: [SPARK-6305][BUILD] Migrate from log4j1 to log4j2 - posted by GitBox <gi...@apache.org> on 2022/09/11 17:58:57 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37850: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 7, ~30 functions) - posted by GitBox <gi...@apache.org> on 2022/09/11 23:25:32 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37833: [SPARK-40292][SQL] Fix column names in "arrays_zip" function when arrays are referenced from nested structs - posted by GitBox <gi...@apache.org> on 2022/09/11 23:46:44 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #37852: [SPARK-40403][SQL] Calculate unsafe array size using longs to avoid negative size in error message - posted by GitBox <gi...@apache.org> on 2022/09/12 00:51:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37845: [SPARK-40399][PS] Make `pearson` correlation in `DataFrame.corr` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/12 01:27:40 UTC, 4 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37853: [SPARK-40404][DOCS] Fix the error description related to `spark.shuffle.service.db` in the document - posted by GitBox <gi...@apache.org> on 2022/09/12 02:58:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37854: [SPARK-40406][CORE] Change default logging to stderr to consistent with the behavior of log4j1 - posted by GitBox <gi...@apache.org> on 2022/09/12 03:25:05 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on a diff in pull request #37733: [SPARK-40267][DOC] Add description for ExecutorAllocationManager metrics - posted by GitBox <gi...@apache.org> on 2022/09/12 03:56:16 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #36717: [SPARK-33274][SS] Stop query in cp mode when total cores less than total kafka partition - posted by GitBox <gi...@apache.org> on 2022/09/12 03:58:50 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37833: [SPARK-40292][SQL] Fix column names in "arrays_zip" function when arrays are referenced from nested structs - posted by GitBox <gi...@apache.org> on 2022/09/12 04:34:03 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37852: [SPARK-40403][SQL] Calculate unsafe array size using longs to avoid negative size in error message - posted by GitBox <gi...@apache.org> on 2022/09/12 04:39:20 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/12 07:13:07 UTC, 10 replies.
- [GitHub] [spark] RS131419 closed pull request #37230: [SPARK-33326][SQL] Fix the problem of writing hive partition table without updating metadata information - posted by GitBox <gi...@apache.org> on 2022/09/12 07:33:34 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37853: [SPARK-40404][DOCS] Fix the wrong description related to `spark.shuffle.service.db.enabled` in the document - posted by GitBox <gi...@apache.org> on 2022/09/12 07:39:04 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/12 07:42:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37853: [SPARK-40404][DOCS] Fix the wrong description related to `spark.shuffle.service.db.enabled` in the document - posted by GitBox <gi...@apache.org> on 2022/09/12 07:50:30 UTC, 11 replies.
- [GitHub] [spark] wbo4958 opened a new pull request, #37855: [WIP][SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/12 08:57:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37834: [SPARK-40400][SQL] Pass error message parameters to exceptions as maps - posted by GitBox <gi...@apache.org> on 2022/09/12 10:14:22 UTC, 23 replies.
- [GitHub] [spark] tgravescs commented on pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/12 13:54:06 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #37834: [SPARK-40400][SQL] Pass error message parameters to exceptions as maps - posted by GitBox <gi...@apache.org> on 2022/09/12 14:16:50 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/12 14:19:56 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37851: [SPARK-40362][SQL] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/12 14:20:01 UTC, 5 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #37851: [SPARK-40362][SQL] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/12 14:26:52 UTC, 10 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #37856: [SPARK-40107][SQL][FOLLOW-UP] Update `empty2null` check - posted by GitBox <gi...@apache.org> on 2022/09/12 14:32:29 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/12 15:41:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37857: [WIP][SPARK-38734][SQL] Remove the error class `INDEX_OUT_OF_BOUNDS` - posted by GitBox <gi...@apache.org> on 2022/09/12 16:05:21 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #37635: [SPARK-40131][PYTHON] Support NumPy ndarray in built-in functions - posted by GitBox <gi...@apache.org> on 2022/09/12 17:07:34 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37857: [SPARK-38734][SQL] Remove the error class `INDEX_OUT_OF_BOUNDS` - posted by GitBox <gi...@apache.org> on 2022/09/12 17:12:27 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #37840: [SPARK-40394][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/12 18:07:34 UTC, 0 replies.
- [GitHub] [spark] ahshahid commented on pull request #33983: [SPARK-33152] [SQL] New algorithm for ConstraintsPropagation rule to solve the problem of performance & OOM if the query plans have large expressions involving multiple aliases - posted by GitBox <gi...@apache.org> on 2022/09/12 19:20:06 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #37856: [SPARK-40107][SQL][FOLLOW-UP] Update `empty2null` check - posted by GitBox <gi...@apache.org> on 2022/09/12 19:24:03 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on pull request #37761: [SPARK-40311][SQL][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/12 19:56:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37858: [SPARK-40410][TESTS] Migrate trait QueryErrorsSuiteBase into SparkFunSuite - posted by GitBox <gi...@apache.org> on 2022/09/12 21:06:37 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #33983: [SPARK-33152] [SQL] New algorithm for ConstraintsPropagation rule to solve the problem of performance & OOM if the query plans have large expressions involving multiple aliases - posted by GitBox <gi...@apache.org> on 2022/09/12 21:43:43 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #37850: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 7, ~30 functions) - posted by GitBox <gi...@apache.org> on 2022/09/12 21:51:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #37834: [SPARK-40400][SQL] Pass error message parameters to exceptions as maps - posted by GitBox <gi...@apache.org> on 2022/09/12 21:58:44 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37850: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 7, ~30 functions) - posted by GitBox <gi...@apache.org> on 2022/09/12 22:12:20 UTC, 0 replies.
- [GitHub] [spark] ahshahid closed pull request #37824: [SPARK-40362][SQL] Bug in Canonicalization of expressions like Add & Multiply i.e Commutative Operators - posted by GitBox <gi...@apache.org> on 2022/09/12 22:16:41 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #37811: [SPARK-40360] *_ALREADY_EXISTS and *_NOT_FOUND error - posted by GitBox <gi...@apache.org> on 2022/09/12 23:32:59 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37850: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 7, ~30 functions) - posted by GitBox <gi...@apache.org> on 2022/09/13 01:37:20 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/13 01:46:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37858: [SPARK-40410][TESTS] Migrate trait QueryErrorsSuiteBase into SparkFunSuite - posted by GitBox <gi...@apache.org> on 2022/09/13 01:54:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37858: [SPARK-40410][TESTS] Migrate trait QueryErrorsSuiteBase into SparkFunSuite - posted by GitBox <gi...@apache.org> on 2022/09/13 01:55:10 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37845: [SPARK-40399][PS] Make `pearson` correlation in `DataFrame.corr` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/13 01:59:42 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #37845: [SPARK-40399][PS] Make `pearson` correlation in `DataFrame.corr` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/13 02:26:18 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/13 02:37:12 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37827: [SPARK-40383][INFRA] Pin `mypy==0.920` in dev/requirements.txt - posted by GitBox <gi...@apache.org> on 2022/09/13 02:43:08 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by GitBox <gi...@apache.org> on 2022/09/13 02:44:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37858: [SPARK-40410][TESTS] Migrate trait QueryErrorsSuiteBase into SparkFunSuite - posted by GitBox <gi...@apache.org> on 2022/09/13 02:49:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37859: [SPARK-40411][SS] Refactor FlatMapGroupsWithStateExec to have a parent trait - posted by GitBox <gi...@apache.org> on 2022/09/13 03:05:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37859: [SPARK-40411][SS] Refactor FlatMapGroupsWithStateExec to have a parent trait - posted by GitBox <gi...@apache.org> on 2022/09/13 03:05:29 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37671: [SPARK-40229][PS][TEST] Re-enable excel I/O test for pandas API on Spark - posted by GitBox <gi...@apache.org> on 2022/09/13 03:22:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/13 03:27:46 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37829: [SPARK-40386][PS][SQL] Implement `ddof` in `DataFrame.cov` - posted by GitBox <gi...@apache.org> on 2022/09/13 03:44:12 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37828: [SPARK-40384][INFRA] Only do base image real in time build when infra dockerfile is changed - posted by GitBox <gi...@apache.org> on 2022/09/13 03:44:28 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37845: [SPARK-40399][PS] Make `pearson` correlation in `DataFrame.corr` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/13 03:49:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37609: [SPARK-40175][SQL]Speed up conversion of Tuple2 to Scala Map - posted by GitBox <gi...@apache.org> on 2022/09/13 04:00:27 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37859: [SPARK-40411][SS] Refactor FlatMapGroupsWithStateExec to have a parent trait - posted by GitBox <gi...@apache.org> on 2022/09/13 04:02:43 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37856: [SPARK-40107][SQL][FOLLOW-UP] Update `empty2null` check - posted by GitBox <gi...@apache.org> on 2022/09/13 04:08:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37856: [SPARK-40107][SQL][FOLLOW-UP] Update `empty2null` check - posted by GitBox <gi...@apache.org> on 2022/09/13 04:08:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37854: [SPARK-40406][CORE] Change default logging to stderr to consistent with the behavior of log4j1 - posted by GitBox <gi...@apache.org> on 2022/09/13 04:10:19 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37859: [SPARK-40411][SS] Refactor FlatMapGroupsWithStateExec to have a parent trait - posted by GitBox <gi...@apache.org> on 2022/09/13 04:15:54 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37854: [SPARK-40406][CORE] Change default logging to stderr to consistent with the behavior of log4j1 - posted by GitBox <gi...@apache.org> on 2022/09/13 04:23:05 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37852: [SPARK-40403][SQL] Calculate unsafe array size using longs to avoid negative size in error message - posted by GitBox <gi...@apache.org> on 2022/09/13 04:32:52 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37852: [SPARK-40403][SQL] Calculate unsafe array size using longs to avoid negative size in error message - posted by GitBox <gi...@apache.org> on 2022/09/13 04:38:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37609: [SPARK-40175][SQL]Speed up conversion of Tuple2 to Scala Map - posted by GitBox <gi...@apache.org> on 2022/09/13 04:40:26 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37834: [SPARK-40400][SQL] Pass error message parameters to exceptions as maps - posted by GitBox <gi...@apache.org> on 2022/09/13 04:46:07 UTC, 2 replies.
- [GitHub] [spark] viirya closed pull request #37854: [SPARK-40406][CORE] Change default logging to stderr to consistent with the behavior of log4j1 - posted by GitBox <gi...@apache.org> on 2022/09/13 05:05:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/13 05:37:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37835: [SPARK-40393][PS][TESTS] Refactor expanding and rolling test for function with input - posted by GitBox <gi...@apache.org> on 2022/09/13 05:39:21 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #37730: [SPARK-39915][SQL][3.3] Dataset.repartition(N) may not create N partitions Non-AQE part - posted by GitBox <gi...@apache.org> on 2022/09/13 05:51:31 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37855: [WIP][SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/13 05:52:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37837: [SPARK-40385][SQL] Fix interpreted path for companion object constructor - posted by GitBox <gi...@apache.org> on 2022/09/13 06:06:10 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37837: [SPARK-40385][SQL] Fix interpreted path for companion object constructor - posted by GitBox <gi...@apache.org> on 2022/09/13 06:06:47 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37859: [SPARK-40411][SS] Refactor FlatMapGroupsWithStateExec to have a parent trait - posted by GitBox <gi...@apache.org> on 2022/09/13 06:15:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37855: [WIP][SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/13 06:34:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37836: [SPARK-40339][SPARK-40345][SPARK-40345][SPARK-40348][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/13 06:35:48 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37845: [SPARK-40399][PS] Make `pearson` correlation in `DataFrame.corr` support missing values and `min_periods ` - posted by GitBox <gi...@apache.org> on 2022/09/13 06:44:51 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37859: [SPARK-40411][SS] Refactor FlatMapGroupsWithStateExec to have a parent trait - posted by GitBox <gi...@apache.org> on 2022/09/13 06:47:29 UTC, 1 replies.
- [GitHub] [spark] eejbyfeldt commented on a diff in pull request #37837: [SPARK-40385][SQL] Fix interpreted path for companion object constructor - posted by GitBox <gi...@apache.org> on 2022/09/13 06:50:04 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37487: [SPARK-40053][CORE][SQL][TESTS] Add `assume` to dynamic cancel cases which requiring Python runtime environment - posted by GitBox <gi...@apache.org> on 2022/09/13 06:53:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37721: [SPARK-40272][CORE]Support service port custom with range - posted by GitBox <gi...@apache.org> on 2022/09/13 07:01:35 UTC, 2 replies.
- [GitHub] [spark] pan3793 commented on pull request #37842: [SPARK-40396][BUILD] Update scalatest and scalatestplus related dependencies to use stable version - posted by GitBox <gi...@apache.org> on 2022/09/13 07:38:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37857: [SPARK-38734][SQL] Remove the error class `INDEX_OUT_OF_BOUNDS` - posted by GitBox <gi...@apache.org> on 2022/09/13 07:55:32 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #37861: [SPARK-40324][SQL][FOLLOWUP] Fix a bug in setting query context in Analyzer - posted by GitBox <gi...@apache.org> on 2022/09/13 08:36:44 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37861: [SPARK-40324][SQL][FOLLOWUP] Fix a bug in setting query context in Analyzer - posted by GitBox <gi...@apache.org> on 2022/09/13 08:39:26 UTC, 1 replies.
- [GitHub] [spark] weixiuli opened a new pull request, #37862: [MINOR][SQL] Remove an unnecessary parameter of the PartitionedFileUtil.splitFiles - posted by GitBox <gi...@apache.org> on 2022/09/13 09:23:40 UTC, 0 replies.
- [GitHub] [spark] wbo4958 commented on a diff in pull request #37855: [WIP][SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/13 11:43:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37860: [DO-NOT-MERGE] - posted by GitBox <gi...@apache.org> on 2022/09/13 11:50:33 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37487: [SPARK-40053][CORE][SQL][TESTS] Add `assume` to dynamic cancel cases which requiring Python runtime environment - posted by GitBox <gi...@apache.org> on 2022/09/13 12:02:42 UTC, 0 replies.
- [GitHub] [spark] LeeeeLiu commented on pull request #37819: [SPARK-40377][SQL] Allow customize maxBroadcastTableBytes and maxBroadcastRows - posted by GitBox <gi...@apache.org> on 2022/09/13 12:19:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37828: [SPARK-40384][INFRA] Only do base image real in time build when infra dockerfile is changed - posted by GitBox <gi...@apache.org> on 2022/09/13 12:27:24 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37863: [WIP][DO-NOT-MERGE] Reference PR for flatMapGroupsWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/13 12:42:17 UTC, 0 replies.
- [GitHub] [spark] wbo4958 commented on pull request #37855: [WIP][SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/13 12:47:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37612: [SPARK-39915][SQL] Ensure the output partitioning is user-specified in AQE - posted by GitBox <gi...@apache.org> on 2022/09/13 12:56:57 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37828: [SPARK-40384][INFRA] Only do base image real in time build when infra dockerfile is changed - posted by GitBox <gi...@apache.org> on 2022/09/13 12:57:17 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37828: [SPARK-40384][INFRA] Only do base image real in time build when infra dockerfile is changed - posted by GitBox <gi...@apache.org> on 2022/09/13 13:09:00 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37864: [SPARK-40414][SQL][PYSPARK] More generic type on PythonArrowInput and PythonArrowOutput - posted by GitBox <gi...@apache.org> on 2022/09/13 13:10:45 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #37612: [SPARK-39915][SQL] Ensure the output partitioning is user-specified in AQE - posted by GitBox <gi...@apache.org> on 2022/09/13 13:11:07 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37864: [SPARK-40414][SQL][PYSPARK] More generic type on PythonArrowInput and PythonArrowOutput - posted by GitBox <gi...@apache.org> on 2022/09/13 13:12:24 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37828: [SPARK-40384][INFRA] Only do base image real in time build when infra dockerfile is changed - posted by GitBox <gi...@apache.org> on 2022/09/13 13:31:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37864: [SPARK-40414][SQL][PYSPARK] More generic type on PythonArrowInput and PythonArrowOutput - posted by GitBox <gi...@apache.org> on 2022/09/13 13:45:22 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37836: [SPARK-40339][SPARK-40342][SPARK-40345][SPARK-40348][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/13 13:48:57 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #37834: [SPARK-40400][SQL] Pass error message parameters to exceptions as maps - posted by GitBox <gi...@apache.org> on 2022/09/13 14:25:26 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37865: [SPARK-40384][INFRA][FOLLOWUP] Also trigger PySpark and SparkR job when changing dockerfile - posted by GitBox <gi...@apache.org> on 2022/09/13 14:31:51 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37865: [SPARK-40384][INFRA][FOLLOWUP] Also trigger PySpark and SparkR job when changing dockerfile - posted by GitBox <gi...@apache.org> on 2022/09/13 14:37:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37851: [SPARK-40362][SQL] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/13 14:52:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37851: [SPARK-40362][SQL] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/13 14:52:47 UTC, 1 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #37866: [SPARK-40362][SQL][3.3] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/13 15:02:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37867: [SPARK-40415][BUILD] Add explicit Maven dependency for okio - posted by GitBox <gi...@apache.org> on 2022/09/13 15:26:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37867: [SPARK-40415][BUILD] Add explicit Maven dependency for okio - posted by GitBox <gi...@apache.org> on 2022/09/13 15:32:00 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37867: [SPARK-40415][BUILD][K8S] Add explicit Maven dependency for okio - posted by GitBox <gi...@apache.org> on 2022/09/13 15:35:29 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37867: [SPARK-40415][BUILD][K8S] Add explicit Maven dependency for okio - posted by GitBox <gi...@apache.org> on 2022/09/13 15:43:08 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/13 16:07:25 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37869: [WIP][SQL] Migrate type check fails in CAST to error classes - posted by GitBox <gi...@apache.org> on 2022/09/13 16:08:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/13 16:08:35 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37867: [SPARK-40415][BUILD][K8S] Add explicit Maven dependency for okio - posted by GitBox <gi...@apache.org> on 2022/09/13 16:08:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #37867: [SPARK-40415][BUILD][K8S] Add explicit Maven dependency for okio - posted by GitBox <gi...@apache.org> on 2022/09/13 16:09:54 UTC, 0 replies.
- [GitHub] [spark] ahshahid opened a new pull request, #37870: [SPARK-33152] [SQL] New algorithm for ConstraintsPropagation rule to solve the problem of performance & OOM if the query plans have large expressions involving multiple aliases - posted by GitBox <gi...@apache.org> on 2022/09/13 16:39:43 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37866: [SPARK-40362][SQL][3.3] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/13 17:01:46 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #37840: [SPARK-40416][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/13 17:18:26 UTC, 0 replies.
- [GitHub] [spark] mengxr commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/09/13 17:18:52 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #37710: [DRAFT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/13 18:03:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37871: [WIP][SQL] Return a map from SparkThrowable.getMessageParameters - posted by GitBox <gi...@apache.org> on 2022/09/13 18:09:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #37872: [SPARK-40417][K8S][DOCS] Use YuniKorn v1.1+ - posted by GitBox <gi...@apache.org> on 2022/09/13 18:26:39 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #37864: [SPARK-40414][SQL][PYTHON] More generic type on PythonArrowInput and PythonArrowOutput - posted by GitBox <gi...@apache.org> on 2022/09/13 18:41:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37872: [SPARK-40417][K8S][DOCS] Use YuniKorn v1.1+ - posted by GitBox <gi...@apache.org> on 2022/09/13 19:18:59 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37866: [SPARK-40362][SQL][3.3] Fix BinaryComparison canonicalization - posted by GitBox <gi...@apache.org> on 2022/09/13 19:42:48 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37872: [SPARK-40417][K8S][DOCS] Use YuniKorn v1.1+ - posted by GitBox <gi...@apache.org> on 2022/09/13 19:47:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37872: [SPARK-40417][K8S][DOCS] Use YuniKorn v1.1+ - posted by GitBox <gi...@apache.org> on 2022/09/13 19:52:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37861: [SPARK-40324][SQL][FOLLOWUP] Fix a bug in setting query context in Analyzer - posted by GitBox <gi...@apache.org> on 2022/09/13 20:22:33 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #37861: [SPARK-40324][SQL][FOLLOWUP] Fix a bug in setting query context in Analyzer - posted by GitBox <gi...@apache.org> on 2022/09/13 20:24:58 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #37852: [SPARK-40403][SQL] Calculate unsafe array size using longs to avoid negative size in error message - posted by GitBox <gi...@apache.org> on 2022/09/13 20:46:10 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #37840: [SPARK-40416][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/13 21:35:05 UTC, 12 replies.
- [GitHub] [spark] Yikun closed pull request #37865: [SPARK-40384][INFRA][FOLLOWUP] Also trigger PySpark and SparkR job when changing dockerfile - posted by GitBox <gi...@apache.org> on 2022/09/13 22:26:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37870: [SPARK-33152] [SQL] New algorithm for ConstraintsPropagation rule to solve the problem of performance & OOM if the query plans have large expressions involving multiple aliases - posted by GitBox <gi...@apache.org> on 2022/09/14 00:43:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37612: [SPARK-39915][SQL] Ensure the output partitioning is user-specified in AQE - posted by GitBox <gi...@apache.org> on 2022/09/14 00:43:15 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37864: [SPARK-40414][SQL][PYTHON] More generic type on PythonArrowInput and PythonArrowOutput - posted by GitBox <gi...@apache.org> on 2022/09/14 01:14:16 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37864: [SPARK-40414][SQL][PYTHON] More generic type on PythonArrowInput and PythonArrowOutput - posted by GitBox <gi...@apache.org> on 2022/09/14 02:13:46 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37852: [SPARK-40403][SQL] Calculate unsafe array size using longs to avoid negative size in error message - posted by GitBox <gi...@apache.org> on 2022/09/14 02:46:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/14 03:16:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/14 03:17:49 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37836: [SPARK-40339][SPARK-40342][SPARK-40345][SPARK-40348][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/14 04:12:45 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #37873: [SPARK-40419][SQL][TESTS] Integrate Grouped Aggregate Pandas UDFs into *.sql test cases - posted by GitBox <gi...@apache.org> on 2022/09/14 04:45:14 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #37861: [SPARK-40324][SQL][FOLLOWUP] Fix a bug in setting query context in Analyzer - posted by GitBox <gi...@apache.org> on 2022/09/14 05:07:26 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37811: [SPARK-40360] *_ALREADY_EXISTS and *_NOT_FOUND error - posted by GitBox <gi...@apache.org> on 2022/09/14 05:10:56 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #37840: [SPARK-40416][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/14 05:13:05 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37864: [SPARK-40414][SQL][PYTHON] More generic type on PythonArrowInput and PythonArrowOutput - posted by GitBox <gi...@apache.org> on 2022/09/14 05:18:10 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #37811: [SPARK-40360] *_ALREADY_EXISTS and *_NOT_FOUND error - posted by GitBox <gi...@apache.org> on 2022/09/14 05:26:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37874: [SPARK-40421][PS] Make `spearman` correlation in `DataFrame.corr` support missing values and `min_periods` - posted by GitBox <gi...@apache.org> on 2022/09/14 05:55:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37875: [WIP][SPARK-40420][SQL] Sort error message parameters by names in the JSON formats - posted by GitBox <gi...@apache.org> on 2022/09/14 05:59:51 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on a diff in pull request #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/14 06:13:48 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37862: [MINOR][SQL] Remove an unnecessary parameter of the PartitionedFileUtil.splitFiles - posted by GitBox <gi...@apache.org> on 2022/09/14 06:59:04 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37836: [SPARK-40339][SPARK-40342][SPARK-40345][SPARK-40348][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/14 07:30:52 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37875: [SPARK-40420][SQL] Sort error message parameters by names in the JSON formats - posted by GitBox <gi...@apache.org> on 2022/09/14 08:17:43 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37876: [SPARK-40175][CORE][SQL][MLLIB][STREAMING] Optimize the performance of `keys.zip(values).toMap` - posted by GitBox <gi...@apache.org> on 2022/09/14 08:33:55 UTC, 0 replies.
- [GitHub] [spark] Yikf closed pull request #35788: [SPARK-38482][SQL] Migrate legacy.keepCommandOutputSchema related to KeepLegacyOutputs - posted by GitBox <gi...@apache.org> on 2022/09/14 08:34:00 UTC, 0 replies.
- [GitHub] [spark] Yikf closed pull request #37254: [SPARK-39841][SQL] simplify conflict binary comparison - posted by GitBox <gi...@apache.org> on 2022/09/14 08:34:01 UTC, 0 replies.
- [GitHub] [spark] Yikf closed pull request #37177: [SPARK-39765][SQL] Logging the exception of detect JDBC table exist - posted by GitBox <gi...@apache.org> on 2022/09/14 08:34:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][STREAMING] Optimize the performance of `keys.zip(values).toMap` - posted by GitBox <gi...@apache.org> on 2022/09/14 08:36:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37875: [SPARK-40420][SQL] Sort error message parameters by names in the JSON formats - posted by GitBox <gi...@apache.org> on 2022/09/14 08:52:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #37877: [SPARK-40423][K8S][TESTS] Add explicit YuniKorn queue submission test coverage - posted by GitBox <gi...@apache.org> on 2022/09/14 09:05:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37877: [SPARK-40423][K8S][TESTS] Add explicit YuniKorn queue submission test coverage - posted by GitBox <gi...@apache.org> on 2022/09/14 09:12:21 UTC, 2 replies.
- [GitHub] [spark] ucas010 commented on pull request #18748: [SPARK-20679][ML] Support recommending for a subset of users/items in ALSModel - posted by GitBox <gi...@apache.org> on 2022/09/14 09:17:20 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37678: [SPARK-40124][SQL][TEST][3.1] Update TPCDS v1.4 q32 for Plan Stability tests - posted by GitBox <gi...@apache.org> on 2022/09/14 09:40:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37678: [SPARK-40124][SQL][TEST][3.1] Update TPCDS v1.4 q32 for Plan Stability tests - posted by GitBox <gi...@apache.org> on 2022/09/14 09:40:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37877: [SPARK-40423][K8S][TESTS] Add explicit YuniKorn queue submission test coverage - posted by GitBox <gi...@apache.org> on 2022/09/14 09:41:24 UTC, 1 replies.
- [GitHub] [spark] ayushi-agarwal commented on pull request #37625: [SPARK-40177][SQL] Simplify condition of form (a==b) || (a==null&&b==null) to a<=>b - posted by GitBox <gi...@apache.org> on 2022/09/14 10:24:52 UTC, 0 replies.
- [GitHub] [spark] ayushi-agarwal closed pull request #37625: [SPARK-40177][SQL] Simplify condition of form (a==b) || (a==null&&b==null) to a<=>b - posted by GitBox <gi...@apache.org> on 2022/09/14 10:24:53 UTC, 0 replies.
- [GitHub] [spark] ayushi-agarwal opened a new pull request, #37625: [SPARK-40177][SQL] Simplify condition of form (a==b) || (a==null&&b==null) to a<=>b - posted by GitBox <gi...@apache.org> on 2022/09/14 10:25:20 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on pull request #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/14 11:08:04 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #18748: [SPARK-20679][ML] Support recommending for a subset of users/items in ALSModel - posted by GitBox <gi...@apache.org> on 2022/09/14 12:00:15 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][STREAMING] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/14 13:09:46 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/14 13:13:17 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][STREAMING] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/14 13:14:02 UTC, 13 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37878: [SPARK-40424][CORE][TESTS] Refactor `ChromeUIHistoryServerSuite` to add UTs for RocksDB - posted by GitBox <gi...@apache.org> on 2022/09/14 13:14:58 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #37268: [SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled - posted by GitBox <gi...@apache.org> on 2022/09/14 14:16:31 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37869: [SPARK-40370][SQL] Migrate type check fails to error classes in CAST - posted by GitBox <gi...@apache.org> on 2022/09/14 14:19:19 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37869: [SPARK-40370][SQL] Migrate type check fails to error classes in CAST - posted by GitBox <gi...@apache.org> on 2022/09/14 14:27:48 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/14 14:48:43 UTC, 0 replies.
- [GitHub] [spark] shrprasa opened a new pull request, #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by GitBox <gi...@apache.org> on 2022/09/14 15:41:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37871: [SPARK-40426][SQL] Return a map from SparkThrowable.getMessageParameters - posted by GitBox <gi...@apache.org> on 2022/09/14 16:07:58 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][STREAMING] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/14 16:10:21 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37853: [SPARK-40404][DOCS] Add precondition description for `spark.shuffle.service.db.backend` in `running-on-yarn.md` - posted by GitBox <gi...@apache.org> on 2022/09/14 16:23:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37877: [SPARK-40423][K8S][TESTS] Add explicit YuniKorn queue submission test coverage - posted by GitBox <gi...@apache.org> on 2022/09/14 16:29:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37840: [SPARK-40416][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/14 16:42:20 UTC, 0 replies.
- [GitHub] [spark] sunchao opened a new pull request, #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/14 16:51:38 UTC, 0 replies.
- [GitHub] [spark] sarutak opened a new pull request, #37882: [SPARK-38017][FOLLOWUP] Hide TimestampNTZ in the doc - posted by GitBox <gi...@apache.org> on 2022/09/14 17:00:25 UTC, 0 replies.
- [GitHub] [spark] sarutak opened a new pull request, #37883: [SPARK-38017][FOLLOWUP][3.2] Hide TimestampNTZ in the doc - posted by GitBox <gi...@apache.org> on 2022/09/14 17:04:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37882: [SPARK-38017][FOLLOWUP][3.3] Hide TimestampNTZ in the doc - posted by GitBox <gi...@apache.org> on 2022/09/14 17:12:15 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/14 17:19:31 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #37840: [SPARK-40416][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/14 17:26:36 UTC, 2 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #37884: [WIP][SPARK-40427][SQL] Move LIMIT/OFFSET CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/14 18:02:15 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #37811: [SPARK-40360] *_ALREADY_EXISTS and *_NOT_FOUND error - posted by GitBox <gi...@apache.org> on 2022/09/14 18:35:17 UTC, 1 replies.
- [GitHub] [spark] holdenk opened a new pull request, #37885: [SPARK-40428][K8S][CORE][WIP] Add a shutdown hook in the CoarseGrainedSchedulerBackend - posted by GitBox <gi...@apache.org> on 2022/09/14 18:38:37 UTC, 0 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #37886: [SPARK-40429][SQL] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/14 20:25:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37885: [SPARK-40428][CORE][WIP] Add a shutdown hook in the CoarseGrainedSchedulerBackend - posted by GitBox <gi...@apache.org> on 2022/09/14 21:05:26 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #37840: [SPARK-40416][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/14 22:06:24 UTC, 5 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/14 22:16:35 UTC, 4 replies.
- [GitHub] [spark] rahulbhatia2702 commented on pull request #32397: [WIP][SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2022/09/14 23:10:48 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #37886: [SPARK-40429][SQL] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/14 23:15:54 UTC, 1 replies.
- [GitHub] [spark] srielau closed pull request #37811: [SPARK-40360] *_ALREADY_EXISTS and *_NOT_FOUND error - posted by GitBox <gi...@apache.org> on 2022/09/14 23:36:36 UTC, 0 replies.
- [GitHub] [spark] srielau opened a new pull request, #37887: [SPARK-40360] ALREADY_EXISTS and NOT_FOUND exceptions - posted by GitBox <gi...@apache.org> on 2022/09/14 23:38:56 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #37777: [WIP][SPARK-40309][PYTHON][PS] Introduce `sql_conf` context manager for `pyspark.sql` - posted by GitBox <gi...@apache.org> on 2022/09/14 23:59:59 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37884: [WIP][SPARK-40427][SQL] Move LIMIT/OFFSET CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/15 00:00:25 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/15 00:15:15 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/15 00:16:34 UTC, 2 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36766: [SPARK-32184][SQL] Remove inferred predicate if it has InOrCorrelatedExistsSubquery - posted by GitBox <gi...@apache.org> on 2022/09/15 00:24:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36626: [SPARK-39249][SQL] Improve subexpression elimination for conditional expressions - posted by GitBox <gi...@apache.org> on 2022/09/15 00:24:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37883: [SPARK-38017][FOLLOWUP][3.2] Hide TimestampNTZ in the doc - posted by GitBox <gi...@apache.org> on 2022/09/15 00:27:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37882: [SPARK-38017][FOLLOWUP][3.3] Hide TimestampNTZ in the doc - posted by GitBox <gi...@apache.org> on 2022/09/15 00:27:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37883: [SPARK-38017][FOLLOWUP][3.2] Hide TimestampNTZ in the doc - posted by GitBox <gi...@apache.org> on 2022/09/15 00:27:43 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/15 00:27:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37882: [SPARK-38017][FOLLOWUP][3.3] Hide TimestampNTZ in the doc - posted by GitBox <gi...@apache.org> on 2022/09/15 00:28:27 UTC, 0 replies.
- [GitHub] [spark] zzccctv commented on pull request #31302: [SPARK-34210][SQL] After upgrading 3.0.1, Spark SQL access hive on HBase table access exception - posted by GitBox <gi...@apache.org> on 2022/09/15 00:35:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37886: [SPARK-40429][SQL] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 01:04:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37878: [SPARK-40424][CORE][TESTS] Refactor `ChromeUIHistoryServerSuite` to add UTs for RocksDB - posted by GitBox <gi...@apache.org> on 2022/09/15 01:33:17 UTC, 0 replies.
- [GitHub] [spark] sarutak closed pull request #37868: [SPARK-40397][BUILD] Upgrade `org.scalatestplus:selenium` to 3.12.13 - posted by GitBox <gi...@apache.org> on 2022/09/15 01:44:00 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #37886: [SPARK-40429][SQL] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 01:46:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37836: [SPARK-40339][SPARK-40342][SPARK-40345][SPARK-40348][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/15 01:47:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37871: [SPARK-40426][SQL] Return a map from SparkThrowable.getMessageParameters - posted by GitBox <gi...@apache.org> on 2022/09/15 01:48:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37871: [SPARK-40426][SQL] Return a map from SparkThrowable.getMessageParameters - posted by GitBox <gi...@apache.org> on 2022/09/15 01:49:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37874: [SPARK-40421][PS] Make `spearman` correlation in `DataFrame.corr` support missing values and `min_periods` - posted by GitBox <gi...@apache.org> on 2022/09/15 01:53:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37874: [SPARK-40421][PS] Make `spearman` correlation in `DataFrame.corr` support missing values and `min_periods` - posted by GitBox <gi...@apache.org> on 2022/09/15 01:54:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][STREAMING] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/15 02:01:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37836: [SPARK-40339][SPARK-40342][SPARK-40345][SPARK-40348][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/15 02:04:27 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on pull request #37836: [SPARK-40339][SPARK-40342][SPARK-40345][SPARK-40348][PS] Implement quantile in Rolling/RollingGroupby/Expanding/ExpandingGroupby - posted by GitBox <gi...@apache.org> on 2022/09/15 02:27:05 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37889: [SPARK-40432][SS][PYTHON] Introduce GroupStateImpl and GroupStateTimeout in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 02:30:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37878: [SPARK-40424][CORE][TESTS] Refactor `ChromeUIHistoryServerSuite` to add UTs for RocksDB - posted by GitBox <gi...@apache.org> on 2022/09/15 02:31:56 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37890: [SPARK-40339][SPARK-40342][PS][DOCS][FOLLOW-UP] Add Rolling.quantile and Expanding.quantile into PySpark documentation - posted by GitBox <gi...@apache.org> on 2022/09/15 02:36:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37889: [SPARK-40432][SS][PYTHON] Introduce GroupStateImpl and GroupStateTimeout in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 02:39:40 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37891: [SPARK-40433][SS][PYTHON] Add toJVMRow in PythonSQLUtils to convert pickled PySpark Row to JVM Row - posted by GitBox <gi...@apache.org> on 2022/09/15 02:39:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/15 03:07:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37892: [SPARK-40436][BUILD] Upgrade Scala to 2.12.17 - posted by GitBox <gi...@apache.org> on 2022/09/15 03:10:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37892: [SPARK-40436][BUILD] Upgrade Scala to 2.12.17 - posted by GitBox <gi...@apache.org> on 2022/09/15 03:11:10 UTC, 3 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37889: [SPARK-40432][SS][PYTHON] Introduce GroupStateImpl and GroupStateTimeout in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 03:15:24 UTC, 3 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37889: [SPARK-40432][SS][PYTHON] Introduce GroupStateImpl and GroupStateTimeout in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 03:22:09 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37890: [SPARK-40339][SPARK-40342][PS][DOCS][FOLLOW-UP] Add Rolling.quantile and Expanding.quantile into PySpark documentation - posted by GitBox <gi...@apache.org> on 2022/09/15 03:26:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37890: [SPARK-40339][SPARK-40342][PS][DOCS][FOLLOW-UP] Add Rolling.quantile and Expanding.quantile into PySpark documentation - posted by GitBox <gi...@apache.org> on 2022/09/15 03:26:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37890: [SPARK-40339][SPARK-40342][PS][DOCS][FOLLOW-UP] Add Rolling.quantile and Expanding.quantile into PySpark documentation - posted by GitBox <gi...@apache.org> on 2022/09/15 03:44:57 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37885: [SPARK-40428][CORE][WIP] Add a shutdown hook in the CoarseGrainedSchedulerBackend - posted by GitBox <gi...@apache.org> on 2022/09/15 04:04:02 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37893: [DRAFT][DO-NOT-MERGE][SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 04:10:59 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37894: [DRAFT][DO-NOT-MERGE][SPARK-40435][SS][PYTHON] Add test suites for applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 04:22:57 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by GitBox <gi...@apache.org> on 2022/09/15 04:30:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37886: [SPARK-40429][SQL] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 06:06:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37886: [SPARK-40429][SQL] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 06:07:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37853: [SPARK-40404][DOCS] Add precondition description for `spark.shuffle.service.db.backend` in `running-on-yarn.md` - posted by GitBox <gi...@apache.org> on 2022/09/15 06:11:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37853: [SPARK-40404][DOCS] Add precondition description for `spark.shuffle.service.db.backend` in `running-on-yarn.md` - posted by GitBox <gi...@apache.org> on 2022/09/15 06:46:44 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/15 07:14:03 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37893: [DRAFT][DO-NOT-MERGE][SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 07:16:37 UTC, 1 replies.
- [GitHub] [spark] LucaCanali commented on pull request #35391: [SPARK-38098][PYTHON] Add support for ArrayType of nested StructType to arrow-based conversion - posted by GitBox <gi...@apache.org> on 2022/09/15 07:25:24 UTC, 1 replies.
- [GitHub] [spark] gaborgsomogyi commented on pull request #37558: [SPARK-38954][CORE] Implement sharing of cloud credentials among driver and executors - posted by GitBox <gi...@apache.org> on 2022/09/15 07:40:36 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/15 07:41:02 UTC, 1 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37895: [SPARK-40440][PS][DOC] Fix wrong reference and content in PS windows related doc - posted by GitBox <gi...@apache.org> on 2022/09/15 07:47:20 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37895: [SPARK-40440][PS][DOCS] Fix wrong reference and content in PS windows related doc - posted by GitBox <gi...@apache.org> on 2022/09/15 07:47:48 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37869: [SPARK-40370][SQL] Migrate type check fails to error classes in CAST - posted by GitBox <gi...@apache.org> on 2022/09/15 07:52:22 UTC, 0 replies.
- [GitHub] [spark] gaborgsomogyi commented on a diff in pull request #37558: [SPARK-38954][CORE] Implement sharing of cloud credentials among driver and executors - posted by GitBox <gi...@apache.org> on 2022/09/15 08:12:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #37896: Revert [SPARK-24544][SQL] Print actual failure cause when look up function failed - posted by GitBox <gi...@apache.org> on 2022/09/15 08:52:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37896: Revert [SPARK-24544][SQL] Print actual failure cause when look up function failed - posted by GitBox <gi...@apache.org> on 2022/09/15 08:54:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37896: Revert [SPARK-24544][SQL] Print actual failure cause when look up function failed - posted by GitBox <gi...@apache.org> on 2022/09/15 08:55:07 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/15 08:59:18 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37897: [SPARK-40445][PS] Refactor `Resampler` to make it consistent with `GroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/15 09:04:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37898: [SPARK-40446][PS][DOC] Rename `_MissingPandasXXX` as `MissingPandasXXX` - posted by GitBox <gi...@apache.org> on 2022/09/15 09:08:39 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37895: [SPARK-40440][PS][DOCS] Fix wrong reference and content in PS windows related doc - posted by GitBox <gi...@apache.org> on 2022/09/15 09:14:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37895: [SPARK-40440][PS][DOCS] Fix wrong reference and content in PS windows related doc - posted by GitBox <gi...@apache.org> on 2022/09/15 09:15:08 UTC, 0 replies.
- [GitHub] [spark] thomasg19930417 commented on pull request #34464: [SPARK-37193][SQL] DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins - posted by GitBox <gi...@apache.org> on 2022/09/15 09:54:15 UTC, 4 replies.
- [GitHub] [spark] caican00 opened a new pull request, #37899: [SPARK-40455][CORE]Abort result stage directly when it failed caused by FetchFailedException - posted by GitBox <gi...@apache.org> on 2022/09/15 12:01:23 UTC, 0 replies.
- [GitHub] [spark] caican00 commented on pull request #37899: [SPARK-40455][CORE]Abort result stage directly when it failed caused by FetchFailedException - posted by GitBox <gi...@apache.org> on 2022/09/15 12:02:31 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37891: [SPARK-40433][SS][PYTHON] Add toJVMRow in PythonSQLUtils to convert pickled PySpark Row to JVM Row - posted by GitBox <gi...@apache.org> on 2022/09/15 12:49:13 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37891: [SPARK-40433][SS][PYTHON] Add toJVMRow in PythonSQLUtils to convert pickled PySpark Row to JVM Row - posted by GitBox <gi...@apache.org> on 2022/09/15 12:50:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37830: [SPARK-40387][SQL] Improve the implementation of Spark Decimal - posted by GitBox <gi...@apache.org> on 2022/09/15 12:51:17 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37898: [SPARK-40446][PS][DOC] Rename `_MissingPandasXXX` as `MissingPandasXXX` - posted by GitBox <gi...@apache.org> on 2022/09/15 13:36:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #37900: [SPARK-40456][SQL] PartitionIterator.hasNext should be cheap to call repeatedly - posted by GitBox <gi...@apache.org> on 2022/09/15 13:48:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/15 14:20:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/15 14:21:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37900: [SPARK-40456][SQL] PartitionIterator.hasNext should be cheap to call repeatedly - posted by GitBox <gi...@apache.org> on 2022/09/15 15:07:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37900: [SPARK-40456][SQL] PartitionIterator.hasNext should be cheap to call repeatedly - posted by GitBox <gi...@apache.org> on 2022/09/15 15:22:34 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37743: [SPARK-40294][SQL] Fix repeat calls to `PartitionReader.hasNext` timing out - posted by GitBox <gi...@apache.org> on 2022/09/15 15:23:47 UTC, 1 replies.
- [GitHub] [spark] ekoifman commented on pull request #34464: [SPARK-37193][SQL] DynamicJoinSelection.shouldDemoteBroadcastHashJoin should not apply to outer joins - posted by GitBox <gi...@apache.org> on 2022/09/15 15:25:40 UTC, 0 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #37901: [SPARK-40429][SQL][3.3] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 16:07:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37901: [SPARK-40429][SQL][3.3] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 16:08:01 UTC, 1 replies.
- [GitHub] [spark] parthchandra commented on a diff in pull request #37558: [SPARK-38954][CORE] Implement sharing of cloud credentials among driver and executors - posted by GitBox <gi...@apache.org> on 2022/09/15 16:46:05 UTC, 0 replies.
- [GitHub] [spark] parthchandra commented on pull request #37558: [SPARK-38954][CORE] Implement sharing of cloud credentials among driver and executors - posted by GitBox <gi...@apache.org> on 2022/09/15 17:10:03 UTC, 3 replies.
- [GitHub] [spark] holdenk commented on pull request #37885: [SPARK-40428][CORE][WIP] Add a shutdown hook in the CoarseGrainedSchedulerBackend - posted by GitBox <gi...@apache.org> on 2022/09/15 18:08:55 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/15 18:18:07 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37901: [SPARK-40429][SQL][3.3] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 18:28:40 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #37901: [SPARK-40429][SQL][3.3] Only set KeyGroupedPartitioning when the referenced column is in the output - posted by GitBox <gi...@apache.org> on 2022/09/15 18:33:03 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37902: [WIP][SPARK-40359][SQL] Migrate type check fails in CSV/JSON expressions to error classes - posted by GitBox <gi...@apache.org> on 2022/09/15 19:22:19 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37889: [SPARK-40432][SS][PYTHON] Introduce GroupStateImpl and GroupStateTimeout in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/15 20:32:01 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/15 22:00:06 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #37903: [SPARK-40459][K8S] `recoverDiskStore` should not stop by existing recomputed files - posted by GitBox <gi...@apache.org> on 2022/09/16 00:22:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36766: [SPARK-32184][SQL] Remove inferred predicate if it has InOrCorrelatedExistsSubquery - posted by GitBox <gi...@apache.org> on 2022/09/16 00:25:25 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36626: [SPARK-39249][SQL] Improve subexpression elimination for conditional expressions - posted by GitBox <gi...@apache.org> on 2022/09/16 00:25:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37897: [SPARK-40445][PS] Refactor `Resampler` for consistency and simplicity - posted by GitBox <gi...@apache.org> on 2022/09/16 00:26:36 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37903: [SPARK-40459][K8S] `recoverDiskStore` should not stop by existing recomputed files - posted by GitBox <gi...@apache.org> on 2022/09/16 00:35:09 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37897: [SPARK-40445][PS] Refactor `Resampler` for consistency and simplicity - posted by GitBox <gi...@apache.org> on 2022/09/16 00:37:38 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37898: [SPARK-40446][PS][DOC] Rename `_MissingPandasXXX` as `MissingPandasXXX` - posted by GitBox <gi...@apache.org> on 2022/09/16 00:39:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37898: [SPARK-40446][PS][DOC] Rename `_MissingPandasXXX` as `MissingPandasXXX` - posted by GitBox <gi...@apache.org> on 2022/09/16 00:39:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37904: [SPARK-40461][INFRA] Set upperbound for pyzmq 24.0.0 for linters - posted by GitBox <gi...@apache.org> on 2022/09/16 00:46:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37904: [SPARK-40461][INFRA] Set upperbound for pyzmq 24.0.0 for linters - posted by GitBox <gi...@apache.org> on 2022/09/16 00:46:50 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37903: [SPARK-40459][K8S] `recoverDiskStore` should not stop by existing recomputed files - posted by GitBox <gi...@apache.org> on 2022/09/16 00:48:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37897: [SPARK-40445][PS] Refactor `Resampler` for consistency and simplicity - posted by GitBox <gi...@apache.org> on 2022/09/16 00:57:40 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/16 01:02:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37903: [SPARK-40459][K8S] `recoverDiskStore` should not stop by existing recomputed files - posted by GitBox <gi...@apache.org> on 2022/09/16 01:04:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37904: [SPARK-40461][INFRA] Set upperbound for pyzmq 24.0.0 for Python linter - posted by GitBox <gi...@apache.org> on 2022/09/16 01:09:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37904: [SPARK-40461][INFRA] Set upperbound for pyzmq 24.0.0 for Python linter - posted by GitBox <gi...@apache.org> on 2022/09/16 01:14:31 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37893: [SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/16 01:35:46 UTC, 4 replies.
- [GitHub] [spark] Yaohua628 opened a new pull request, #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/16 02:10:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/16 02:41:12 UTC, 6 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 02:58:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37898: [SPARK-40446][PS][DOC] Rename `_MissingPandasXXX` as `MissingPandasXXX` - posted by GitBox <gi...@apache.org> on 2022/09/16 03:10:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37897: [SPARK-40445][PS] Refactor `Resampler` for consistency and simplicity - posted by GitBox <gi...@apache.org> on 2022/09/16 03:13:44 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37907: [SPARK-40467][SS] Split FlatMapGroupsWithState down to multiple test suites - posted by GitBox <gi...@apache.org> on 2022/09/16 03:21:43 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37907: [SPARK-40467][SS] Split FlatMapGroupsWithState down to multiple test suites - posted by GitBox <gi...@apache.org> on 2022/09/16 03:26:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/16 03:46:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/16 03:51:06 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 03:53:28 UTC, 0 replies.
- [GitHub] [spark] chong0929 commented on a diff in pull request #37721: [SPARK-40272][CORE]Support service port custom with range - posted by GitBox <gi...@apache.org> on 2022/09/16 03:53:43 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/16 03:58:34 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][DSTREAM][R] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/16 04:18:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37908: [SPARK-40196][PS][FOLLOWUP] `SF.lit` -> `F.lit` in `Groupby.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/16 04:34:38 UTC, 0 replies.
- [GitHub] [spark] pralabhkumar commented on a diff in pull request #37417: [SPARK-33782][K8S][CORE]Place spark.files, spark.jars and spark.files under the current working directory on the driver in K8S cluster mode - posted by GitBox <gi...@apache.org> on 2022/09/16 04:35:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][DSTREAM][R] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/16 04:44:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][DSTREAM][R] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/16 04:45:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37908: [SPARK-40196][PS][FOLLOWUP] `SF.lit` -> `F.lit` in `window.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/16 04:47:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37908: [SPARK-40196][PS][FOLLOWUP] `SF.lit` -> `F.lit` in `window.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/16 04:48:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37908: [SPARK-40196][PS][FOLLOWUP] `SF.lit` -> `F.lit` in `window.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/16 04:48:40 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/16 05:10:33 UTC, 1 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #37909: [SPARK-40468] Fix column pruning in CSV when _corrupt_record is selected - posted by GitBox <gi...@apache.org> on 2022/09/16 05:20:23 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37909: [SPARK-40468][SQL] Fix column pruning in CSV when _corrupt_record is selected - posted by GitBox <gi...@apache.org> on 2022/09/16 05:27:40 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/16 06:10:35 UTC, 8 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37910: [SPARK-40469][CORE] Avoid creating directory failures - posted by GitBox <gi...@apache.org> on 2022/09/16 06:24:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/16 06:26:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37625: [SPARK-40177][SQL] Simplify condition of form (a==b) || (a==null&&b==null) to a<=>b - posted by GitBox <gi...@apache.org> on 2022/09/16 06:33:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 06:46:22 UTC, 1 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #37911: [SPARK-40470] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function - posted by GitBox <gi...@apache.org> on 2022/09/16 06:49:29 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 06:50:18 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37888: [SPARK-40196][PYTHON][PS] Consolidate `lit` function with NumPy scalar in sql and pandas module - posted by GitBox <gi...@apache.org> on 2022/09/16 06:53:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 06:58:51 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on pull request #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 07:01:06 UTC, 2 replies.
- [GitHub] [spark] Yikun opened a new pull request, #37912: [SPARK-40196][PYTHON][PS][FOLLOWUP] SparkFunctionsTests.test_repeat - posted by GitBox <gi...@apache.org> on 2022/09/16 07:05:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37912: [SPARK-40196][PYTHON][PS][FOLLOWUP] Skip SparkFunctionsTests.test_repeat - posted by GitBox <gi...@apache.org> on 2022/09/16 07:08:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37912: [SPARK-40196][PYTHON][PS][FOLLOWUP] Skip SparkFunctionsTests.test_repeat - posted by GitBox <gi...@apache.org> on 2022/09/16 07:10:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 07:12:16 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37906: [SPARK-40463][INFRA] Update gpg's keyserver - posted by GitBox <gi...@apache.org> on 2022/09/16 07:13:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37913: [SPARK-40447][PS] Implement `kendall` correlation in `DataFrame.corr` - posted by GitBox <gi...@apache.org> on 2022/09/16 07:37:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37914: [SPARK-40471][BUILD] Upgrade RoaringBitmap to 0.9.32 - posted by GitBox <gi...@apache.org> on 2022/09/16 07:58:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37914: [SPARK-40471][BUILD] Upgrade RoaringBitmap to 0.9.32 - posted by GitBox <gi...@apache.org> on 2022/09/16 07:58:47 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #37407: [SPARK-39876][SQL] Add UNPIVOT to SQL syntax - posted by GitBox <gi...@apache.org> on 2022/09/16 08:57:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37914: [SPARK-40471][BUILD] Upgrade RoaringBitmap to 0.9.32 - posted by GitBox <gi...@apache.org> on 2022/09/16 09:26:14 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2022/09/16 09:39:28 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37907: [SPARK-40467][SS] Split FlatMapGroupsWithState down to multiple test suites - posted by GitBox <gi...@apache.org> on 2022/09/16 09:40:02 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37907: [SPARK-40467][SS] Split FlatMapGroupsWithState down to multiple test suites - posted by GitBox <gi...@apache.org> on 2022/09/16 09:42:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/16 09:48:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37916: [WIP][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/16 09:56:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37743: [SPARK-40294][SQL] Fix repeat calls to `PartitionReader.hasNext` timing out - posted by GitBox <gi...@apache.org> on 2022/09/16 10:09:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37893: [SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/16 11:00:39 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/16 11:41:38 UTC, 7 replies.
- [GitHub] [spark] martin-g commented on pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/16 12:40:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37911: [SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function - posted by GitBox <gi...@apache.org> on 2022/09/16 12:55:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37911: [SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function - posted by GitBox <gi...@apache.org> on 2022/09/16 13:04:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37911: [SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function - posted by GitBox <gi...@apache.org> on 2022/09/16 13:05:20 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/16 13:29:38 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37843: [SPARK-40398][CORE][SQL] Use Loop instead of Arrays.stream api - posted by GitBox <gi...@apache.org> on 2022/09/16 13:29:55 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37909: [SPARK-40468][SQL] Fix column pruning in CSV when _corrupt_record is selected - posted by GitBox <gi...@apache.org> on 2022/09/16 13:51:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37909: [SPARK-40468][SQL] Fix column pruning in CSV when _corrupt_record is selected - posted by GitBox <gi...@apache.org> on 2022/09/16 14:00:37 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37862: [MINOR][SQL] Remove an unnecessary parameter of the PartitionedFileUtil.splitFiles - posted by GitBox <gi...@apache.org> on 2022/09/16 14:16:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37862: [MINOR][SQL] Remove an unnecessary parameter of the PartitionedFileUtil.splitFiles - posted by GitBox <gi...@apache.org> on 2022/09/16 14:39:52 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/16 14:57:58 UTC, 4 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/16 15:39:01 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/16 16:09:52 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/16 16:32:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/16 16:41:26 UTC, 7 replies.
- [GitHub] [spark] huanliwang-db opened a new pull request, #37917: [WIP][SPARK-40466][SS] Improve the error message when DSv2 is disabled whi… - posted by GitBox <gi...@apache.org> on 2022/09/16 16:42:17 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on a diff in pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/16 17:22:44 UTC, 3 replies.
- [GitHub] [spark] sunchao closed pull request #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/16 17:47:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37881: [SPARK-40169][SQL] Don't pushdown Parquet filters with no reference to data schema - posted by GitBox <gi...@apache.org> on 2022/09/16 18:51:14 UTC, 0 replies.
- [GitHub] [spark] huanliwang-db commented on pull request #37917: [SPARK-40466][SS] Improve the error message when DSv2 is disabled whi… - posted by GitBox <gi...@apache.org> on 2022/09/16 19:03:33 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37899: [SPARK-40455][CORE]Abort result stage directly when it failed caused by FetchFailedException - posted by GitBox <gi...@apache.org> on 2022/09/16 19:15:45 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37899: [SPARK-40455][CORE]Abort result stage directly when it failed caused by FetchFailedException - posted by GitBox <gi...@apache.org> on 2022/09/16 22:24:14 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #37533: [SPARK-40096]Fix finalize shuffle stage slow due to connection creation slow - posted by GitBox <gi...@apache.org> on 2022/09/16 22:40:59 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37897: [SPARK-40445][PS] Refactor `Resampler` for consistency and simplicity - posted by GitBox <gi...@apache.org> on 2022/09/16 23:29:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37913: [SPARK-40447][PS] Implement `kendall` correlation in `DataFrame.corr` - posted by GitBox <gi...@apache.org> on 2022/09/16 23:30:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37913: [SPARK-40447][PS] Implement `kendall` correlation in `DataFrame.corr` - posted by GitBox <gi...@apache.org> on 2022/09/16 23:31:14 UTC, 0 replies.
- [GitHub] [spark] alex-balikov commented on a diff in pull request #37893: [SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/17 00:12:36 UTC, 7 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37348: [SPARK-39854][SQL] replaceWithAliases should keep the original children for Generate - posted by GitBox <gi...@apache.org> on 2022/09/17 00:13:56 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/17 00:22:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/17 00:25:41 UTC, 3 replies.
- [GitHub] [spark] viirya commented on pull request #37348: [SPARK-39854][SQL] replaceWithAliases should keep the original children for Generate - posted by GitBox <gi...@apache.org> on 2022/09/17 00:43:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37348: [SPARK-39854][SQL] replaceWithAliases should keep the original children for Generate - posted by GitBox <gi...@apache.org> on 2022/09/17 00:45:19 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37896: Revert [SPARK-24544][SQL] Print actual failure cause when look up function failed - posted by GitBox <gi...@apache.org> on 2022/09/17 02:32:34 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37887: [SPARK-40360] [WIP] ALREADY_EXISTS and NOT_FOUND exceptions - posted by GitBox <gi...@apache.org> on 2022/09/17 03:22:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37914: [SPARK-40471][BUILD] Upgrade RoaringBitmap to 0.9.32 - posted by GitBox <gi...@apache.org> on 2022/09/17 05:26:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37892: [SPARK-40436][BUILD] Upgrade Scala to 2.12.17 - posted by GitBox <gi...@apache.org> on 2022/09/17 05:35:29 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37896: Revert [SPARK-24544][SQL] Print actual failure cause when look up function failed - posted by GitBox <gi...@apache.org> on 2022/09/17 05:36:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/17 05:47:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37885: [SPARK-40428][CORE][WIP] Fix shutdown hook in the CoarseGrainedSchedulerBackend - posted by GitBox <gi...@apache.org> on 2022/09/17 05:53:54 UTC, 0 replies.
- [GitHub] [spark] ayushi-agarwal commented on a diff in pull request #37625: [SPARK-40177][SQL] Simplify condition of form (a==b) || (a==null&&b==null) to a<=>b - posted by GitBox <gi...@apache.org> on 2022/09/17 06:22:38 UTC, 0 replies.
- [GitHub] [spark] jiaji-wu commented on a diff in pull request #37348: [SPARK-39854][SQL] replaceWithAliases should keep the original children for Generate - posted by GitBox <gi...@apache.org> on 2022/09/17 07:23:46 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #37909: [SPARK-40468][SQL] Fix column pruning in CSV when _corrupt_record is selected - posted by GitBox <gi...@apache.org> on 2022/09/17 08:00:14 UTC, 0 replies.
- [GitHub] [spark] wbo4958 commented on pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/17 10:49:24 UTC, 7 replies.
- [GitHub] [spark] wbo4958 commented on a diff in pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/17 10:50:34 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37910: [SPARK-40469][CORE] Avoid creating directory failures - posted by GitBox <gi...@apache.org> on 2022/09/17 14:08:10 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37609: [SPARK-40175][SQL]Speed up conversion of Tuple2 to Scala Map - posted by GitBox <gi...@apache.org> on 2022/09/17 14:39:22 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37910: [SPARK-40469][CORE] Avoid creating directory failures - posted by GitBox <gi...@apache.org> on 2022/09/17 17:10:09 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by GitBox <gi...@apache.org> on 2022/09/18 00:07:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36790: [SPARK-39402][SQL] Optimize ReplaceCTERefWithRepartition to support coalesce partitions - posted by GitBox <gi...@apache.org> on 2022/09/18 00:24:59 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #37919: [SPARK-40478][DOCS] Add create datasource table options docs - posted by GitBox <gi...@apache.org> on 2022/09/18 01:45:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37917: [SPARK-40466][SS] Improve the error message when DSv2 is disabled whi… - posted by GitBox <gi...@apache.org> on 2022/09/18 03:04:52 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 opened a new pull request, #37920: [SPARK-40413] fix `Column.isin` return null - posted by GitBox <gi...@apache.org> on 2022/09/18 04:47:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37921: [WIP][SQL] Migrate unexpected input type error to an error class - posted by GitBox <gi...@apache.org> on 2022/09/18 08:45:47 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #37922: [SPARK-40480][SHUFFLE]Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/18 14:25:54 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37919: [SPARK-40478][DOCS] Add create datasource table options docs - posted by GitBox <gi...@apache.org> on 2022/09/18 14:57:55 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37885: [SPARK-40428][CORE][WIP] Fix shutdown hook in the CoarseGrainedSchedulerBackend - posted by GitBox <gi...@apache.org> on 2022/09/18 17:55:51 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37853: [SPARK-40404][DOCS] Add precondition description for `spark.shuffle.service.db.backend` in `running-on-yarn.md` - posted by GitBox <gi...@apache.org> on 2022/09/18 17:57:24 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/18 18:07:33 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37922: [SPARK-40480][SHUFFLE]Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/18 18:10:14 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE]Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/18 18:49:32 UTC, 0 replies.
- [GitHub] [spark] ayudovin opened a new pull request, #37923: [SPARK-40334] - Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/18 19:53:15 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 opened a new pull request, #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/18 22:06:23 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37911: [SPARK-40470][SQL] Handle GetArrayStructFields and GetMapValue in "arrays_zip" function - posted by GitBox <gi...@apache.org> on 2022/09/18 23:05:29 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37893: [SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/18 23:40:24 UTC, 35 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/18 23:45:57 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36790: [SPARK-39402][SQL] Optimize ReplaceCTERefWithRepartition to support coalesce partitions - posted by GitBox <gi...@apache.org> on 2022/09/19 00:23:14 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/19 00:38:26 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/19 00:58:05 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/19 00:59:02 UTC, 18 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37878: [SPARK-40424][CORE][TESTS] Refactor `ChromeUIHistoryServerSuite` to add UTs for RocksDB - posted by GitBox <gi...@apache.org> on 2022/09/19 01:13:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37878: [SPARK-40424][CORE][TESTS] Refactor `ChromeUIHistoryServerSuite` to add UTs for RocksDB - posted by GitBox <gi...@apache.org> on 2022/09/19 01:29:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37878: [SPARK-40424][CORE][TESTS] Refactor `ChromeUIHistoryServerSuite` to add UTs for RocksDB - posted by GitBox <gi...@apache.org> on 2022/09/19 01:30:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37896: Revert [SPARK-24544][SQL] Print actual failure cause when look up function failed - posted by GitBox <gi...@apache.org> on 2022/09/19 01:31:49 UTC, 0 replies.
- [GitHub] [spark] HuwCampbell commented on a diff in pull request #36441: [SPARK-39091][SQL] Updating specific SQL Expression traits that don't compose when multiple are extended due to nodePatterns being final. - posted by GitBox <gi...@apache.org> on 2022/09/19 01:35:40 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #37919: [SPARK-40478][DOCS] Add create datasource table options docs - posted by GitBox <gi...@apache.org> on 2022/09/19 01:51:37 UTC, 0 replies.
- [GitHub] [spark] weixiuli commented on a diff in pull request #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/19 01:56:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37853: [SPARK-40404][DOCS] Add precondition description for `spark.shuffle.service.db.backend` in `running-on-yarn.md` - posted by GitBox <gi...@apache.org> on 2022/09/19 02:24:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37853: [SPARK-40404][DOCS] Add precondition description for `spark.shuffle.service.db.backend` in `running-on-yarn.md` - posted by GitBox <gi...@apache.org> on 2022/09/19 02:24:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by GitBox <gi...@apache.org> on 2022/09/19 02:47:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/19 03:00:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37917: [SPARK-40466][SS] Improve the error message when DSv2 is disabled while DSv1 is not avaliable - posted by GitBox <gi...@apache.org> on 2022/09/19 03:04:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37850: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 7, ~30 functions) - posted by GitBox <gi...@apache.org> on 2022/09/19 03:07:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37850: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (part 7, ~30 functions) - posted by GitBox <gi...@apache.org> on 2022/09/19 03:08:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37925: [SPARK-40483][CONNECT][INFRA] Add `CONNECT` label - posted by GitBox <gi...@apache.org> on 2022/09/19 03:19:39 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37917: [SPARK-40466][SS] Improve the error message when DSv2 is disabled while DSv1 is not avaliable - posted by GitBox <gi...@apache.org> on 2022/09/19 03:21:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37893: [SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/19 03:28:01 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37896: [SPARK-40482][SQL] Revert `SPARK-24544 Print actual failure cause when look up function failed` - posted by GitBox <gi...@apache.org> on 2022/09/19 04:55:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37896: [SPARK-40482][SQL] Revert `SPARK-24544 Print actual failure cause when look up function failed` - posted by GitBox <gi...@apache.org> on 2022/09/19 04:56:13 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/19 05:02:37 UTC, 2 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/19 05:04:44 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #37896: [SPARK-40482][SQL] Revert `SPARK-24544 Print actual failure cause when look up function failed` - posted by GitBox <gi...@apache.org> on 2022/09/19 05:10:23 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/19 05:29:34 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37905: [SPARK-40460][SS] Fix streaming metrics when selecting `_metadata` - posted by GitBox <gi...@apache.org> on 2022/09/19 05:32:58 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/19 05:38:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/19 05:45:02 UTC, 2 replies.
- [GitHub] [spark] martin-g commented on pull request #37844: [DON'T MERGE][BUILD] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/19 06:05:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37926: [SPARK-40484][BUILD] Upgrade log4j2 to 2.19.0 - posted by GitBox <gi...@apache.org> on 2022/09/19 06:13:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37926: [SPARK-40484][BUILD] Upgrade log4j2 to 2.19.0 - posted by GitBox <gi...@apache.org> on 2022/09/19 06:13:53 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37927: [SPARK-40447][PS][FOLLOWUP] Fix doc of `DataFrame.corr` - posted by GitBox <gi...@apache.org> on 2022/09/19 06:50:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37927: [SPARK-40447][PS][FOLLOWUP] Fix doc of `DataFrame.corr` - posted by GitBox <gi...@apache.org> on 2022/09/19 06:51:00 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/19 07:24:17 UTC, 4 replies.
- [GitHub] [spark] itholic commented on pull request #37873: [SPARK-40419][SQL][TESTS] Integrate Grouped Aggregate Pandas UDFs into *.sql test cases - posted by GitBox <gi...@apache.org> on 2022/09/19 07:34:20 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #37743: [SPARK-40294][SQL] Fix repeat calls to `PartitionReader.hasNext` timing out - posted by GitBox <gi...@apache.org> on 2022/09/19 07:53:55 UTC, 0 replies.
- [GitHub] [spark] LucaCanali opened a new pull request, #37928: [SPARK-40485][SQL] Extend the partitioning options of the JDBC data source - posted by GitBox <gi...@apache.org> on 2022/09/19 08:27:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/19 08:47:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by GitBox <gi...@apache.org> on 2022/09/19 08:59:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by GitBox <gi...@apache.org> on 2022/09/19 09:17:16 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37925: [SPARK-40483][CONNECT][INFRA] Add `CONNECT` label - posted by GitBox <gi...@apache.org> on 2022/09/19 09:29:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37927: [SPARK-40447][PS][FOLLOWUP] Fix doc of `DataFrame.corr` - posted by GitBox <gi...@apache.org> on 2022/09/19 09:29:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37925: [SPARK-40483][CONNECT][INFRA] Add `CONNECT` label - posted by GitBox <gi...@apache.org> on 2022/09/19 09:29:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37927: [SPARK-40447][PS][FOLLOWUP] Fix doc of `DataFrame.corr` - posted by GitBox <gi...@apache.org> on 2022/09/19 09:30:06 UTC, 0 replies.
- [GitHub] [spark] clementguillot commented on pull request #33154: [SPARK-35949][CORE]Add `keep-spark-context-alive` arg for to prevent closing spark context after invoking main for some case - posted by GitBox <gi...@apache.org> on 2022/09/19 09:37:48 UTC, 1 replies.
- [GitHub] [spark] sunpe commented on pull request #33154: [SPARK-35949][CORE]Add `keep-spark-context-alive` arg for to prevent closing spark context after invoking main for some case - posted by GitBox <gi...@apache.org> on 2022/09/19 09:47:11 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37929: [SPARK-40486][PS] Implement `spearman` and `kendall` in `DataFrame.corrwith` - posted by GitBox <gi...@apache.org> on 2022/09/19 10:00:29 UTC, 0 replies.
- [GitHub] [spark] xclyfe opened a new pull request, #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/19 10:35:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37873: [SPARK-40419][SQL][TESTS] Integrate Grouped Aggregate Pandas UDFs into *.sql test cases - posted by GitBox <gi...@apache.org> on 2022/09/19 10:46:06 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37873: [SPARK-40419][SQL][TESTS] Integrate Grouped Aggregate Pandas UDFs into *.sql test cases - posted by GitBox <gi...@apache.org> on 2022/09/19 10:51:11 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #37931: [WIP][SPARK-40488] Do not wrap exceptions thrown in FileFormatWriter.write with SparkException - posted by GitBox <gi...@apache.org> on 2022/09/19 10:52:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37873: [SPARK-40419][SQL][TESTS] Integrate Grouped Aggregate Pandas UDFs into *.sql test cases - posted by GitBox <gi...@apache.org> on 2022/09/19 10:53:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37873: [SPARK-40419][SQL][TESTS] Integrate Grouped Aggregate Pandas UDFs into *.sql test cases - posted by GitBox <gi...@apache.org> on 2022/09/19 10:53:32 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/19 12:33:51 UTC, 3 replies.
- [GitHub] [spark] srowen commented on pull request #37743: [SPARK-40294][SQL] Fix repeat calls to `PartitionReader.hasNext` timing out - posted by GitBox <gi...@apache.org> on 2022/09/19 12:36:55 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37743: [SPARK-40294][SQL] Fix repeat calls to `PartitionReader.hasNext` timing out - posted by GitBox <gi...@apache.org> on 2022/09/19 12:36:56 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks commented on a diff in pull request #37879: [SPARK-40425][SQL] DROP TABLE does not need to do table lookup - posted by GitBox <gi...@apache.org> on 2022/09/19 12:47:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/19 12:47:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/19 12:49:10 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #37900: [SPARK-40456][SQL] PartitionIterator.hasNext should be cheap to call repeatedly - posted by GitBox <gi...@apache.org> on 2022/09/19 12:53:30 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #37920: [SPARK-40413][SQL] Fix `Column.isin` return null - posted by GitBox <gi...@apache.org> on 2022/09/19 13:09:20 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/19 13:09:41 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/19 14:30:40 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/19 14:43:21 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37920: [SPARK-40413][SQL] Fix `Column.isin` return null - posted by GitBox <gi...@apache.org> on 2022/09/19 15:07:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/19 15:57:43 UTC, 34 replies.
- [GitHub] [spark] huanliwang-db commented on a diff in pull request #37917: [SPARK-40466][SS] Improve the error message when DSv2 is disabled while DSv1 is not avaliable - posted by GitBox <gi...@apache.org> on 2022/09/19 16:14:54 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #37926: [SPARK-40484][BUILD] Upgrade log4j2 to 2.19.0 - posted by GitBox <gi...@apache.org> on 2022/09/19 16:48:25 UTC, 0 replies.
- [GitHub] [spark] viirya closed pull request #37926: [SPARK-40484][BUILD] Upgrade log4j2 to 2.19.0 - posted by GitBox <gi...@apache.org> on 2022/09/19 16:48:59 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/19 16:57:13 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 opened a new pull request, #37932: [SPARK-40460][SS][3.3] Fix streaming metrics when selecting _metadata - posted by GitBox <gi...@apache.org> on 2022/09/19 17:14:16 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on pull request #37932: [SPARK-40460][SS][3.3] Fix streaming metrics when selecting _metadata - posted by GitBox <gi...@apache.org> on 2022/09/19 17:15:02 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by GitBox <gi...@apache.org> on 2022/09/19 17:23:05 UTC, 0 replies.
- [GitHub] [spark] xiaonanyang-db opened a new pull request, #37933: SPARK-40474 Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/19 17:24:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/19 17:28:15 UTC, 0 replies.
- [GitHub] [spark] ayudovin commented on a diff in pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/19 17:40:57 UTC, 5 replies.
- [GitHub] [spark] mridulm commented on pull request #37922: [WIP][SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/19 17:51:45 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by GitBox <gi...@apache.org> on 2022/09/19 19:04:42 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37928: [SPARK-40485][SQL] Extend the partitioning options of the JDBC data source - posted by GitBox <gi...@apache.org> on 2022/09/19 19:17:47 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/19 19:30:06 UTC, 68 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/19 19:37:12 UTC, 0 replies.
- [GitHub] [spark] kazuyukitanimura opened a new pull request, #37934: [SPARK-40477][SQL] Support `NullType` in `ColumnarBatchRow` - posted by GitBox <gi...@apache.org> on 2022/09/19 19:56:37 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #37912: [SPARK-40196][PYTHON][PS][FOLLOWUP] Skip SparkFunctionsTests.test_repeat - posted by GitBox <gi...@apache.org> on 2022/09/19 20:04:31 UTC, 0 replies.
- [GitHub] [spark] kazuyukitanimura commented on pull request #37934: [SPARK-40477][SQL] Support `NullType` in `ColumnarBatchRow` - posted by GitBox <gi...@apache.org> on 2022/09/19 20:05:38 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #37908: [SPARK-40196][PS][FOLLOWUP] `SF.lit` -> `F.lit` in `window.quantile` - posted by GitBox <gi...@apache.org> on 2022/09/19 20:05:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37921: [SPARK-40479][SQL] Migrate unexpected input type error to an error class - posted by GitBox <gi...@apache.org> on 2022/09/19 20:44:06 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/19 21:10:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/19 21:10:25 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37922: [WIP][SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/19 21:10:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37424: [SPARK-39991][SQL][AQE] Use available column statistics from completed query stages - posted by GitBox <gi...@apache.org> on 2022/09/19 21:18:35 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 opened a new pull request, #37935: Do maintenance before streaming StateStore unload - posted by GitBox <gi...@apache.org> on 2022/09/19 21:49:50 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37934: [SPARK-40477][SQL] Support `NullType` in `ColumnarBatchRow` - posted by GitBox <gi...@apache.org> on 2022/09/19 22:21:07 UTC, 2 replies.
- [GitHub] [spark] WweiL opened a new pull request, #37936: Add additional tests to StreamingSessionWindowSuite - posted by GitBox <gi...@apache.org> on 2022/09/19 23:02:35 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37917: [SPARK-40466][SS] Improve the error message when DSv2 is disabled while DSv1 is not avaliable - posted by GitBox <gi...@apache.org> on 2022/09/19 23:20:19 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37917: [SPARK-40466][SS] Improve the error message when DSv2 is disabled while DSv1 is not avaliable - posted by GitBox <gi...@apache.org> on 2022/09/19 23:22:13 UTC, 0 replies.
- [GitHub] [spark] kazuyukitanimura commented on a diff in pull request #37934: [SPARK-40477][SQL] Support `NullType` in `ColumnarBatchRow` - posted by GitBox <gi...@apache.org> on 2022/09/19 23:51:41 UTC, 3 replies.
- [GitHub] [spark] chaoqin-li1123 commented on pull request #37935: Do maintenance before streaming StateStore unload - posted by GitBox <gi...@apache.org> on 2022/09/20 00:05:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/20 00:10:25 UTC, 5 replies.
- [GitHub] [spark] warrenzhu25 commented on a diff in pull request #37924: [SPARK-40481][CORE] Ignore stage fetch failure caused by decommissioned executor - posted by GitBox <gi...@apache.org> on 2022/09/20 00:49:00 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37929: [SPARK-40486][PS] Implement `spearman` and `kendall` in `DataFrame.corrwith` - posted by GitBox <gi...@apache.org> on 2022/09/20 02:10:31 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on pull request #36087: [SPARK-38802][K8S][TESTS] Add Support for `spark.kubernetes.test.(driver|executor)RequestCores` - posted by GitBox <gi...@apache.org> on 2022/09/20 02:19:40 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/20 02:52:05 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #37937: [SPARK-40491][SQL] Expose a jdbcRDD function in SparkContext - posted by GitBox <gi...@apache.org> on 2022/09/20 03:10:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37938: [SPARK-40490][YARN][TESTS] Ensure `YarnShuffleIntegrationSuite` tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/20 03:10:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37938: [SPARK-40490][YARN][TESTS] Ensure `YarnShuffleIntegrationSuite` tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/20 03:11:20 UTC, 6 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37938: [SPARK-40490][YARN][TESTS] Ensure `YarnShuffleIntegrationSuite` tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/20 03:14:07 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37932: [SPARK-40460][SS][3.3] Fix streaming metrics when selecting _metadata - posted by GitBox <gi...@apache.org> on 2022/09/20 03:46:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37932: [SPARK-40460][SS][3.3] Fix streaming metrics when selecting _metadata - posted by GitBox <gi...@apache.org> on 2022/09/20 03:47:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37939: [MINOR][DOCS][PYTHON] Document datetime.timedelta <> DayTimeIntervalType - posted by GitBox <gi...@apache.org> on 2022/09/20 04:34:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #36087: [SPARK-38802][K8S][TESTS] Add Support for `spark.kubernetes.test.(driver|executor)RequestCores` - posted by GitBox <gi...@apache.org> on 2022/09/20 05:02:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #36096: [SPARK-38803][K8S][TESTS] Lower minio cpu to 250m (0.25) from 1 in K8s IT - posted by GitBox <gi...@apache.org> on 2022/09/20 05:03:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37934: [SPARK-40477][SQL] Support `NullType` in `ColumnarBatchRow` - posted by GitBox <gi...@apache.org> on 2022/09/20 05:12:31 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #37937: [SPARK-40491][SQL] Expose a jdbcRDD function in SparkContext - posted by GitBox <gi...@apache.org> on 2022/09/20 05:19:15 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37940: [SPARK-40494][CORE][SQL][MLLIB] Optimize the performance of `keys.zipWithIndex.toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/20 05:35:53 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #37933: [SPARK-40474][SQL] Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/20 05:43:45 UTC, 9 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37940: [SPARK-40494][CORE][SQL][MLLIB] Optimize the performance of `keys.zipWithIndex.toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/20 05:46:23 UTC, 0 replies.
- [GitHub] [spark] xiaonanyang-db commented on pull request #37933: [SPARK-40474][SQL] Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/20 05:46:48 UTC, 2 replies.
- [GitHub] [spark] xiaonanyang-db commented on a diff in pull request #37933: [SPARK-40474][SQL] Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/20 05:49:46 UTC, 23 replies.
- [GitHub] [spark] wankunde commented on pull request #37922: [WIP][SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/20 05:59:47 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37941: add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/20 06:02:23 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/20 06:02:56 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37844: [DON'T MERGE][BUILD] Upgrade slf4j to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/09/20 06:21:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37939: [MINOR][DOCS][PYTHON] Document datetime.timedelta <> DayTimeIntervalType - posted by GitBox <gi...@apache.org> on 2022/09/20 06:27:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37939: [MINOR][DOCS][PYTHON] Document datetime.timedelta <> DayTimeIntervalType - posted by GitBox <gi...@apache.org> on 2022/09/20 06:27:37 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37933: [SPARK-40474][SQL] Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/20 06:42:53 UTC, 1 replies.
- [GitHub] [spark] WweiL commented on pull request #37936: [SPARK-40495] [SQL] [TESTS] Add additional tests to StreamingSessionWindowSuite - posted by GitBox <gi...@apache.org> on 2022/09/20 06:43:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37929: [SPARK-40486][PS] Implement `spearman` and `kendall` in `DataFrame.corrwith` - posted by GitBox <gi...@apache.org> on 2022/09/20 06:50:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/20 07:07:55 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/20 07:08:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37933: [SPARK-40474][SQL] Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/20 07:19:41 UTC, 2 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #37942: [SPARK-40496][SQL] Fix configs to control "enableDateTimeParsingFallback" - posted by GitBox <gi...@apache.org> on 2022/09/20 07:29:08 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37942: [SPARK-40496][SQL] Fix configs to control "enableDateTimeParsingFallback" - posted by GitBox <gi...@apache.org> on 2022/09/20 07:30:51 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37943: [SPARK-40497][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/20 07:52:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37943: [SPARK-40497][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/20 07:53:40 UTC, 0 replies.
- [GitHub] [spark] baibaichen commented on pull request #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2022/09/20 08:01:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37937: [SPARK-40491][SQL] Expose a jdbcRDD function in SparkContext - posted by GitBox <gi...@apache.org> on 2022/09/20 08:14:51 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #37937: [SPARK-40491][SQL] Expose a jdbcRDD function in SparkContext - posted by GitBox <gi...@apache.org> on 2022/09/20 08:21:46 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2022/09/20 08:22:59 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #37944: [SQL][MINOR] Re-generate equals/hashCode of IdentifierImpl with non-null optimization - posted by GitBox <gi...@apache.org> on 2022/09/20 08:38:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37944: [SQL][MINOR] Re-generate equals/hashCode of IdentifierImpl with non-null optimization - posted by GitBox <gi...@apache.org> on 2022/09/20 08:39:05 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37945: [SPARK-40498][PS] Implement `kendall` and `min_periods` in `Series.corr` - posted by GitBox <gi...@apache.org> on 2022/09/20 08:53:46 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #37946: [SPARK-40419][SQL][TESTS][FOLLOW-UP] Fix test to skip when package is unavailable - posted by GitBox <gi...@apache.org> on 2022/09/20 09:05:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37946: [SPARK-40419][SQL][TESTS][FOLLOW-UP] Fix test to skip when package is unavailable - posted by GitBox <gi...@apache.org> on 2022/09/20 09:10:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37946: [SPARK-40419][SQL][TESTS][FOLLOW-UP] Fix test to skip when package is unavailable - posted by GitBox <gi...@apache.org> on 2022/09/20 09:19:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37946: [SPARK-40419][SQL][TESTS][FOLLOW-UP] Fix test to skip when package is unavailable - posted by GitBox <gi...@apache.org> on 2022/09/20 09:20:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37947: [SPARK-40500][PS] Use `pd.items` instead of `pd.iteritems` - posted by GitBox <gi...@apache.org> on 2022/09/20 09:23:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37948: [SPARK-40327][PS][DOCS] Add resampling to API references - posted by GitBox <gi...@apache.org> on 2022/09/20 10:14:27 UTC, 0 replies.
- [GitHub] [spark] xingchaozh commented on a diff in pull request #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/20 10:23:51 UTC, 2 replies.
- [GitHub] [spark] zhengchenyu opened a new pull request, #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/09/20 10:42:21 UTC, 1 replies.
- [GitHub] [spark] zhengchenyu closed pull request #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/09/20 10:53:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37948: [SPARK-40327][PS][DOCS] Add resampling to API references - posted by GitBox <gi...@apache.org> on 2022/09/20 11:03:40 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37943: [WIP][SPARK-40497][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/20 11:07:15 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37937: [SPARK-40491][SQL] Remove too old TODO for JdbcRDD - posted by GitBox <gi...@apache.org> on 2022/09/20 11:21:56 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37941: [SPARK-40501][SQL] Add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/20 11:32:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37943: [WIP][SPARK-40497][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/20 11:36:53 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #37941: [SPARK-40501][SQL] Add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/20 12:04:40 UTC, 8 replies.
- [GitHub] [spark] bryanck opened a new pull request, #37950: [SPARK-40505][K8S] Remove min heap setting for executor in entrypoint.sh - posted by GitBox <gi...@apache.org> on 2022/09/20 12:09:39 UTC, 0 replies.
- [GitHub] [spark] Kwafoor opened a new pull request, #37951: [SPARK-40506]Spark Streaming metrics name doesn't need application name - posted by GitBox <gi...@apache.org> on 2022/09/20 12:37:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2022/09/20 13:24:06 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #37938: [SPARK-40490][YARN][TESTS] Ensure `YarnShuffleIntegrationSuite` tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/20 13:51:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37921: [SPARK-40479][SQL] Migrate unexpected input type error to an error class - posted by GitBox <gi...@apache.org> on 2022/09/20 14:15:08 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/20 16:20:44 UTC, 5 replies.
- [GitHub] [spark] tedyu opened a new pull request, #37952: Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/20 16:21:05 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #37952: Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/20 16:21:24 UTC, 3 replies.
- [GitHub] [spark] sunchao commented on pull request #37952: Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/20 16:29:14 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/20 16:54:57 UTC, 1 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #37822: [SPARK-40381][DEPLOY] Support standalone worker recommission - posted by GitBox <gi...@apache.org> on 2022/09/20 17:42:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37943: [WIP][SPARK-40497][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/20 17:51:51 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37822: [SPARK-40381][DEPLOY] Support standalone worker recommission - posted by GitBox <gi...@apache.org> on 2022/09/20 17:54:23 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #37840: [SPARK-40416][SQL] Move subquery expression CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/20 18:25:20 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #35391: [SPARK-38098][PYTHON] Add support for ArrayType of nested StructType to arrow-based conversion - posted by GitBox <gi...@apache.org> on 2022/09/20 18:44:56 UTC, 1 replies.
- [GitHub] [spark] kazuyukitanimura closed pull request #37934: [SPARK-40477][SQL] Support `NullType` in `ColumnarBatchRow` - posted by GitBox <gi...@apache.org> on 2022/09/20 19:58:07 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #37952: [SPARK-40508] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/20 20:23:24 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #37952: [SPARK-40508][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/20 20:36:36 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37951: [SPARK-40506]Spark Streaming metrics name doesn't need application name - posted by GitBox <gi...@apache.org> on 2022/09/20 20:51:17 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37950: [SPARK-40505][K8S] Remove min heap setting for executor in entrypoint.sh - posted by GitBox <gi...@apache.org> on 2022/09/20 20:51:20 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/09/20 20:51:23 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #37952: [SPARK-40508][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/20 23:20:00 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37929: [SPARK-40486][PS] Implement `spearman` and `kendall` in `DataFrame.corrwith` - posted by GitBox <gi...@apache.org> on 2022/09/21 00:06:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/21 00:27:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37947: [SPARK-40500][PS] Use `pd.items` instead of `pd.iteritems` - posted by GitBox <gi...@apache.org> on 2022/09/21 01:05:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37947: [SPARK-40500][PS] Use `pd.items` instead of `pd.iteritems` - posted by GitBox <gi...@apache.org> on 2022/09/21 01:16:06 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 commented on pull request #37931: [SPARK-40488] Do not wrap exceptions thrown when datasource write fails - posted by GitBox <gi...@apache.org> on 2022/09/21 01:33:52 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #37937: [SPARK-40491][SQL] Remove too old TODO for JdbcRDD - posted by GitBox <gi...@apache.org> on 2022/09/21 01:44:06 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37947: [SPARK-40500][PS] Deprecate `iteritems` in DataFrame and Seriese - posted by GitBox <gi...@apache.org> on 2022/09/21 02:06:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37947: [SPARK-40500][PS] Deprecate `iteritems` in DataFrame and Seriese - posted by GitBox <gi...@apache.org> on 2022/09/21 02:06:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37947: [SPARK-40500][PS] Deprecate `iteritems` in DataFrame and Seriese - posted by GitBox <gi...@apache.org> on 2022/09/21 02:07:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37947: [SPARK-40500][PS] Deprecate `iteritems` in DataFrame and Seriese - posted by GitBox <gi...@apache.org> on 2022/09/21 02:09:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37929: [SPARK-40486][PS] Implement `spearman` and `kendall` in `DataFrame.corrwith` - posted by GitBox <gi...@apache.org> on 2022/09/21 02:20:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37937: [SPARK-40491][SQL] Remove too old TODO for JdbcRDD - posted by GitBox <gi...@apache.org> on 2022/09/21 02:24:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37937: [SPARK-40491][SQL] Remove too old TODO for JdbcRDD - posted by GitBox <gi...@apache.org> on 2022/09/21 02:24:56 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #37937: [SPARK-40491][SQL] Remove too old TODO for JdbcRDD - posted by GitBox <gi...@apache.org> on 2022/09/21 02:25:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37945: [SPARK-40498][PS] Implement `kendall` and `min_periods` in `Series.corr` - posted by GitBox <gi...@apache.org> on 2022/09/21 02:47:28 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #37944: [SQL][MINOR] Re-generate equals/hashCode of IdentifierImpl with non-null optimization - posted by GitBox <gi...@apache.org> on 2022/09/21 02:50:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37942: [SPARK-40496][SQL] Fix configs to control "enableDateTimeParsingFallback" - posted by GitBox <gi...@apache.org> on 2022/09/21 02:53:15 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/21 02:58:11 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37940: [SPARK-40494][CORE][SQL][ML][MLLIB] Optimize the performance of `keys.zipWithIndex.toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/21 03:10:47 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37940: [SPARK-40494][CORE][SQL][ML][MLLIB] Optimize the performance of `keys.zipWithIndex.toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/21 03:17:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37940: [SPARK-40494][CORE][SQL][ML][MLLIB] Optimize the performance of `keys.zipWithIndex.toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/21 03:20:28 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/21 03:35:14 UTC, 0 replies.
- [GitHub] [spark] Kwafoor commented on pull request #37951: [SPARK-40506]Spark Streaming metrics name doesn't need application name - posted by GitBox <gi...@apache.org> on 2022/09/21 03:37:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37953: [SPARK-40510][PS] Implement `ddof` in `Series.cov` - posted by GitBox <gi...@apache.org> on 2022/09/21 03:40:52 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #37941: [SPARK-40501][SQL] Enhance 'SpecialLimits' to support project(..., limit(...)) - posted by GitBox <gi...@apache.org> on 2022/09/21 03:49:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37954: [SPARK-40332][PS][DOCS][FOLLOWUP] Fix wrong underline length - posted by GitBox <gi...@apache.org> on 2022/09/21 04:03:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37948: [SPARK-40327][PS][DOCS] Add resampling to API references - posted by GitBox <gi...@apache.org> on 2022/09/21 04:20:30 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37954: [SPARK-40332][PS][DOCS][FOLLOWUP] Fix wrong underline length - posted by GitBox <gi...@apache.org> on 2022/09/21 04:20:57 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #37942: [SPARK-40496][SQL] Fix configs to control "enableDateTimeParsingFallback" - posted by GitBox <gi...@apache.org> on 2022/09/21 05:17:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37931: [SPARK-40488] Do not wrap exceptions thrown when datasource write fails - posted by GitBox <gi...@apache.org> on 2022/09/21 05:28:29 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37945: [SPARK-40498][PS] Implement `kendall` and `min_periods` in `Series.corr` - posted by GitBox <gi...@apache.org> on 2022/09/21 05:47:45 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37948: [SPARK-40327][PS][DOCS] Add resampling to API references - posted by GitBox <gi...@apache.org> on 2022/09/21 05:50:05 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37948: [SPARK-40327][PS][DOCS] Add resampling to API references - posted by GitBox <gi...@apache.org> on 2022/09/21 05:53:49 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #37955: [SPARK-40512][PS][INFRA] Upgrade pandas to 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/21 06:06:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37844: [SPARK-40511][BUILD][CORE] Upgrade slf4j to 2.0.2 - posted by GitBox <gi...@apache.org> on 2022/09/21 06:10:11 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37953: [SPARK-40510][PS] Implement `ddof` in `Series.cov` - posted by GitBox <gi...@apache.org> on 2022/09/21 06:29:46 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37955: [SPARK-40512][PS][INFRA] Upgrade pandas to 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/21 06:32:11 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37954: [SPARK-40332][PS][DOCS][FOLLOWUP] Fix wrong underline length - posted by GitBox <gi...@apache.org> on 2022/09/21 06:34:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37945: [SPARK-40498][PS] Implement `kendall` and `min_periods` in `Series.corr` - posted by GitBox <gi...@apache.org> on 2022/09/21 06:39:26 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/21 06:45:06 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37953: [SPARK-40510][PS] Implement `ddof` in `Series.cov` - posted by GitBox <gi...@apache.org> on 2022/09/21 06:47:42 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #37955: [SPARK-40512][PS][INFRA] Upgrade pandas to 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/21 06:55:36 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37894: [DO-NOT-MERGE][SPARK-40435][SS][PYTHON] Add test suites for applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/21 07:05:22 UTC, 3 replies.
- [GitHub] [spark] xiaonanyang-db commented on pull request #37894: [DO-NOT-MERGE][SPARK-40435][SS][PYTHON] Add test suites for applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/21 07:07:07 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/21 07:07:14 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37933: [SPARK-40474][SQL] Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/21 07:14:46 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #37941: [SPARK-40501][SQL] Enhance 'SpecialLimits' to support project(..., limit(...)) - posted by GitBox <gi...@apache.org> on 2022/09/21 07:14:47 UTC, 3 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37936: [SPARK-40495] [SQL] [TESTS] Add additional tests to StreamingSessionWindowSuite - posted by GitBox <gi...@apache.org> on 2022/09/21 07:20:51 UTC, 4 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #37825: [SPARK-40382][SQL] Group distinct aggregate expressions by semantically equivalent children in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/09/21 07:22:24 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37941: [SPARK-40501][SQL] Enhance 'SpecialLimits' to support project(..., limit(...)) - posted by GitBox <gi...@apache.org> on 2022/09/21 07:43:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37945: [SPARK-40498][PS] Implement `kendall` and `min_periods` in `Series.corr` - posted by GitBox <gi...@apache.org> on 2022/09/21 08:16:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37956: [SPARK-40514][TESTS] Make python related tests check python minimum support version - posted by GitBox <gi...@apache.org> on 2022/09/21 08:25:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37940: [SPARK-40494][CORE][SQL][ML][MLLIB] Optimize the performance of `keys.zipWithIndex.toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/21 08:35:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37940: [SPARK-40494][CORE][SQL][ML][MLLIB] Optimize the performance of `keys.zipWithIndex.toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/21 08:36:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37825: [SPARK-40382][SQL] Group distinct aggregate expressions by semantically equivalent children in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/09/21 08:50:07 UTC, 3 replies.
- [GitHub] [spark] Yikf commented on a diff in pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/21 09:11:44 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37609: [SPARK-40175][SQL]Speed up conversion of Tuple2 to Scala Map - posted by GitBox <gi...@apache.org> on 2022/09/21 11:55:48 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37936: [SPARK-40495] [SQL] [TESTS] Add additional tests to StreamingSessionWindowSuite - posted by GitBox <gi...@apache.org> on 2022/09/21 12:16:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37935: [SPARK-40492][SS] Do maintenance before streaming StateStore unload - posted by GitBox <gi...@apache.org> on 2022/09/21 12:16:26 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #37938: [SPARK-40490][YARN][TESTS] Ensure `YarnShuffleIntegrationSuite` tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/21 13:05:47 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #37938: [SPARK-40490][YARN][TESTS] Ensure `YarnShuffleIntegrationSuite` tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/21 13:06:08 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37956: [SPARK-40514][CORE][SQL][YARN][PYTHON][TESTS] Make python related tests check python minimum support version - posted by GitBox <gi...@apache.org> on 2022/09/21 13:31:54 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #37955: [SPARK-40512][PS][INFRA] Upgrade pandas to 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/21 13:32:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/21 14:00:17 UTC, 10 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37933: [SPARK-40474][SQL] Infer columns with mixed date and timestamp as String in CSV schema inference - posted by GitBox <gi...@apache.org> on 2022/09/21 14:04:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37956: [SPARK-40514][CORE][SQL][YARN][PYTHON][TESTS] Make python related tests check python minimum support version - posted by GitBox <gi...@apache.org> on 2022/09/21 14:19:48 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37955: [SPARK-40512][PS][INFRA] Upgrade pandas to 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/21 16:22:14 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #37952: [SPARK-40508][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 16:41:36 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #37952: [SPARK-40508][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 16:43:21 UTC, 1 replies.
- [GitHub] [spark] entong commented on a diff in pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/21 16:47:22 UTC, 0 replies.
- [GitHub] [spark] tedyu opened a new pull request, #37957: [SPARK-40508][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 16:58:10 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #37957: [SPARK-40508][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 16:58:28 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #37844: [SPARK-40511][BUILD][CORE] Upgrade slf4j to 2.0.2 - posted by GitBox <gi...@apache.org> on 2022/09/21 17:24:25 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #37957: [SPARK-40508][3.3][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 17:29:10 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #37958: [SPARK-40522][BUILD] Upgrade `kafka` from 3.2.1 to 3.2.2 - posted by GitBox <gi...@apache.org> on 2022/09/21 17:41:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37957: [SPARK-40508][3.3][SQL] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 17:52:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37938: [SPARK-40490][YARN][TESTS] Ensure `YarnShuffleIntegrationSuite` tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/21 17:54:17 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #37958: [SPARK-40522][BUILD] Upgrade `kafka` from 3.2.1 to 3.2.2 - posted by GitBox <gi...@apache.org> on 2022/09/21 18:03:44 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #37887: [SPARK-40360] ALREADY_EXISTS and NOT_FOUND exceptions - posted by GitBox <gi...@apache.org> on 2022/09/21 20:03:59 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37902: [SPARK-40359][SQL] Migrate type check fails in CSV/JSON expressions to error classes - posted by GitBox <gi...@apache.org> on 2022/09/21 20:30:23 UTC, 0 replies.
- [GitHub] [spark] dtenedor closed pull request #37884: [WIP][SPARK-40427][SQL] Move LIMIT/OFFSET CheckAnalysis error messages to use the new error framework - posted by GitBox <gi...@apache.org> on 2022/09/21 21:43:01 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on a diff in pull request #37922: [WIP][SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/21 21:48:12 UTC, 2 replies.
- [GitHub] [spark] sunchao closed pull request #37957: [SPARK-40508][SQL][3.3] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 21:51:14 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #37957: [SPARK-40508][SQL][3.3] Treat unknown partitioning as UnknownPartitioning - posted by GitBox <gi...@apache.org> on 2022/09/21 21:51:20 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37933: [SPARK-40474][SQL] Correct CSV schema inference and data parsing behavior on columns with mixed dates and timestamps - posted by GitBox <gi...@apache.org> on 2022/09/21 22:54:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37933: [SPARK-40474][SQL] Correct CSV schema inference and data parsing behavior on columns with mixed dates and timestamps - posted by GitBox <gi...@apache.org> on 2022/09/22 00:16:00 UTC, 6 replies.
- [GitHub] [spark] xiaonanyang-db commented on a diff in pull request #37933: [SPARK-40474][SQL] Correct CSV schema inference and data parsing behavior on columns with mixed dates and timestamps - posted by GitBox <gi...@apache.org> on 2022/09/22 00:22:00 UTC, 7 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36844: Update ExecutorClassLoader.scala - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36798: [SPARK-39408][SQL] Update the buildKeys for DynamicPruningSubquery.withNewPlan - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36751: [WIP][SPARK-39366][CORE] Do not release write locks on task end. - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:15 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36678: [SPARK-39297][CORE][UI] bugfix: spark.ui.proxyBase contains proxy or history - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36668: [SPARK-39291][CORE] Fetch blocks and open stream should not respond a closed channel - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:19 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36665: [SPARK-39287][CORE] TaskSchedulerImpl should quickly ignore task finished event if its task was finished state. - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:20 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36613: [WIP][SPARK-30983] Support typed select in Datasets up to the max tuple size - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36551: [SPARK-38463][CORE] Use error classes in org.apache.spark.input - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36548: [SPARK-38470][CORE] Use error classes in org.apache.spark.partial - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36540: [SPARK-38466][CORE] Use error classes in org.apache.spark.mapred - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:25 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36495: [SPARK-39136][SQL] JDBCTable support table properties - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36485: [SPARK-39128][SQL][HIVE] Log cost time for getting FileStatus in HadoopTableReader - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:27 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36483: [SPARK-39126][SQL] After eliminating join to one side, that side should take advantage of LocalShuffleRead optimization - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36453: SPARK-39103: SparkContext.addFiles trigger backend exception if it tr… - posted by GitBox <gi...@apache.org> on 2022/09/22 00:23:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37959: [SPARK-40142][PYTHON][DOCS][FOLLOW-UP] Remove non-ANSI compliant example in element_at - posted by GitBox <gi...@apache.org> on 2022/09/22 00:29:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/22 00:46:59 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37959: [SPARK-40142][PYTHON][DOCS][FOLLOW-UP] Remove non-ANSI compliant example in element_at - posted by GitBox <gi...@apache.org> on 2022/09/22 00:50:36 UTC, 1 replies.
- [GitHub] [spark] ukby1234 opened a new pull request, #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/22 01:23:08 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #37955: [SPARK-40512][PS][INFRA] Upgrade pandas to 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/22 01:29:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #35391: [SPARK-38098][PYTHON] Add support for ArrayType of nested StructType to arrow-based conversion - posted by GitBox <gi...@apache.org> on 2022/09/22 01:59:56 UTC, 0 replies.
- [GitHub] [spark] YetiCuzMountain commented on pull request #37280: [SPARK-39862][SQL] Fix two bugs in existence DEFAULT value lookups - posted by GitBox <gi...@apache.org> on 2022/09/22 02:11:29 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/22 02:13:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2022/09/22 02:14:54 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2022/09/22 02:33:31 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37959: [SPARK-40142][PYTHON][DOCS][FOLLOW-UP] Remove non-ANSI compliant example in element_at - posted by GitBox <gi...@apache.org> on 2022/09/22 02:44:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37837: [SPARK-40385][SQL] Fix interpreted path for companion object constructor - posted by GitBox <gi...@apache.org> on 2022/09/22 02:46:48 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37961: [BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/22 02:48:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37962: [SPARK-40490][YARN][TESTS][3.3] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 02:55:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37963: [SPARK-40490][YARN][TESTS][3.2] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 02:55:48 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37779: [wip][SPARK-40320][Core] Executor should exit when it failed to initialize for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/22 02:59:21 UTC, 6 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/22 03:00:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37962: [SPARK-40490][YARN][TESTS][3.3] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 03:02:04 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37963: [SPARK-40490][YARN][TESTS][3.2] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 03:02:24 UTC, 2 replies.
- [GitHub] [spark] wangyum closed pull request #37930: [SPARK-40487][SQL] Make defaultJoin in BroadcastNestedLoopJoinExec running in parallel - posted by GitBox <gi...@apache.org> on 2022/09/22 03:14:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37893: [SPARK-40434][SS][PYTHON] Implement applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/22 03:35:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37963: [SPARK-40490][YARN][TESTS][3.2] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 03:39:29 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37961: [WIP][SPARK-40526][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/22 03:42:46 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #37961: [WIP][SPARK-40526][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/22 03:46:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun closed pull request #37961: [WIP][SPARK-40526][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/22 03:46:44 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #37964: [SPARK-40434][SS][PYTHON][FOLLOWUP] Address review comments - posted by GitBox <gi...@apache.org> on 2022/09/22 04:05:59 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37964: [SPARK-40434][SS][PYTHON][FOLLOWUP] Address review comments - posted by GitBox <gi...@apache.org> on 2022/09/22 04:06:14 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37863: [WIP][DO-NOT-MERGE] Reference PR for flatMapGroupsWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/22 04:54:19 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37863: [WIP][DO-NOT-MERGE] Reference PR for flatMapGroupsWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/22 04:54:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37963: [SPARK-40490][YARN][TESTS][3.2] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 05:02:38 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/22 05:05:23 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/22 05:06:05 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37948: [SPARK-40327][PS][DOCS] Add resampling to API references - posted by GitBox <gi...@apache.org> on 2022/09/22 05:08:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37941: [SPARK-40501][SQL] Enhance 'SpecialLimits' to support project(..., limit(...)) - posted by GitBox <gi...@apache.org> on 2022/09/22 05:08:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37958: [SPARK-40522][BUILD] Upgrade `kafka` to 3.2.3 - posted by GitBox <gi...@apache.org> on 2022/09/22 05:09:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37953: [SPARK-40510][PS] Implement `ddof` in `Series.cov` - posted by GitBox <gi...@apache.org> on 2022/09/22 05:12:27 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37964: [SPARK-40434][SS][PYTHON][FOLLOWUP] Address review comments - posted by GitBox <gi...@apache.org> on 2022/09/22 05:26:52 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37894: [SPARK-40435][SS][PYTHON] Add test suites for applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/22 06:36:10 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37894: [SPARK-40435][SS][PYTHON] Add test suites for applyInPandasWithState in PySpark - posted by GitBox <gi...@apache.org> on 2022/09/22 06:37:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37963: [SPARK-40490][YARN][TESTS][3.2] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 06:51:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37941: [SPARK-40501][SQL] Enhance 'SpecialLimits' to support project(..., limit(...)) - posted by GitBox <gi...@apache.org> on 2022/09/22 07:30:51 UTC, 2 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #37965: [SPARK-40527][SQL] Keep struct field names or map keys in CreateStruct - posted by GitBox <gi...@apache.org> on 2022/09/22 07:47:48 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37965: [SPARK-40527][SQL] Keep struct field names or map keys in CreateStruct - posted by GitBox <gi...@apache.org> on 2022/09/22 07:50:47 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37902: [SPARK-40359][SQL] Migrate type check fails in CSV/JSON expressions to error classes - posted by GitBox <gi...@apache.org> on 2022/09/22 07:50:49 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #37916: [SPARK-40473][SQL] Migrate parsing errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/22 08:24:58 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #37966: [SPARK-40462][PYTHON] Support np.ndarray for `functions.lit`. - posted by GitBox <gi...@apache.org> on 2022/09/22 08:45:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37966: [SPARK-40462][PYTHON] Support np.ndarray for `functions.lit`. - posted by GitBox <gi...@apache.org> on 2022/09/22 08:50:12 UTC, 0 replies.
- [GitHub] [spark] EvgenyZamyatin opened a new pull request, #37967: [WIP] Scalable SkipGram-Word2Vec implementation - posted by GitBox <gi...@apache.org> on 2022/09/22 09:02:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37962: [SPARK-40490][YARN][TESTS][3.3] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 09:33:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37962: [SPARK-40490][YARN][TESTS][3.3] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 09:35:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37962: [SPARK-40490][YARN][TESTS][3.3] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 09:43:11 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37964: [SPARK-40434][SS][PYTHON][FOLLOWUP] Address review comments - posted by GitBox <gi...@apache.org> on 2022/09/22 09:50:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37953: [SPARK-40510][PS] Implement `ddof` in `Series.cov` - posted by GitBox <gi...@apache.org> on 2022/09/22 09:52:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37902: [SPARK-40359][SQL] Migrate type check fails in CSV/JSON expressions to error classes - posted by GitBox <gi...@apache.org> on 2022/09/22 09:57:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37968: [SPARK-40529][PS] Remove `pyspark.pandas.ml` - posted by GitBox <gi...@apache.org> on 2022/09/22 10:04:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37968: [SPARK-40529][PS] Remove `pyspark.pandas.ml` - posted by GitBox <gi...@apache.org> on 2022/09/22 10:05:05 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/22 10:16:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37963: [SPARK-40490][YARN][TESTS][3.2] Ensure YarnShuffleIntegrationSuite tests registeredExecFile reload scenarios - posted by GitBox <gi...@apache.org> on 2022/09/22 11:15:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #37969: [SPARK-40530][SQL] Add error-related developer APIs - posted by GitBox <gi...@apache.org> on 2022/09/22 11:15:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37969: [SPARK-40530][SQL] Add error-related developer APIs - posted by GitBox <gi...@apache.org> on 2022/09/22 11:16:34 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37969: [SPARK-40530][SQL] Add error-related developer APIs - posted by GitBox <gi...@apache.org> on 2022/09/22 11:16:37 UTC, 7 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37970: [WIP] Upgrade zstd-jni from 1.5.2-3 to 1.5.2-4 - posted by GitBox <gi...@apache.org> on 2022/09/22 11:42:07 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37967: [WIP] Scalable SkipGram-Word2Vec implementation - posted by GitBox <gi...@apache.org> on 2022/09/22 11:53:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37968: [SPARK-40529][PS] Remove `pyspark.pandas.ml` - posted by GitBox <gi...@apache.org> on 2022/09/22 12:04:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #37943: [WIP][SPARK-40497][BUILD] Upgrade Scala to 2.13.9 - posted by GitBox <gi...@apache.org> on 2022/09/22 12:12:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37971: [MINOR][YARN][TESTS] Rename `logConfFile` in `BaseYarnClusterSuite` from `log4j.properties` to `log4j2.properties` - posted by GitBox <gi...@apache.org> on 2022/09/22 12:39:59 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37965: [SPARK-40527][SQL] Keep struct field names or map keys in CreateStruct - posted by GitBox <gi...@apache.org> on 2022/09/22 12:48:04 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #37779: [wip][SPARK-40320][Core] Executor should exit when it failed to initialize for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/22 12:56:24 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37931: [SPARK-40488] Do not wrap exceptions thrown when datasource write fails - posted by GitBox <gi...@apache.org> on 2022/09/22 12:57:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37931: [SPARK-40488] Do not wrap exceptions thrown when datasource write fails - posted by GitBox <gi...@apache.org> on 2022/09/22 12:58:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/22 12:59:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #34474: [SPARK-37203][SQL] Fix NotSerializableException when observe with TypedImperativeAggregate - posted by GitBox <gi...@apache.org> on 2022/09/22 13:07:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37933: [SPARK-40474][SQL] Correct CSV schema inference and data parsing behavior on columns with mixed dates and timestamps - posted by GitBox <gi...@apache.org> on 2022/09/22 13:09:17 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/22 13:28:23 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/22 13:28:23 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #36844: Update ExecutorClassLoader.scala - posted by GitBox <gi...@apache.org> on 2022/09/22 13:29:16 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/22 14:36:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37969: [SPARK-40530][SQL] Add error-related developer APIs - posted by GitBox <gi...@apache.org> on 2022/09/22 16:30:30 UTC, 2 replies.
- [GitHub] [spark] BryanCutler closed pull request #35391: [SPARK-38098][PYTHON] Add support for ArrayType of nested StructType to arrow-based conversion - posted by GitBox <gi...@apache.org> on 2022/09/22 17:10:16 UTC, 0 replies.
- [GitHub] [spark] BryanCutler commented on pull request #35391: [SPARK-38098][PYTHON] Add support for ArrayType of nested StructType to arrow-based conversion - posted by GitBox <gi...@apache.org> on 2022/09/22 17:10:36 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN opened a new pull request, #37972: Protobuf support for Spark - from_proto AND to_proto - posted by GitBox <gi...@apache.org> on 2022/09/22 17:26:16 UTC, 0 replies.
- [GitHub] [spark] bluesmoon commented on a diff in pull request #16497: [SPARK-19118] [SQL] Percentile support for frequency distribution table - posted by GitBox <gi...@apache.org> on 2022/09/22 18:21:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37973: [WIP][SPARK-40540][SQL] Migrate compilation errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/22 19:39:36 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #37969: [SPARK-40530][SQL] Add error-related developer APIs - posted by GitBox <gi...@apache.org> on 2022/09/22 20:39:03 UTC, 3 replies.
- [GitHub] [spark] ukby1234 commented on a diff in pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/22 21:55:43 UTC, 4 replies.
- [GitHub] [spark] srowen commented on pull request #37970: [SPARK-40531][BUILD] Upgrade zstd-jni from 1.5.2-3 to 1.5.2-4 - posted by GitBox <gi...@apache.org> on 2022/09/22 23:10:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37655: [SPARK-40218][SQL] GROUPING SETS should preserve the grouping columns - posted by GitBox <gi...@apache.org> on 2022/09/23 00:00:58 UTC, 11 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37974: [SPARK-40542][PS] Make `ddof` in `DataFrame.std` and `Series.std` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/23 00:16:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36856: [SPARK-39455][SQL] Improve expression non-codegen code path performance by cache data type matching - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36823: [SPARK-39429][SQL] Convert Inner Join With Aggregation to Semi Join - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36798: [SPARK-39408][SQL] Update the buildKeys for DynamicPruningSubquery.withNewPlan - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36751: [WIP][SPARK-39366][CORE] Do not release write locks on task end. - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36678: [SPARK-39297][CORE][UI] bugfix: spark.ui.proxyBase contains proxy or history - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36668: [SPARK-39291][CORE] Fetch blocks and open stream should not respond a closed channel - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36665: [SPARK-39287][CORE] TaskSchedulerImpl should quickly ignore task finished event if its task was finished state. - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36613: [WIP][SPARK-30983] Support typed select in Datasets up to the max tuple size - posted by GitBox <gi...@apache.org> on 2022/09/23 00:27:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36495: [SPARK-39136][SQL] JDBCTable support table properties - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36485: [SPARK-39128][SQL][HIVE] Log cost time for getting FileStatus in HadoopTableReader - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36483: [SPARK-39126][SQL] After eliminating join to one side, that side should take advantage of LocalShuffleRead optimization - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36453: SPARK-39103: SparkContext.addFiles trigger backend exception if it tr… - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36438: [SPARK-39092][SQL] Propagate Empty Partitions - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36378: [SPARK-39022][SQL] Fix combination of HAVING and SORT not being resolved correctly - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36305: [SPARK-38987][shuffle] Handle fallback when merged shuffle blocks are corrupted and spark.shuffle.detectCorrupt is set to true - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36301: [SPARK-21697][SQL] NPE & ExceptionInInitializerError trying to load UDF from HDFS - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36279: [WIP][SPARK-38965][SHUFFLE]Optimize RemoteBlockPushResolver with a memory pool - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36234: [SPARK-38409][CORE] Do not export gauges with null values in prometheus metric snapshots - posted by GitBox <gi...@apache.org> on 2022/09/23 00:28:16 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #37970: [SPARK-40531][BUILD] Upgrade zstd-jni from 1.5.2-3 to 1.5.2-4 - posted by GitBox <gi...@apache.org> on 2022/09/23 00:33:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37965: [SPARK-40527][SQL] Keep struct field names or map keys in CreateStruct - posted by GitBox <gi...@apache.org> on 2022/09/23 01:07:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37965: [SPARK-40527][SQL] Keep struct field names or map keys in CreateStruct - posted by GitBox <gi...@apache.org> on 2022/09/23 01:08:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37970: [SPARK-40531][BUILD] Upgrade zstd-jni from 1.5.2-3 to 1.5.2-4 - posted by GitBox <gi...@apache.org> on 2022/09/23 01:08:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37970: [SPARK-40531][BUILD] Upgrade zstd-jni from 1.5.2-3 to 1.5.2-4 - posted by GitBox <gi...@apache.org> on 2022/09/23 01:08:57 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #37533: [SPARK-40096]Fix finalize shuffle stage slow due to connection creation slow - posted by GitBox <gi...@apache.org> on 2022/09/23 01:31:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37974: [SPARK-40542][PS][SQL] Make `ddof` in `DataFrame.std` and `Series.std` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/23 02:14:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37974: [SPARK-40542][PS][SQL] Make `ddof` in `DataFrame.std` and `Series.std` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/23 02:14:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37933: [SPARK-40474][SQL] Correct CSV schema inference and data parsing behavior on columns with mixed dates and timestamps - posted by GitBox <gi...@apache.org> on 2022/09/23 02:30:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37975: [SPARK-40543][PS][SQL] Make `ddof` in `DataFrame.var` and `Series.var` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/23 02:30:56 UTC, 0 replies.
- [GitHub] [spark] brkyvz commented on a diff in pull request #37933: [SPARK-40474][SQL] Correct CSV schema inference and data parsing behavior on columns with mixed dates and timestamps - posted by GitBox <gi...@apache.org> on 2022/09/23 02:33:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37971: [MINOR][YARN][TESTS] Rename `logConfFile` in `BaseYarnClusterSuite` from `log4j.properties` to `log4j2.properties` - posted by GitBox <gi...@apache.org> on 2022/09/23 02:38:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37971: [MINOR][YARN][TESTS] Rename `logConfFile` in `BaseYarnClusterSuite` from `log4j.properties` to `log4j2.properties` - posted by GitBox <gi...@apache.org> on 2022/09/23 02:39:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37971: [MINOR][YARN][TESTS] Rename `logConfFile` in `BaseYarnClusterSuite` from `log4j.properties` to `log4j2.properties` - posted by GitBox <gi...@apache.org> on 2022/09/23 02:39:28 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on pull request #37972: Protobuf support for Spark - from_proto AND to_proto - posted by GitBox <gi...@apache.org> on 2022/09/23 03:07:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37976: [DON'T MERGE][SQL][TESTS] Restore the file appender log level threshold of the hive UTs to info - posted by GitBox <gi...@apache.org> on 2022/09/23 03:14:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37976: [DON'T MERGE][SQL][TESTS] Restore the file appender log level threshold of the hive UTs to info - posted by GitBox <gi...@apache.org> on 2022/09/23 03:22:05 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #37966: [SPARK-40462][PYTHON] Support np.ndarray for `functions.lit`. - posted by GitBox <gi...@apache.org> on 2022/09/23 03:41:27 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37935: [SPARK-40492][SS] Do maintenance before streaming StateStore unload - posted by GitBox <gi...@apache.org> on 2022/09/23 03:44:53 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #34474: [SPARK-37203][SQL] Fix NotSerializableException when observe with TypedImperativeAggregate - posted by GitBox <gi...@apache.org> on 2022/09/23 04:11:24 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37966: [SPARK-40462][PYTHON] Support np.ndarray for `functions.lit` - posted by GitBox <gi...@apache.org> on 2022/09/23 04:12:47 UTC, 0 replies.
- [GitHub] [spark] itholic closed pull request #37966: [SPARK-40462][PYTHON] Support np.ndarray for `functions.lit` - posted by GitBox <gi...@apache.org> on 2022/09/23 04:12:48 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #37941: [SPARK-40501][SQL] Add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/23 04:53:00 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37655: [SPARK-40218][SQL] GROUPING SETS should preserve the grouping columns - posted by GitBox <gi...@apache.org> on 2022/09/23 05:09:36 UTC, 3 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #37935: [SPARK-40492][SS] Do maintenance before streaming StateStore unload - posted by GitBox <gi...@apache.org> on 2022/09/23 05:14:52 UTC, 1 replies.
- [GitHub] [spark] chaoqin-li1123 commented on a diff in pull request #37935: [SPARK-40492][SS] Do maintenance before streaming StateStore unload - posted by GitBox <gi...@apache.org> on 2022/09/23 05:22:38 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #37977: [SPARK-37203][SQL][FOLLOWUP] Fix bug the buffer of AggregatingAccumulator will not be created if the input rows is empty - posted by GitBox <gi...@apache.org> on 2022/09/23 05:48:05 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #37977: [SPARK-37203][SQL][FOLLOWUP] Fix bug the buffer of AggregatingAccumulator will not be created if the input rows is empty - posted by GitBox <gi...@apache.org> on 2022/09/23 05:49:30 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #37787: [SPARK-40323][BUILD] Update ORC to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/09/23 06:44:05 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37978: [SPARK-40330][PS] Implement `Series.searchsorted` - posted by GitBox <gi...@apache.org> on 2022/09/23 06:44:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37975: [SPARK-40543][PS][SQL] Make `ddof` in `DataFrame.var` and `Series.var` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/23 06:54:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37975: [SPARK-40543][PS][SQL] Make `ddof` in `DataFrame.var` and `Series.var` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/23 06:54:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37979: [SPARK-40545][SQL][TESTS] Clean up `metastorePath` after `SparkSQLEnvSuite` execution - posted by GitBox <gi...@apache.org> on 2022/09/23 06:56:32 UTC, 0 replies.
- [GitHub] [spark] ukby1234 commented on pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/23 07:02:00 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37978: [SPARK-40330][PS] Implement `Series.searchsorted` - posted by GitBox <gi...@apache.org> on 2022/09/23 07:44:52 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #37655: [SPARK-40218][SQL] GROUPING SETS should preserve the grouping columns - posted by GitBox <gi...@apache.org> on 2022/09/23 07:50:47 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #37933: [SPARK-40474][SQL] Correct CSV schema inference and data parsing behavior on columns with mixed dates and timestamps - posted by GitBox <gi...@apache.org> on 2022/09/23 07:56:15 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37980: [SPARK-40322][DOCS] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/23 08:13:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37941: [SPARK-40501][SQL] Add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/23 08:31:55 UTC, 2 replies.
- [GitHub] [spark] wangyum closed pull request #37980: [SPARK-40322][DOCS] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/23 08:39:36 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37981: [SPARK-40322][DOCS] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/23 08:48:52 UTC, 0 replies.
- [GitHub] [spark] huleilei commented on pull request #33754: [SPARK-36526][SQL] DSV2 Index Support: Add supportsIndex interface - posted by GitBox <gi...@apache.org> on 2022/09/23 08:54:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #36027: [SPARK-38717][SQL] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/23 09:30:17 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #36027: [SPARK-38717][SQL] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/23 09:30:42 UTC, 0 replies.
- [GitHub] [spark] EvgenyZamyatin commented on pull request #37967: [WIP] Scalable SkipGram-Word2Vec implementation - posted by GitBox <gi...@apache.org> on 2022/09/23 09:51:26 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37977: [SPARK-40535][SQL] Fix bug the buffer of AggregatingAccumulator will not be created if the input rows is empty - posted by GitBox <gi...@apache.org> on 2022/09/23 11:21:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37977: [SPARK-40535][SQL] Fix bug the buffer of AggregatingAccumulator will not be created if the input rows is empty - posted by GitBox <gi...@apache.org> on 2022/09/23 11:23:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37960: [SPARK-39200][CORE] Make Fallback Storage readFully on content - posted by GitBox <gi...@apache.org> on 2022/09/23 11:23:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37787: [SPARK-40323][BUILD] Update ORC to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/09/23 11:40:06 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #37977: [SPARK-40535][SQL] Fix bug the buffer of AggregatingAccumulator will not be created if the input rows is empty - posted by GitBox <gi...@apache.org> on 2022/09/23 12:01:15 UTC, 0 replies.
- [GitHub] [spark] bryanck commented on pull request #37950: [SPARK-40505][K8S] Remove min heap setting for executor in entrypoint.sh - posted by GitBox <gi...@apache.org> on 2022/09/23 12:07:40 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #35789: [SPARK-32268][SQL] Row-level Runtime Filtering - posted by GitBox <gi...@apache.org> on 2022/09/23 12:15:23 UTC, 2 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #37982: [SPARK-38717][SQL][3.3] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/23 12:54:34 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37982: [SPARK-38717][SQL][3.3] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/23 12:55:15 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37972: [WIP] : Protobuf support for Spark - from_proto AND to_proto - posted by GitBox <gi...@apache.org> on 2022/09/23 13:23:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37973: [SPARK-40540][SQL] Migrate compilation errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/23 13:56:49 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37983: [MINOR][DOCS] Fix dead links in sparkr-vignettes.Rmd - posted by GitBox <gi...@apache.org> on 2022/09/23 14:05:26 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #37918: [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS - posted by GitBox <gi...@apache.org> on 2022/09/23 14:05:37 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37983: [MINOR][DOCS] Fix dead links in sparkr-vignettes.Rmd - posted by GitBox <gi...@apache.org> on 2022/09/23 14:05:47 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37983: [MINOR][DOCS] Fix dead links in sparkr-vignettes.Rmd - posted by GitBox <gi...@apache.org> on 2022/09/23 14:45:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37950: [SPARK-40505][K8S] Remove min heap setting for executor in entrypoint.sh - posted by GitBox <gi...@apache.org> on 2022/09/23 14:49:00 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37950: [SPARK-40505][K8S] Remove min heap setting for executor in entrypoint.sh - posted by GitBox <gi...@apache.org> on 2022/09/23 14:49:21 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37983: [SPARK-40547][DOCS] Fix dead links in sparkr-vignettes.Rmd - posted by GitBox <gi...@apache.org> on 2022/09/23 15:03:23 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on a diff in pull request #35789: [SPARK-32268][SQL] Row-level Runtime Filtering - posted by GitBox <gi...@apache.org> on 2022/09/23 15:54:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37983: [SPARK-40547][DOCS] Fix dead links in sparkr-vignettes.Rmd - posted by GitBox <gi...@apache.org> on 2022/09/23 15:57:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37983: [SPARK-40547][DOCS] Fix dead links in sparkr-vignettes.Rmd - posted by GitBox <gi...@apache.org> on 2022/09/23 15:58:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37981: [SPARK-40322][DOCS] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/23 16:00:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37981: [SPARK-40322][DOCS] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/23 16:00:36 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #37973: [SPARK-40540][SQL] Migrate compilation errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/23 20:19:42 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #37982: [SPARK-38717][SQL][3.3] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/23 20:31:59 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37973: [SPARK-40540][SQL] Migrate compilation errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/23 20:44:44 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #37982: [SPARK-38717][SQL][3.3] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/23 20:44:49 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #37941: [SPARK-40501][SQL] Add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/23 23:32:26 UTC, 4 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36859: DTW: new distance measure for clustering - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36856: [SPARK-39455][SQL] Improve expression non-codegen code path performance by cache data type matching - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36823: [SPARK-39429][SQL] Convert Inner Join With Aggregation to Semi Join - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:25 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36789: [SPARK-39403] Add SPARK_SUBMIT_OPTS in spark-env.sh.template - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36770: [SPARK-39382][WEBUI] UI show the duration of the failed task when the executor lost - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:27 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:28 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36658: [SPARK-39278][CORE] Fix backward compatibility of alternative configs of Hadoop Filesystems to access - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36551: [SPARK-38463][CORE] Use error classes in org.apache.spark.input - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:32 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36548: [SPARK-38470][CORE] Use error classes in org.apache.spark.partial - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36540: [SPARK-38466][CORE] Use error classes in org.apache.spark.mapred - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:34 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36438: [SPARK-39092][SQL] Propagate Empty Partitions - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36279: [WIP][SPARK-38965][SHUFFLE]Optimize RemoteBlockPushResolver with a memory pool - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:36 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36234: [SPARK-38409][CORE] Do not export gauges with null values in prometheus metric snapshots - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36208: [SPARK-38911][CORE] Fix the potential resource profile id mess up issue - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:38 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36180: [SPARK-38887][SQL] Support switch inner join side for sort merge join - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36151: WIP: [SPARK-27998] [SQL] Add support for double-quoted named expressions - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:41 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36128: [SPARK-34444][SQL] Pushdown scalar-subquery filter to FileSourceScan - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:42 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36126: [SPARK-38843][SQL] Fix translate metadata col filters - posted by GitBox <gi...@apache.org> on 2022/09/24 00:27:43 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #36789: [SPARK-39403] Add SPARK_SUBMIT_OPTS in spark-env.sh.template - posted by GitBox <gi...@apache.org> on 2022/09/24 05:19:40 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #37984: [SPARK-40322][DOCS][3.3] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/24 06:37:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37984: [SPARK-40322][DOCS][3.3] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/24 08:11:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37984: [SPARK-40322][DOCS][3.3] Fix all dead links in the docs - posted by GitBox <gi...@apache.org> on 2022/09/24 08:12:49 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #37985: [BUILD] Upgrade rocksdbjni from 7.5.3 to 7.6.0 - posted by GitBox <gi...@apache.org> on 2022/09/24 13:22:06 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #37976: [SPARK-40544][SQL][TESTS] Restore the file appender log level threshold of the hive UTs to info - posted by GitBox <gi...@apache.org> on 2022/09/24 15:27:00 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37976: [SPARK-40544][SQL][TESTS] Restore the file appender log level threshold of the hive UTs to info - posted by GitBox <gi...@apache.org> on 2022/09/24 15:27:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37985: [SPARK-40548][BUILD] Upgrade rocksdbjni from 7.5.3 to 7.6.0 - posted by GitBox <gi...@apache.org> on 2022/09/24 15:50:53 UTC, 0 replies.
- [GitHub] [spark] lvshaokang opened a new pull request, #37986: [SPARK-40357][SQL] Migrate window type check failures onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/24 16:27:42 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37986: [SPARK-40357][SQL] Migrate window type check failures onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/24 19:40:28 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36874: [SPARK-39475][SQL] Pull out complex join keys for shuffled join - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:43 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36789: [SPARK-39403] Add SPARK_SUBMIT_OPTS in spark-env.sh.template - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36859: DTW: new distance measure for clustering - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36770: [SPARK-39382][WEBUI] UI show the duration of the failed task when the executor lost - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:46 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36658: [SPARK-39278][CORE] Fix backward compatibility of alternative configs of Hadoop Filesystems to access - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:47 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:47 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36378: [SPARK-39022][SQL] Fix combination of HAVING and SORT not being resolved correctly - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36305: [SPARK-38987][shuffle] Handle fallback when merged shuffle blocks are corrupted and spark.shuffle.detectCorrupt is set to true - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36126: [SPARK-38843][SQL] Fix translate metadata col filters - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:50 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36088: [SPARK-38805][SHUFFLE] Automatically remove an expired indexFilePath from the ESS shuffleIndexCache or the PBS indexCache to save memory. - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36052: [SPARK-38777][YARN] Add `bin/spark-submit --kill / --status` support for yarn - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36046: [SPARK-38771][SQL] Adaptive Bloom filter Join - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36030: Draft: [SPARK-38715] Configurable client ID for Kafka Spark SQL producer - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36005: [SPARK-38506][SQL] Push partial aggregation through join - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35990: [SPARK-38639][SQL] Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35969: [SPARK-38651][SQL] Add configuration to support writing out empty schemas in supported filebased datasources - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:57 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35927: [WIP] Simplify the rule of auto-generated alias name - posted by GitBox <gi...@apache.org> on 2022/09/25 00:23:59 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #37979: [SPARK-40545][SQL][TESTS] Clean up `metastorePath` after `SparkSQLEnvSuite` execution - posted by GitBox <gi...@apache.org> on 2022/09/25 01:19:53 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37979: [SPARK-40545][SQL][TESTS] Clean up `metastorePath` after `SparkSQLEnvSuite` execution - posted by GitBox <gi...@apache.org> on 2022/09/25 01:20:13 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37986: [SPARK-40357][SQL] Migrate window type check failures onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/25 02:48:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/25 05:56:10 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] opened a new pull request, #37987: Bump protobuf from 4.21.5 to 4.21.6 in /dev - posted by GitBox <gi...@apache.org> on 2022/09/25 05:57:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37987: Bump protobuf from 4.21.5 to 4.21.6 in /dev - posted by GitBox <gi...@apache.org> on 2022/09/25 05:58:24 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] commented on pull request #37987: Bump protobuf from 4.21.5 to 4.21.6 in /dev - posted by GitBox <gi...@apache.org> on 2022/09/25 05:58:25 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #37988: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (FINAL) - posted by GitBox <gi...@apache.org> on 2022/09/25 08:20:03 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37988: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (FINAL) - posted by GitBox <gi...@apache.org> on 2022/09/25 08:21:39 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #37988: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (FINAL) - posted by GitBox <gi...@apache.org> on 2022/09/25 09:32:57 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #37989: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Explicitly check the element and length - posted by GitBox <gi...@apache.org> on 2022/09/25 11:39:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37989: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Explicitly check the element and length - posted by GitBox <gi...@apache.org> on 2022/09/25 11:40:39 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37989: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Explicitly check the element and length - posted by GitBox <gi...@apache.org> on 2022/09/25 11:41:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37988: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (FINAL) - posted by GitBox <gi...@apache.org> on 2022/09/25 11:41:57 UTC, 0 replies.
- [GitHub] [spark] cchantep commented on pull request #36030: Draft: [SPARK-38715] Configurable client ID for Kafka Spark SQL producer - posted by GitBox <gi...@apache.org> on 2022/09/25 12:20:47 UTC, 0 replies.
- [GitHub] [spark] EvgenyZamyatin commented on pull request #37967: Scalable SkipGram-Word2Vec implementation - posted by GitBox <gi...@apache.org> on 2022/09/25 13:17:12 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37985: [SPARK-40548][BUILD] Upgrade rocksdbjni from 7.5.3 to 7.6.0 - posted by GitBox <gi...@apache.org> on 2022/09/25 13:21:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37985: [SPARK-40548][BUILD] Upgrade rocksdbjni from 7.5.3 to 7.6.0 - posted by GitBox <gi...@apache.org> on 2022/09/25 13:21:17 UTC, 0 replies.
- [GitHub] [spark] lvshaokang commented on pull request #37986: [SPARK-40357][SQL] Migrate window type check failures onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/25 14:22:39 UTC, 1 replies.
- [GitHub] [spark] attilapiros opened a new pull request, #37990: [WIP][SPARK-40458] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/25 17:22:33 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #37990: [WIP][SPARK-40458] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/25 17:30:38 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #37990: [WIP][SPARK-40458][K8S] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/25 17:34:49 UTC, 10 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #37991: [SPARK-40552][] Upgrade `protobuf-python` to 4.21.6 - posted by GitBox <gi...@apache.org> on 2022/09/25 17:40:45 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37988: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (FINAL) - posted by GitBox <gi...@apache.org> on 2022/09/25 17:45:46 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37988: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (FINAL) - posted by GitBox <gi...@apache.org> on 2022/09/25 17:45:50 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37991: [SPARK-40552][BUILD] Upgrade `protobuf-python` to 4.21.6 - posted by GitBox <gi...@apache.org> on 2022/09/25 18:23:44 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #37991: [SPARK-40552][BUILD] Upgrade `protobuf-python` to 4.21.6 - posted by GitBox <gi...@apache.org> on 2022/09/25 20:19:21 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #37988: [SPARK-40142][PYTHON][SQL][FOLLOW-UP] Make pyspark.sql.functions examples self-contained (FINAL) - posted by GitBox <gi...@apache.org> on 2022/09/25 22:10:52 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37989: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Explicitly check the element and length - posted by GitBox <gi...@apache.org> on 2022/09/25 23:05:48 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37989: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Explicitly check the element and length - posted by GitBox <gi...@apache.org> on 2022/09/25 23:05:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37923: [SPARK-40334][PS] Implement `GroupBy.prod` - posted by GitBox <gi...@apache.org> on 2022/09/26 00:00:50 UTC, 0 replies.
- [GitHub] [spark] nkronenfeld commented on pull request #36613: [WIP][SPARK-30983] Support typed select in Datasets up to the max tuple size - posted by GitBox <gi...@apache.org> on 2022/09/26 00:15:16 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37978: [SPARK-40330][PS] Implement `Series.searchsorted` - posted by GitBox <gi...@apache.org> on 2022/09/26 00:17:24 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36874: [SPARK-39475][SQL] Pull out complex join keys for shuffled join - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:50 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36829: [SPARK-39438][SQL] Add a threshold to not in line CTE - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36301: [SPARK-21697][SQL] NPE & ExceptionInInitializerError trying to load UDF from HDFS - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36265: [SPARK-38951][SQL] Aggregate aliases override field names in ResolveAggregateFunctions - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36180: [SPARK-38887][SQL] Support switch inner join side for sort merge join - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36208: [SPARK-38911][CORE] Fix the potential resource profile id mess up issue - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36151: WIP: [SPARK-27998] [SQL] Add support for double-quoted named expressions - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36128: [SPARK-34444][SQL] Pushdown scalar-subquery filter to FileSourceScan - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36088: [SPARK-38805][SHUFFLE] Automatically remove an expired indexFilePath from the ESS shuffleIndexCache or the PBS indexCache to save memory. - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:57 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35867: [SPARK-38559][SQL][WEBUI]Display the number of empty partitions on spark ui - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35927: [WIP] Simplify the rule of auto-generated alias name - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35858: [SPARK-38448] [YARN] [CORE] Sending Available Resources in Yarn Cluster Information to Spark Driver - posted by GitBox <gi...@apache.org> on 2022/09/26 00:25:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35845: [SPARK-38520][SQL] ANSI interval overflow when reading CSV - posted by GitBox <gi...@apache.org> on 2022/09/26 00:26:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35808: [WIP][SPARK-38512] Rebased traversal order from "pre-order" to "post-order" for `ResolveFunctions` Rule - posted by GitBox <gi...@apache.org> on 2022/09/26 00:26:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35806: [SPARK-38505][SQL] Make partial aggregation adaptive - posted by GitBox <gi...@apache.org> on 2022/09/26 00:26:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35799: [SPARK-38498][STREAM] Support customized StreamingListener by configuration - posted by GitBox <gi...@apache.org> on 2022/09/26 00:26:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35764: [SPARK-38444][SQL]Automatically calculate the upper and lower bounds of partitions when no specified partition related params - posted by GitBox <gi...@apache.org> on 2022/09/26 00:26:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35763: [SPARK-38433][BUILD] change the shell code style with shellcheck - posted by GitBox <gi...@apache.org> on 2022/09/26 00:26:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37991: [SPARK-40552][BUILD] Upgrade `protobuf-python` to 4.21.6 - posted by GitBox <gi...@apache.org> on 2022/09/26 00:52:47 UTC, 0 replies.
- [GitHub] [spark] Kwafoor closed pull request #37951: [SPARK-40506]Spark Streaming metrics name doesn't need application name - posted by GitBox <gi...@apache.org> on 2022/09/26 01:20:21 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37989: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Explicitly check the element and length - posted by GitBox <gi...@apache.org> on 2022/09/26 01:58:44 UTC, 0 replies.
- [GitHub] [spark] weixiuli commented on a diff in pull request #37922: [WIP][SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/09/26 02:04:39 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37285: [POC][PYTHON][SS] Arbitrary stateful processing in Structured Streaming with Python - posted by GitBox <gi...@apache.org> on 2022/09/26 02:39:16 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37285: [POC][PYTHON][SS] Arbitrary stateful processing in Structured Streaming with Python - posted by GitBox <gi...@apache.org> on 2022/09/26 02:39:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37979: [SPARK-40545][SQL][TESTS] Clean up `metastorePath` after `SparkSQLEnvSuite` execution - posted by GitBox <gi...@apache.org> on 2022/09/26 02:44:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37976: [SPARK-40544][SQL][TESTS] Restore the file appender log level threshold of the hive UTs to info - posted by GitBox <gi...@apache.org> on 2022/09/26 02:44:24 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37935: [SPARK-40492][SS] Do maintenance before streaming StateStore unload - posted by GitBox <gi...@apache.org> on 2022/09/26 02:45:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37992: [SPARK-40554][PS] Make `ddof` in `DataFrame.sem` and `Series.sem` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/26 03:13:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37978: [SPARK-40330][PS] Implement `Series.searchsorted` - posted by GitBox <gi...@apache.org> on 2022/09/26 03:55:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37978: [SPARK-40330][PS] Implement `Series.searchsorted` - posted by GitBox <gi...@apache.org> on 2022/09/26 03:55:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37978: [SPARK-40330][PS] Implement `Series.searchsorted` - posted by GitBox <gi...@apache.org> on 2022/09/26 03:59:02 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #37993: [Cleanup] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/26 04:02:21 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #37825: [SPARK-40382][SQL] Group distinct aggregate expressions by semantically equivalent children in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/09/26 04:10:03 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37993: [CONNECT] [Cleanup] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/26 04:20:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2022/09/26 04:54:39 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2022/09/26 04:54:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #36265: [SPARK-38951][SQL] Aggregate aliases override field names in ResolveAggregateFunctions - posted by GitBox <gi...@apache.org> on 2022/09/26 05:03:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37982: [SPARK-38717][SQL][3.3] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/26 05:07:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37982: [SPARK-38717][SQL][3.3] Handle Hive's bucket spec case preserving behaviour - posted by GitBox <gi...@apache.org> on 2022/09/26 05:10:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #35789: [SPARK-32268][SQL] Row-level Runtime Filtering - posted by GitBox <gi...@apache.org> on 2022/09/26 05:20:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #37994: [SPARK-40454] Initial DSL framework for protobuf testing - posted by GitBox <gi...@apache.org> on 2022/09/26 05:53:08 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #37994: [SPARK-40454][Connect]Initial DSL framework for protobuf testing - posted by GitBox <gi...@apache.org> on 2022/09/26 05:54:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37995: [SPARK-40556][PS][SQL] Clean the intermediate cached datasets created in `AttachDistributedSequenceExec` - posted by GitBox <gi...@apache.org> on 2022/09/26 06:01:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37994: [SPARK-40454][CONNECT] Initial DSL framework for protobuf testing - posted by GitBox <gi...@apache.org> on 2022/09/26 06:04:50 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on pull request #37993: [SPARK-40557] [CONNECT] [Cleanup] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/26 06:19:26 UTC, 0 replies.
- [GitHub] [spark] mskapilks opened a new pull request, #37996: [SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2022/09/26 06:37:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #37990: [WIP][SPARK-40458][K8S] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/26 07:16:56 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37992: [SPARK-40554][PS] Make `ddof` in `DataFrame.sem` and `Series.sem` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/26 07:23:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37992: [SPARK-40554][PS] Make `ddof` in `DataFrame.sem` and `Series.sem` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/26 07:23:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37990: [WIP][SPARK-40458][K8S] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/26 07:23:47 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #37973: [SPARK-40540][SQL] Migrate compilation errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/26 07:26:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37969: [SPARK-40530][SQL] Add error-related developer APIs - posted by GitBox <gi...@apache.org> on 2022/09/26 07:35:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35764: [SPARK-38444][SQL]Automatically calculate the upper and lower bounds of partitions when no specified partition related params - posted by GitBox <gi...@apache.org> on 2022/09/26 07:39:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35969: [SPARK-38651][SQL] Add configuration to support writing out empty schemas in supported filebased datasources - posted by GitBox <gi...@apache.org> on 2022/09/26 07:45:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37941: [SPARK-40501][SQL] Add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/26 07:50:04 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #37941: [SPARK-40501][SQL] Add PushProjectionThroughLimit for Optimizer - posted by GitBox <gi...@apache.org> on 2022/09/26 07:50:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #37997: [SPARK-40560][SQL] Rename `message` to `messageFormat` in the `STANDARD` format of errors - posted by GitBox <gi...@apache.org> on 2022/09/26 08:21:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37997: [SPARK-40560][SQL] Rename `message` to `messageFormat` in the `STANDARD` format of errors - posted by GitBox <gi...@apache.org> on 2022/09/26 08:21:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #37998: [SPARK-40561][PS] Implement `min_count` in `GroupBy.min` - posted by GitBox <gi...@apache.org> on 2022/09/26 08:26:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37999: [SPARK-39146][CORE] Introduce `JacksonUtils` to use singleton Jackson ObjectMapper - posted by GitBox <gi...@apache.org> on 2022/09/26 08:47:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37999: [SPARK-39146][CORE] Introduce `JacksonUtils` to use singleton Jackson ObjectMapper - posted by GitBox <gi...@apache.org> on 2022/09/26 09:01:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38000: [WIP][SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1100-1199 - posted by GitBox <gi...@apache.org> on 2022/09/26 09:19:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/26 09:28:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/26 09:40:49 UTC, 5 replies.
- [GitHub] [spark] mskapilks commented on pull request #37996: [WIP][SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2022/09/26 09:53:17 UTC, 0 replies.
- [GitHub] [spark] mskapilks commented on pull request #37996: [SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2022/09/26 09:57:53 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37985: [SPARK-40548][BUILD] Upgrade rocksdbjni from 7.5.3 to 7.6.0 - posted by GitBox <gi...@apache.org> on 2022/09/26 11:16:47 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37997: [SPARK-40560][SQL] Rename `message` to `messageTemplate` in the `STANDARD` format of errors - posted by GitBox <gi...@apache.org> on 2022/09/26 11:31:28 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37997: [SPARK-40560][SQL] Rename `message` to `messageTemplate` in the `STANDARD` format of errors - posted by GitBox <gi...@apache.org> on 2022/09/26 11:33:01 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37996: [SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2022/09/26 11:34:19 UTC, 2 replies.
- [GitHub] [spark] panbingkun commented on pull request #37985: [SPARK-40548][BUILD] Upgrade rocksdbjni from 7.5.3 to 7.6.0 - posted by GitBox <gi...@apache.org> on 2022/09/26 11:38:24 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2022/09/26 11:45:11 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #37919: [SPARK-40478][DOCS] Add create datasource table options docs - posted by GitBox <gi...@apache.org> on 2022/09/26 11:45:57 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38002: [WIP][Do not merge] Spark 40465 refactor - posted by GitBox <gi...@apache.org> on 2022/09/26 12:05:54 UTC, 0 replies.
- [GitHub] [spark] lvshaokang commented on a diff in pull request #37986: [SPARK-40357][SQL] Migrate window type check failures onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/26 12:48:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/26 13:01:29 UTC, 1 replies.
- [GitHub] [spark] Kimahriman opened a new pull request, #38003: [SPARK-40565][SQL] Don't push non-deterministic filters to V2 file sources - posted by GitBox <gi...@apache.org> on 2022/09/26 13:02:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #37956: [SPARK-40514][CORE][SQL][YARN][PYTHON][TESTS] Make python related tests check python minimum support version - posted by GitBox <gi...@apache.org> on 2022/09/26 13:26:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37919: [SPARK-40478][DOCS] Add create datasource table options docs - posted by GitBox <gi...@apache.org> on 2022/09/26 14:31:53 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37919: [SPARK-40478][DOCS] Add create datasource table options docs - posted by GitBox <gi...@apache.org> on 2022/09/26 14:32:37 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37999: [SPARK-39146][CORE][SQL][K8S] Introduce `JacksonUtils` to use singleton Jackson ObjectMapper - posted by GitBox <gi...@apache.org> on 2022/09/26 14:38:47 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #37991: [SPARK-40552][BUILD][INFRA] Upgrade `protobuf-python` to 4.21.6 - posted by GitBox <gi...@apache.org> on 2022/09/26 14:40:04 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37991: [SPARK-40552][BUILD][INFRA] Upgrade `protobuf-python` to 4.21.6 - posted by GitBox <gi...@apache.org> on 2022/09/26 14:40:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37999: [SPARK-39146][CORE][SQL][K8S] Introduce `JacksonUtils` to use singleton Jackson ObjectMapper - posted by GitBox <gi...@apache.org> on 2022/09/26 14:50:05 UTC, 5 replies.
- [GitHub] [spark] Ngone51 commented on pull request #37899: [SPARK-40455][CORE]Abort result stage directly when it failed caused by FetchFailedException - posted by GitBox <gi...@apache.org> on 2022/09/26 14:51:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #32987: [SPARK-35564][SQL] Support subexpression elimination for conditionally evaluated expressions - posted by GitBox <gi...@apache.org> on 2022/09/26 14:58:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37887: [SPARK-40360] ALREADY_EXISTS and NOT_FOUND exceptions - posted by GitBox <gi...@apache.org> on 2022/09/26 15:03:34 UTC, 3 replies.
- [GitHub] [spark] Kimahriman commented on pull request #32987: [SPARK-35564][SQL] Support subexpression elimination for conditionally evaluated expressions - posted by GitBox <gi...@apache.org> on 2022/09/26 15:20:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37994: [SPARK-40454][CONNECT] Initial DSL framework for protobuf testing - posted by GitBox <gi...@apache.org> on 2022/09/26 15:36:18 UTC, 13 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37986: [SPARK-40357][SQL] Migrate window type check failures onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/26 16:25:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #37986: [SPARK-40357][SQL] Migrate window type check failures onto error classes - posted by GitBox <gi...@apache.org> on 2022/09/26 16:26:32 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/26 16:54:52 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #37994: [SPARK-40454][CONNECT] Initial DSL framework for protobuf testing - posted by GitBox <gi...@apache.org> on 2022/09/26 17:13:01 UTC, 16 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37996: [SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2022/09/26 19:09:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38003: [SPARK-40565][SQL] Don't push non-deterministic filters to V2 file sources - posted by GitBox <gi...@apache.org> on 2022/09/26 19:28:30 UTC, 1 replies.
- [GitHub] [spark] thiyaga commented on pull request #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/26 19:29:39 UTC, 0 replies.
- [GitHub] [spark] amaliujia closed pull request #37750: [SPARK-40296] Error class for DISTINCT function not found - posted by GitBox <gi...@apache.org> on 2022/09/26 19:29:50 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on pull request #37996: [SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2022/09/26 19:33:48 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/09/26 19:50:39 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37994: [SPARK-40454][CONNECT] Initial DSL framework for protobuf testing - posted by GitBox <gi...@apache.org> on 2022/09/26 20:04:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #37993: [SPARK-40557] [CONNECT] [Cleanup] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/26 20:04:45 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/09/26 20:50:28 UTC, 1 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/09/26 21:10:52 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/09/26 21:12:19 UTC, 12 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #38005: [SPARK-40550][SQL] Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2022/09/26 21:19:48 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/09/26 21:21:43 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38006: [SPARK-40536] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/26 22:53:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38006: [SPARK-40536] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/26 22:54:36 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #38003: [SPARK-40565][SQL] Don't push non-deterministic filters to V2 file sources - posted by GitBox <gi...@apache.org> on 2022/09/26 23:14:04 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/26 23:54:16 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/26 23:55:31 UTC, 11 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36889: [SPARK-21195][CORE] Dynamically register metrics from sources as they are reported - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36829: [SPARK-39438][SQL] Add a threshold to not in line CTE - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36046: [SPARK-38771][SQL] Adaptive Bloom filter Join - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36030: Draft: [SPARK-38715] Configurable client ID for Kafka Spark SQL producer - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36005: [SPARK-38506][SQL] Push partial aggregation through join - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:57 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35858: [SPARK-38448] [YARN] [CORE] Sending Available Resources in Yarn Cluster Information to Spark Driver - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35845: [SPARK-38520][SQL] ANSI interval overflow when reading CSV - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35808: [WIP][SPARK-38512] Rebased traversal order from "pre-order" to "post-order" for `ResolveFunctions` Rule - posted by GitBox <gi...@apache.org> on 2022/09/27 00:27:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35806: [SPARK-38505][SQL] Make partial aggregation adaptive - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35799: [SPARK-38498][STREAM] Support customized StreamingListener by configuration - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35751: [SPARK-38433][BUILD] Add shell code style check Actions - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35763: [SPARK-38433][BUILD] change the shell code style with shellcheck - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35748: [SPARK-38431][SQL]Support to delete matched rows from jdbc tables - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35744: [SPARK-37383][SQL][WEBUI]Show the parsing time for each phase of a SQL on spark ui - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35734: [SPARK-32432][SQL] Add support for reading ORC/Parquet files of SymlinkTextInputFormat table And Fix Analyze for SymlinkTextInputFormat table - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35638: [SPARK-38296][SQL] Support error class AnalysisExceptions in FunctionRegistry - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35608: [SPARK-32838][SQL] Static partition overwrite could use staging dir insert - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35569: [SPARK-38250][CORE] Check existence before deleting stagingDir in HadoopMapReduceCommitProtocol - posted by GitBox <gi...@apache.org> on 2022/09/27 00:28:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37679: [SPARK-35242][SQL] Support changing session catalog's default database - posted by GitBox <gi...@apache.org> on 2022/09/27 00:29:03 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on a diff in pull request #38003: [SPARK-40565][SQL] Don't push non-deterministic filters to V2 file sources - posted by GitBox <gi...@apache.org> on 2022/09/27 00:58:38 UTC, 1 replies.
- [GitHub] [spark] huleilei opened a new pull request, #38007: [SPARK-40566][SQL]Add showIndex function - posted by GitBox <gi...@apache.org> on 2022/09/27 01:26:49 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #37990: [WIP][SPARK-40458][K8S] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/27 01:29:43 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/27 01:37:37 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #37990: [SPARK-40458][K8S] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/27 01:45:34 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37998: [SPARK-40561][PS] Implement `min_count` in `GroupBy.min` - posted by GitBox <gi...@apache.org> on 2022/09/27 01:53:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37998: [SPARK-40561][PS] Implement `min_count` in `GroupBy.min` - posted by GitBox <gi...@apache.org> on 2022/09/27 01:53:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37993: [SPARK-40557][CONNECT] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/27 01:55:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37993: [SPARK-40557][CONNECT] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/27 01:56:00 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #38008: [SPARk-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 01:59:31 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37993: [SPARK-40557][CONNECT] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/27 02:00:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37967: Scalable SkipGram-Word2Vec implementation - posted by GitBox <gi...@apache.org> on 2022/09/27 02:12:42 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38008: [SPARk-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 02:14:00 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38008: [SPARk-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 02:14:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38008: [SPARk-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 02:15:40 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/27 02:21:56 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38009: [SPARK-40573][PS] Make `ddof` in `GroupBy.std`, `GroupBy.var` and `GroupBy.sem` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/27 02:56:00 UTC, 0 replies.
- [GitHub] [spark] srowen opened a new pull request, #38010: [MINOR] Clarify that xxhash64 seed is 42 - posted by GitBox <gi...@apache.org> on 2022/09/27 03:13:42 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38010: [MINOR] Clarify that xxhash64 seed is 42 - posted by GitBox <gi...@apache.org> on 2022/09/27 03:14:41 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #38011: [SPARK-40574][DOCS] Enhance DROP TABLE documentation - posted by GitBox <gi...@apache.org> on 2022/09/27 03:36:34 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37989: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Explicitly check the element and length - posted by GitBox <gi...@apache.org> on 2022/09/27 04:27:43 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38008: [SPARK-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 04:27:43 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38008: [SPARK-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 04:29:03 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38012: [DO-NOT-MERGE][TEST] Pandas 1.5 Test - posted by GitBox <gi...@apache.org> on 2022/09/27 05:23:36 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #35764: [SPARK-38444][SQL]Automatically calculate the upper and lower bounds of partitions when no specified partition related params - posted by GitBox <gi...@apache.org> on 2022/09/27 05:51:59 UTC, 2 replies.
- [GitHub] [spark] caican00 commented on pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][DSTREAM][R] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/27 06:07:01 UTC, 10 replies.
- [GitHub] [spark] chaoqin-li1123 opened a new pull request, #38013: [SPARK-40509][SS][PYTHON] add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/27 06:08:33 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on pull request #38013: [SPARK-40509][SS][PYTHON] add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/27 06:10:14 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37876: [SPARK-40175][CORE][SQL][MLLIB][DSTREAM][R] Optimize the performance of `keys.zip(values).toMap` code pattern - posted by GitBox <gi...@apache.org> on 2022/09/27 06:21:35 UTC, 7 replies.
- [GitHub] [spark] itholic closed pull request #38012: [DO-NOT-MERGE][TEST] Pandas 1.5 Test - posted by GitBox <gi...@apache.org> on 2022/09/27 06:26:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38014: [SPARK-40575][DOCS] Add badges for PySpark downloads - posted by GitBox <gi...@apache.org> on 2022/09/27 06:27:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38009: [SPARK-40573][PS] Make `ddof` in `GroupBy.std`, `GroupBy.var` and `GroupBy.sem` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/27 06:40:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38009: [SPARK-40573][PS] Make `ddof` in `GroupBy.std`, `GroupBy.var` and `GroupBy.sem` accept arbitary integers - posted by GitBox <gi...@apache.org> on 2022/09/27 06:41:15 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38015: [SPARK-40577][PS] Fix CategoricalIndex.append to match pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 06:57:19 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38013: [SPARK-40509][SS][PYTHON] add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/27 07:21:46 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/27 07:34:26 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38016: [SPARK-40577][PS] Fix `IndexesTest.test_to_frame` when pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 07:35:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38017: [SPARK-40579][PS] `GroupBy.first` should skip NULLs - posted by GitBox <gi...@apache.org> on 2022/09/27 07:35:10 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38013: [SPARK-40509][SS][PYTHON] add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/27 07:37:51 UTC, 3 replies.
- [GitHub] [spark] Yikf commented on a diff in pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2022/09/27 07:45:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38014: [SPARK-40575][DOCS] Add badges for PySpark downloads - posted by GitBox <gi...@apache.org> on 2022/09/27 08:00:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38016: [SPARK-40578][PS] Fix `IndexesTest.test_to_frame` when pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 08:02:23 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38015: [SPARK-40577][PS] Fix `CategoricalIndex.append` to match pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 08:13:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38016: [SPARK-40578][PS] Fix `IndexesTest.test_to_frame` when pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 08:15:14 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38018: [SPARK-40580] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/27 08:17:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38010: [MINOR] Clarify that xxhash64 seed is 42 - posted by GitBox <gi...@apache.org> on 2022/09/27 08:17:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38010: [MINOR] Clarify that xxhash64 seed is 42 - posted by GitBox <gi...@apache.org> on 2022/09/27 08:17:59 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/27 08:19:00 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2022/09/27 08:19:42 UTC, 0 replies.
- [GitHub] [spark] huleilei commented on a diff in pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2022/09/27 08:22:15 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/27 08:25:05 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38008: [SPARK-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 08:40:44 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38008: [SPARK-40571][SS][TESTS] Construct a new test case for applyInPandasWithState to verify fault-tolerance semantic with random python worker failures - posted by GitBox <gi...@apache.org> on 2022/09/27 08:41:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37770: [SPARK-40314][SQL][PYTHON] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/27 08:43:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37770: [SPARK-40314][SQL][PYTHON] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/27 08:44:24 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38015: [SPARK-40577][PS] Fix `CategoricalIndex.append` to match pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 08:46:03 UTC, 3 replies.
- [GitHub] [spark] huleilei commented on pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2022/09/27 08:48:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37761: [SPARK-40311][SQL][PYTHON] Add withColumnsRenamed to scala and pyspark API - posted by GitBox <gi...@apache.org> on 2022/09/27 08:50:33 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/27 08:54:11 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37995: [SPARK-40556][PS][SQL][WIP] Unpersist the intermediate datasets cached in `AttachDistributedSequenceExec` - posted by GitBox <gi...@apache.org> on 2022/09/27 08:56:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38016: [SPARK-40578][PS] Fix `IndexesTest.test_to_frame` when pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 09:01:34 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2022/09/27 09:08:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37995: [SPARK-40556][PS][SQL] Unpersist the intermediate datasets cached in `AttachDistributedSequenceExec` - posted by GitBox <gi...@apache.org> on 2022/09/27 09:13:20 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38001: [SPARK-40562][SQL] Add `spark.sql.legacy.groupingIdWithAppendedUserGroupBy` - posted by GitBox <gi...@apache.org> on 2022/09/27 09:15:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/27 09:19:10 UTC, 4 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38013: [SPARK-40509][SS][PYTHON] add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/27 09:46:52 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/27 10:27:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37710: [SPARK-40448][CONNECT] Spark Connect build as Driver Plugin with Shaded Dependencies - posted by GitBox <gi...@apache.org> on 2022/09/27 11:10:57 UTC, 3 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #38019: [SPARK-38864][FOLLOWUP] Add tests unpivoting struct expressions - posted by GitBox <gi...@apache.org> on 2022/09/27 12:25:52 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #38020: [SPARK-39877][FOLLOW-UP] PySpark DataFrame.unpivot allows for column names only - posted by GitBox <gi...@apache.org> on 2022/09/27 13:03:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38013: [SPARK-40509][SS][PYTHON] add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/27 13:30:05 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38020: [SPARK-39877][FOLLOW-UP] PySpark DataFrame.unpivot allows for column names only - posted by GitBox <gi...@apache.org> on 2022/09/27 13:35:49 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #37999: [SPARK-39146][CORE][SQL]Introduce `JacksonUtils` to use singleton Jackson ObjectMapper - posted by GitBox <gi...@apache.org> on 2022/09/27 14:58:38 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #35751: [SPARK-38433][BUILD] Add shell code style check Actions - posted by GitBox <gi...@apache.org> on 2022/09/27 15:00:53 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2022/09/27 15:14:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/27 15:14:08 UTC, 0 replies.
- [GitHub] [spark] danitico opened a new pull request, #38021: [SPARK-40583] Fixing artifact Id name for cloud integration - posted by GitBox <gi...@apache.org> on 2022/09/27 15:35:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/27 15:48:02 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/09/27 15:59:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37999: [SPARK-39146][CORE][SQL] Introduce local singleton for `ObjectMapper` that may be reused - posted by GitBox <gi...@apache.org> on 2022/09/27 16:05:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38021: [SPARK-40583] Fixing artifactId name for cloud integration - posted by GitBox <gi...@apache.org> on 2022/09/27 16:38:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #37826: [SPARK-40364][CORE] Use the unified `DBProvider#initDB ` method - posted by GitBox <gi...@apache.org> on 2022/09/27 16:50:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #37344: [SPARK-39924][CORE] Extract `constructUnsupportedMergeException` method to deduplicate code in `AccumulatorV2` - posted by GitBox <gi...@apache.org> on 2022/09/27 16:51:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38003: [SPARK-40565][SQL] Don't push non-deterministic filters to V2 file sources - posted by GitBox <gi...@apache.org> on 2022/09/27 17:46:29 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38003: [SPARK-40565][SQL] Don't push non-deterministic filters to V2 file sources - posted by GitBox <gi...@apache.org> on 2022/09/27 17:59:06 UTC, 0 replies.
- [GitHub] [spark] srielau opened a new pull request, #38022: [SPARK-40585] Double quoted identifiers - posted by GitBox <gi...@apache.org> on 2022/09/27 18:22:42 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on a diff in pull request #38013: [SPARK-40509][SS][PYTHON] add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/27 18:54:18 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on pull request #37993: [SPARK-40557][CONNECT] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/27 18:56:02 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38003: [SPARK-40565][SQL] Don't push non-deterministic filters to V2 file sources - posted by GitBox <gi...@apache.org> on 2022/09/27 19:02:56 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2022/09/27 19:31:17 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38000: [SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1100-1199 - posted by GitBox <gi...@apache.org> on 2022/09/27 19:41:04 UTC, 2 replies.
- [GitHub] [spark] Kimahriman commented on pull request #37770: [SPARK-40314][SQL][PYTHON] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/27 20:25:36 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/09/27 21:52:21 UTC, 2 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/27 21:59:47 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/27 22:01:42 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #38015: [SPARK-40577][PS] Fix `CategoricalIndex.append` to match pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/27 22:13:04 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38023: [SPARK-40587][CONNECT] Support SELECT * in an explicit way by connect proto - posted by GitBox <gi...@apache.org> on 2022/09/27 22:13:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38023: [SPARK-40587][CONNECT] Support SELECT * in an explicit way in connect proto - posted by GitBox <gi...@apache.org> on 2022/09/27 22:16:20 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on pull request #38023: [SPARK-40587][CONNECT] Support SELECT * in an explicit way in connect proto - posted by GitBox <gi...@apache.org> on 2022/09/27 22:20:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38021: [SPARK-40583][DOCS] Fixing artifactId name in `cloud-integration.md` - posted by GitBox <gi...@apache.org> on 2022/09/27 22:25:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38021: [SPARK-40583][DOCS] Fixing artifactId name in `cloud-integration.md` - posted by GitBox <gi...@apache.org> on 2022/09/27 22:26:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38011: [SPARK-40574][DOCS] Enhance DROP TABLE documentation - posted by GitBox <gi...@apache.org> on 2022/09/27 22:32:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36889: [SPARK-21195][CORE] Dynamically register metrics from sources as they are reported - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:28 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35990: [SPARK-38639][SQL] Support ignoreCorruptRecord flag to ensure querying broken sequence file table smoothly - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35867: [SPARK-38559][SQL][WEBUI]Display the number of empty partitions on spark ui - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:30 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35744: [SPARK-37383][SQL][WEBUI]Show the parsing time for each phase of a SQL on spark ui - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35748: [SPARK-38431][SQL]Support to delete matched rows from jdbc tables - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35734: [SPARK-32432][SQL] Add support for reading ORC/Parquet files of SymlinkTextInputFormat table And Fix Analyze for SymlinkTextInputFormat table - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:32 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35638: [SPARK-38296][SQL] Support error class AnalysisExceptions in FunctionRegistry - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35608: [SPARK-32838][SQL] Static partition overwrite could use staging dir insert - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:34 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35550: [SPARK-38238][SQL]Contains Join for Spark SQL - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35569: [SPARK-38250][CORE] Check existence before deleting stagingDir in HadoopMapReduceCommitProtocol - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35549: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions in most cases - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:36 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35548: [SPARK-38234] [SQL] [SS] Added structured streaming monitoring APIs. - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35371: [WIP][SPARK-37946][SQL] Use error classes in the execution errors related to partitions - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35337: [SPARK-37840][SQL] Dynamic Update of UDF - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35319: [SPARK-36571][SQL] Add new SQLPathHadoopMapReduceCommitProtocol resolve conflict when write into partition table's different partition - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:41 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35088: [SPARK-37758][PYTHON][BUILD] Enable PySpark test scheduled job on ARM runner - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:42 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34903: [SPARK-37650][PYTHON] Tell spark-env.sh the python interpreter - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:44 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34856: [SPARK-37602][CORE] Add config property to set default Spark listeners - posted by GitBox <gi...@apache.org> on 2022/09/28 00:31:45 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2022/09/28 01:16:29 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2022/09/28 01:16:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38016: [SPARK-40578][PS] Fix `IndexesTest.test_to_frame` when pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/28 01:23:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38017: [SPARK-40579][PS] `GroupBy.first` should skip NULLs - posted by GitBox <gi...@apache.org> on 2022/09/28 01:23:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38016: [SPARK-40578][PS] Fix `IndexesTest.test_to_frame` when pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/28 01:23:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38017: [SPARK-40579][PS] `GroupBy.first` should skip NULLs - posted by GitBox <gi...@apache.org> on 2022/09/28 01:24:19 UTC, 0 replies.
- [GitHub] [spark] Yikun closed pull request #35088: [SPARK-37758][PYTHON][BUILD] Enable PySpark test scheduled job on ARM runner - posted by GitBox <gi...@apache.org> on 2022/09/28 01:24:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38017: [SPARK-40579][PS] `GroupBy.first` should skip NULLs - posted by GitBox <gi...@apache.org> on 2022/09/28 01:26:20 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #38024: [SPARK-40591][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/28 01:54:07 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #38024: [SPARK-40591][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/28 01:59:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38024: [SPARK-40591][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/28 02:45:21 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #37855: [SPARK-40407][SQL] Fix the potential data skew caused by df.repartition - posted by GitBox <gi...@apache.org> on 2022/09/28 02:52:58 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu opened a new pull request, #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/09/28 02:55:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/09/28 02:55:57 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/09/28 03:21:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38014: [SPARK-40575][DOCS] Add badges for PySpark downloads - posted by GitBox <gi...@apache.org> on 2022/09/28 03:22:38 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #37654: [SPARK-40216][SQL] Extract common `ParquetUtils.prepareWrite` method to deduplicate code in `ParquetFileFormat` and `ParquetWrite` - posted by GitBox <gi...@apache.org> on 2022/09/28 03:32:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37654: [SPARK-40216][SQL] Extract common `ParquetUtils.prepareWrite` method to deduplicate code in `ParquetFileFormat` and `ParquetWrite` - posted by GitBox <gi...@apache.org> on 2022/09/28 03:37:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37654: [SPARK-40216][SQL] Extract common `ParquetUtils.prepareWrite` method to deduplicate code in `ParquetFileFormat` and `ParquetWrite` - posted by GitBox <gi...@apache.org> on 2022/09/28 03:37:53 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #38024: [SPARK-40591][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/28 03:40:16 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #37995: [SPARK-40556][PS][SQL] Unpersist the intermediate datasets cached in `AttachDistributedSequenceExec` - posted by GitBox <gi...@apache.org> on 2022/09/28 03:49:06 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38025: [MINOR][SQL] Skip warning if JOB_SUMMARY_LEVEL is set to NONE for ParquetWrite - posted by GitBox <gi...@apache.org> on 2022/09/28 03:50:26 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38025: [MINOR][SQL] Skip warning if JOB_SUMMARY_LEVEL is set to NONE for ParquetWrite - posted by GitBox <gi...@apache.org> on 2022/09/28 03:51:27 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/09/28 04:14:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38026: [SPARK-40592][PS] Implement `min_count` in `GroupBy.max` - posted by GitBox <gi...@apache.org> on 2022/09/28 04:40:33 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on a diff in pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/28 05:33:23 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/28 05:41:48 UTC, 10 replies.
- [GitHub] [spark] MaxGekk closed pull request #38000: [SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1100-1199 - posted by GitBox <gi...@apache.org> on 2022/09/28 05:53:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38014: [SPARK-40575][DOCS] Add badges for PySpark downloads - posted by GitBox <gi...@apache.org> on 2022/09/28 06:00:43 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on a diff in pull request #38024: [SPARK-40591][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/28 06:25:47 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #37654: [SPARK-40216][SQL] Extract common `ParquetUtils.prepareWrite` method to deduplicate code in `ParquetFileFormat` and `ParquetWrite` - posted by GitBox <gi...@apache.org> on 2022/09/28 06:26:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37654: [SPARK-40216][SQL] Extract common `ParquetUtils.prepareWrite` method to deduplicate code in `ParquetFileFormat` and `ParquetWrite` - posted by GitBox <gi...@apache.org> on 2022/09/28 06:44:02 UTC, 2 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38027: [WIP][SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1200-1299 - posted by GitBox <gi...@apache.org> on 2022/09/28 07:17:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/09/28 07:22:42 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/28 07:36:56 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/28 07:38:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38028: [SPARK-40435][SQL][TESTS][FOLLOWUP] Correct test precondition of `PythonUDFSuite` and `ContinuousSuite` - posted by GitBox <gi...@apache.org> on 2022/09/28 07:50:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38028: [SPARK-40435][SQL][TESTS][FOLLOWUP] Correct test precondition of `PythonUDFSuite` and `ContinuousSuite` - posted by GitBox <gi...@apache.org> on 2022/09/28 07:51:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38023: [SPARK-40587][CONNECT] Support SELECT * in an explicit way in connect proto - posted by GitBox <gi...@apache.org> on 2022/09/28 07:57:15 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/09/28 08:23:53 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #35799: [SPARK-38498][STREAM] Support customized StreamingListener by configuration - posted by GitBox <gi...@apache.org> on 2022/09/28 08:24:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38026: [SPARK-40592][PS] Implement `min_count` in `GroupBy.max` - posted by GitBox <gi...@apache.org> on 2022/09/28 08:28:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38026: [SPARK-40592][PS] Implement `min_count` in `GroupBy.max` - posted by GitBox <gi...@apache.org> on 2022/09/28 08:28:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38029: [SPARK-40595][SQL] Improve error message for unused CTE relations - posted by GitBox <gi...@apache.org> on 2022/09/28 08:32:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38029: [SPARK-40595][SQL] Improve error message for unused CTE relations - posted by GitBox <gi...@apache.org> on 2022/09/28 08:32:35 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #38030: [SPARK-40596] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo - posted by GitBox <gi...@apache.org> on 2022/09/28 10:49:15 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38031: [SPARK-40589][PS][TEST] Fix test for `DataFrame.corr_with` skip the pandas regression - posted by GitBox <gi...@apache.org> on 2022/09/28 11:01:33 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38031: [SPARK-40589][PS][TEST] Fix test for `DataFrame.corr_with` skip the pandas regression - posted by GitBox <gi...@apache.org> on 2022/09/28 11:03:06 UTC, 3 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #38032: [WIP][SPARK-40597][CORE] local mode should respect TASK_MAX_FAILURES like all other cluster managers - posted by GitBox <gi...@apache.org> on 2022/09/28 12:05:40 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38033: [SPARK-40598][PS] Fix plotting features work properly with pandas 1.5.0. - posted by GitBox <gi...@apache.org> on 2022/09/28 12:36:01 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38024: [SPARK-40591][CORE][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/28 12:44:16 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #38030: [SPARK-40596][CORE] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo - posted by GitBox <gi...@apache.org> on 2022/09/28 12:44:19 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #38024: [SPARK-40591][CORE][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/28 12:50:44 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38032: [WIP][SPARK-40597][CORE] local mode should respect TASK_MAX_FAILURES like all other cluster managers - posted by GitBox <gi...@apache.org> on 2022/09/28 12:53:04 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37779: [SPARK-40320][Core] Executor should exit when initialization failed for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/28 12:54:21 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2022/09/28 13:20:52 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #38035: [WIP][SQL] Improve constraint generation - posted by GitBox <gi...@apache.org> on 2022/09/28 13:46:39 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #38036: [SPARK-40601] Assert key size when cogrouping groups - posted by GitBox <gi...@apache.org> on 2022/09/28 13:57:39 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38036: [SPARK-40601] Assert key size when cogrouping groups - posted by GitBox <gi...@apache.org> on 2022/09/28 13:59:21 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38037: [CONNECT][SPARK-40537] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/28 14:07:03 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #38038: [WIP][SQL] Refactor BroadcastHashJoinExec output partitioning calculation - posted by GitBox <gi...@apache.org> on 2022/09/28 14:31:48 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2022/09/28 14:36:38 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38023: [SPARK-40587][CONNECT] Support SELECT * in an explicit way in connect proto - posted by GitBox <gi...@apache.org> on 2022/09/28 15:12:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38039: [SPARK-40603][SQL] Throw the original error from catalog implementations - posted by GitBox <gi...@apache.org> on 2022/09/28 16:08:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38039: [SPARK-40603][SQL] Throw the original error from catalog implementations - posted by GitBox <gi...@apache.org> on 2022/09/28 16:09:09 UTC, 0 replies.
- [GitHub] [spark] LucaCanali commented on pull request #33559: [SPARK-34265][PYTHON][SQL] Instrument Python UDFs using SQL metrics - posted by GitBox <gi...@apache.org> on 2022/09/28 18:00:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38029: [SPARK-40595][SQL] Improve error message for unused CTE relations - posted by GitBox <gi...@apache.org> on 2022/09/28 18:19:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38029: [SPARK-40595][SQL] Improve error message for unused CTE relations - posted by GitBox <gi...@apache.org> on 2022/09/28 18:21:05 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #37885: [SPARK-40428][CORE][WIP] Fix shutdown hook in the CoarseGrainedSchedulerBackend - posted by GitBox <gi...@apache.org> on 2022/09/28 20:24:11 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #37641: [SPARK-40201][SQL][TESTS] Improve v1 write test coverage - posted by GitBox <gi...@apache.org> on 2022/09/28 21:24:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38037: [CONNECT][SPARK-40537] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/28 21:27:42 UTC, 6 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #37654: [SPARK-40216][SQL] Extract common `ParquetUtils.prepareWrite` method to deduplicate code in `ParquetFileFormat` and `ParquetWrite` - posted by GitBox <gi...@apache.org> on 2022/09/28 21:35:38 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38030: [SPARK-40596][CORE] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo - posted by GitBox <gi...@apache.org> on 2022/09/28 21:38:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38030: [SPARK-40596][CORE] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo - posted by GitBox <gi...@apache.org> on 2022/09/28 21:40:33 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38037: [CONNECT][SPARK-40537] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/28 22:25:57 UTC, 1 replies.
- [GitHub] [spark] rdblue commented on pull request #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/09/28 23:09:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #38025: [MINOR][SQL] Skip warning if JOB_SUMMARY_LEVEL is set to NONE for ParquetWrite - posted by GitBox <gi...@apache.org> on 2022/09/29 00:11:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38028: [SPARK-40435][SQL][SS][TESTS][FOLLOWUP] Correct test precondition of `PythonUDFSuite` and `ContinuousSuite` - posted by GitBox <gi...@apache.org> on 2022/09/29 00:22:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38028: [SPARK-40435][SQL][SS][TESTS][FOLLOWUP] Correct test precondition of `PythonUDFSuite` and `ContinuousSuite` - posted by GitBox <gi...@apache.org> on 2022/09/29 00:23:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36872: [SPARK-36252][CORE]: Add log files rolling policy for driver running in cluster mode with spark standalone cluster - posted by GitBox <gi...@apache.org> on 2022/09/29 00:31:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35550: [SPARK-38238][SQL]Contains Join for Spark SQL - posted by GitBox <gi...@apache.org> on 2022/09/29 00:31:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35549: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions in most cases - posted by GitBox <gi...@apache.org> on 2022/09/29 00:31:57 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35548: [SPARK-38234] [SQL] [SS] Added structured streaming monitoring APIs. - posted by GitBox <gi...@apache.org> on 2022/09/29 00:31:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35371: [WIP][SPARK-37946][SQL] Use error classes in the execution errors related to partitions - posted by GitBox <gi...@apache.org> on 2022/09/29 00:31:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35337: [SPARK-37840][SQL] Dynamic Update of UDF - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35319: [SPARK-36571][SQL] Add new SQLPathHadoopMapReduceCommitProtocol resolve conflict when write into partition table's different partition - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34903: [SPARK-37650][PYTHON] Tell spark-env.sh the python interpreter - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34829: [SPARK-23607][CORE] Use HDFS extended attributes to store application summary information in SHS - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34856: [SPARK-37602][CORE] Add config property to set default Spark listeners - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34791: [SPARK-37528][SQL][CORE] Schedule Tasks By Input Size - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34695: [SPARK-32446][CORE] Add percentile distribution REST API & UI of peak memory metrics for all executors - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34672: [SPARK-37394][CORE] Skip registering with external shuffle server if a customized shuffle manager is configured - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:09 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34500: [WIP][SPARK-33574][CORE] Improve locality for push-based shuffle especially for join-like operations - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34367: [SPARK-37099][SQL] Introduce a rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34326: [SPARK-37053][CORE] Add metrics to SparkHistoryServer - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34212: [SPARK-36402][PYTHON] Implement Series.combine - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #33941: [SPARK-36699][Core] Reuse compatible executors for stage-level scheduling - posted by GitBox <gi...@apache.org> on 2022/09/29 00:32:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/29 00:34:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38015: [SPARK-40577][PS] Fix `CategoricalIndex.append` to match pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/29 00:35:13 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38018: [SPARK-40580][PS][DOCS] Update the document for `DataFrame.to_orc`. - posted by GitBox <gi...@apache.org> on 2022/09/29 00:35:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38039: [SPARK-40603][SQL] Throw the original error from catalog implementations - posted by GitBox <gi...@apache.org> on 2022/09/29 00:43:07 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 commented on a diff in pull request #38030: [SPARK-40596][CORE] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo - posted by GitBox <gi...@apache.org> on 2022/09/29 00:46:08 UTC, 1 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #37779: [SPARK-40320][Core] Executor should exit when initialization failed for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/29 01:41:39 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #38024: [SPARK-40591][CORE][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2022/09/29 01:46:04 UTC, 2 replies.
- [GitHub] [spark] Ngone51 commented on pull request #37268: [SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled - posted by GitBox <gi...@apache.org> on 2022/09/29 01:49:03 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38040: [SPARK-40604][PS] Verify the temporary column names in PS - posted by GitBox <gi...@apache.org> on 2022/09/29 01:58:16 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38037: [CONNECT][SPARK-40537] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/29 02:05:50 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38036: [SPARK-40601] Assert identical key size when cogrouping groups - posted by GitBox <gi...@apache.org> on 2022/09/29 02:05:53 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #38030: [SPARK-40596][CORE] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo - posted by GitBox <gi...@apache.org> on 2022/09/29 02:30:28 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38041: [SPARK-40605][CONNECT] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 02:56:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38041: [SPARK-40605][CONNECT] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 02:59:31 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38041: [SPARK-40605][CONNECT] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 03:05:34 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38040: [SPARK-40604][PS] Verify the temporary column names - posted by GitBox <gi...@apache.org> on 2022/09/29 05:31:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38040: [SPARK-40604][PS] Verify the temporary column names - posted by GitBox <gi...@apache.org> on 2022/09/29 05:31:40 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by GitBox <gi...@apache.org> on 2022/09/29 05:40:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38042: [SPARK-40606][PS][TEST] Eliminate `to_pandas` warnings in test - posted by GitBox <gi...@apache.org> on 2022/09/29 05:50:16 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on a diff in pull request #35636: [SPARK-31357][SQL][WIP] Catalog API for view metadata - posted by GitBox <gi...@apache.org> on 2022/09/29 06:16:41 UTC, 1 replies.
- [GitHub] [spark] lsyldliu commented on pull request #36046: [SPARK-38771][SQL] Adaptive Bloom filter Join - posted by GitBox <gi...@apache.org> on 2022/09/29 06:26:14 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by GitBox <gi...@apache.org> on 2022/09/29 06:41:56 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #36046: [SPARK-38771][SQL] Adaptive Bloom filter Join - posted by GitBox <gi...@apache.org> on 2022/09/29 07:50:17 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #36588: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2022/09/29 08:13:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38027: [WIP][SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1200-1299 - posted by GitBox <gi...@apache.org> on 2022/09/29 08:15:04 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #37936: [SPARK-40495] [SQL] [TESTS] Add additional tests to StreamingSessionWindowSuite - posted by GitBox <gi...@apache.org> on 2022/09/29 08:35:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38041: [SPARK-40605][CONNECT] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 08:50:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38041: [SPARK-40605][CONNECT] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 08:50:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38043: [SPARK-40607][CORE][SQL] Remove redundant string interpolator operations - posted by GitBox <gi...@apache.org> on 2022/09/29 08:55:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38043: [SPARK-40607][CORE][SQL][SS] Remove redundant string interpolator operations - posted by GitBox <gi...@apache.org> on 2022/09/29 08:56:18 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #34829: [SPARK-23607][CORE] Use HDFS extended attributes to store application summary information in SHS - posted by GitBox <gi...@apache.org> on 2022/09/29 09:43:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38044: [SPARK-40556][PS][SQL][WIP] Make `AttachDistributedSequenceExec` a `BinaryExecNode` - posted by GitBox <gi...@apache.org> on 2022/09/29 09:46:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/29 10:46:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38037: [CONNECT][SPARK-40537] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/29 10:46:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38006: [SPARK-40536][CONNECT] Make Spark Connect port configurable - posted by GitBox <gi...@apache.org> on 2022/09/29 10:46:53 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38027: [SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1200-1299 - posted by GitBox <gi...@apache.org> on 2022/09/29 10:59:31 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37999: [SPARK-39146][CORE][SQL] Introduce local singleton for `ObjectMapper` that may be reused - posted by GitBox <gi...@apache.org> on 2022/09/29 11:08:35 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38045: [SPARK-40540][SQL][TESTS][FOLLOW-UP] Use a regex for List and ArrayBuffer for 'Struct Star Expansion' test - posted by GitBox <gi...@apache.org> on 2022/09/29 11:12:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37844: [SPARK-40511][BUILD][CORE] Upgrade slf4j to 2.0.3 - posted by GitBox <gi...@apache.org> on 2022/09/29 11:38:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/29 12:10:47 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38020: [SPARK-39877][FOLLOW-UP] PySpark DataFrame.unpivot allows for column names only - posted by GitBox <gi...@apache.org> on 2022/09/29 12:24:35 UTC, 0 replies.
- [GitHub] [spark] EnricoMi closed pull request #38020: [SPARK-39877][FOLLOW-UP] PySpark DataFrame.unpivot allows for column names only - posted by GitBox <gi...@apache.org> on 2022/09/29 12:24:36 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37999: [SPARK-39146][CORE][SQL] Introduce local singleton for `ObjectMapper` that may be reused - posted by GitBox <gi...@apache.org> on 2022/09/29 12:37:28 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #37999: [SPARK-39146][CORE][SQL] Introduce local singleton for `ObjectMapper` that may be reused - posted by GitBox <gi...@apache.org> on 2022/09/29 12:37:34 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38046: [WIP][SPARK-40611][SQL] Improve the performance of `setInterval` & `getInterval` for `UnsafeRow` - posted by GitBox <gi...@apache.org> on 2022/09/29 12:39:18 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #38047: [SPARK-40609][SQL] Casts types according to bucket info for Equality expressions - posted by GitBox <gi...@apache.org> on 2022/09/29 12:39:38 UTC, 0 replies.
- [GitHub] [spark] attilapiros opened a new pull request, #38048: [WIP][SPARK-40612] Fixing the principal used for delegation token renewal on non-YARN resource managers - posted by GitBox <gi...@apache.org> on 2022/09/29 12:58:00 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #38049: [SPARK-40613][BUILD] Upgrade sbt-protoc to 1.0.6 - posted by GitBox <gi...@apache.org> on 2022/09/29 13:02:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38045: [SPARK-40540][SQL][TESTS][FOLLOW-UP] Use a regex for List and ArrayBuffer for 'Struct Star Expansion' test - posted by GitBox <gi...@apache.org> on 2022/09/29 14:04:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38045: [SPARK-40540][SQL][TESTS][FOLLOW-UP] Use a regex for List and ArrayBuffer for 'Struct Star Expansion' test - posted by GitBox <gi...@apache.org> on 2022/09/29 14:05:52 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #37972: [WIP] : Protobuf support for Spark - from_proto AND to_proto - posted by GitBox <gi...@apache.org> on 2022/09/29 16:00:46 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #38050: [SPARK-40615][SQL] Check unsupported data types when decorrelating subqueries - posted by GitBox <gi...@apache.org> on 2022/09/29 16:03:00 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38041: [SPARK-40605][CONNECT][TESTS] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 16:05:20 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #38041: [SPARK-40605][CONNECT][TESTS] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 16:05:30 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38037: [CONNECT][SPARK-40537] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/29 16:09:43 UTC, 0 replies.
- [GitHub] [spark] mposdev21 commented on pull request #37972: [WIP] : Protobuf support for Spark - from_proto AND to_proto - posted by GitBox <gi...@apache.org> on 2022/09/29 16:50:50 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #37972: [WIP] : Protobuf support for Spark - from_proto AND to_proto - posted by GitBox <gi...@apache.org> on 2022/09/29 16:51:58 UTC, 5 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38051: [WIP][SQL] Eliminate error sub-classes - posted by GitBox <gi...@apache.org> on 2022/09/29 17:27:41 UTC, 0 replies.
- [GitHub] [spark] ljfgem commented on a diff in pull request #35636: [SPARK-31357][SQL][WIP] Catalog API for view metadata - posted by GitBox <gi...@apache.org> on 2022/09/29 17:40:33 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38031: [SPARK-40589][PS][TEST] Fix test for `DataFrame.corr_with` skip the pandas regression - posted by GitBox <gi...@apache.org> on 2022/09/29 17:40:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38047: [SPARK-40609][SQL] Casts types according to bucket info for Equality expressions - posted by GitBox <gi...@apache.org> on 2022/09/29 17:43:08 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38050: [SPARK-40615][SQL] Check unsupported data types when decorrelating subqueries - posted by GitBox <gi...@apache.org> on 2022/09/29 17:48:18 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38033: [SPARK-40598][PS] Fix plotting features work properly with pandas 1.5.0. - posted by GitBox <gi...@apache.org> on 2022/09/29 17:55:11 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on pull request #38041: [SPARK-40605][CONNECT][TESTS] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/29 18:29:14 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38042: [SPARK-40606][PS][TEST] Eliminate `to_pandas` warnings in test - posted by GitBox <gi...@apache.org> on 2022/09/29 18:34:18 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #31871: [SPARK-34779][CORE] ExecutorMetricsPoller should keep stage entry in stageTCMP until a heartbeat occurs - posted by GitBox <gi...@apache.org> on 2022/09/29 18:41:43 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #37990: [SPARK-40458][K8S] Bump Kubernetes Client Version to 6.1.1 - posted by GitBox <gi...@apache.org> on 2022/09/29 19:19:43 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38049: [SPARK-40613][BUILD] Upgrade sbt-protoc to 1.0.6 - posted by GitBox <gi...@apache.org> on 2022/09/29 19:23:32 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38049: [SPARK-40613][BUILD] Upgrade sbt-protoc to 1.0.6 - posted by GitBox <gi...@apache.org> on 2022/09/29 19:25:11 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38023: [SPARK-40587][CONNECT] Support SELECT * in an explicit way in connect proto - posted by GitBox <gi...@apache.org> on 2022/09/29 19:43:29 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38022: [SPARK-40585] Double quoted identifiers - posted by GitBox <gi...@apache.org> on 2022/09/29 19:43:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38020: [SPARK-39877][FOLLOW-UP] PySpark DataFrame.unpivot allows for column names only - posted by GitBox <gi...@apache.org> on 2022/09/29 19:43:35 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38019: [SPARK-38864][FOLLOW-UP] Add tests unpivoting struct expressions - posted by GitBox <gi...@apache.org> on 2022/09/29 19:43:38 UTC, 0 replies.
- [GitHub] [spark] mposdev21 commented on a diff in pull request #37972: [WIP] : Protobuf support for Spark - from_proto AND to_proto - posted by GitBox <gi...@apache.org> on 2022/09/29 19:53:06 UTC, 11 replies.
- [GitHub] [spark] attilapiros commented on pull request #38048: [SPARK-40612] Fixing the principal used for delegation token renewal on non-YARN resource managers - posted by GitBox <gi...@apache.org> on 2022/09/29 21:41:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38048: [SPARK-40612] Fixing the principal used for delegation token renewal on non-YARN resource managers - posted by GitBox <gi...@apache.org> on 2022/09/29 21:42:38 UTC, 1 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #38052: [SPARK-40618][SQL] Fix bug in MergeScalarSubqueries rule with nested subqueries - posted by GitBox <gi...@apache.org> on 2022/09/29 23:45:33 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #38052: [SPARK-40618][SQL] Fix bug in MergeScalarSubqueries rule with nested subqueries - posted by GitBox <gi...@apache.org> on 2022/09/29 23:49:59 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38052: [SPARK-40618][SQL] Fix bug in MergeScalarSubqueries rule with nested subqueries - posted by GitBox <gi...@apache.org> on 2022/09/30 00:16:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36872: [SPARK-36252][CORE]: Add log files rolling policy for driver running in cluster mode with spark standalone cluster - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36831: [SPARK-39126][SQL] After eliminating join to one side, that side should take advantage of LocalShuffleRead optimization - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:18 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34829: [SPARK-23607][CORE] Use HDFS extended attributes to store application summary information in SHS - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:20 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34791: [SPARK-37528][SQL][CORE] Schedule Tasks By Input Size - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34695: [SPARK-32446][CORE] Add percentile distribution REST API & UI of peak memory metrics for all executors - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34672: [SPARK-37394][CORE] Skip registering with external shuffle server if a customized shuffle manager is configured - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34500: [WIP][SPARK-33574][CORE] Improve locality for push-based shuffle especially for join-like operations - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34367: [SPARK-37099][SQL] Introduce a rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34326: [SPARK-37053][CORE] Add metrics to SparkHistoryServer - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34212: [SPARK-36402][PYTHON] Implement Series.combine - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:25 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #33941: [SPARK-36699][Core] Reuse compatible executors for stage-level scheduling - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #33893: [SPARK-36638][SQL] Generalize OptimizeSkewedJoin - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #33828: [SPARK-36579][CORE][SQL] Make spark source stagingDir can be customized - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:27 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #33675: [SPARK-27997][K8S] Add support for kubernetes OAuth Token refresh - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #32477: [SPARK-35348][SQL] Support the utils for escapse the regex for ANSI SQL: SIMILAR TO … ESCAPE syntax - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:30 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #31774: [SPARK-34659] Fix that Web UI always correctly get appId - posted by GitBox <gi...@apache.org> on 2022/09/30 00:36:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38031: [SPARK-40589][PS][TEST] Fix test for `DataFrame.corr_with` skip the pandas regression - posted by GitBox <gi...@apache.org> on 2022/09/30 00:48:38 UTC, 1 replies.
- [GitHub] [spark] Ngone51 closed pull request #37268: [SPARK-39853][CORE] Support stage level task resource profile for standalone cluster when dynamic allocation disabled - posted by GitBox <gi...@apache.org> on 2022/09/30 01:04:57 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38031: [SPARK-40589][PS][TEST] Fix test for `DataFrame.corr_with` skip the pandas regression - posted by GitBox <gi...@apache.org> on 2022/09/30 01:12:39 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #38033: [SPARK-40598][PS] Fix plotting features work properly with pandas 1.5.0. - posted by GitBox <gi...@apache.org> on 2022/09/30 01:22:57 UTC, 2 replies.
- [GitHub] [spark] xuanyuanking commented on a diff in pull request #38030: [SPARK-40596][CORE] Populate ExecutorDecommission with messages in ExecutorDecommissionInfo - posted by GitBox <gi...@apache.org> on 2022/09/30 01:23:47 UTC, 0 replies.
- [GitHub] [spark] wForget opened a new pull request, #38053: [SPARK-40600] Support recursiveFileLookup for partitioned datasource - posted by GitBox <gi...@apache.org> on 2022/09/30 01:27:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #37770: [SPARK-40314][SQL][PYTHON] Add scala and python bindings for inline and inline_outer - posted by GitBox <gi...@apache.org> on 2022/09/30 01:42:55 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38054: [WIP][BUILD] Update test plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/09/30 01:45:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38031: [SPARK-40589][PS][TEST] Fix test for `DataFrame.corr_with` skip the pandas regression - posted by GitBox <gi...@apache.org> on 2022/09/30 01:46:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38031: [SPARK-40589][PS][TEST] Fix test for `DataFrame.corr_with` skip the pandas regression - posted by GitBox <gi...@apache.org> on 2022/09/30 01:46:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38015: [SPARK-40577][PS] Fix `CategoricalIndex.append` to match pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/30 01:48:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38042: [SPARK-40606][PS][TEST] Eliminate `to_pandas` warnings in test - posted by GitBox <gi...@apache.org> on 2022/09/30 01:51:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38033: [SPARK-40598][PS] Fix plotting features work properly with pandas 1.5.0. - posted by GitBox <gi...@apache.org> on 2022/09/30 01:59:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38042: [SPARK-40606][PS][TEST] Eliminate `to_pandas` warnings in test - posted by GitBox <gi...@apache.org> on 2022/09/30 02:00:45 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38015: [SPARK-40577][PS] Fix `CategoricalIndex.append` to match pandas 1.5.0 - posted by GitBox <gi...@apache.org> on 2022/09/30 02:04:25 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38054: [WIP] Investigate the root cause for SPARK-40165 - posted by GitBox <gi...@apache.org> on 2022/09/30 02:09:19 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38055: [SPARK-40590][TEST] Fix `ps.read_parquet` when `pandas_metadata` is `True` - posted by GitBox <gi...@apache.org> on 2022/09/30 02:17:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38055: [SPARK-40590][TEST] Fix `ps.read_parquet` when `pandas_metadata` is `True` - posted by GitBox <gi...@apache.org> on 2022/09/30 02:18:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38041: [SPARK-40605][CONNECT][TESTS] Change to use `log4j2.properties` to configure test log output - posted by GitBox <gi...@apache.org> on 2022/09/30 02:59:58 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38028: [SPARK-40435][SQL][SS][TESTS][FOLLOWUP] Correct test precondition of `PythonUDFSuite` and `ContinuousSuite` - posted by GitBox <gi...@apache.org> on 2022/09/30 03:02:04 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38049: [SPARK-40613][BUILD] Upgrade sbt-protoc to 1.0.6 - posted by GitBox <gi...@apache.org> on 2022/09/30 03:29:58 UTC, 0 replies.
- [GitHub] [spark] attilapiros opened a new pull request, #38056: [WIP][SPARK-40617] Fix race condition at the handling of ExecutorMetricsPoller's stageTCMP entries - posted by GitBox <gi...@apache.org> on 2022/09/30 03:39:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38057: [SPARK-39869][SQL][TESTS] Add explicitly gc for `HivePartitionFilteringSuite` - posted by GitBox <gi...@apache.org> on 2022/09/30 04:02:36 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/09/30 04:29:23 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #38046: [SPARK-40611][SQL] Improve the performance of `setInterval` & `getInterval` for `UnsafeRow` - posted by GitBox <gi...@apache.org> on 2022/09/30 04:49:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38042: [SPARK-40606][PS][TEST] Eliminate `to_pandas` warnings in test - posted by GitBox <gi...@apache.org> on 2022/09/30 05:45:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38057: [SPARK-39869][SQL][TESTS] Add explicitly gc for `HivePartitionFilteringSuite` - posted by GitBox <gi...@apache.org> on 2022/09/30 05:49:23 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #38027: [SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1200-1299 - posted by GitBox <gi...@apache.org> on 2022/09/30 06:23:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38043: [SPARK-40607][CORE][DSTREAM][GRAPHX][K8S][MESOS][ML][MLLIB][PYTHON][R][SQL][SS][YARN] Remove redundant string interpolator operations - posted by GitBox <gi...@apache.org> on 2022/09/30 06:48:07 UTC, 1 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #38058: [SPARK-40620] [CORE] Simplify make offers - posted by GitBox <gi...@apache.org> on 2022/09/30 07:17:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38057: [SPARK-40619][SQL][TESTS] Add explicitly gc for `HivePartitionFilteringSuite` - posted by GitBox <gi...@apache.org> on 2022/09/30 07:19:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38057: [SPARK-40619][SQL][TESTS] Add explicitly gc for `HivePartitionFilteringSuite` - posted by GitBox <gi...@apache.org> on 2022/09/30 07:20:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38057: [SPARK-40619][SQL][TESTS] Add explicitly gc for `HivePartitionFilteringSuite` - posted by GitBox <gi...@apache.org> on 2022/09/30 07:32:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #38043: [SPARK-40607][CORE][DSTREAM][GRAPHX][K8S][MESOS][ML][MLLIB][PYTHON][R][SQL][SS][YARN] Remove redundant string interpolator operations - posted by GitBox <gi...@apache.org> on 2022/09/30 07:36:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38043: [SPARK-40607][CORE][DSTREAM][GRAPHX][K8S][MESOS][ML][MLLIB][PYTHON][R][SQL][SS][YARN] Remove redundant string interpolator operations - posted by GitBox <gi...@apache.org> on 2022/09/30 07:37:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38059: [WIP] Try to implement attach_distributed_sequence_column with dataframe operations - posted by GitBox <gi...@apache.org> on 2022/09/30 07:56:52 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #38052: [SPARK-40618][SQL] Fix bug in MergeScalarSubqueries rule with nested subqueries - posted by GitBox <gi...@apache.org> on 2022/09/30 08:25:39 UTC, 2 replies.
- [GitHub] [spark] yabola commented on pull request #37779: [SPARK-40320][Core] Executor should exit when initialization failed for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/30 08:36:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38060: [SPARK-40621][PS] Implement `numeric_only` and `min_count` in `GroupBy.sum` - posted by GitBox <gi...@apache.org> on 2022/09/30 09:08:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38060: [SPARK-40621][PS] Implement `numeric_only` and `min_count` in `GroupBy.sum` - posted by GitBox <gi...@apache.org> on 2022/09/30 09:11:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38049: [SPARK-40613][BUILD] Upgrade sbt-protoc to 1.0.6 - posted by GitBox <gi...@apache.org> on 2022/09/30 09:36:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38045: [SPARK-40540][SQL][TESTS][FOLLOW-UP] Use a regex for List and ArrayBuffer for 'Struct Star Expansion' test - posted by GitBox <gi...@apache.org> on 2022/09/30 09:38:01 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on a diff in pull request #38037: [SPARK-40537][CONNECT] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/30 09:42:37 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38054: [WIP] Investigate the root cause for SPARK-40165 - posted by GitBox <gi...@apache.org> on 2022/09/30 10:22:21 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38054: [SPARK-40165][BUILD] Update test plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/09/30 10:26:55 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38060: [SPARK-40621][PS] Implement `numeric_only` and `min_count` in `GroupBy.sum` - posted by GitBox <gi...@apache.org> on 2022/09/30 10:40:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38060: [SPARK-40621][PS] Implement `numeric_only` and `min_count` in `GroupBy.sum` - posted by GitBox <gi...@apache.org> on 2022/09/30 10:40:32 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38054: [SPARK-40165][BUILD] Update test plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/09/30 11:00:15 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #38054: [SPARK-40165][BUILD] Update test plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/09/30 11:00:20 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38054: [SPARK-40165][BUILD] Update test plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/09/30 11:14:51 UTC, 8 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38054: [SPARK-40165][BUILD] Update test plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/09/30 11:24:27 UTC, 7 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #37993: [SPARK-40557][CONNECT] Update generated proto files for Spark Connect - posted by GitBox <gi...@apache.org> on 2022/09/30 12:14:06 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38061: [SPARK-40448][CONNECT][FOLLOWUP] Use more suitable message name. - posted by GitBox <gi...@apache.org> on 2022/09/30 12:24:18 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #38056: [SPARK-40617] Fix race condition at the handling of ExecutorMetricsPoller's stageTCMP entries - posted by GitBox <gi...@apache.org> on 2022/09/30 12:49:14 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38056: [SPARK-40617] Fix race condition at the handling of ExecutorMetricsPoller's stageTCMP entries - posted by GitBox <gi...@apache.org> on 2022/09/30 13:26:31 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38054: [SPARK-40165][BUILD] Update test plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/09/30 13:40:43 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on pull request #38061: [SPARK-40448][CONNECT][FOLLOWUP] Use more suitable message name. - posted by GitBox <gi...@apache.org> on 2022/09/30 16:49:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38062: [WIP][SPARK-40540][SQL] Migrate compilation errors onto error classes: _LEGACY_ERROR_TEMP_1300-1319 - posted by GitBox <gi...@apache.org> on 2022/09/30 17:01:23 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38037: [SPARK-40537][CONNECT] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/30 17:08:18 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38037: [SPARK-40537][CONNECT] Enable mypy for Spark Connect Python Client - posted by GitBox <gi...@apache.org> on 2022/09/30 17:26:08 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #38052: [SPARK-40618][SQL] Fix bug in MergeScalarSubqueries rule with nested subqueries - posted by GitBox <gi...@apache.org> on 2022/09/30 17:59:08 UTC, 3 replies.
- [GitHub] [spark] wypoon commented on pull request #38056: [SPARK-40617] Fix race condition at the handling of ExecutorMetricsPoller's stageTCMP entries - posted by GitBox <gi...@apache.org> on 2022/09/30 18:39:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38058: [SPARK-40620] [CORE] Simplify make offers - posted by GitBox <gi...@apache.org> on 2022/09/30 18:54:30 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #37821: [SPARK-40379][K8S] Propagate decommission executor loss reason in K8s - posted by GitBox <gi...@apache.org> on 2022/09/30 19:23:08 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38056: [SPARK-40617] Fix race condition at the handling of ExecutorMetricsPoller's stageTCMP entries - posted by GitBox <gi...@apache.org> on 2022/09/30 19:44:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38052: [SPARK-40618][SQL] Fix bug in MergeScalarSubqueries rule with nested subqueries - posted by GitBox <gi...@apache.org> on 2022/09/30 20:27:58 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37779: [SPARK-40320][Core] Executor should exit when initialization failed for fatal error - posted by GitBox <gi...@apache.org> on 2022/09/30 20:55:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38048: [SPARK-40612][CORE] Fixing the principal used for delegation token renewal on non-YARN resource managers - posted by GitBox <gi...@apache.org> on 2022/09/30 21:52:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38048: [SPARK-40612][CORE] Fixing the principal used for delegation token renewal on non-YARN resource managers - posted by GitBox <gi...@apache.org> on 2022/09/30 21:52:56 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/09/30 22:18:13 UTC, 6 replies.
- [GitHub] [spark] attilapiros commented on pull request #37821: [SPARK-40379][K8S] Propagate decommission executor loss reason in K8s - posted by GitBox <gi...@apache.org> on 2022/09/30 22:43:05 UTC, 0 replies.