You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [spark] HyukjinKwon closed pull request #42251: [SPARK-44617] Support comparison between lists of Rows - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 00:00:28 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42235: [SPARK-44599][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 00:13:26 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/01 00:19:24 UTC, 3 replies.
- [GitHub] [spark] dtenedor commented on pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/01 00:21:15 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40824: [SPARK-32064][SQL] Support temporary table - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/01 00:22:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40897: [SPARK-43228][SQL] Join keys also match PartitioningCollection in CoalesceBucketsInJoin - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/01 00:22:38 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40790: [SPARK-43116][SQL] Fix Cast.forceNullable - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/01 00:22:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40616: [SPARK-42991][SQL] Disable string type +/- interval in ANSI mode - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/01 00:22:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40266: [SPARK-42660][SQL] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/01 00:22:41 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42246: [SPARK-44611][CONNECT] Do not exclude scala-xml - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 01:01:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42246: [SPARK-44611][CONNECT] Do not exclude scala-xml - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 01:02:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/01 01:17:00 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #42199: [SPARK-44579][SQL] Support Interrupt On Cancel in SQLExecution - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/01 01:42:02 UTC, 0 replies.
- [GitHub] [spark] ulysses-you closed pull request #42199: [SPARK-44579][SQL] Support Interrupt On Cancel in SQLExecution - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/01 01:45:20 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/01 01:58:42 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #42205: [SPARK-44583][DOC] `spark.*.io.connectionCreationTimeout` parameter documentation - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/01 01:59:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42205: [SPARK-44583][DOC] `spark.*.io.connectionCreationTimeout` parameter documentation - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/01 02:00:25 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #41628: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/01 02:02:25 UTC, 3 replies.
- [GitHub] [spark] itholic opened a new pull request, #42252: [SPARK-43611][SPARK-44602][SQL][PS][CONNCECT][3.5] Make `ExtractWindowExpressions` & `WidenSetOperationTypes` retain the `PLAN_ID_TAG` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 02:08:05 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42252: [SPARK-43611][SPARK-44602][SQL][PS][CONNCECT][3.5] Make `ExtractWindowExpressions` & `WidenSetOperationTypes` retain the `PLAN_ID_TAG` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 02:09:24 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42230: [SPARK-44602][SQL][CONNECT][PS] Make `WidenSetOperationTypes` retain the `Plan_ID_TAG` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 02:09:54 UTC, 0 replies.
- [GitHub] [spark] itholic closed pull request #41587: [WIP][SPARK-43654][SPARK-43655][CONNECT] Ensure that Spark Connect assigns correct column name - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 02:10:36 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42136: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/01 02:11:27 UTC, 1 replies.
- [GitHub] [spark] liuzqt commented on a diff in pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/08/01 02:11:41 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41723: [SPARK-44179][CORE]Fix the number of executors is calculated incorrctly when the task fails and it is speculated that the task is still executing - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/01 02:15:22 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/01 02:25:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42210: [SPARK-44586][INFRA][ML][PYTHON] `TorchDistributor` should install cpu-only Torch for testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 02:27:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42210: [SPARK-44586][INFRA][ML][PYTHON] `TorchDistributor` should install cpu-only Torch for testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 02:28:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42253: [SPARK-44619][INFRA] Free up disk space for container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 02:37:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42253: [SPARK-44619][INFRA] Free up disk space for container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 02:38:04 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42253: [SPARK-44619][INFRA] Free up disk space for pyspark container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 02:44:25 UTC, 4 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/01 02:44:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 02:54:31 UTC, 4 replies.
- [GitHub] [spark] LuciferYang closed pull request #42247: [SPARK-44615][CONNECT][TESTS] rename spark connect client suites to avoid conflict - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 03:02:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42247: [SPARK-44615][CONNECT][TESTS] rename spark connect client suites to avoid conflict - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 03:03:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42228: [SPARK-44421][SPARK-44423][CONNECT] Reattachable execution in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 03:42:06 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42228: [SPARK-44421][SPARK-44423][CONNECT] Reattachable execution in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 03:42:25 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42228: [SPARK-44421][SPARK-44423][CONNECT] Reattachable execution in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 03:42:26 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/08/01 03:43:46 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/01 03:52:47 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #42233: [SPARK-44340][SQL][FOLLOWUP][3.5] Set partition index correctly for WindowGroupLimitExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/01 04:00:27 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42244: [SPARK-44591][CONNECT][WEBUI] Use jobTags in SparkListenerSQLExecutionStart to link SQL Execution IDs for Spark UI Connect page - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/01 04:14:59 UTC, 4 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41755: [SPARK-43999][SQL][CORE] Support force finish useless stage when AQE on - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/01 04:29:06 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X closed pull request #41755: [SPARK-43999][SQL][CORE] Support force finish useless stage when AQE on - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/01 04:29:09 UTC, 0 replies.
- [GitHub] [spark] jasonli-db commented on a diff in pull request #42244: [SPARK-44591][CONNECT][WEBUI] Use jobTags in SparkListenerSQLExecutionStart to link SQL Execution IDs for Spark UI Connect page - posted by "jasonli-db (via GitHub)" <gi...@apache.org> on 2023/08/01 04:31:16 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42228: [SPARK-44421][SPARK-44423][CONNECT] Reattachable execution in Spark Connect - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/01 04:31:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42254: [SPARK-44421][SPARK-44423][CONNECT][FOLLOW-UP][3.5] Extends Logging to allow SparkConnectService to use logging - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 04:48:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42254: [SPARK-44421][SPARK-44423][CONNECT][FOLLOW-UP][3.5] Extends Logging to allow SparkConnectService to use logging - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 04:49:22 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42254: [SPARK-44421][SPARK-44423][CONNECT][FOLLOW-UP][3.5] Extends Logging to allow SparkConnectService to use logging - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 04:49:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42203: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:00:06 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:11:40 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41466: [SPARK-43646][PROTOBUF][BUILD] Split `protobuf-assembly` module from `protobuf` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:25:22 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #41466: [SPARK-43646][PROTOBUF][BUILD] Split `protobuf-assembly` module from `protobuf` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:25:23 UTC, 1 replies.
- [GitHub] [spark] anishshri-db commented on a diff in pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/01 05:27:27 UTC, 10 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42232: [SPARK-44604][BUILD] Upgrade Netty to 4.1.96.Final - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:28:32 UTC, 0 replies.
- [GitHub] [spark] ericm-db commented on a diff in pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2023/08/01 05:30:37 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42238: Clear some unused codes in "***Errors" and extract some common logic. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:31:18 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42180: [SPARK-44562][SQL] Add OptimizeOneRowRelationSubquery in batch of Subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/01 05:42:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42252: [SPARK-43611][SPARK-44602][SQL][PS][CONNCECT][3.5] Make `ExtractWindowExpressions` & `WidenSetOperationTypes` retain the `PLAN_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 05:44:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42252: [SPARK-43611][SPARK-44602][SQL][PS][CONNCECT][3.5] Make `ExtractWindowExpressions` & `WidenSetOperationTypes` retain the `PLAN_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 05:44:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42168: [SPARK-44556][SQL] Reuse `OrcTail` when enable vectorizedReader - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:52:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42168: [SPARK-44556][SQL] Reuse `OrcTail` when enable vectorizedReader - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 05:55:03 UTC, 0 replies.
- [GitHub] [spark] advancedxy opened a new pull request, #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/08/01 06:09:34 UTC, 0 replies.
- [GitHub] [spark] advancedxy commented on a diff in pull request #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/08/01 06:11:21 UTC, 9 replies.
- [GitHub] [spark] advancedxy commented on pull request #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/08/01 06:11:49 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42245: [SPARK-29497][CONNECT] Throw error when UDF is not deserializable. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 06:32:25 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on pull request #42085: [SPARK-44490][WEBUI] Remove unused `TaskPagedTable` in StagePage - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/08/01 06:36:00 UTC, 0 replies.
- [GitHub] [spark] sarutak closed pull request #42085: [SPARK-44490][WEBUI] Remove unused `TaskPagedTable` in StagePage - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/08/01 06:37:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 06:37:59 UTC, 19 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 06:38:49 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/01 06:53:56 UTC, 7 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42245: [SPARK-29497][CONNECT] Throw error when UDF is not deserializable. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/01 06:58:51 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 07:09:37 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42244: [SPARK-44591][CONNECT][WEBUI] Use jobTags in SparkListenerSQLExecutionStart to link SQL Execution IDs for Spark UI Connect page - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/01 07:14:50 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42244: [SPARK-44591][CONNECT][WEBUI] Use jobTags in SparkListenerSQLExecutionStart to link SQL Execution IDs for Spark UI Connect page - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/01 07:15:18 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42256: [SPARK-41532][CONNECT][FOLLOWUP] Make the scala client using the same error class as python client. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/01 07:15:44 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42221: [SPARK-44595][CONNECT] Make the user session cache number and cache time be configurable in spark connect service - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/01 07:23:04 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 07:24:55 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42241: [SPARK-44618][INFRA] Free up disk space for non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 07:38:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42241: [SPARK-44618][INFRA] Free up disk space for non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 07:38:21 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/08/01 07:47:20 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42228: [SPARK-44421][SPARK-44423][CONNECT] Reattachable execution in Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 07:57:49 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42257: [SPARK-44614][PYTHON][CONNECT][3.5] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/01 08:04:31 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42248: [SPARK-44614][PYTHON][CONNECT] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/01 08:07:09 UTC, 1 replies.
- [GitHub] [spark] ueshin closed pull request #42248: [SPARK-44614][PYTHON][CONNECT] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/01 08:08:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42258: Test remove graphx/src/main/scala/org/apache/spark/graphx/util/BytecodeUtils.scala - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 08:25:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42030: [SPARK-44452][CONNECT][TESTS] Move `test` function from `RemoteSparkSession` to `ConnectFunSuite` and ignore `ArrowEncoderSuite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 08:33:20 UTC, 0 replies.
- [GitHub] [spark] zhaomin1423 commented on a diff in pull request #42221: [SPARK-44595][CONNECT] Make the user session cache number and cache time be configurable in spark connect service - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/08/01 08:35:36 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/01 09:01:03 UTC, 22 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 09:03:08 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42240: [SPARK-44608][SQL] Remove unused definitions from `DataTypeExpression` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 09:12:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42259: [MINOR][CONNECT] Fix some typos in connect server module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 09:21:00 UTC, 0 replies.
- [GitHub] [spark] zhaomin1423 commented on pull request #42221: [SPARK-44595][CONNECT] Make the user session cache number and cache time be configurable in spark connect service - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/08/01 09:32:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42253: [WIP][SPARK-44619][INFRA] Free up disk space for pyspark container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 09:32:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42260: [SPARK-44601][BUILD] Add `jackson-mapper-asl` as test dependency to make `hive-thriftserver` module test pass using Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 09:38:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42203: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 10:22:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42203: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 10:23:14 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42261: [SPARK-44620][SQL][PS][CONNECT] Make `ResolvePivot` retain the `Plan_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 10:48:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42261: [SPARK-44620][SQL][PS][CONNECT] Make `ResolvePivot` retain the `Plan_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 10:49:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42262: [SPARK-42497][FOLLOWUPS][TESTS] Add missing UTs to `modules.py` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/01 11:12:04 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #42253: [SPARK-44619][INFRA] Free up disk space for pyspark container jobs - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/08/01 11:12:42 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42260: [SPARK-44601][BUILD] Add `jackson-mapper-asl` as test dependency to `hive-thriftserver` module to make Maven test pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 11:16:42 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/01 11:25:19 UTC, 7 replies.
- [GitHub] [spark] jeanlyn commented on a diff in pull request #41628: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions - posted by "jeanlyn (via GitHub)" <gi...@apache.org> on 2023/08/01 11:31:09 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42263: [SPARK-44421][FOLLOWUP] Reenable test in SparkThrowableSuite - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 11:33:22 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42264: [SPARK-44613][CONNECT] Add Encoders object - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 11:37:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42245: [SPARK-29497][CONNECT] Throw error when UDF is not deserializable. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 12:00:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42258: Test remove BytecodeUtils.scala and HadoopUtils.scala - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 12:10:38 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42256: [SPARK-41532][CONNECT][FOLLOWUP] Make the scala client using the same error class as python client. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/01 12:32:20 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42265: [SPARK-41636][SQL] Make sure `selectFilters` returns predicates in deterministic order - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/01 12:32:28 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41156: [SPARK-40129][SQL] Fix Decimal multiply can produce the wrong answer - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/01 13:09:28 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/01 13:12:58 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/01 13:23:35 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42221: [SPARK-44595][CONNECT] Make the user session cache number and cache time be configurable in spark connect service - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 13:35:54 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41156: [SPARK-40129][SQL] Fix Decimal multiply can produce the wrong answer - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/01 13:38:12 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on pull request #42260: [SPARK-44601][BUILD] Add `jackson-mapper-asl` as test dependency to `hive-thriftserver` module to make Maven test pass - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/01 13:48:36 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42263: [SPARK-44421][FOLLOWUP] Reenable test in SparkThrowableSuite - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 13:48:57 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/01 13:52:30 UTC, 2 replies.
- [GitHub] [spark] srowen closed pull request #42232: [SPARK-44604][BUILD] Upgrade Netty to 4.1.96.Final - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/01 13:55:38 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42232: [SPARK-44604][BUILD] Upgrade Netty to 4.1.96.Final - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/01 13:55:57 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42263: [SPARK-44421][FOLLOWUP] Reenable test in SparkThrowableSuite - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 13:56:14 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/01 14:10:52 UTC, 3 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42266: [SPARK-44575] Implement basic error translation - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/01 14:14:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/01 14:17:07 UTC, 8 replies.
- [GitHub] [spark] itholic opened a new pull request, #42267: [SPARK-43606][PS] Remove `Int64Index` & `Float64Index` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 14:28:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41876: [SPARK-44311[CONNECT][SQL] Improved support for UDFs on value classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 14:49:22 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #41876: [SPARK-44311[CONNECT][SQL] Improved support for UDFs on value classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 14:50:44 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41156: [SPARK-40129][SQL] Fix Decimal multiply can produce the wrong answer - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/01 14:51:54 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42235: [SPARK-44424][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 14:58:00 UTC, 8 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/01 15:10:01 UTC, 39 replies.
- [GitHub] [spark] ueshin commented on pull request #42257: [SPARK-44614][PYTHON][CONNECT][3.5] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/01 15:15:27 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42257: [SPARK-44614][PYTHON][CONNECT][3.5] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/01 15:16:44 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42268: [SPARK-43562][SPARK-43870][PS] Remove APIs from `DataFrame` and `Series` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 15:45:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40658: [SPARK-XXXXX][PS] Matching the behavior of pandas API on Spark to pandas 2.0.0 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 15:50:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #42240: [SPARK-44608][SQL] Remove unused definitions from `DataTypeExpression` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/01 16:02:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42269: [SPARK-44623][BUILD] Upgrade commons-lang3 to 3.13.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 16:08:01 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42270: [SPARK-43567][PS] Support `use_na_sentinel` for `factorize` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 16:12:03 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42271: [SPARK-43245][SPARK-43705][PS] Type match for `DatetimeIndex`/`TimedeltaIndex` with pandas 2 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 16:39:49 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/01 17:04:52 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42273: [SPARK-43568][SPARK-43633][PS] Support `Categorical` APIs for pandas 2 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 17:24:14 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #42177: [SPARK-44059] Add better error messages for SQL named argumnts - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/08/01 17:32:39 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40991: [Spark-42330] Assign name to _LEGACY_ERROR_TEMP_2175: RULE_ID_NOT_FOUND - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/01 17:49:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40991: [Spark-42330] Assign name to _LEGACY_ERROR_TEMP_2175: RULE_ID_NOT_FOUND - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/01 17:51:02 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42250: [SPARK-42941][SS][CONNECT][3.5] Python StreamingQueryListener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/01 17:52:40 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 17:54:21 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/01 17:56:12 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 commented on pull request #42177: [SPARK-44059] Add better error messages for SQL named argumnts - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/08/01 17:57:35 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42263: [SPARK-44421][FOLLOWUP] Fix doc test in SparkThrowableSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 17:58:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42263: [SPARK-44421][FOLLOWUP] Fix doc test in SparkThrowableSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 17:58:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 17:59:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42269: [SPARK-44623][BUILD] Upgrade `commons-lang3` to 3.13.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 18:38:47 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42264: [SPARK-44613][CONNECT] Add Encoders object - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 18:39:15 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42264: [SPARK-44613][CONNECT] Add Encoders object - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 18:39:48 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42245: [SPARK-29497][CONNECT] Throw error when UDF is not deserializable. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 18:53:39 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42245: [SPARK-29497][CONNECT] Throw error when UDF is not deserializable. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/01 18:54:27 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/01 19:41:21 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42269: [SPARK-44623][BUILD] Upgrade `commons-lang3` to 3.13.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 19:46:38 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/01 19:52:58 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42235: [SPARK-44424][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/01 20:08:56 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #42206: [SPARK-44582][SQL] Skip iterator on SMJ if it was cleaned up - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/08/01 20:20:41 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #42270: [SPARK-43567][PS] Support `use_na_sentinel` for `factorize` - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/01 20:22:02 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42274: [SPARK-44624][CONNECT] Wrap retries around initial server-streaming GRPC call - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 20:46:30 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42274: [SPARK-44624][CONNECT] Wrap retries around initial server-streaming GRPC call - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/01 20:46:51 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42263: [SPARK-44421][FOLLOWUP] Fix doc test in SparkThrowableSuite - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 20:54:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42260: [SPARK-44601][BUILD] Add `jackson-mapper-asl` as test dependency to `hive-thriftserver` module to make Maven test pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 20:56:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42181: [SPARK-44563][BUILD] Upgrade Arrow to 13.0.0 & Netty to 4.1.95.Final - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 21:16:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42181: [SPARK-44247][BUILD] Upgrade Arrow to 13.0.0 & Netty to 4.1.95.Final - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 21:19:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42181: [SPARK-44247][BUILD] Upgrade Arrow to 13.0.0 & Netty to 4.1.95.Final - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/01 21:21:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42214: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 21:26:53 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42214: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 21:33:27 UTC, 0 replies.
- [GitHub] [spark] henrymai commented on pull request #42214: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 21:46:05 UTC, 1 replies.
- [GitHub] [spark] henrymai opened a new pull request, #42275: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.3 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:06:00 UTC, 0 replies.
- [GitHub] [spark] anchovYu opened a new pull request, #42276: [WIP] Throw a more tailored error message for having - window - LCA unresolved case - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/08/01 22:06:07 UTC, 0 replies.
- [GitHub] [spark] henrymai closed pull request #42275: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.3 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:06:53 UTC, 0 replies.
- [GitHub] [spark] henrymai opened a new pull request, #42277: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.3 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:07:52 UTC, 0 replies.
- [GitHub] [spark] henrymai opened a new pull request, #42278: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.4 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:10:32 UTC, 0 replies.
- [GitHub] [spark] henrymai closed pull request #42278: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.4 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:11:38 UTC, 0 replies.
- [GitHub] [spark] henrymai opened a new pull request, #42279: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.4 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:12:16 UTC, 0 replies.
- [GitHub] [spark] henrymai commented on pull request #42277: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.3 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:14:32 UTC, 0 replies.
- [GitHub] [spark] henrymai commented on pull request #42279: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks -- Backport to 3.4 - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 22:15:12 UTC, 0 replies.
- [GitHub] [spark] bogao007 opened a new pull request, #42280: [SPARK-44626][SS][CONNECT] Followup on streaming query termination when client session is timed out for Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/01 22:57:10 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 commented on pull request #42243: [SPARK-38475][CORE] Use error class in org.apache.spark.serializer - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/08/01 23:15:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42279: [SPARK-44588][CORE][3.4] Fix double encryption issue for migrated shuffle blocks - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 23:29:31 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42277: [SPARK-44588][CORE][3.3] Fix double encryption issue for migrated shuffle blocks - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/01 23:30:22 UTC, 1 replies.
- [GitHub] [spark] henrymai commented on pull request #41413: [SPARK-43905][CORE] Consolidate BlockId parsing and creation - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/01 23:34:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42196: [SPARK-44218][PYTHON] Customize diff log in assertDataFrameEqual error message format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 23:40:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42196: [SPARK-44218][PYTHON] Customize diff log in assertDataFrameEqual error message format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/01 23:41:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42250: [SPARK-42941][SS][CONNECT][3.5] Python StreamingQueryListener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 00:01:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42250: [SPARK-42941][SS][CONNECT][3.5] Python StreamingQueryListener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 00:02:36 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42270: [SPARK-43567][PS] Support `use_na_sentinel` for `factorize` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/02 00:18:50 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40912: [SPARK-43238][CORE] Support only decommission idle workers in standalone - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/02 00:19:27 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42281: [SPARK-44421][CONNECT][FOLLOWUP] Minor comment improvements - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/02 00:29:18 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42274: [SPARK-44624][CONNECT] Retry ExecutePlan in case initial request didn't reach server - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/02 00:39:07 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42282: [SPARK-44624][CONNECT] Retry ExecutePlan in case initial request didn't reach server - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/02 00:51:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42262: [SPARK-42497][FOLLOWUPS][TESTS] Add missing UTs to `modules.py` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 00:54:28 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42253: [SPARK-44619][INFRA] Free up disk space for pyspark container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 00:58:04 UTC, 2 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42283: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with `removeListener` and improvements - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/02 01:11:21 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42283: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with `removeListener` and improvements - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/02 01:11:42 UTC, 2 replies.
- [GitHub] [spark] panbingkun commented on pull request #42238: [SPARK-44628][SQL] Clear some unused codes in "***Errors" and extract some common logic - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/02 01:31:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42262: [SPARK-42497][FOLLOWUPS][TESTS] Add missing UTs to `modules.py` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 01:39:23 UTC, 0 replies.
- [GitHub] [spark] liuzqt commented on pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/08/02 02:09:09 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42284: [SPARK-44629] Publish PySpark Test Guidelines webpage - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/02 02:09:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42235: [SPARK-44424][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 02:16:08 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40690: [SPARK-43043][CORE] Improve the performance of MapOutputTracker.updateMapOutput - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/02 02:30:08 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42285: [SPARK-44630][CORE][3.4] Revert "[SPARK-43043][CORE] Improve the performance of MapOutputTracker.updateMapOutput" - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/02 02:47:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42259: [MINOR][CONNECT] Fix some typos in connect server module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 02:52:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42259: [MINOR][CONNECT] Fix some typos in connect server module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 02:53:40 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42285: [SPARK-44630][CORE][3.4] Revert "[SPARK-43043][CORE] Improve the performance of MapOutputTracker.updateMapOutput" - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/02 02:54:58 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42253: [SPARK-44619][INFRA] Free up disk space for pyspark container jobs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 02:58:58 UTC, 0 replies.
- [GitHub] [spark] liuzqt opened a new pull request, #42286: rename shouldBroadcast to isDynamicPruning - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/08/02 03:15:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42239: [SPARK-44607][SQL] Remove unused function `containsNestedColumn` from `Filter` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 03:18:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42239: [SPARK-44607][SQL] Remove unused function `containsNestedColumn` from `Filter` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 03:18:41 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42276: [WIP] Throw a more tailored error message for having - window - LCA unresolved case - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/02 03:40:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42281: [SPARK-44421][CONNECT][FOLLOWUP] Minor comment improvements - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 03:49:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42281: [SPARK-44421][CONNECT][FOLLOWUP] Minor comment improvements - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 03:49:22 UTC, 0 replies.
- [GitHub] [spark] henrymai commented on pull request #42279: [SPARK-44588][CORE][3.4] Fix double encryption issue for migrated shuffle blocks - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/08/02 03:58:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42279: [SPARK-44588][CORE][3.4] Fix double encryption issue for migrated shuffle blocks - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/02 04:07:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42277: [SPARK-44588][CORE][3.3] Fix double encryption issue for migrated shuffle blocks - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/02 04:08:14 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42287: [SPARK-44632][Core] DiskBlockManager should check and be able to handle stale directories - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/02 04:23:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42288: [SPARK-44618][INFRA][FOLLOWUPS] Make `free_disk_space` uninstall two unused packages completely - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 04:31:46 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41156: [SPARK-40129][SQL] Fix Decimal multiply can produce the wrong answer - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/02 04:39:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42256: [SPARK-41532][CONNECT][FOLLOWUP] Make the scala client using the same error class as python client. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 04:49:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42256: [SPARK-41532][CONNECT][FOLLOWUP] Make the scala client using the same error class as python client. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 04:49:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42289: [SPARK-44631][CONNECT][CORE] Remove session-based directory when the isolated session cache is evicted - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 04:50:14 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40690: [SPARK-43043][CORE] Improve the performance of MapOutputTracker.updateMapOutput - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/02 04:50:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42289: [SPARK-44631][CONNECT][CORE] Remove session-based directory when the isolated session cache is evicted - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 04:51:15 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #42287: [SPARK-44632][CORE] DiskBlockManager should check and be able to handle stale directories - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/02 04:54:50 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42290: [SPARK-44559][PYTHON][3.5] Improve error messages for Python UDTF arrow cast - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/02 05:36:07 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #42206: [SPARK-44582][SQL] Skip iterator on SMJ if it was cleaned up - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/02 05:44:50 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42291: Test maven test repl module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 05:47:07 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42287: [SPARK-44632][CORE] DiskBlockManager should check and be able to handle stale directories - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/02 05:48:59 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42169: [SPARK-44555][SQL] Use checkError() to check Exception in command Suite & assign some error class names - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/02 05:50:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42169: [SPARK-44555][SQL] Use checkError() to check Exception in command Suite & assign some error class names - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/02 05:51:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42285: [SPARK-44630][CORE][3.4] Revert "[SPARK-43043][CORE] Improve the performance of MapOutputTracker.updateMapOutput" - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 06:15:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42285: [SPARK-44630][CORE][3.4] Revert "[SPARK-43043][CORE] Improve the performance of MapOutputTracker.updateMapOutput" - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 06:15:58 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42169: [SPARK-44555][SQL] Use checkError() to check Exception in command Suite & assign some error class names - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/02 06:17:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42292: [SPARK-44572][INFRA] Clean up unused installers ASAP - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 06:44:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41843: [SPARK-44280][SQL] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/02 06:47:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41843: [SPARK-44280][SQL] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/02 06:48:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42109: [SPARK-44404][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1009,1010,1013,1015,1016,1278] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/02 06:56:17 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42288: [SPARK-44618][INFRA][FOLLOWUPS] Make `free_disk_space` uninstall two unused packages completely - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 06:56:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42288: [SPARK-44618][INFRA][FOLLOWUPS] Make `free_disk_space` uninstall two unused packages completely - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 06:56:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42022: [WIP][SPARK-44355][SQL] Move `WithCTE` into command queries - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/02 06:57:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42289: [SPARK-44631][CONNECT][CORE] Remove session-based directory when the isolated session cache is evicted - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 07:45:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42293: re-gen sql-error-conditions-connect-error-class.md - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 07:51:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42293: [SPARK-41532][DOCS][FOLLOWUP] Regenerate `sql-error-conditions-connect-error-class.md` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 08:01:10 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #42293: [SPARK-41532][DOCS][FOLLOWUP] Regenerate `sql-error-conditions-connect-error-class.md` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/02 08:09:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42293: [SPARK-41532][DOCS][FOLLOWUP] Regenerate `sql-error-conditions-connect-error-class.md` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/02 08:11:15 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42180: [SPARK-44562][SQL] Add OptimizeOneRowRelationSubquery in batch of Subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/02 08:52:59 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42180: [SPARK-44562][SQL] Add OptimizeOneRowRelationSubquery in batch of Subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/02 08:53:17 UTC, 0 replies.
- [GitHub] [spark] amoylan2 opened a new pull request, #42294: [MINOR][BUG-FIX] Fix one unit mistake related to spark.eventLog.buffer.kb - posted by "amoylan2 (via GitHub)" <gi...@apache.org> on 2023/08/02 09:06:46 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/02 09:10:18 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41088: [SPARK-43402][SQL] FileSourceScanExec supports push down data filter with scalar subquery - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/02 09:13:46 UTC, 10 replies.
- [GitHub] [spark] LuciferYang closed pull request #42293: [SPARK-41532][DOCS][FOLLOWUP] Regenerate `sql-error-conditions-connect-error-class.md` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/02 09:37:18 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #41088: [SPARK-43402][SQL] FileSourceScanExec supports push down data filter with scalar subquery - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/02 10:18:55 UTC, 6 replies.
- [GitHub] [spark] liangyu-1 opened a new pull request, #42295: [SPARK-44581][YARN]Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/02 10:22:28 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #42296: [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/08/02 11:01:49 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42181: [SPARK-44247][BUILD] Upgrade Arrow to 13.0.0 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/02 11:39:25 UTC, 2 replies.
- [GitHub] [spark] alibiyeslambek opened a new pull request, #42297: [SPARK-44609][KUBERNETES] Remove executor pod from PodsAllocator if it was removed from scheduler backend - posted by "alibiyeslambek (via GitHub)" <gi...@apache.org> on 2023/08/02 12:15:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42299: [SPARK-44637][CONNECT] Synchronize accesses to ExecuteResponseObserver - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/02 16:22:11 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40474: [SPARK-42849] [SQL] Session Variables - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/02 16:25:21 UTC, 21 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42300: [SPARK-44421][FOLLOWU] Minor rename of ResponseComplete to ResultComplete - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/02 16:28:38 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42280: [SPARK-44626][SS][CONNECT] Followup on streaming query termination when client session is timed out for Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/02 17:48:20 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/02 17:53:22 UTC, 0 replies.
- [GitHub] [spark] Kimahriman opened a new pull request, #42301: [SPARK-44639][SQL] Add option to use the Java tmp dir for local RocksDB state storage - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/08/02 18:57:19 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #42301: [SPARK-44639][SS] Add option to use the Java tmp dir for local RocksDB state storage - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/08/02 18:59:33 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42302: [SPARK-44640][PYTHON] Improve error messages for Python UDTF returning non Iterable - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/02 19:01:59 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42299: [SPARK-44637][CONNECT] Synchronize accesses to ExecuteResponseObserver - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/02 19:07:22 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42299: [SPARK-44637][CONNECT] Synchronize accesses to ExecuteResponseObserver - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/02 19:07:34 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42300: [SPARK-44421][FOLLOWUP] Minor rename of ResponseComplete to ResultComplete - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/02 19:13:53 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42300: [SPARK-44421][FOLLOWUP] Minor rename of ResponseComplete to ResultComplete - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/02 19:17:52 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42300: [SPARK-44421][FOLLOWUP] Minor rename of ResponseComplete to ResultComplete - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/02 19:18:18 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe commented on pull request #42136: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "zhouyejoe (via GitHub)" <gi...@apache.org> on 2023/08/02 19:34:07 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42303: [SPARK-44643][SQL][PYTHON] Fix Row.__repr__ for the case the field is empty Row - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/02 20:15:59 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #42266: [SPARK-44575][SQL][CONNECT] Implement basic error translation - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/02 20:17:16 UTC, 5 replies.
- [GitHub] [spark] ueshin commented on pull request #42303: [SPARK-44643][SQL][PYTHON] Fix Row.__repr__ for the case the field is empty Row - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/02 20:21:45 UTC, 0 replies.
- [GitHub] [spark] ion-elgreco commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "ion-elgreco (via GitHub)" <gi...@apache.org> on 2023/08/02 20:34:15 UTC, 5 replies.
- [GitHub] [spark] heyihong commented on a diff in pull request #42266: [SPARK-44575][SQL][CONNECT] Implement basic error translation - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/02 20:37:33 UTC, 9 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42302: [SPARK-44640][PYTHON] Improve error messages for Python UDTF returning non Iterable - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/02 21:15:09 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42302: [SPARK-44640][PYTHON] Improve error messages for Python UDTF returning non Iterable - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/02 21:16:05 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/02 21:21:16 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/02 21:24:31 UTC, 8 replies.
- [GitHub] [spark] ueshin commented on pull request #42290: [SPARK-44559][PYTHON][3.5] Improve error messages for Python UDTF arrow cast - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/02 21:27:21 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42290: [SPARK-44559][PYTHON][3.5] Improve error messages for Python UDTF arrow cast - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/02 21:29:21 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42298: [SPARK-44636][CONNECT] Leave no dangling iterators - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/02 21:41:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42298: [SPARK-44636][CONNECT] Leave no dangling iterators - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/02 21:41:56 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42304: [SPARK-44642] ReleaseExecute in ExecutePlanResponseReattachableIterator after it gets error from server - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/02 21:47:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42280: [SPARK-44626][SS][CONNECT] Followup on streaming query termination when client session is timed out for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 21:53:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42280: [SPARK-44626][SS][CONNECT] Followup on streaming query termination when client session is timed out for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 21:54:03 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42305: [SPARK-44645] Update assertDataFrameEqual docs error example output - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/02 22:22:58 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski closed pull request #42274: [SPARK-44624][CONNECT] Retry ExecutePlan in case initial request didn't reach server overkill - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/02 22:26:22 UTC, 0 replies.
- [GitHub] [spark] szehon-ho opened a new pull request, #42306: [SQL][SPARK-44647] Support SPJ where join keys are less than cluster keys - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/02 22:43:18 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #42177: [SPARK-44059][SQL] Add better error messages for SQL named argumnts - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/02 22:49:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42235: [SPARK-44424][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 23:10:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42235: [SPARK-44424][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 23:11:13 UTC, 0 replies.
- [GitHub] [spark] pegasas opened a new pull request, #42307: [SPARK-42730][CONNECT][DOCS] Update Spark Standalone Mode - Starting … - posted by "pegasas (via GitHub)" <gi...@apache.org> on 2023/08/02 23:15:18 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42308: [Spark Ticket][WIP]Added a warning to pop up in the case the user doesn't use gpus - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/08/02 23:18:49 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42309: [SPARK-44644][PYTHON] Improve error messages for creating Python UDTFs with pickling errors - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/02 23:21:06 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42309: [SPARK-44644][PYTHON] Improve error messages for creating Python UDTFs with pickling errors - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/02 23:21:44 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42310: [SPARK-44561][PYTHON] Fix AssertionError when converting UDTF output to a complex type - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/02 23:41:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42311: [SPARK-44424][PYTHON][CONNECT][FOLLOW-UP] Import Connect related libraries after checking dependencies - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 23:41:43 UTC, 0 replies.
- [GitHub] [spark] agubichev commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/02 23:48:30 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42304: [SPARK-44642] ReleaseExecute in ExecutePlanResponseReattachableIterator after it gets error from server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 23:50:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42311: [SPARK-44424][PYTHON][CONNECT][FOLLOW-UP] Import Connect related libraries after checking dependencies - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/02 23:56:37 UTC, 0 replies.
- [GitHub] [spark-connect-go] zhengruifeng closed pull request #13: [SPARK-44368] Support Repartition and RepartitionByRange in Spark Connect Go Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/03 00:13:10 UTC, 0 replies.
- [GitHub] [spark-connect-go] zhengruifeng commented on pull request #13: [SPARK-44368] Support Repartition and RepartitionByRange in Spark Connect Go Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/03 00:13:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40930: [DO NOT MERGE] File constant metadata extractors split - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/03 00:20:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40912: [SPARK-43238][CORE] Support only decommission idle workers in standalone - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/03 00:20:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/03 00:20:17 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42310: [SPARK-44561][PYTHON] Fix AssertionError when converting UDTF output to a complex type - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/03 00:45:48 UTC, 2 replies.
- [GitHub] [spark] sandip-db commented on pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/08/03 00:48:55 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #42312: [SPARK-43476][SPARK-43477][SPARK-43478][PS] Support `StringMethods` for pandas 2.0.0 and above - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/03 01:46:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42286: [MINOR][SQL] Rename shouldBroadcast to isDynamicPruning in InSubqueryExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 01:54:53 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #42296: [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2023/08/03 02:56:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42261: [SPARK-44620][SQL][PS][CONNECT] Make `ResolvePivot` retain the `Plan_ID_TAG` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 02:57:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42261: [SPARK-44620][SQL][PS][CONNECT] Make `ResolvePivot` retain the `Plan_ID_TAG` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 02:57:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42309: [SPARK-44644][PYTHON] Improve error messages for Python UDTFs with pickling errors - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:02:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42303: [SPARK-44643][SQL][PYTHON] Fix Row.__repr__ for the case the field is empty Row - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:03:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42303: [SPARK-44643][SQL][PYTHON] Fix Row.__repr__ for the case the field is empty Row - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:05:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42177: [SPARK-44059][SQL] Add better error messages for SQL named argumnts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:06:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42177: [SPARK-44059][SQL] Add better error messages for SQL named argumnts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:07:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42083: [SPARK-44488][SQL] Support deserializing long types when creating `Metadata` object from JObject - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:08:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42083: [SPARK-44488][SQL] Support deserializing long types when creating `Metadata` object from JObject - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:08:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:09:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42307: [SPARK-42730][CONNECT][DOCS] Update Spark Standalone Mode page - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:15:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42307: [SPARK-42730][CONNECT][DOCS] Update Spark Standalone Mode page - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:15:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42305: [SPARK-44645][PYTHON][DOCS] Update assertDataFrameEqual docs error example output - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:18:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42305: [SPARK-44645][PYTHON][DOCS] Update assertDataFrameEqual docs error example output - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:18:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42294: [MINOR][BUG-FIX] Fix one unit mistake related to spark.eventLog.buffer.kb - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:21:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 03:34:50 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/03 03:37:20 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 commented on pull request #42058: [SPARK-42972][DSTREAM]ExecutorAllocationManager cannot allocate new instances when all executors down - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/03 03:40:26 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/03 03:45:37 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/03 04:01:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42292: [SPARK-44572][INFRA] Clean up unused installers ASAP - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/03 04:05:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42292: [SPARK-44572][INFRA] Clean up unused installers ASAP - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/03 04:06:13 UTC, 0 replies.
- [GitHub] [spark] sandip-db commented on a diff in pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/08/03 04:10:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35214: [SPARK-37915][SQL] Combine unions if there is a project between them - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 04:12:17 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42296: [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/03 04:38:01 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #42313: [SPARK-44650][CORE] `spark.executor.defaultJavaOptions` Check illegal java options - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/08/03 04:48:43 UTC, 0 replies.
- [GitHub] [spark] anjakefala commented on a diff in pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "anjakefala (via GitHub)" <gi...@apache.org> on 2023/08/03 05:35:43 UTC, 2 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42314: [SPARK-44652] Raise error when only one df is None - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/03 05:47:58 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40991: [SPARK-42330][SQL] Assign the name `RULE_ID_NOT_FOUND` to the error class `_LEGACY_ERROR_TEMP_2175` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/03 05:56:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40991: [SPARK-42330][SQL] Assign the name `RULE_ID_NOT_FOUND` to the error class `_LEGACY_ERROR_TEMP_2175` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/03 05:57:39 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/03 06:13:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42315: [SPARK-44653][SQL] Non-trivial DataFrame unions should not break caching - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 06:17:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42315: [SPARK-44653][SQL] Non-trivial DataFrame unions should not break caching - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 06:19:38 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42112: [SPARK-44493][SQL] Support for translating catalyst expressions into partial datasource filters - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 06:39:13 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/08/03 07:18:33 UTC, 1 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #42316: [SPARK-40770][PYTHON][FOLLOW-UP][3.5] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/08/03 07:27:16 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #42316: [SPARK-40770][PYTHON][FOLLOW-UP][3.5] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/08/03 07:28:41 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42315: [SPARK-44653][SQL] Non-trivial DataFrame unions should not break caching - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/03 07:47:52 UTC, 2 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42317: [SPARK-44649][SQL] Runtime Filter supports passing equivalent creation side expressions - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/03 08:48:58 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 commented on pull request #42295: [SPARK-44581][YARN]Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/03 09:15:47 UTC, 2 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42318: [SPARK-44655][SQL] Make the code cleaner about static and dynamic data/partition filters - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 09:25:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42253: [SPARK-44619][INFRA] Free up disk space for container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/03 09:33:45 UTC, 2 replies.
- [GitHub] [spark] ever4Kenny commented on a diff in pull request #34492: [SPARK-37216][SQL] Add the Hive macro functionality to SparkSQL - posted by "ever4Kenny (via GitHub)" <gi...@apache.org> on 2023/08/03 10:54:38 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/03 10:57:42 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #42319: [SPARK-43873][PS] Enabling `FrameDescribeTests` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/03 11:02:00 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42319: [SPARK-43873][PS] Enabling `FrameDescribeTests` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/03 11:02:49 UTC, 0 replies.
- [GitHub] [spark] cdkrot opened a new pull request, #42320: [SPARK-44656] Close Iterators in SparkResult as well. - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/03 11:27:20 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on pull request #42320: [SPARK-44656] Close Iterators in SparkResult as well. - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/03 11:37:44 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42320: [SPARK-44656] Close Iterators in SparkResult as well. - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/03 12:00:16 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #42321: [SPARK-44657] Fix incorrect limit handling in ArrowBatchWithSchemaIterator and config parsing of CONNECT_GRPC_ARROW_MAX_BATCH_SIZE - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/03 13:09:33 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #42321: [SPARK-44657] Fix incorrect limit handling in ArrowBatchWithSchemaIterator and config parsing of CONNECT_GRPC_ARROW_MAX_BATCH_SIZE - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/03 13:11:25 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on a diff in pull request #42321: [SPARK-44657][CONNECT] Fix incorrect limit handling in ArrowBatchWithSchemaIterator and config parsing of CONNECT_GRPC_ARROW_MAX_BATCH_SIZE - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/03 13:20:31 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #42321: [SPARK-44657][CONNECT] Fix incorrect limit handling in ArrowBatchWithSchemaIterator and config parsing of CONNECT_GRPC_ARROW_MAX_BATCH_SIZE - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/03 13:24:31 UTC, 3 replies.
- [GitHub] [spark] tgravescs commented on pull request #42058: [SPARK-42972][DSTREAM]ExecutorAllocationManager cannot allocate new instances when all executors down - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/08/03 13:29:47 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/03 13:52:17 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/08/03 14:02:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42318: [SPARK-44655][SQL] Make the code cleaner about static and dynamic data/partition filters - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 14:19:14 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42181: [SPARK-44247][BUILD] Upgrade Arrow to 13.0.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 14:24:14 UTC, 7 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42318: [SPARK-44655][SQL] Make the code cleaner about static and dynamic data/partition filters - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/03 14:41:24 UTC, 0 replies.
- [GitHub] [spark] aakshintala commented on a diff in pull request #42321: [SPARK-44657][CONNECT] Fix incorrect limit handling in ArrowBatchWithSchemaIterator and config parsing of CONNECT_GRPC_ARROW_MAX_BATCH_SIZE - posted by "aakshintala (via GitHub)" <gi...@apache.org> on 2023/08/03 14:54:25 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41855: [SPARK-44262][SQL] Add `dropTable` and `getInsertStatement` to JdbcDialect - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/03 15:09:58 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42318: [SPARK-44655][SQL] Make the code cleaner about static and dynamic data/partition filters - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/03 15:50:49 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42304: [SPARK-44642] ReleaseExecute in ExecutePlanResponseReattachableIterator after it gets error from server - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/03 16:29:51 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42315: [SPARK-44653][SQL] Non-trivial DataFrame unions should not break caching - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/03 16:50:07 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42310: [SPARK-44561][PYTHON] Fix AssertionError when converting UDTF output to a complex type - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/03 16:54:47 UTC, 1 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #42308: [Spark Ticket][WIP]Added a warning to pop up in the case the user doesn't use gpus - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/08/03 16:57:18 UTC, 3 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #42308: [Spark Ticket][WIP]Added a warning to pop up in the case the user doesn't use gpus - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/08/03 16:59:53 UTC, 0 replies.
- [GitHub] [spark] sdruzkin opened a new pull request, #42322: [MINOR][DOC] Fix a typo in ResolveReferencesInUpdate scaladoc - posted by "sdruzkin (via GitHub)" <gi...@apache.org> on 2023/08/03 17:00:24 UTC, 0 replies.
- [GitHub] [spark] liuzqt commented on a diff in pull request #42286: [MINOR][SQL] Rename shouldBroadcast to isDynamicPruning in InSubqueryExec - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/08/03 17:48:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42323: [SPARK-44658][CORE] ShuffleStatus.getMapStatus should return None - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 17:51:36 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/08/03 18:08:17 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/08/03 18:11:27 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42310: [SPARK-44561][PYTHON] Fix AssertionError when converting UDTF output to a complex type - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/03 18:22:43 UTC, 0 replies.
- [GitHub] [spark] sunchao opened a new pull request, #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/03 18:33:19 UTC, 0 replies.
- [GitHub] [spark] sunchao opened a new pull request, #42325: [SPARK-44659][SQL] Include keyGroupedPartitioning in StoragePartitionJoinParams equality check - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/03 18:35:33 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #42308: [Spark Ticket][WIP]Added a warning to pop up in the case the user doesn't use gpus - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/08/03 18:37:19 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42322: [MINOR][DOC] Fix a typo in ResolveReferencesInUpdate scaladoc - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/03 19:30:34 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #42322: [MINOR][DOC] Fix a typo in ResolveReferencesInUpdate scaladoc - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/03 19:36:39 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42326: [SPARK-44661][CORE][TESTS] `getMapOutputLocation` should not throw NPE - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 19:42:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42323: [SPARK-44658][CORE] `ShuffleStatus.getMapStatus` should return `None` instead of `Some(null)` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 20:02:48 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 20:06:15 UTC, 1 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/03 20:19:01 UTC, 5 replies.
- [GitHub] [spark] sunchao commented on pull request #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/03 20:27:14 UTC, 2 replies.
- [GitHub] [spark] sunchao commented on pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/03 20:31:35 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42323: [SPARK-44658][CORE] `ShuffleStatus.getMapStatus` should return `None` instead of `Some(null)` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 21:18:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42326: [SPARK-44661][CORE][TESTS] `getMapOutputLocation` should not throw NPE - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 21:19:35 UTC, 4 replies.
- [GitHub] [spark] viirya commented on pull request #42326: [SPARK-44661][CORE][TESTS] `getMapOutputLocation` should not throw NPE - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/03 21:36:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42326: [SPARK-44661][CORE][TESTS] `getMapOutputLocation` should not throw NPE - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/03 21:41:14 UTC, 0 replies.
- [GitHub] [spark] gbloisi-openaire opened a new pull request, #42327: [SPARK-44634][SQL] Encoders.bean does no longer support nested beans with type arguments - posted by "gbloisi-openaire (via GitHub)" <gi...@apache.org> on 2023/08/03 21:51:41 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42328: [SPARK-43967][SQL][PYTHON] Add memory limits for Python UDTF analyzer - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/03 22:33:48 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42329: [SPARK-44663][PYTHON] Disable arrow optimization by default for Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/03 22:34:27 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42328: [SPARK-43967][SQL][PYTHON] Add memory limits for Python UDTF analyzer - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/03 22:34:43 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/03 22:48:55 UTC, 14 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42302: [SPARK-44640][PYTHON] Improve error messages for Python UDTF returning non Iterable - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/03 22:56:11 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42283: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with `removeListener` and improvements - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/03 23:16:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42314: [SPARK-44652] Raise error when only one df is None - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 23:42:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42314: [SPARK-44652] Raise error when only one df is None - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 23:43:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42304: [SPARK-44642][CONNECT] ReleaseExecute in ExecutePlanResponseReattachableIterator after it gets error from server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 23:45:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42304: [SPARK-44642][CONNECT] ReleaseExecute in ExecutePlanResponseReattachableIterator after it gets error from server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 23:45:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42320: [SPARK-44656][CONNECT][FOLLOWUP] Close Iterators in SparkResult as well. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/03 23:46:01 UTC, 0 replies.
- [GitHub] [spark] sdruzkin commented on pull request #42322: [MINOR][DOC] Fix a typo in ResolveReferencesInUpdate scaladoc - posted by "sdruzkin (via GitHub)" <gi...@apache.org> on 2023/08/03 23:50:38 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #42322: [MINOR][DOC] Fix a typo in ResolveReferencesInUpdate scaladoc - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/03 23:53:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42330: [SPARK-44664][PYTHON][CONNECT] Release the execute when closing the iterator in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 00:06:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42330: [SPARK-44664][PYTHON][CONNECT] Release the execute when closing the iterator in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 00:08:32 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42331: [SPARK-44656][CONNECT] Make all iterators CloseableIterators - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/04 00:13:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42283: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with `removeListener` and improvements - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 00:13:56 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42283: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with `removeListener` and improvements - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 00:14:16 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42320: [SPARK-44656][CONNECT][FOLLOWUP] Close Iterators in SparkResult as well. - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/04 00:14:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40930: [DO NOT MERGE] File constant metadata extractors split - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/04 00:20:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40929: [SPARK-43264][SQL] Avoid allocation of unwritten ColumnVector in Spark Vectorized Reader - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/04 00:20:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40918: [WIP][CORE] Add shuffle sort merge joins to RDD API - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/04 00:20:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/04 00:20:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40629: [SPARK-42980][CORE] Implement a lightweight SmallBroadcast - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/04 00:20:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/04 00:20:09 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42332: [SPARK-44665] Add support for pandas DataFrame assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/04 00:29:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41466: [SPARK-43646][PROTOBUF][BUILD] Split `protobuf-assembly` module from `protobuf` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 00:30:30 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42309: [SPARK-44644][PYTHON] Improve error messages for Python UDTFs with pickling errors - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/04 00:50:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42315: [SPARK-44653][SQL] Non-trivial DataFrame unions should not break caching - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/04 01:18:36 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42302: [SPARK-44640][PYTHON] Improve error messages for Python UDTF returning non Iterable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:22:39 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42302: [SPARK-44640][PYTHON] Improve error messages for Python UDTF returning non Iterable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:22:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42284: [SPARK-44629] Publish PySpark Test Guidelines webpage - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:29:02 UTC, 8 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42319: [SPARK-43873][PS] Enabling `FrameDescribeTests` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:34:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42319: [SPARK-43873][PS] Enabling `FrameDescribeTests` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:35:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42268: [SPARK-43562][SPARK-43870][PS] Remove APIs from `DataFrame` and `Series` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:35:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42268: [SPARK-43562][SPARK-43870][PS] Remove APIs from `DataFrame` and `Series` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:36:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42316: [SPARK-40770][PYTHON][FOLLOW-UP][3.5] Improved error messages for mapInPandas for schema mismatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:37:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42332: [SPARK-44665] Add support for pandas DataFrame assertDataFrameEqual - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 01:37:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42316: [SPARK-40770][PYTHON][FOLLOW-UP][3.5] Improved error messages for mapInPandas for schema mismatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 01:38:46 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42318: [SPARK-44655][SQL] Make the code cleaner about static and dynamic data/partition filters - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/04 01:42:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42253: [SPARK-44619][INFRA] Free up disk space for container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 01:51:53 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #42332: [SPARK-44665] Add support for pandas DataFrame assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/04 02:05:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42282: [SPARK-44624][CONNECT] Retry ExecutePlan in case initial request didn't reach server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 03:04:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42330: [SPARK-44664][PYTHON][CONNECT] Release the execute when closing the iterator in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 03:04:05 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42295: [SPARK-44581][YARN]Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/04 03:04:19 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42282: [SPARK-44624][CONNECT] Retry ExecutePlan in case initial request didn't reach server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 03:05:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42118: [SPARK-44264][PYTHON]E2E Testing for Deepspeed - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 03:23:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42315: [SPARK-44653][SQL] Non-trivial DataFrame unions should not break caching - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/04 03:26:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42333: [SPARK-44618][INFRA] Uninstall CodeQL/Go/Node in non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 03:39:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42334: [SPARK-44667][INFRA] Uninstall large ML libraries for non-ML jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 03:42:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42332: [SPARK-44665][PYTHON] Add support for pandas DataFrame assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 03:48:56 UTC, 5 replies.
- [GitHub] [spark] 7mming7 opened a new pull request, #42335: [SPARK-44654][SQL]Optimize InSubquery Partition pruning - posted by "7mming7 (via GitHub)" <gi...@apache.org> on 2023/08/04 03:49:57 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #42284: [SPARK-44629] Publish PySpark Test Guidelines webpage - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/04 03:50:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42332: [SPARK-44665][PYTHON] Add support for pandas DataFrame assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 03:51:35 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42118: [SPARK-44264][PYTHON]E2E Testing for Deepspeed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 03:52:41 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #42336: [SPARK-44669][SQL][HIVE] Parquet/ORC files written using Hive Serde should has file extension - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/04 04:14:32 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42337: [SPARK-44640][PYTHON][3.5] Improve error messages for Python UDTF returning non Iterable - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/04 04:16:01 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42336: [SPARK-44669][SQL][HIVE] Parquet/ORC files written using Hive Serde should has file extension - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/04 04:21:22 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42331: [SPARK-44656][CONNECT] Make all iterators CloseableIterators - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/04 04:26:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42338: [SPARK-44671][PYTHON][CONNECT] Retry ExecutePlan in case initial request didn't reach server in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 04:41:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42338: [SPARK-44671][PYTHON][CONNECT] Retry ExecutePlan in case initial request didn't reach server in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 04:50:23 UTC, 1 replies.
- [GitHub] [spark] Madhukar98 opened a new pull request, #42339: [SPARK-44670][PYTHON] Fix the tests for python3.7 - posted by "Madhukar98 (via GitHub)" <gi...@apache.org> on 2023/08/04 05:15:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42206: [SPARK-44582][SQL] Skip iterator on SMJ if it was cleaned up - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 05:23:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42206: [SPARK-44582][SQL] Skip iterator on SMJ if it was cleaned up - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 05:24:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42339: [SPARK-44670][PYTHON][TESTS][PS][3.4] Fix 'test_dataframe_conversion.DataFrameConversionTest.get_excel_dfs' test to work with Python 3.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 05:28:28 UTC, 2 replies.
- [GitHub] [spark] ukby1234 commented on pull request #42155: [SPARK-44547][CORE] Ignore fallback storage for cached RDD migration - posted by "ukby1234 (via GitHub)" <gi...@apache.org> on 2023/08/04 05:41:56 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40474: [SPARK-42849] [SQL] Session Variables - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/04 06:09:00 UTC, 15 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42284: [SPARK-44629] Publish PySpark Test Guidelines webpage - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 06:22:09 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #42313: [SPARK-44650][CORE] `spark.executor.defaultJavaOptions` Check illegal java options - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/08/04 06:50:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42328: [SPARK-43967][SQL][PYTHON] Add memory limits for Python UDTF analyzer - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 06:58:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42328: [SPARK-43967][SQL][PYTHON] Add memory limits for Python UDTF analyzer - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 06:59:55 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42291: [SPARK-44600][INFRA] Make `repl` module to pass Maven daily testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 07:00:57 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42313: [SPARK-44650][CORE] `spark.executor.defaultJavaOptions` Check illegal java options - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 07:02:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42337: [SPARK-44640][PYTHON][3.5] Improve error messages for Python UDTF returning non Iterable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 07:07:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42337: [SPARK-44640][PYTHON][3.5] Improve error messages for Python UDTF returning non Iterable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 07:08:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42333: [SPARK-44618][INFRA] Uninstall CodeQL/Go/Node in non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 07:16:53 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42340: [SPARK-44433][3.5][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with removeListener and improvements - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/04 07:19:51 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42340: [SPARK-44433][3.5][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with removeListener and improvements - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/04 07:24:30 UTC, 1 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42341: finish - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/04 07:28:40 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42341: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP]Set back USE_DAEMON after creating streaming python processes - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/04 07:29:56 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42112: [SPARK-44493][SQL] Support for translating catalyst expressions into partial datasource filters - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/04 07:35:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42333: [SPARK-44618][INFRA] Uninstall CodeQL/Go/Node in non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 07:35:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42333: [SPARK-44666][INFRA] Uninstall CodeQL/Go/Node in non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 07:37:34 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/04 07:47:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42342: [MINOR] Fix gitignore rules related to Antlr. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 08:21:34 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #20634: [SPARK-23456][SPARK-21783] Turn on `native` ORC impl and PPD by default - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/04 08:23:03 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42342: [SPARK-44672][INFRA] Fix git ignore rules related to Antlr. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 08:30:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42334: [SPARK-44667][INFRA] Uninstall large ML libraries for non-ML jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/04 09:08:18 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #42336: [SPARK-44669][SQL][HIVE] Parquet/ORC files written using Hive Serde should has file extension - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/04 09:14:09 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #42291: [SPARK-44600][INFRA] Make `repl` module to pass Maven daily testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 09:18:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42258: Test remove BytecodeUtils.scala and HadoopUtils.scala - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 09:20:42 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42336: [SPARK-44669][SQL][HIVE] Parquet/ORC files written using Hive Serde should has file extension - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/04 09:49:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42343: [SPARK-44674][GRAPHX] Remove `BytecodeUtils` from `graphx` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 10:10:38 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42344: [SPARK-44675][INFRA] Increase ReservedCodeCacheSize for release build - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/04 10:20:16 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42344: [SPARK-44675][INFRA] Increase ReservedCodeCacheSize for release build - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/04 10:22:16 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42318: [SPARK-44655][SQL] Make the code cleaner about static and dynamic data/partition filters - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/04 11:24:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42318: [SPARK-44655][SQL] Make the code cleaner about static and dynamic data/partition filters - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/04 11:24:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39977: [SPARK-42323][SQL] Assign name to `_LEGACY_ERROR_TEMP_2332` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/04 11:52:36 UTC, 2 replies.
- [GitHub] [spark] wangyum closed pull request #42344: [SPARK-44675][INFRA] Increase ReservedCodeCacheSize for release build - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/04 11:54:59 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42344: [SPARK-44675][INFRA] Increase ReservedCodeCacheSize for release build - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/04 11:55:39 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "zero323 (via GitHub)" <gi...@apache.org> on 2023/08/04 12:52:32 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42313: [SPARK-44650][CORE] `spark.executor.defaultJavaOptions` Check illegal java options - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/04 12:56:45 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #33428: [SPARK-36220][PYTHON] Fix pyspark.sql.types.Row type annotation - posted by "zero323 (via GitHub)" <gi...@apache.org> on 2023/08/04 13:02:08 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42317: [SPARK-44649][SQL] Runtime Filter supports passing equivalent creation side expressions - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/04 13:29:13 UTC, 2 replies.
- [GitHub] [spark] hvanhovell closed pull request #42331: [SPARK-44656][CONNECT] Make all iterators CloseableIterators - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/04 14:07:08 UTC, 0 replies.
- [GitHub] [spark] Don-Burns commented on pull request #33428: [SPARK-36220][PYTHON] Fix pyspark.sql.types.Row type annotation - posted by "Don-Burns (via GitHub)" <gi...@apache.org> on 2023/08/04 14:22:40 UTC, 0 replies.
- [GitHub] [spark] dzhigimont commented on pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/08/04 14:24:11 UTC, 3 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42338: [SPARK-44671][PYTHON][CONNECT] Retry ExecutePlan in case initial request didn't reach server in Python client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/04 14:31:14 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42331: [SPARK-44656][CONNECT] Make all iterators CloseableIterators - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/04 14:42:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42343: [SPARK-44674][GRAPHX] Remove `BytecodeUtils` from `graphx` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 14:48:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42343: [SPARK-44674][GRAPHX] Remove `BytecodeUtils` from `graphx` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 14:48:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #20634: [SPARK-23456][SPARK-21783] Turn on `native` ORC impl and PPD by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 14:52:51 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42343: [SPARK-44674][GRAPHX] Remove `BytecodeUtils` from `graphx` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/04 15:01:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42342: [SPARK-44672][INFRA] Fix git ignore rules related to Antlr - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 15:10:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42342: [SPARK-44672][INFRA] Fix git ignore rules related to Antlr - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 15:10:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42336: [SPARK-44669][SQL][HIVE] Parquet/ORC files written using Hive Serde should has file extension - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 15:13:23 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42327: [SPARK-44634][SQL] Encoders.bean does no longer support nested beans with type arguments - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/04 15:19:32 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42327: [SPARK-44634][SQL] Encoders.bean does no longer support nested beans with type arguments - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/04 15:27:33 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42338: [SPARK-44671][PYTHON][CONNECT] Retry ExecutePlan in case initial request didn't reach server in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 15:54:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42338: [SPARK-44671][PYTHON][CONNECT] Retry ExecutePlan in case initial request didn't reach server in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/04 15:58:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42345: [SPARK-44678][BUILD][3.5] Downgrade Hadoop to 3.3.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 16:09:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42345: [SPARK-44678][BUILD][3.5] Downgrade Hadoop to 3.3.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 16:27:06 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/04 17:22:22 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42346: [SPARK-44433][3.5][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with removeListener and improvements - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/04 17:25:26 UTC, 0 replies.
- [GitHub] [spark] WweiL closed pull request #42346: [SPARK-44433][3.5][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with removeListener and improvements - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/04 17:26:37 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #42342: [SPARK-44672][INFRA] Fix git ignore rules related to Antlr - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/04 17:39:45 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42347: [Spark Ticket][Python] Adding Deepspeed To The Test Dockerfile - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/08/04 17:44:19 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #42347: [SPARK-44264][Python] Adding Deepspeed To The Test Dockerfile - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/08/04 17:45:24 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42341: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP]Set back USE_DAEMON after creating streaming python processes - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/04 17:52:05 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42340: [SPARK-44433][3.5][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with removeListener and improvements - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/04 17:55:00 UTC, 1 replies.
- [GitHub] [spark] szehon-ho commented on a diff in pull request #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/04 18:14:42 UTC, 0 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #42308: [Spark Ticket][WIP]Added a warning to pop up in the case the user doesn't use gpus - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/08/04 18:17:07 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/04 18:28:10 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #42345: [SPARK-44678][BUILD][3.5] Downgrade Hadoop to 3.3.4 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/04 18:29:25 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42348: [SPARK-44682] Make pandas error class message_parameters strings - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/04 18:40:49 UTC, 0 replies.
- [GitHub] [spark-connect-go] hiboyang opened a new pull request, #14: [SPARK-44681] Fix issues when writing Go application code using Spark Connect Go client library - posted by "hiboyang (via GitHub)" <gi...@apache.org> on 2023/08/04 18:43:48 UTC, 0 replies.
- [GitHub] [spark-connect-go] hiboyang commented on pull request #14: [SPARK-44681] Fix issues when writing Go application code using Spark Connect Go client library - posted by "hiboyang (via GitHub)" <gi...@apache.org> on 2023/08/04 18:48:03 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #42284: [SPARK-44629][PYTHON][DOCS] Publish PySpark Test Guidelines webpage - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/08/04 18:59:49 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #42284: [SPARK-44629][PYTHON][DOCS] Publish PySpark Test Guidelines webpage - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/08/04 19:03:35 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/04 19:22:36 UTC, 1 replies.
- [GitHub] [spark] gbloisi-openaire commented on a diff in pull request #42327: [SPARK-44634][SQL] Encoders.bean does no longer support nested beans with type arguments - posted by "gbloisi-openaire (via GitHub)" <gi...@apache.org> on 2023/08/04 19:43:43 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on pull request #42309: [SPARK-44644][PYTHON] Improve error messages for Python UDTFs with pickling errors - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/04 20:15:26 UTC, 1 replies.
- [GitHub] [spark] ueshin closed pull request #42309: [SPARK-44644][PYTHON] Improve error messages for Python UDTFs with pickling errors - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/04 20:16:15 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42349: [SPARK-44644][PYTHON][3.5] Improve error messages for Python UDTFs with pickling errors - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/04 20:57:31 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42349: [SPARK-44644][PYTHON][3.5] Improve error messages for Python UDTFs with pickling errors - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/04 20:58:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42345: [SPARK-44678][BUILD][3.5] Downgrade Hadoop to 3.3.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/04 21:21:14 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42332: [SPARK-44665][PYTHON] Add support for pandas DataFrame assertDataFrameEqual - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/04 22:55:31 UTC, 3 replies.
- [GitHub] [spark] ahshahid opened a new pull request, #42350: [SPARK-44662] Perf improvement in BroadcastHashJoin queries with stream side join key on non partition columns - posted by "ahshahid (via GitHub)" <gi...@apache.org> on 2023/08/04 22:58:15 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #42351: [SPARK-44503][SQL] Project any PARTITION BY expressions not already returned from Python UDTF TABLE arguments - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/04 22:59:31 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #42351: [SPARK-44503][SQL] Project any PARTITION BY expressions not already returned from Python UDTF TABLE arguments - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/04 23:01:46 UTC, 0 replies.
- [GitHub] [spark] ahshahid commented on pull request #42350: [SPARK-44662] Perf improvement in BroadcastHashJoin queries with stream side join key on non partition columns - posted by "ahshahid (via GitHub)" <gi...@apache.org> on 2023/08/04 23:02:21 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42284: [SPARK-44629][PYTHON][DOCS] Publish PySpark Test Guidelines webpage - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/04 23:11:08 UTC, 0 replies.
- [GitHub] [spark] pkotikalapudi opened a new pull request, #42352: [WIP][SPARK-24815] [CORE] Trigger Interval based DRA for Structured Streaming - posted by "pkotikalapudi (via GitHub)" <gi...@apache.org> on 2023/08/04 23:30:22 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42329: [SPARK-44663][PYTHON] Disable arrow optimization by default for Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/04 23:38:02 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42329: [SPARK-44663][PYTHON] Disable arrow optimization by default for Python UDTFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/04 23:42:07 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42329: [SPARK-44663][PYTHON] Disable arrow optimization by default for Python UDTFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/04 23:44:11 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #42332: [SPARK-44665][PYTHON] Add support for pandas DataFrame assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/05 00:01:55 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42334: [SPARK-44667][INFRA] Uninstall large ML libraries for non-ML jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/05 00:05:42 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on pull request #42348: [SPARK-44682] Make pandas error class message_parameters strings - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/08/05 00:08:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42345: [SPARK-44678][BUILD][3.5] Downgrade Hadoop to 3.3.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 00:11:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42342: [SPARK-44672][INFRA] Fix git ignore rules related to Antlr - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 00:12:30 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40960: [SPARK-43180][PYTHON-INFRA]: Upgrade mypy and pytest-mypypplugins packages - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/05 00:19:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40929: [SPARK-43264][SQL] Avoid allocation of unwritten ColumnVector in Spark Vectorized Reader - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/05 00:19:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40918: [WIP][CORE] Add shuffle sort merge joins to RDD API - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/05 00:19:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/05 00:19:59 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40629: [SPARK-42980][CORE] Implement a lightweight SmallBroadcast - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/05 00:20:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40171: [SPARK-42598][TEST] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/05 00:20:01 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42353: [SPARK-44005][PYTHON] Improve error messages for regular Python UDTFs that return non-tuple values - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/05 00:31:01 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42341: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP]Set back USE_DAEMON after creating streaming python processes - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/05 00:40:03 UTC, 0 replies.
- [GitHub] [spark] siying opened a new pull request, #42354: [SPARK-44683][SS] Logging level isn't passed to RocksDB state store provider correctly - posted by "siying (via GitHub)" <gi...@apache.org> on 2023/08/05 00:40:17 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42341: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP]Set back USE_DAEMON after creating streaming python processes - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/05 00:41:38 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42355: [WIP][CONNECT] Return from executePlan / reattachExecute handler and process stream on different thread - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/05 01:15:52 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42340: [SPARK-44433][3.5][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with removeListener and improvements - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/05 02:40:14 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42340: [SPARK-44433][3.5][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with removeListener and improvements - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/05 02:43:34 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42356: [SPARK-44685][SQL] Remove deprecated Catalog#createExternalTable - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/05 03:04:23 UTC, 0 replies.
- [GitHub] [spark] shuwang21 opened a new pull request, #42357: [WIP][SPARK-44306][YARN] Group FileStatus with few RPC calls within Yarn Client - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/08/05 03:55:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42284: [SPARK-44629][PYTHON][DOCS] Publish PySpark Test Guidelines webpage - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/05 04:01:32 UTC, 6 replies.
- [GitHub] [spark] shuwang21 commented on a diff in pull request #41489: [SPARK-43987][Shuffle] Separate finalizeShuffleMerge Processing to Dedicated Thread Pools - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/08/05 04:15:39 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #42323: [SPARK-44658][CORE] `ShuffleStatus.getMapStatus` should return `None` instead of `Some(null)` - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/05 05:35:46 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42093: [SPARK-44497][WEBUI] Show task partition id in Task table - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/05 05:41:33 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #42295: [SPARK-44581][YARN]Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/05 05:45:39 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42352: [WIP][SPARK-24815] [CORE] Trigger Interval based DRA for Structured Streaming - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/05 05:51:51 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41489: [SPARK-43987][Shuffle] Separate finalizeShuffleMerge Processing to Dedicated Thread Pools - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/05 05:58:13 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42358: [SPARK-44687] Fix mima check for Scala 2.13 after SPARK-44198 merged - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 06:33:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42359: Execute after verifying the existence of `./dev/free_disk_space` and `./dev/free_disk_space_container` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 07:29:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42359: [SPARK-44688][INFRA] Add a file existence check before executing `free_disk_space` and `free_disk_space_container` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 07:49:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42360: [CONNECT] Make the exception handling of function `SparkConnectPlanner#unpackScalarScalaUDF` compatible with Java 17. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 09:17:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42360: [CONNECT] Make the exception handling of function `SparkConnectPlanner#unpackScalarScalaUDF` compatible with Java 17. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 09:17:50 UTC, 0 replies.
- [GitHub] [spark] dzhigimont commented on pull request #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/08/05 11:25:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42361: Rename the `object Catalyst` in SparkBuild to `object SqlApi` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 13:14:56 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on a diff in pull request #42313: [SPARK-44650][CORE] `spark.executor.defaultJavaOptions` Check illegal java options - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/08/05 14:19:06 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42356: [SPARK-44685][SQL] Remove deprecated Catalog#createExternalTable - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/05 14:57:14 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #42339: [SPARK-44670][PYTHON][TESTS][PS][3.4] Fix 'test_dataframe_conversion.DataFrameConversionTest.get_excel_dfs' test to work with Python 3.7 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/05 14:59:45 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/05 15:04:00 UTC, 1 replies.
- [GitHub] [spark] Madhukar98 commented on pull request #42339: [SPARK-44670][PYTHON][TESTS][PS][3.4] Fix 'test_dataframe_conversion.DataFrameConversionTest.get_excel_dfs' test to work with Python 3.7 - posted by "Madhukar98 (via GitHub)" <gi...@apache.org> on 2023/08/05 15:41:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 16:05:04 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #42358: [SPARK-44687][BUILD] Fix mima check for Scala 2.13 after SPARK-44198 merged - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 17:13:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42358: [SPARK-44687][BUILD] Fix mima check for Scala 2.13 after SPARK-44198 merged - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/05 17:14:30 UTC, 0 replies.
- [GitHub] [spark] pkotikalapudi commented on pull request #42352: [WIP][SPARK-24815] [CORE] Trigger Interval based DRA for Structured Streaming - posted by "pkotikalapudi (via GitHub)" <gi...@apache.org> on 2023/08/05 19:58:37 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40960: [SPARK-43180][PYTHON-INFRA]: Upgrade mypy and pytest-mypypplugins packages - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/06 00:19:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40171: [SPARK-42598][TEST] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/06 00:19:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42339: [SPARK-44670][PYTHON][TESTS][PS][3.4] Fix 'test_dataframe_conversion.DataFrameConversionTest.get_excel_dfs' test to work with Python 3.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 01:24:28 UTC, 0 replies.
- [GitHub] [spark] dzhigimont opened a new pull request, #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/08/06 01:26:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 01:26:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42359: [SPARK-44688][INFRA] Add a file existence check before executing `free_disk_space` and `free_disk_space_container` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 01:29:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42284: [SPARK-44629][PYTHON][DOCS] Publish PySpark Test Guidelines webpage - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 01:30:55 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42362: [BUILD][3.5] Downgrade Scala from 2.13.11 to 2.13.8 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 02:11:34 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42356: [SPARK-44685][SQL] Remove deprecated Catalog#createExternalTable - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/06 02:18:18 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42265: [SPARK-41636][SQL] Make sure `selectFilters` returns predicates in deterministic order - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/06 04:12:38 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42018: [SPARK-42321][SQL] Assign name to _LEGACY_ERROR_TEMP_2133 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/06 04:13:04 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42362: [SPARK-44690][BUILD][3.5] Downgrade Scala to 2.13.8 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 07:00:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42362: [SPARK-44690][BUILD][3.5] Downgrade Scala to 2.13.8 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 07:46:23 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42362: [SPARK-44690][BUILD][3.5] Downgrade Scala to 2.13.8 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 07:46:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42362: [SPARK-44690][BUILD][3.5] Downgrade Scala to 2.13.8 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 07:47:00 UTC, 6 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42243: [SPARK-38475][CORE] Use error class in org.apache.spark.serializer - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/06 09:32:43 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #42359: [SPARK-44688][INFRA] Add a file existence check before executing `free_disk_space` and `free_disk_space_container` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 09:41:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42359: [SPARK-44688][INFRA] Add a file existence check before executing `free_disk_space` and `free_disk_space_container` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 09:42:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42238: [SPARK-44628][SQL] Clear some unused codes in "***Errors" and extract some common logic - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/06 10:15:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42018: [SPARK-42321][SQL] Assign name to _LEGACY_ERROR_TEMP_2133 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/06 10:50:28 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/06 10:56:49 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42018: [SPARK-42321][SQL] Assign name to _LEGACY_ERROR_TEMP_2133 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/06 11:59:17 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #42313: [SPARK-44650][CORE] `spark.executor.defaultJavaOptions` Check illegal java options - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/06 13:25:04 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #42313: [SPARK-44650][CORE] `spark.executor.defaultJavaOptions` Check illegal java options - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/06 13:25:04 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42362: [SPARK-44690][BUILD][3.5] Downgrade Scala to 2.13.8 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/06 13:53:45 UTC, 1 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42363: [SPARK-44691][SQL][CONNECT] Move Subclasses of AnalysisException to sql/api - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/06 13:58:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42364: [SPARK-44690][BUILD] Downgrade Scala to 2.13.8 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 14:17:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42364: [SPARK-44690][SPARK-44376][BUILD] Downgrade Scala to 2.13.8 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/06 14:26:49 UTC, 6 replies.
- [GitHub] [spark] srowen commented on pull request #42364: [SPARK-44690][SPARK-44376][BUILD] Downgrade Scala to 2.13.8 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/06 14:45:19 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on pull request #42038: [SPARK-42500][SQL] ConstantPropagation support more case - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/06 15:15:23 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #42038: [SPARK-42500][SQL] ConstantPropagation support more case - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/06 15:22:04 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42365: [WIP][SPARK-44680][SQL] Improve the error of parameters in `DEFAULT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/06 18:27:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42366: [SPARK-44686][CONNECT][SQL] Add the ability to create a RowEncoder in Encoders.scala. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/06 19:19:54 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42366: [SPARK-44686][CONNECT][SQL] Add the ability to create a RowEncoder in Encoders.scala. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/06 19:20:51 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42363: [SPARK-44691][SQL][CONNECT] Move Subclasses of AnalysisException to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/06 19:22:43 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42355: [WIP][CONNECT] Return from executePlan / reattachExecute handler and process stream on different thread - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/06 19:27:28 UTC, 7 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42355: [WIP][CONNECT] Return from executePlan / reattachExecute handler and process stream on different thread - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/06 19:43:09 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42327: [SPARK-44634][SQL] Encoders.bean does no longer support nested beans with type arguments - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/06 19:48:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42366: [SPARK-44686][CONNECT][SQL] Add the ability to create a RowEncoder in Encoders.scala. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 23:32:25 UTC, 2 replies.
- [GitHub] [spark] cdkrot commented on pull request #42320: [SPARK-44656][CONNECT][FOLLOWUP] Close Iterators in SparkResult as well. - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/06 23:45:23 UTC, 0 replies.
- [GitHub] [spark] cdkrot closed pull request #42320: [SPARK-44656][CONNECT][FOLLOWUP] Close Iterators in SparkResult as well. - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/06 23:45:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42284: [SPARK-44629][PYTHON][DOCS] Publish PySpark Test Guidelines webpage - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 23:55:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42356: [SPARK-44685][SQL] Remove deprecated Catalog#createExternalTable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/06 23:59:51 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42350: [SPARK-44662] Perf improvement in BroadcastHashJoin queries with stream side join key on non partition columns - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 00:03:43 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42265: [SPARK-41636][SQL] Make sure `selectFilters` returns predicates in deterministic order - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 00:07:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42265: [SPARK-41636][SQL] Make sure `selectFilters` returns predicates in deterministic order - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 00:07:37 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42356: [SPARK-44685][SQL] Remove deprecated Catalog#createExternalTable - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/07 00:14:02 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42359: [SPARK-44688][INFRA] Add a file existence check before executing `free_disk_space` and `free_disk_space_container` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/07 00:18:48 UTC, 0 replies.
- [GitHub] [spark] yangjiandan commented on pull request #42295: [SPARK-44581][YARN]Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "yangjiandan (via GitHub)" <gi...@apache.org> on 2023/08/07 01:21:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42356: [SPARK-44685][SQL] Remove deprecated Catalog#createExternalTable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/07 01:49:46 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/07 01:58:34 UTC, 2 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42367: [SPARK-43429][CONNECT] Add Default & Active SparkSession for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 02:06:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42367: [SPARK-43429][CONNECT] Add Default & Active SparkSession for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 02:07:27 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42265: [SPARK-41636][SQL] Make sure `selectFilters` returns predicates in deterministic order - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/07 02:10:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42367: [SPARK-43429][CONNECT] Add Default & Active SparkSession for Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 02:11:15 UTC, 4 replies.
- [GitHub] [spark] LuciferYang closed pull request #42364: [SPARK-44690][SPARK-44376][BUILD] Downgrade Scala to 2.13.8 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 02:14:37 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42367: [SPARK-43429][CONNECT] Add Default & Active SparkSession for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 02:20:15 UTC, 3 replies.
- [GitHub] [spark] lvkaihua commented on pull request #42295: [SPARK-44581][YARN]Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "lvkaihua (via GitHub)" <gi...@apache.org> on 2023/08/07 02:23:01 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42368: [SPARK-44692][CONNECT][SQL] Move Trigger(s) to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 02:24:13 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/07 02:25:41 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #40436: [SPARK-42619][PS] Add `show_counts` parameter for DataFrame.info - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/07 02:26:41 UTC, 2 replies.
- [GitHub] [spark] LuciferYang closed pull request #42361: [SPARK-44693][BUILD] Rename the `object Catalyst` in SparkBuild to `object SqlApi` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 02:28:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42361: [SPARK-44693][BUILD] Rename the `object Catalyst` in SparkBuild to `object SqlApi` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 02:28:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42360: [SPARK-44689][CONNECT] Make the exception handling of function `SparkConnectPlanner#unpackScalarScalaUDF` more universal - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 02:30:17 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42347: [SPARK-44264][PYTHON][ML][TESTS][FOLLOWUP] Adding Deepspeed To The Test Dockerfile - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/07 02:57:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42347: [SPARK-44264][PYTHON][ML][TESTS][FOLLOWUP] Adding Deepspeed To The Test Dockerfile - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/07 02:58:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42348: [SPARK-44682][PS] Make pandas error class message_parameters strings - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 03:01:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42348: [SPARK-44682][PS] Make pandas error class message_parameters strings - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 03:01:55 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42369: [SPARK-44695][PYTHON] Improve error message for `DataFrame.toDF` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/07 03:34:43 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/07 03:41:08 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42370: Using `ThreadLocalRandom` instead of `RandomUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 03:57:09 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42332: [SPARK-44665][PYTHON] Add support for pandas DataFrame assertDataFrameEqual - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/07 03:59:51 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/07 04:05:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42371: [SPARK-44694][PYTHON][CONNECT] Refactor active sessions and expose them as an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 04:50:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42371: [SPARK-44694][PYTHON][CONNECT] Refactor active sessions and expose them as an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 04:52:46 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42349: [SPARK-44644][PYTHON][3.5] Improve error messages for Python UDTFs with pickling errors - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/07 05:03:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42349: [SPARK-44644][PYTHON][3.5] Improve error messages for Python UDTFs with pickling errors - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/07 05:03:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42356: [SPARK-44685][SQL] Remove deprecated Catalog#createExternalTable - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 05:54:35 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42371: [SPARK-44694][PYTHON][CONNECT] Refactor active sessions and expose them as an API - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/07 05:56:27 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42371: [SPARK-44694][PYTHON][CONNECT] Refactor active sessions and expose them as an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 06:19:34 UTC, 4 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/07 06:54:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42238: [SPARK-44628][SQL] Clear some unused codes in "***Errors" and extract some common logic - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 07:01:13 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42238: [SPARK-44628][SQL] Clear some unused codes in "***Errors" and extract some common logic - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 07:02:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 07:09:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42018: [SPARK-42321][SQL] Assign name to _LEGACY_ERROR_TEMP_2133 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 07:13:23 UTC, 0 replies.
- [GitHub] [spark] shuyouZZ opened a new pull request, #42372: [SPARK-44699][CORE] Add log when finished write events to file in EventLogFileWriter.closeWriter - posted by "shuyouZZ (via GitHub)" <gi...@apache.org> on 2023/08/07 07:14:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42373: [MINOR][UI] Increasing the number of significant digits for Fraction Cached of RDD - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/07 07:14:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42018: [SPARK-42321][SQL] Assign name to _LEGACY_ERROR_TEMP_2133 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 07:14:10 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #42276: [WIP] Throw a more tailored error message for having - window - LCA unresolved case - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/08/07 07:21:03 UTC, 1 replies.
- [GitHub] [spark] zhuqi-lucas opened a new pull request, #42374: [SPARK-44698][SQL] Create table like other table should also copy tab… - posted by "zhuqi-lucas (via GitHub)" <gi...@apache.org> on 2023/08/07 07:25:26 UTC, 0 replies.
- [GitHub] [spark] zhuqi-lucas commented on pull request #42374: [SPARK-44698][SQL] Create table like other table should also copy tab… - posted by "zhuqi-lucas (via GitHub)" <gi...@apache.org> on 2023/08/07 07:30:20 UTC, 1 replies.
- [GitHub] [spark] zhuqi-lucas closed pull request #42374: [SPARK-44698][SQL] Create table like other table should also copy tab… - posted by "zhuqi-lucas (via GitHub)" <gi...@apache.org> on 2023/08/07 07:44:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42266: [SPARK-44575][SQL][CONNECT] Implement basic error translation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 08:01:00 UTC, 8 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/07 08:15:41 UTC, 16 replies.
- [GitHub] [spark] VVKot commented on a diff in pull request #42297: [SPARK-44609][KUBERNETES] Remove executor pod from PodsAllocator if it was removed from scheduler backend - posted by "VVKot (via GitHub)" <gi...@apache.org> on 2023/08/07 08:18:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42375: [MINOR][PYTHON][TESTS] Skip `ClassificationTestsOnConnect` when `torch` is not installed - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/07 08:20:09 UTC, 0 replies.
- [GitHub] [spark] dzhigimont commented on a diff in pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/08/07 08:31:22 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42373: [MINOR][UI] Increasing the number of significant digits for Fraction Cached of RDD - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/07 09:06:53 UTC, 0 replies.
- [GitHub] [spark] monkeyboy123 opened a new pull request, #42376: [SPARK-44700][SQL] Rule OptimizeCsvJsonExprs should not be applied to expression like from_json(regexp_replace) - posted by "monkeyboy123 (via GitHub)" <gi...@apache.org> on 2023/08/07 09:20:37 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42365: [SPARK-44680][SQL] Improve the error for parameters in `DEFAULT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 09:30:19 UTC, 1 replies.
- [GitHub] [spark] liangyu-1 commented on pull request #42295: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/07 09:55:11 UTC, 4 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42377: [SPARK-44622][SQL][CONNECT] Add GetErrorInfo RPC - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/07 10:01:02 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42295: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/07 10:03:23 UTC, 4 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42355: [WIP][CONNECT] Return from executePlan / reattachExecute handler and process stream on different thread - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/07 10:40:09 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42355: [WIP][CONNECT] Return from executePlan / reattachExecute handler and process stream on different thread - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/07 10:48:30 UTC, 0 replies.
- [GitHub] [spark] shuyouZZ opened a new pull request, #42378: SPARK-44703. Log eventLog rewrite duration when compact old event log files - posted by "shuyouZZ (via GitHub)" <gi...@apache.org> on 2023/08/07 11:00:18 UTC, 0 replies.
- [GitHub] [spark] xkrt commented on pull request #32289: [SPARK-33357][K8S] Support Spark application managing with SparkAppHandle on Kubernetes - posted by "xkrt (via GitHub)" <gi...@apache.org> on 2023/08/07 11:11:31 UTC, 0 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #42365: [SPARK-44680][SQL] Improve the error for parameters in `DEFAULT` - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/07 11:34:47 UTC, 0 replies.
- [GitHub] [spark] shuyouZZ commented on pull request #42378: [SPARK-44703][CORE] Log eventLog rewrite duration when compact old event log files - posted by "shuyouZZ (via GitHub)" <gi...@apache.org> on 2023/08/07 11:40:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42375: [SPARK-44701][PYTHON][TESTS] Skip `ClassificationTestsOnConnect` when `torch` is not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 11:46:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42375: [SPARK-44701][PYTHON][TESTS] Skip `ClassificationTestsOnConnect` when `torch` is not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 11:46:49 UTC, 0 replies.
- [GitHub] [spark] gbloisi-openaire opened a new pull request, #42379: [SPARK-44634][SQL][3.4] Encoders.bean does no longer support nested beans with type arguments - posted by "gbloisi-openaire (via GitHub)" <gi...@apache.org> on 2023/08/07 11:57:46 UTC, 0 replies.
- [GitHub] [spark] gbloisi-openaire commented on pull request #42327: [SPARK-44634][SQL] Encoders.bean does no longer support nested beans with type arguments - posted by "gbloisi-openaire (via GitHub)" <gi...@apache.org> on 2023/08/07 11:59:34 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/07 12:35:26 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41779: [SPARK-44236][SQL] Disable WholeStageCodegen when set `spark.sql.codegen.factoryMode` to NO_CODEGEN - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/07 12:39:16 UTC, 0 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #42336: [SPARK-44669][SQL][HIVE] Parquet/ORC files written using Hive Serde should has file extension - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/07 12:48:06 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #42366: [SPARK-44686][CONNECT][SQL] Add the ability to create a RowEncoder in Encoders.scala. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 13:10:09 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42379: [SPARK-44634][SQL][3.4] Encoders.bean does no longer support nested beans with type arguments - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 13:19:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42379: [SPARK-44634][SQL][3.4] Encoders.bean does no longer support nested beans with type arguments - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 13:19:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42255: [SPARK-40178][SQL] Support string parameters in hint method - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/07 13:33:29 UTC, 2 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #40502: [SPARK-42829] [UI] add repeat identifier to cached RDD on stage page - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/07 13:39:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42370: [SPARK-44697][CORE] Clean up the deprecated usage of `o.a.commons.lang3.RandomUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 13:41:49 UTC, 2 replies.
- [GitHub] [spark] LuciferYang closed pull request #42370: [SPARK-44697][CORE] Clean up the deprecated usage of `o.a.commons.lang3.RandomUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/07 13:43:19 UTC, 0 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #40474: [SPARK-42849] [SQL] Session Variables - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/07 13:53:10 UTC, 2 replies.
- [GitHub] [spark] advancedxy commented on pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/08/07 14:45:08 UTC, 0 replies.
- [GitHub] [spark] monkeyboy123 commented on pull request #42376: [SPARK-44700][SQL] Rule OptimizeCsvJsonExprs should not be applied to expression like from_json(regexp_replace) - posted by "monkeyboy123 (via GitHub)" <gi...@apache.org> on 2023/08/07 15:36:05 UTC, 7 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/08/07 16:18:41 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #42363: [SPARK-44691][SQL][CONNECT] Move Subclasses of AnalysisException to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/07 16:31:21 UTC, 1 replies.
- [GitHub] [spark] heyihong commented on a diff in pull request #42363: [SPARK-44691][SQL][CONNECT] Move Subclasses of AnalysisException to sql/api - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/07 16:34:46 UTC, 7 replies.
- [GitHub] [spark] lvyanquan opened a new pull request, #42380: [SPARK-44696][SQL] Support different timestamp precise for `from_json` function - posted by "lvyanquan (via GitHub)" <gi...@apache.org> on 2023/08/07 16:37:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42167: [SPARK-44554][INFRA] Make Python linter related checks pass of branch-3.3/3.4 daily testing - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/07 16:37:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42365: [SPARK-44680][SQL] Improve the error for parameters in `DEFAULT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 17:04:19 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42243: [SPARK-38475][CORE] Use error class in org.apache.spark.serializer - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 17:09:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42243: [SPARK-38475][CORE] Use error class in org.apache.spark.serializer - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/07 17:10:14 UTC, 0 replies.
- [GitHub] [spark] juanvisoler opened a new pull request, #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "juanvisoler (via GitHub)" <gi...@apache.org> on 2023/08/07 17:18:53 UTC, 2 replies.
- [GitHub] [spark] holdenk commented on pull request #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/08/07 17:20:22 UTC, 0 replies.
- [GitHub] [spark] holdenk opened a new pull request, #31553: [WIP][SPARK-34425][SQL] Clarify error message - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/08/07 17:21:16 UTC, 0 replies.
- [GitHub] [spark] holdenk closed pull request #31553: [WIP][SPARK-34425][SQL] Clarify error message - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/08/07 17:22:23 UTC, 0 replies.
- [GitHub] [spark] oleg-smith opened a new pull request, #28488: [SPARK-29083][CORE] Prefetch elements in rdd.toLocalIterator - posted by "oleg-smith (via GitHub)" <gi...@apache.org> on 2023/08/07 17:26:45 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #28488: [SPARK-29083][CORE] Prefetch elements in rdd.toLocalIterator - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/08/07 17:27:30 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #41712: [SPARK-44132][SQL] Materialize `Stream` of join column names to avoid codegen failure - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/08/07 17:50:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42381: [SPARK-44707][K8S] Use INFO log in `ExecutorPodsWatcher.onClose` if `SparkContext` is stopped - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/07 18:44:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42381: [SPARK-44707][K8S] Use INFO log in `ExecutorPodsWatcher.onClose` if `SparkContext` is stopped - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/07 18:45:25 UTC, 3 replies.
- [GitHub] [spark] ueshin closed pull request #42310: [SPARK-44561][PYTHON] Fix AssertionError when converting UDTF output to a complex type - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/07 18:48:34 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42371: [SPARK-44694][PYTHON][CONNECT] Refactor active sessions and expose them as an API - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/07 18:49:31 UTC, 0 replies.
- [GitHub] [spark] sergioschena-db opened a new pull request, #42382: [WIP][ML] Remove usage of RDD APIs for loads in spark-ml - posted by "sergioschena-db (via GitHub)" <gi...@apache.org> on 2023/08/07 19:03:40 UTC, 0 replies.
- [GitHub] [spark] agubichev opened a new pull request, #42383: [SPARK-44549] Support window functions in correlated scalar subqueries - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/07 19:57:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42381: [SPARK-44707][K8S] Use INFO log in `ExecutorPodsWatcher.onClose` if `SparkContext` is stopped - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/07 20:14:27 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42381: [SPARK-44707][K8S] Use INFO log in `ExecutorPodsWatcher.onClose` if `SparkContext` is stopped - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/07 20:32:57 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42381: [SPARK-44707][K8S] Use INFO log in `ExecutorPodsWatcher.onClose` if `SparkContext` is stopped - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/07 20:56:33 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42353: [SPARK-44005][PYTHON] Improve error messages for regular Python UDTFs that return non-tuple values - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/07 21:16:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42267: [SPARK-43606][PS] Remove `Int64Index` & `Float64Index` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 21:31:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42267: [SPARK-43606][PS] Remove `Int64Index` & `Float64Index` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 21:32:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42266: [SPARK-44575][SQL][CONNECT] Implement basic error translation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 21:33:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42266: [SPARK-44575][SQL][CONNECT] Implement basic error translation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 21:33:59 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #42380: [SPARK-44696][SQL] Support different timestamp precise for `from_json` function - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/08/07 21:56:35 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42371: [SPARK-44694][PYTHON][CONNECT] Refactor active sessions and expose them as an API - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/07 22:04:50 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42368: [SPARK-44692][CONNECT][SQL] Move Trigger(s) to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 22:41:52 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42368: [SPARK-44692][CONNECT][SQL] Move Trigger(s) to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/07 22:42:23 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42355: [SPARK-44709][CONNECT] Run ExecuteGrpcResponseSender in reattachable execute in new thread to fix flow control - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/07 22:50:07 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42355: [SPARK-44709][CONNECT] Run ExecuteGrpcResponseSender in reattachable execute in new thread to fix flow control - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/07 22:52:06 UTC, 1 replies.
- [GitHub] [spark] siying commented on pull request #42354: [SPARK-44683][SS] Logging level isn't passed to RocksDB state store provider correctly - posted by "siying (via GitHub)" <gi...@apache.org> on 2023/08/07 23:08:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42321: [SPARK-44657][CONNECT] Fix incorrect limit handling in ArrowBatchWithSchemaIterator and config parsing of CONNECT_GRPC_ARROW_MAX_BATCH_SIZE - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 23:08:37 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41712: [SPARK-44132][SQL] Materialize `Stream` of join column names to avoid codegen failure - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 23:08:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41712: [SPARK-44132][SQL] Materialize `Stream` of join column names to avoid codegen failure - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 23:09:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42373: [MINOR][UI] Increasing the number of significant digits for Fraction Cached of RDD - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 23:10:00 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42373: [MINOR][UI] Increasing the number of significant digits for Fraction Cached of RDD - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/07 23:10:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42373: [MINOR][UI] Increasing the number of significant digits for Fraction Cached of RDD - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/07 23:15:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42378: [SPARK-44703][CORE] Log eventLog rewrite duration when compact old event log files - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 00:05:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40949: [DRAFT][SPARK-23607][CORE] Use HDFS extended attributes to store application summary information in SHS - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/08 00:20:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/08 00:20:06 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #28488: [SPARK-29083][CORE] Prefetch elements in rdd.toLocalIterator - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/08 00:20:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42384: [SPARK-44710][CONNECT] Add Dataset.dropDuplicatesWithinWatermark to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 00:58:58 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42384: [SPARK-44710][CONNECT] Add Dataset.dropDuplicatesWithinWatermark to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 00:59:11 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42384: [SPARK-44710][CONNECT] Add Dataset.dropDuplicatesWithinWatermark to Spark Connect Scala Client - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/08 01:06:30 UTC, 0 replies.
- [GitHub] [spark] utkarsh39 opened a new pull request, #42385: [SPARK-44705] Make PythonRunner single-threaded - posted by "utkarsh39 (via GitHub)" <gi...@apache.org> on 2023/08/08 01:31:08 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #42276: [SPARK-44714] Ease restriction of LCA resolution regarding queries with having - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/08/08 01:35:52 UTC, 3 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42386: [SPARK-44713][CONNECT][SQL] Move shared classes to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 01:54:30 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42386: [SPARK-44713][CONNECT][SQL] Move shared classes to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 01:54:54 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42371: [SPARK-44694][PYTHON][CONNECT] Refactor active sessions and expose them as an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 02:03:16 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42354: [SPARK-44683][SS] Logging level isn't passed to RocksDB state store provider correctly - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/08 02:11:17 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42354: [SPARK-44683][SS] Logging level isn't passed to RocksDB state store provider correctly - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/08 02:12:10 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42367: [SPARK-43429][CONNECT] Add Default & Active SparkSession for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 02:15:16 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #42324: [SPARK-44641][SQL] Incorrect result in certain scenarios when SPJ is not triggered - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/08 02:16:47 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on pull request #42386: [SPARK-44713][CONNECT][SQL] Move shared classes to sql/api - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/08 02:35:00 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42336: [SPARK-44669][SQL][HIVE] Parquet/ORC files written using Hive Serde should has file extension - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/08 02:53:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42167: [SPARK-44554][INFRA] Make Python linter related checks pass of branch-3.3/3.4 daily testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/08 03:05:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42167: [SPARK-44554][INFRA] Make Python linter related checks pass of branch-3.3/3.4 daily testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/08 03:05:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42360: [SPARK-44689][CONNECT] Make the exception handling of function `SparkConnectPlanner#unpackScalarScalaUDF` more universal - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/08 03:06:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42385: [SPARK-44705] Make PythonRunner single-threaded - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/08 03:08:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42353: [SPARK-44005][PYTHON] Improve error messages for regular Python UDTFs that return non-tuple values - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/08 03:10:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42353: [SPARK-44005][PYTHON] Improve error messages for regular Python UDTFs that return non-tuple values - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/08 03:10:57 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42378: [SPARK-44703][CORE] Log eventLog rewrite duration when compact old event log files - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/08 03:17:13 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42387: [SPARK-44715][CONNECT] Bring back callUdf and udf function. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 03:17:48 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42378: [SPARK-44703][CORE] Log eventLog rewrite duration when compact old event log files - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/08 03:18:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42369: [SPARK-44695][PYTHON] Improve error message for `DataFrame.toDF` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 03:51:55 UTC, 1 replies.
- [GitHub] [spark] lvyanquan commented on pull request #42380: [SPARK-44696][SQL] Support different timestamp precise for `from_json` function - posted by "lvyanquan (via GitHub)" <gi...@apache.org> on 2023/08/08 03:54:02 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42380: [SPARK-44696][SQL] Support different timestamp precise for `from_json` function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 03:55:11 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42388: [SPARK-43618][SPARK-43658][CONNECT][PS][TESTS] Enabling more tests - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/08 04:40:01 UTC, 0 replies.
- [GitHub] [spark] lvyanquan commented on a diff in pull request #42380: [SPARK-44696][SQL] Support different timestamp precise for `from_json` function - posted by "lvyanquan (via GitHub)" <gi...@apache.org> on 2023/08/08 04:45:14 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42369: [SPARK-44695][PYTHON] Improve error message for `DataFrame.toDF` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/08 04:52:19 UTC, 4 replies.
- [GitHub] [spark] itholic commented on pull request #42388: [SPARK-43618][SPARK-43658][CONNECT][PS][TESTS] Enabling more tests - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/08 04:55:37 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42388: [SPARK-43618][SPARK-43658][CONNECT][PS][TESTS] Enabling more tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/08 05:01:50 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42378: [SPARK-44703][CORE] Log eventLog rewrite duration when compact old event log files - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/08 05:49:37 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42388: [SPARK-43618][SPARK-43658][CONNECT][PS][TESTS] Enabling more tests - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/08 05:50:38 UTC, 2 replies.
- [GitHub] [spark] shuyouZZ commented on a diff in pull request #42378: [SPARK-44703][CORE] Log eventLog rewrite duration when compact old event log files - posted by "shuyouZZ (via GitHub)" <gi...@apache.org> on 2023/08/08 06:09:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42270: [SPARK-43567][PS] Support `use_na_sentinel` for `factorize` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 07:16:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42270: [SPARK-43567][PS] Support `use_na_sentinel` for `factorize` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 07:16:22 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 closed pull request #42295: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/08 07:26:12 UTC, 1 replies.
- [GitHub] [spark] liangyu-1 opened a new pull request, #42295: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager get wrong hadoop user group information - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/08 07:37:45 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42389: [SPARK-43709][PS] Remove `closed` parameter from `ps.date_range` & enable test. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/08 07:45:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41877: [SPARK-43660][CONNECT][PS] Enable `resample` with Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 07:48:19 UTC, 0 replies.
- [GitHub] [spark] maheshk114 commented on a diff in pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "maheshk114 (via GitHub)" <gi...@apache.org> on 2023/08/08 08:14:36 UTC, 5 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42365: [SPARK-44680][SQL] Improve the error for parameters in `DEFAULT` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/08 08:21:16 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #42365: [SPARK-44680][SQL] Improve the error for parameters in `DEFAULT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/08 08:26:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42321: [SPARK-44657][CONNECT] Fix incorrect limit handling in ArrowBatchWithSchemaIterator and config parsing of CONNECT_GRPC_ARROW_MAX_BATCH_SIZE - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 08:30:27 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42390: [SPARK-43872][PS] Support `(DataFrame|Series).plot` with pandas 2.0.0 and above. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/08 08:53:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42276: [SPARK-44714] Ease restriction of LCA resolution regarding queries with having - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/08 09:06:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42276: [SPARK-44714] Ease restriction of LCA resolution regarding queries with having - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/08 09:07:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42376: [SPARK-44700][SQL] Rule OptimizeCsvJsonExprs should not be applied to expression like from_json(regexp_replace) - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/08 09:19:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42367: [SPARK-43429][CONNECT] Add Default & Active SparkSession for Scala Client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/08 09:31:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41779: [SPARK-44236][SQL] Disable WholeStageCodegen when set `spark.sql.codegen.factoryMode` to NO_CODEGEN - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/08 09:45:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41779: [SPARK-44236][SQL] Disable WholeStageCodegen when set `spark.sql.codegen.factoryMode` to NO_CODEGEN - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/08 09:46:23 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41779: [SPARK-44236][SQL] Disable WholeStageCodegen when set `spark.sql.codegen.factoryMode` to NO_CODEGEN - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/08 09:49:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/08 10:16:08 UTC, 14 replies.
- [GitHub] [spark] juanvisoler commented on pull request #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "juanvisoler (via GitHub)" <gi...@apache.org> on 2023/08/08 10:43:00 UTC, 0 replies.
- [GitHub] [spark] Deependra-Patel opened a new pull request, #42391: [SPARK-44704][CORE] Cleanup files from host node after migration due to graceful decommissioning - posted by "Deependra-Patel (via GitHub)" <gi...@apache.org> on 2023/08/08 10:48:41 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38388: [SPARK-40909][SQL] Reuse the broadcast exchange for bloom filter - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/08 11:08:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42392: [SPARK-44717][PYTHON][PS] Respect TimestampNTZ in resampling - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 11:17:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42392: [SPARK-44717][PYTHON][PS] Respect TimestampNTZ in resampling - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 11:17:33 UTC, 2 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42393: [WIP][SPARK-43438][SQL] Error on missing input columns in `INSERT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/08 11:21:30 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/08 11:35:37 UTC, 13 replies.
- [GitHub] [spark] majdyz opened a new pull request, #42394: Match ColumnVector memory-mode config default to OffHeapMemoryMode config value - posted by "majdyz (via GitHub)" <gi...@apache.org> on 2023/08/08 11:45:02 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42395: [WIP][SPARK-40909][SQL] Reuse the broadcast exchange for bloom filter - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/08 12:33:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/08 12:44:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42386: [SPARK-44713][CONNECT][SQL] Move shared classes to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 13:04:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42384: [SPARK-44710][CONNECT] Add Dataset.dropDuplicatesWithinWatermark to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 13:05:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42387: [SPARK-44715][CONNECT] Bring back callUdf and udf function. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 13:41:20 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42387: [SPARK-44715][CONNECT] Bring back callUdf and udf function. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 13:42:18 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42396: [SPARK-44720][CONNECT] Make Dataset use Encoder instead of AgnosticEncoder - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 14:36:30 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42397: [SPARK-44722][CONNECT] ExecutePlanResponseReattachableIterator._call_iter: AttributeError: 'NoneType' object has no attribute 'message' - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/08 14:40:51 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42398: [SPARK-42746][SQL] Add the LIST_AGG() aggregate function - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/08 15:12:42 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42398: [SPARK-42746][SQL] Add the LIST_AGG() aggregate function - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/08 15:13:16 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42397: [SPARK-44722][CONNECT] ExecutePlanResponseReattachableIterator._call_iter: AttributeError: 'NoneType' object has no attribute 'message' - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/08 15:18:05 UTC, 0 replies.
- [GitHub] [spark] dstrodtman-db commented on a diff in pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "dstrodtman-db (via GitHub)" <gi...@apache.org> on 2023/08/08 15:44:53 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42364: [SPARK-44690][SPARK-44376][BUILD] Downgrade Scala to 2.13.8 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 15:56:51 UTC, 1 replies.
- [GitHub] [spark] cdkrot opened a new pull request, #42399: [SPARK-44721] Revamp Retry Logic - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/08 15:58:01 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on pull request #42399: [SPARK-44721] Revamp Retry Logic - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/08 15:58:13 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42355: [SPARK-44709][CONNECT] Run ExecuteGrpcResponseSender in reattachable execute in new thread to fix flow control - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 16:32:02 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42355: [SPARK-44709][CONNECT] Run ExecuteGrpcResponseSender in reattachable execute in new thread to fix flow control - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/08 16:32:36 UTC, 0 replies.
- [GitHub] [spark] mox692 opened a new pull request, #42400: [MINOR][DOC] Fixed deprecated procedure syntax - posted by "mox692 (via GitHub)" <gi...@apache.org> on 2023/08/08 16:45:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42401: [SPARK-44723][BUILD] Upgrade `gcs-connector` to 2.2.16 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 17:18:56 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/08 17:43:15 UTC, 1 replies.
- [GitHub] [spark] abhishekukmeesho commented on pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by "abhishekukmeesho (via GitHub)" <gi...@apache.org> on 2023/08/08 19:04:02 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42351: [SPARK-44503][SQL] Project any PARTITION BY expressions not already returned from Python UDTF TABLE arguments - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/08 19:06:19 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #42392: [SPARK-44717][PYTHON][PS] Respect TimestampNTZ in resampling - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/08/08 19:10:34 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/08 20:24:34 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42402: [SPARK-44725][DOCS] Document `spark.network.timeoutInterval` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 20:51:04 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42351: [SPARK-44503][SQL] Project any PARTITION BY expressions not already returned from Python UDTF TABLE arguments - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/08 21:28:33 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42403: [SPARK-44726][CORE] Improve `HeartbeatReceiver config validation error Improve HeartbeatReceiver config validation error message - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 21:36:38 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #42404: [SPARK-44727] Improve the error message for dynamic allocation conditions - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/08 22:00:21 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42404: [SPARK-44727] Improve the error message for dynamic allocation conditions - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/08 22:02:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42404: [SPARK-44727][CORE] Improve the error message for dynamic allocation conditions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 22:25:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42402: [SPARK-44725][DOCS] Document `spark.network.timeoutInterval` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 22:26:32 UTC, 3 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42404: [SPARK-44727][CORE] Improve the error message for dynamic allocation conditions - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/08 22:31:16 UTC, 2 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #42402: [SPARK-44725][DOCS] Document `spark.network.timeoutInterval` - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/08/08 22:35:52 UTC, 3 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/08 22:46:37 UTC, 9 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42402: [SPARK-44725][DOCS] Document `spark.network.timeoutInterval` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 23:03:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #41683: [SPARK-36680][SQL] Supports Dynamic Table Options for Spark SQL - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/08 23:10:39 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42351: [SPARK-44503][SQL] Project any PARTITION BY expressions not already returned from Python UDTF TABLE arguments - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/08 23:21:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42397: [SPARK-44722][CONNECT] ExecutePlanResponseReattachableIterator._call_iter: AttributeError: 'NoneType' object has no attribute 'message' - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 23:58:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42397: [SPARK-44722][CONNECT] ExecutePlanResponseReattachableIterator._call_iter: AttributeError: 'NoneType' object has no attribute 'message' - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 23:58:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42332: [SPARK-44665][PYTHON] Add support for pandas DataFrame assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/08 23:59:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40949: [DRAFT][SPARK-23607][CORE] Use HDFS extended attributes to store application summary information in SHS - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/09 00:20:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42399: [SPARK-44721][CONNECT] Revamp Retry Logic - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 00:22:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42401: [SPARK-44723][BUILD] Upgrade `gcs-connector` to 2.2.16 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 00:23:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42401: [SPARK-44723][BUILD] Upgrade `gcs-connector` to 2.2.16 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 00:23:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 00:25:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 00:36:54 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41877: [SPARK-43660][CONNECT][PS] Enable `resample` with Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/09 00:44:14 UTC, 0 replies.
- [GitHub] [spark] sandip-db commented on a diff in pull request #41832: [SPARK-44732][SQL] Built-in XML data source support - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/08/09 00:51:20 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42392: [SPARK-44717][PYTHON][PS] Respect TimestampNTZ in resampling - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 00:56:20 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42401: [SPARK-44723][BUILD] Upgrade `gcs-connector` to 2.2.16 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 01:05:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42403: [SPARK-44726][CORE] Improve `HeartbeatReceiver` config validation error message - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 01:05:42 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42369: [SPARK-44695][PYTHON] Improve error message for `DataFrame.toDF` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 01:07:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42273: [SPARK-43568][SPARK-43633][PS] Support `Categorical` APIs for pandas 2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 01:08:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42369: [SPARK-44695][PYTHON] Improve error message for `DataFrame.toDF` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 01:08:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42273: [SPARK-43568][SPARK-43633][PS] Support `Categorical` APIs for pandas 2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 01:08:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41832: [SPARK-44732][SQL] Built-in XML data source support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 01:50:53 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42403: [SPARK-44726][CORE] Improve `HeartbeatReceiver` config validation error message - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/09 01:51:43 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42404: [SPARK-44727][CORE][DOCS] Improve docs and error message for dynamic allocation conditions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/09 01:58:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42392: [SPARK-44717][PYTHON][PS] Respect TimestampNTZ in resampling - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 02:03:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42400: [MINOR][DOC] Fixed deprecated procedure syntax - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 02:04:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42400: [MINOR][DOC] Fixed deprecated procedure syntax - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 02:04:54 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42398: [SPARK-42746][SQL] Add the LISTAGG() aggregate function - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/09 02:09:55 UTC, 3 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/09 02:19:35 UTC, 14 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42404: [SPARK-44727][CORE][DOCS] Improve docs and error message for dynamic allocation conditions - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/09 02:38:11 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/09 02:56:06 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 opened a new pull request, #42405: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager get wrong ha… - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/09 03:01:15 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 commented on pull request #42405: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager get wrong ha… - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/08/09 03:09:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42402: [SPARK-44725][DOCS] Document `spark.network.timeoutInterval` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 03:32:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42406: [SPARK-43429][CONNECT] Deflake SparkSessionSuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 03:39:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42398: [SPARK-42746][SQL] Add the LISTAGG() aggregate function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 04:11:07 UTC, 1 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42407: [SPARK-44737][SQL][UI] Should not display json format errors on SQL page for non-SparkThrowables on SQL Tab - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/09 04:17:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42394: [SPARK-44718] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 04:17:17 UTC, 0 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #41489: [SPARK-43987][Shuffle] Separate finalizeShuffleMerge Processing to Dedicated Thread Pools - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/08/09 04:28:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42403: [SPARK-44726][CORE] Improve `HeartbeatReceiver` config validation error message - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 04:30:53 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/09 05:00:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42389: [SPARK-43709][PS] Remove `closed` parameter from `ps.date_range` & enable test. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 05:09:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42389: [SPARK-43709][PS] Remove `closed` parameter from `ps.date_range` & enable test. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 05:10:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42408: [SPARK-43979][SQL][FOLLOWUP] `transformUpWithNewOutput` should only be used with new outputs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/09 05:46:34 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42405: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager gets wrong UGI from SecurityManager of ApplicationMaster - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/09 05:47:11 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42405: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager gets wrong UGI from SecurityManager of ApplicationMaster - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/09 05:50:44 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42403: [SPARK-44726][CORE] Improve `HeartbeatReceiver` config validation error message - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 05:58:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42405: [SPARK-44581][YARN] Fix the bug that ShutdownHookManager gets wrong UGI from SecurityManager of ApplicationMaster - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 06:02:20 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42403: [SPARK-44726][CORE] Improve `HeartbeatReceiver` config validation error message - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/09 06:32:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42163: [SPARK-44551][SQL] Fix behavior of null IN (empty list) in expression execution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/09 07:01:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42163: [SPARK-44551][SQL] Fix behavior of null IN (empty list) in expression execution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/09 07:11:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 07:17:43 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42409: [SPARK-44738] Add missing client metadata to calls - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/09 07:17:55 UTC, 0 replies.
- [GitHub] [spark] majdyz commented on a diff in pull request #42394: [SPARK-44718][SQL] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value - posted by "majdyz (via GitHub)" <gi...@apache.org> on 2023/08/09 08:24:22 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40474: [SPARK-42849] [SQL] Session Variables - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/09 08:51:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40474: [SPARK-42849] [SQL] Session Variables - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/09 08:51:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42409: [SPARK-44738][PYTHON][CONNECT] Add missing client metadata to calls - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 09:08:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42409: [SPARK-44738][PYTHON][CONNECT] Add missing client metadata to calls - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 09:08:35 UTC, 0 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #42399: [SPARK-44721][CONNECT] Revamp Retry Logic - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/08/09 09:30:57 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #42410: [SPARK-43660][CONNECT][PS][FOLLOWUP] Remove JVM dependency for resample - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/09 10:16:49 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #42410: [SPARK-43660][CONNECT][PS][FOLLOWUP] Remove JVM dependency for resample - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/09 10:17:06 UTC, 1 replies.
- [GitHub] [spark] ashangit commented on pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "ashangit (via GitHub)" <gi...@apache.org> on 2023/08/09 10:29:41 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42109: [SPARK-44404][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1009,1010,1013,1015,1016,1278] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/09 10:44:11 UTC, 2 replies.
- [GitHub] [spark] advancedxy commented on a diff in pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/08/09 10:49:46 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42408: [SPARK-43979][SQL][FOLLOWUP] `transformUpWithNewOutput` should only be used with new outputs - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/09 11:16:53 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42408: [SPARK-43979][SQL][FOLLOWUP] `transformUpWithNewOutput` should only be used with new outputs - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/09 11:17:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42410: [SPARK-43660][CONNECT][PS][FOLLOWUP] Remove JVM dependency for resample - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 11:32:35 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42410: [SPARK-43660][CONNECT][PS][FOLLOWUP] Remove JVM dependency for resample - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 11:32:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 11:35:02 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 11:35:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42406: [SPARK-43429][CONNECT] Deflake SparkSessionSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 11:42:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42406: [SPARK-43429][CONNECT] Deflake SparkSessionSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 11:42:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42363: [SPARK-44691][SQL][CONNECT] Move Subclasses of AnalysisException to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/09 11:57:34 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42411: [MINOR][INFRA] Skip `deepspeed` in requirements on MacOS - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/09 12:27:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42407: [SPARK-44737][SQL][UI] Should not display json format errors on SQL page for non-SparkThrowables on SQL Tab - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 12:30:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42412: [SPARK-44701][PYTHON][TESTS][FOLLOWUP] Keep `torch` in 3.5 daily GA - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/09 13:14:32 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42399: [SPARK-44721][CONNECT] Revamp retry logic and make retries run for 10 minutes - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/09 13:20:19 UTC, 19 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42413: [SPARK-44554][INFRA][FOLLOWUP] Install `IPython` for branch-3.3 Python linter check. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 13:21:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42413: [SPARK-44554][INFRA][FOLLOWUP] Install `IPython` for the Python linter check in the daily test for branch-3.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 13:26:36 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42413: [SPARK-44554][INFRA][FOLLOWUP] Install `IPython` for the Python linter check in the daily test for branch-3.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 13:28:55 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on a diff in pull request #42399: [SPARK-44721][CONNECT] Revamp retry logic and make retries run for 10 minutes - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/09 13:37:05 UTC, 14 replies.
- [GitHub] [spark] jinhai-cloud commented on a diff in pull request #29053: [SPARK-32241][SQL] Remove empty children of union - posted by "jinhai-cloud (via GitHub)" <gi...@apache.org> on 2023/08/09 13:40:40 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42399: [SPARK-44721][CONNECT] Revamp retry logic and make retries run for 10 minutes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 13:41:25 UTC, 4 replies.
- [GitHub] [spark] cdkrot commented on pull request #42399: [SPARK-44721][CONNECT] Revamp retry logic and make retries run for 10 minutes - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/09 13:41:41 UTC, 5 replies.
- [GitHub] [spark] hvanhovell closed pull request #42396: [SPARK-44720][CONNECT] Make Dataset use Encoder instead of AgnosticEncoder - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 13:58:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40352: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 14:01:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40352: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 14:04:23 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42414: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 14:15:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42414: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 14:16:56 UTC, 5 replies.
- [GitHub] [spark] LuciferYang closed pull request #40352: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 14:17:07 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42376: [SPARK-44700][SQL] Rule OptimizeCsvJsonExprs should not be applied to expression like from_json(regexp_replace) - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/09 14:20:46 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42415: [SPARK-44740][CONNECT]Support specifying `session_id` in SPARK_REMOTE connection string. - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/09 14:29:33 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42415: [SPARK-44740][CONNECT]Support specifying `session_id` in SPARK_REMOTE connection string. - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/09 14:37:12 UTC, 12 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #42415: [SPARK-44740][CONNECT]Support specifying `session_id` in SPARK_REMOTE connection string. - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/09 14:41:18 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42414: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 14:46:52 UTC, 1 replies.
- [GitHub] [spark] monkeyboy123 closed pull request #42376: [SPARK-44700][SQL] Rule OptimizeCsvJsonExprs should not be applied to expression like from_json(regexp_replace) - posted by "monkeyboy123 (via GitHub)" <gi...@apache.org> on 2023/08/09 15:11:20 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42382: [ML] Remove usage of RDD APIs for load/save in spark-ml - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/09 15:20:13 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42382: [ML] Remove usage of RDD APIs for load/save in spark-ml - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/09 15:26:25 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42415: [SPARK-44740][CONNECT]Support specifying `session_id` in SPARK_REMOTE connection string. - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/09 15:33:51 UTC, 1 replies.
- [GitHub] [spark] ramesh-muthusamy opened a new pull request, #42416: [SPARK-44741][Metrics]Spark StatsD metrics reporter to support metrics filter option - posted by "ramesh-muthusamy (via GitHub)" <gi...@apache.org> on 2023/08/09 17:19:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42414: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 17:20:32 UTC, 5 replies.
- [GitHub] [spark] ueshin commented on pull request #42351: [SPARK-44503][SQL] Project any PARTITION BY expressions not already returned from Python UDTF TABLE arguments - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/09 17:25:21 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42351: [SPARK-44503][SQL] Project any PARTITION BY expressions not already returned from Python UDTF TABLE arguments - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/09 17:27:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42414: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/09 17:41:23 UTC, 13 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42398: [SPARK-42746][SQL] Add the LISTAGG() aggregate function - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/09 17:42:33 UTC, 0 replies.
- [GitHub] [spark] dbtsai commented on pull request #42416: [SPARK-44741][Metrics]Spark StatsD metrics reporter to support metrics filter option - posted by "dbtsai (via GitHub)" <gi...@apache.org> on 2023/08/09 17:47:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42416: [SPARK-44741][Metrics]Spark StatsD metrics reporter to support metrics filter option - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 18:00:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42416: [SPARK-44741][CORE] Spark StatsD metrics reporter to support metrics filter option - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 18:41:12 UTC, 3 replies.
- [GitHub] [spark] NarekDW opened a new pull request, #40040: [SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/08/09 19:08:49 UTC, 1 replies.
- [GitHub] [spark] holdenk commented on pull request #40040: [SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/08/09 19:09:50 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42415: [SPARK-44740][CONNECT]Support specifying `session_id` in SPARK_REMOTE connection string. - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/09 19:53:57 UTC, 0 replies.
- [GitHub] [spark] ramesh-muthusamy commented on a diff in pull request #42416: [SPARK-44741][CORE] Spark StatsD metrics reporter to support metrics filter option - posted by "ramesh-muthusamy (via GitHub)" <gi...@apache.org> on 2023/08/09 19:59:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42417: [SPARK-44745][DOCS][K8S] Document shuffle data recovery from the remounted K8s PVCs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 20:06:24 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/09 20:40:15 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42418: [SPARK-44736][CONNECT] Add Dataset.explode to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 20:54:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42418: [SPARK-44736][CONNECT] Add Dataset.explode to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 20:55:38 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42419: [SPARK-44747][CONNECT] Add missing SparkSession.Builder methods. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 21:38:14 UTC, 0 replies.
- [GitHub] [spark] ramesh-muthusamy commented on a diff in pull request #42416: [SPARK-44741][CORE] Support regex-based MetricFilter in `StatsdSink` - posted by "ramesh-muthusamy (via GitHub)" <gi...@apache.org> on 2023/08/09 21:51:21 UTC, 1 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42283: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with `removeListener` and improvements - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/09 21:51:52 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42417: [SPARK-44745][DOCS][K8S] Document shuffle data recovery from the remounted K8s PVCs - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/09 21:58:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42417: [SPARK-44745][DOCS][K8S] Document shuffle data recovery from the remounted K8s PVCs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 22:24:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42417: [SPARK-44745][DOCS][K8S] Document shuffle data recovery from the remounted K8s PVCs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 22:25:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42417: [SPARK-44745][DOCS][K8S] Document shuffle data recovery from the remounted K8s PVCs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/09 22:25:42 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #42420: [SPARK-44748][SQL] Query execution for the PARTITION BY clause in UDTF TABLE arguments - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/09 22:26:01 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42421: [SPARK-44461] Verify Python Version for spark connect streaming workers - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/09 23:00:53 UTC, 0 replies.
- [GitHub] [spark] szehon-ho commented on a diff in pull request #41683: [SPARK-36680][SQL] Supports Dynamic Table Options for Spark SQL - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/09 23:26:03 UTC, 0 replies.
- [GitHub] [spark] szehon-ho commented on pull request #41683: [SPARK-36680][SQL] Supports Dynamic Table Options for Spark SQL - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/09 23:26:06 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42415: [SPARK-44740][CONNECT]Support specifying `session_id` in SPARK_REMOTE connection string. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 23:41:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42415: [SPARK-44740][CONNECT]Support specifying `session_id` in SPARK_REMOTE connection string. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 23:41:27 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42419: [SPARK-44747][CONNECT] Add missing SparkSession.Builder methods. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/09 23:51:02 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42422: [SPARK-44749][SQL][PYTHON] Support named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/09 23:52:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42418: [SPARK-44736][CONNECT] Add Dataset.explode to Spark Connect Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/09 23:53:10 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42416: [SPARK-44741][CORE] Support regex-based MetricFilter in `StatsdSink` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 00:14:36 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41002: Update ExecutorAllocationManager.scala - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/10 00:20:19 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40974: [CORE] Clear the bitmap for tracking free pages when invoking cleanUp… - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/10 00:20:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40040: [SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/10 00:20:24 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42422: [SPARK-44749][SQL][PYTHON] Support named arguments in Python UDTF - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/10 00:38:58 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42419: [SPARK-44747][CONNECT] Add missing SparkSession.Builder methods. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 00:49:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42419: [SPARK-44747][CONNECT] Add missing SparkSession.Builder methods. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 00:49:56 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42422: [SPARK-44749][SQL][PYTHON] Support named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/10 00:53:44 UTC, 4 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42423: [WIP][SPARK-44625] SparkConnectExecutionManager to track all executions - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/10 01:02:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41918: [DO_NOT_MERGE][INFRA] test dockerfile - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 01:58:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 02:05:22 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42407: [SPARK-44737][SQL][UI] Should not display json format errors on SQL page for non-SparkThrowables on SQL Tab - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/10 02:21:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41832: [SPARK-44732][SQL] Built-in XML data source support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 02:23:15 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41832: [SPARK-44732][SQL] Built-in XML data source support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 02:23:52 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42399: [SPARK-44721][CONNECT] Revamp retry logic and make retries run for 10 minutes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/10 02:33:26 UTC, 7 replies.
- [GitHub] [spark] itholic commented on pull request #40665: [SPARK-42621][PS] Add inclusive parameter for pd.date_range - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/10 02:35:45 UTC, 0 replies.
- [GitHub] [spark] zeruibao opened a new pull request, #42424: [SPARK-43380][SQL] Fix Avro data type conversion issues with causing performance regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/10 02:58:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42421: [SPARK-44461][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 03:08:02 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42421: [SPARK-44461][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 03:08:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42412: [SPARK-44701][PYTHON][TESTS][FOLLOWUP] Keep `torch` in 3.5 daily GA - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 03:11:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42412: [SPARK-44701][PYTHON][TESTS][FOLLOWUP] Keep `torch` in 3.5 daily GA - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 03:11:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42411: [MINOR][BUILD] Skip `deepspeed` in requirements on MacOS - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 03:12:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42411: [MINOR][BUILD] Skip `deepspeed` in requirements on MacOS - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 03:12:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42407: [SPARK-44737][SQL][UI] Should not display json format errors on SQL page for non-SparkThrowables on SQL Tab - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 03:13:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42419: [SPARK-44747][CONNECT] Add missing SparkSession.Builder methods. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 03:30:57 UTC, 0 replies.
- [GitHub] [spark] lvyanquan closed pull request #42380: [SPARK-44696][SQL] Support different timestamp precise for `from_json` function - posted by "lvyanquan (via GitHub)" <gi...@apache.org> on 2023/08/10 04:01:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42424: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/10 04:41:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42413: [SPARK-44554][INFRA][FOLLOWUP] Install `IPython` for the Python linter check in the daily test for branch-3.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/10 04:48:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42425: [SPARK-44729][PYTHON][DOCS] Add canonical links to the PySpark docs page - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/10 06:23:26 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42425: [SPARK-44729][PYTHON][DOCS] Add canonical links to the PySpark docs page - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/10 06:26:01 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41683: [SPARK-36680][SQL] Supports Dynamic Table Options for Spark SQL - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/10 06:32:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42363: [SPARK-44691][SQL][CONNECT] Move Subclasses of AnalysisException to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/10 06:55:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42363: [SPARK-44691][SQL][CONNECT] Move Subclasses of AnalysisException to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/10 06:57:02 UTC, 0 replies.
- [GitHub] [spark] hdaikoku opened a new pull request, #42426: [WIP][SPARK-44756] Executor hangs when RetryingBlockTransferor fails to initiate retry - posted by "hdaikoku (via GitHub)" <gi...@apache.org> on 2023/08/10 08:04:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42271: [SPARK-43245][SPARK-43705][PS] Type match for `DatetimeIndex`/`TimedeltaIndex` with pandas 2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 08:41:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42271: [SPARK-43245][SPARK-43705][PS] Type match for `DatetimeIndex`/`TimedeltaIndex` with pandas 2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/10 08:42:46 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39977: [SPARK-42323][SQL] Assign name to `_LEGACY_ERROR_TEMP_2332` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/10 09:13:59 UTC, 0 replies.
- [GitHub] [spark] zml1206 opened a new pull request, #42427: [SPARK-44758][K8S] Support configurable memory limits - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/10 09:30:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42413: [SPARK-44554][INFRA][FOLLOWUP] Install `IPython` for the Python linter check in the daily test for branch-3.3 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 09:58:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42425: [SPARK-44729][PYTHON][DOCS] Add canonical links to the PySpark docs page - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 10:04:49 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42410: [SPARK-43660][CONNECT][PS][FOLLOWUP] Remove JVM dependency for resample - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 10:20:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42410: [SPARK-43660][CONNECT][PS][FOLLOWUP] Remove JVM dependency for resample - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 10:21:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42382: [ML] Remove usage of RDD APIs for load/save in spark-ml - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 10:34:52 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/10 10:40:06 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/10 10:40:59 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/10 10:42:15 UTC, 3 replies.
- [GitHub] [spark] panbingkun commented on pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/10 10:42:29 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/10 11:20:28 UTC, 0 replies.
- [GitHub] [spark] hdaikoku commented on a diff in pull request #42426: [WIP][SPARK-44756] Executor hangs when RetryingBlockTransferor fails to initiate retry - posted by "hdaikoku (via GitHub)" <gi...@apache.org> on 2023/08/10 11:34:58 UTC, 0 replies.
- [GitHub] [spark] hdaikoku commented on a diff in pull request #42426: [SPARK-44756] Executor hangs when RetryingBlockTransferor fails to initiate retry - posted by "hdaikoku (via GitHub)" <gi...@apache.org> on 2023/08/10 12:18:56 UTC, 0 replies.
- [GitHub] [spark] shuwang21 commented on pull request #42357: [SPARK-44306][YARN] Group FileStatus with few RPC calls within Yarn Client - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/08/10 12:41:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42430: [SPARK-44761][CONNECT] Support DataStreamWriter.foreachBatch(VoidFunction2) - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/10 12:58:31 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42430: [SPARK-44761][CONNECT] Support DataStreamWriter.foreachBatch(VoidFunction2) - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/10 12:58:42 UTC, 1 replies.
- [GitHub] [spark-connect-go] grundprinzip commented on a diff in pull request #14: [SPARK-44681] Fix issues when writing Go application code using Spark Connect Go client library - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/10 13:20:24 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42376: [SPARK-44700][SQL] Rule OptimizeCsvJsonExprs should not be applied to expression like from_json(regexp_replace) - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/10 13:56:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42416: [SPARK-44741][CORE] Support regex-based MetricFilter in `StatsdSink` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 14:07:37 UTC, 0 replies.
- [GitHub] [spark] xiaoa6435 opened a new pull request, #42431: [WIP][SPARK-42905][MLLIB] fix spearman correlation incorrect and inconsistent results when data has huge amount of ties - posted by "xiaoa6435 (via GitHub)" <gi...@apache.org> on 2023/08/10 14:38:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 15:08:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 15:09:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/10 15:17:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/10 15:19:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #33675: [SPARK-27997][K8S] Add support for kubernetes OAuth Token refresh - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 15:30:44 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42432: [SPARK-27997][K8S] Support OAuth Token Provider - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 15:30:54 UTC, 0 replies.
- [GitHub] [spark] andygrove opened a new pull request, #42433: [WIP][SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value in codegen when first value is null - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 15:40:20 UTC, 0 replies.
- [GitHub] [spark] andygrove commented on a diff in pull request #41432: [SPARK-43063][SQL][FOLLOWUP] Add a space between `->` and value - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 15:41:24 UTC, 1 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #42357: [SPARK-44306][YARN] Group FileStatus with few RPC calls within Yarn Client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/08/10 16:21:51 UTC, 1 replies.
- [GitHub] [spark] andygrove closed pull request #42433: [WIP][SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value in codegen when first value is null - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 16:21:53 UTC, 0 replies.
- [GitHub] [spark] andygrove commented on pull request #42433: [WIP][SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value in codegen when first value is null - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 16:21:53 UTC, 1 replies.
- [GitHub] [spark-connect-go] hiboyang commented on a diff in pull request #14: [SPARK-44681] Fix issues when writing Go application code using Spark Connect Go client library - posted by "hiboyang (via GitHub)" <gi...@apache.org> on 2023/08/10 16:32:36 UTC, 1 replies.
- [GitHub] [spark] andygrove opened a new pull request, #42434: [WIP][SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 16:50:01 UTC, 1 replies.
- [GitHub] [spark] andygrove commented on a diff in pull request #42434: [WIP][SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 16:53:37 UTC, 0 replies.
- [GitHub] [spark] andygrove closed pull request #42434: [WIP][SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 17:03:31 UTC, 0 replies.
- [GitHub] [spark] advancedxy commented on a diff in pull request #42255: [SPARK-40178][SQL][COONECT] support coalesce hints with ease for PySpark and R - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/08/10 17:04:48 UTC, 3 replies.
- [GitHub] [spark] advancedxy commented on pull request #42255: [SPARK-40178][SQL][COONECT] support coalesce hints with ease for PySpark and R - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/08/10 17:05:47 UTC, 2 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42422: [SPARK-44749][SQL][PYTHON] Support named arguments in Python UDTF - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/10 17:29:30 UTC, 3 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42435: [WIP][SQL] Add the alias `TIMEDIFF` for `TIMESTAMPDIFF` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/10 17:57:10 UTC, 0 replies.
- [GitHub] [spark] MrPowers commented on pull request #42425: [SPARK-44729][PYTHON][DOCS] Add canonical links to the PySpark docs page - posted by "MrPowers (via GitHub)" <gi...@apache.org> on 2023/08/10 18:16:45 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42436: [SPARK-44763][SQL] Fix a bug of promoting string as double in binary arithmetic with interval - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/10 18:55:33 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #42436: [SPARK-44763][SQL] Fix a bug of promoting string as double in binary arithmetic with interval - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/10 19:00:26 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42437: [SPARK-43032][FOLLOWUP][SS][CONNECT] StreamingQueryManager bug fix - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/10 19:53:38 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42437: [SPARK-43032][FOLLOWUP][SS][CONNECT] StreamingQueryManager bug fix - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/10 20:38:10 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42438: [SPARK-44765][CONNECT] Simplify retries of ReleaseExecute - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/10 20:38:56 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #42436: [SPARK-44763][SQL] Fix a bug of promoting string as double in binary arithmetic with interval - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/10 20:39:45 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42436: [SPARK-44763][SQL] Fix a bug of promoting string as double in binary arithmetic with interval - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/10 20:40:56 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42437: [SPARK-43032][FOLLOWUP][SS][CONNECT] StreamingQueryManager bug fix - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/10 20:44:16 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42438: [SPARK-44765][CONNECT] Simplify retries of ReleaseExecute - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/10 20:47:00 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #42434: [SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/08/10 20:56:51 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42423: [SPARK-44625][CONNECT] SparkConnectExecutionManager to track all executions - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/10 21:26:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42427: [SPARK-44758][K8S] Support memory limits configurable - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 21:39:08 UTC, 0 replies.
- [GitHub] [spark] andygrove commented on a diff in pull request #42434: [SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/08/10 21:50:00 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42439: [SPARK-44766][PYTHON] Cache the pandas converter for reuse for Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/10 21:54:13 UTC, 0 replies.
- [GitHub] [spark] rshkv opened a new pull request, #42440: [SPARK-44767] Plugin API for PySpark and SparkR workers - posted by "rshkv (via GitHub)" <gi...@apache.org> on 2023/08/10 22:14:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42421: [SPARK-44461][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/10 22:17:44 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/10 22:30:13 UTC, 0 replies.
- [GitHub] [spark] rshkv commented on a diff in pull request #42440: [SPARK-44767] Plugin API for PySpark and SparkR workers - posted by "rshkv (via GitHub)" <gi...@apache.org> on 2023/08/10 22:41:39 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #42420: [SPARK-44748][SQL] Query execution for the PARTITION BY clause in UDTF TABLE arguments - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/10 22:44:42 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42441: [CONNECT][POC] Have real server and real simple client in tests - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/10 22:52:06 UTC, 0 replies.
- [GitHub] [spark] jasonli-db opened a new pull request, #42442: [SPARK-44770][WEBUI] Add a displayOrder variable to WebUITab to specify the order in which tabs appear - posted by "jasonli-db (via GitHub)" <gi...@apache.org> on 2023/08/10 22:58:31 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42441: [CONNECT][POC] Have real server and real simple client in tests - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/10 23:02:49 UTC, 3 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42441: [CONNECT][POC] Have real server and real simple client in tests - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/10 23:05:35 UTC, 1 replies.
- [GitHub] [spark] agubichev commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates for scalar and lateral subqueries - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/10 23:09:40 UTC, 3 replies.
- [GitHub] [spark] ueshin commented on pull request #42421: [SPARK-44461][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/10 23:16:59 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42421: [SPARK-44461][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/11 00:00:50 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42441: [CONNECT][POC] Have real server and real simple client in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 00:03:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42434: [SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 00:13:29 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41002: Update ExecutorAllocationManager.scala - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/11 00:16:11 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40974: [CORE] Clear the bitmap for tracking free pages when invoking cleanUp… - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/11 00:16:12 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 00:34:55 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 00:36:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 00:57:50 UTC, 3 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42443: [SPARK-44461][3.5][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/11 01:09:06 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42443: [SPARK-44461][3.5][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/11 01:10:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42439: [SPARK-44766][PYTHON] Cache the pandas converter for reuse for Python UDTFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:22:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42439: [SPARK-44766][PYTHON] Cache the pandas converter for reuse for Python UDTFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:23:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42437: [SPARK-43032][FOLLOWUP][SS][CONNECT] StreamingQueryManager bug fix - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:23:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42437: [SPARK-43032][FOLLOWUP][SS][CONNECT] StreamingQueryManager bug fix - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:24:06 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/11 01:26:19 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42430: [SPARK-44761][CONNECT] Support DataStreamWriter.foreachBatch(VoidFunction2) - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:26:59 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42438: [SPARK-44765][CONNECT] Simplify retries of ReleaseExecute - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:28:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42438: [SPARK-44765][CONNECT] Simplify retries of ReleaseExecute - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:28:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:34:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 01:34:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42431: [WIP][SPARK-42905][MLLIB] fix spearman correlation incorrect and inconsistent results when data has huge amount of ties - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 01:42:25 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #42431: [WIP][SPARK-42905][MLLIB] fix spearman correlation incorrect and inconsistent results when data has huge amount of ties - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/11 02:03:31 UTC, 3 replies.
- [GitHub] [spark] zml1206 commented on pull request #42427: [SPARK-44758][K8S] Support memory limits configurable - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/11 02:09:57 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42390: [SPARK-43872][PS] Support `(DataFrame|Series).plot` with pandas 2.0.0 and above. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 02:10:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42390: [SPARK-43872][PS] Support `(DataFrame|Series).plot` with pandas 2.0.0 and above. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 02:10:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 02:14:06 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42436: [SPARK-44763][SQL] Fix a bug of promoting string as double in binary arithmetic with interval - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 02:18:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 02:18:52 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #42382: [ML] Remove usage of RDD APIs for load/save in spark-ml - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/11 02:25:32 UTC, 2 replies.
- [GitHub] [spark] xiaoa6435 commented on a diff in pull request #42431: [WIP][SPARK-42905][MLLIB] fix spearman correlation incorrect and inconsistent results when data has huge amount of ties - posted by "xiaoa6435 (via GitHub)" <gi...@apache.org> on 2023/08/11 02:29:42 UTC, 6 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #42444: [SPARK-44771][INFRA] Remove 'sudo' in 'pip install' suggestions in the dev scripts - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 02:29:59 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42444: [SPARK-44771][INFRA] Remove 'sudo' in 'pip install' suggestions in the dev scripts - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 02:30:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42445: [SPARK-44731][PYTHON][CONNECT] Make TimestampNTZ works with literals in Python Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 02:33:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42445: [SPARK-44731][PYTHON][CONNECT] Make TimestampNTZ works with literals in Python Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 02:33:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/11 02:46:38 UTC, 0 replies.
- [GitHub] [spark] shuwang21 commented on a diff in pull request #42357: [SPARK-44306][YARN] Group FileStatus with few RPC calls within Yarn Client - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/08/11 03:01:10 UTC, 12 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42444: [SPARK-44771][INFRA] Remove 'sudo' in 'pip install' suggestions of dev scripts - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 03:02:43 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42444: [SPARK-44771][INFRA] Remove 'sudo' in 'pip install' suggestions of dev scripts - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 03:03:08 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42446: [SPARK-44719][SQL] Fix NoClassDefFoundError when using Hive UDF - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/11 03:05:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42432: [SPARK-27997][K8S] Support user-provided OAuth Token Providers - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/11 03:10:30 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42446: [SPARK-44719][SQL] Fix NoClassDefFoundError when using Hive UDF - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/11 03:25:46 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42447: [SPARK-44719][SQL][3.5] Fix NoClassDefFoundError when using Hive UDF - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/11 03:37:55 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42447: [SPARK-44719][SQL][3.5] Fix NoClassDefFoundError when using Hive UDF - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/11 03:39:07 UTC, 2 replies.
- [GitHub] [spark] nebi-frame commented on pull request #21593: [SPARK-24578][Core] Cap sub-region's size of returned nio buffer - posted by "nebi-frame (via GitHub)" <gi...@apache.org> on 2023/08/11 03:42:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42447: [SPARK-44719][SQL][3.5] Fix NoClassDefFoundError when using Hive UDF - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/11 04:54:31 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42408: [SPARK-43979][SQL][FOLLOWUP] `transformUpWithNewOutput` should only be used with new outputs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/11 05:48:45 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42441: [CONNECT][POC] Have real server and real simple client in tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/11 06:04:56 UTC, 3 replies.
- [GitHub] [spark] zeruibao opened a new pull request, #42448: Spark 43380 fix arvo data type without causing regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/11 06:46:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 06:46:29 UTC, 0 replies.
- [GitHub] [spark] zeruibao closed pull request #42448: Spark 43380 fix arvo data type without causing regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/11 06:48:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42408: [SPARK-43979][SQL][FOLLOWUP] `transformUpWithNewOutput` should only be used with new outputs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 06:56:27 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42407: [SPARK-44737][SQL][UI] Should not display json format errors on SQL page for non-SparkThrowables on SQL Tab - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 06:59:01 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42407: [SPARK-44737][SQL][UI] Should not display json format errors on SQL page for non-SparkThrowables on SQL Tab - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 07:04:06 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42404: [SPARK-44727][CORE][DOCS] Improve docs and error message for dynamic allocation conditions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 07:09:02 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42404: [SPARK-44727][CORE][DOCS] Improve docs and error message for dynamic allocation conditions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 07:10:29 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42442: [SPARK-44770][WEBUI] Add a displayOrder variable to WebUITab to specify the order in which tabs appear - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 07:12:48 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42442: [SPARK-44770][WEBUI] Add a displayOrder variable to WebUITab to specify the order in which tabs appear - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 07:13:12 UTC, 0 replies.
- [GitHub] [spark] qusaibb commented on pull request #41567: [SPARK-43663][CONNECT][PS] Enable `SeriesParityTests.test_compare` - posted by "qusaibb (via GitHub)" <gi...@apache.org> on 2023/08/11 07:54:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42449: [SPARK-43979][SQL][FOLLOWUP] transformUpWithNewOutput should only be used with new outputs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 08:05:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42449: [SPARK-43979][SQL][FOLLOWUP] transformUpWithNewOutput should only be used with new outputs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 08:06:20 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42449: [SPARK-43979][SQL][FOLLOWUP] transformUpWithNewOutput should only be used with new outputs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 08:06:40 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42424: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 08:17:44 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #42450: [SPARK-44773][SQL] Code-gen CodegenFallback expression in WholeStageCodegen if possible - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/11 09:11:44 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #42376: [SPARK-44700][SQL] Rule OptimizeCsvJsonExprs should not be applied to expression like from_json(regexp_replace) - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/11 09:17:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42445: [SPARK-44731][PYTHON][CONNECT] Make TimestampNTZ works with literals in Python Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 09:35:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42445: [SPARK-44731][PYTHON][CONNECT] Make TimestampNTZ works with literals in Python Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 09:36:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42388: [SPARK-43618][SPARK-43658][CONNECT][PS][TESTS] Enabling more tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 09:43:20 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42312: [SPARK-43476][SPARK-43477][SPARK-43478][PS] Support `StringMethods` for pandas 2.0.0 and above - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 09:46:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42312: [SPARK-43476][SPARK-43477][SPARK-43478][PS] Support `StringMethods` for pandas 2.0.0 and above - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 09:46:53 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42429: [SPARK-44760][INFRA] Fix list index out of range for JIRA resolution in merge_spark_pr - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 10:19:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42451: [SPARK-44775][PYTHON][DOCS] Add missing version information in DataFrame APIs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 10:20:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42451: [SPARK-44775][PYTHON][DOCS] Add missing version information in DataFrame APIs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/11 10:22:58 UTC, 2 replies.
- [GitHub] [spark] NarekDW commented on pull request #40040: [SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/08/11 10:33:21 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42452: [SPARK-44740][CONNECT][FOLLOW] Fix metadata values in Python client - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/11 10:57:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42430: [SPARK-44761][CONNECT] Support DataStreamWriter.foreachBatch(VoidFunction2) - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/11 12:37:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42449: [SPARK-43979][SQL][FOLLOWUP] transformUpWithNewOutput should only be used with new outputs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 13:02:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42453: [SPARK-43979][SQL][FOLLOWUP] Fix the detection of alias-only project - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 13:06:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42453: [SPARK-43979][SQL][FOLLOWUP] Fix the detection of alias-only project - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/11 13:06:31 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42453: [SPARK-43979][SQL][FOLLOWUP] Fix the detection of alias-only project - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 14:33:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42451: [SPARK-44775][PYTHON][DOCS] Add missing version information in DataFrame APIs - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 14:36:28 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42449: [SPARK-43979][SQL][FOLLOWUP] transformUpWithNewOutput should only be used with new outputs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/11 15:01:24 UTC, 0 replies.
- [GitHub] [spark] gjxdxh opened a new pull request, #42454: [SPARK-44776] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "gjxdxh (via GitHub)" <gi...@apache.org> on 2023/08/11 15:03:15 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42455: [DRAFT] Fix Spark Connect Behavior for Default Session - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/11 15:49:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42423: [SPARK-44625][CONNECT] SparkConnectExecutionManager to track all executions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/11 16:32:24 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42423: [SPARK-44625][CONNECT] SparkConnectExecutionManager to track all executions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/11 16:32:59 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42453: [SPARK-43979][SQL][FOLLOWUP] Fix the detection of alias-only project - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 16:38:46 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42453: [SPARK-43979][SQL][FOLLOWUP] Fix the detection of alias-only project - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/11 16:39:44 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42456: [SPARK-44422][FOLLOWUP][CONNECT] Fix typo in ProtoUtils - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/11 17:37:45 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42456: [SPARK-44422][FOLLOWUP][CONNECT] Fix typo in ProtoUtils - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/11 17:38:10 UTC, 0 replies.
- [GitHub] [spark] jdesjean commented on a diff in pull request #42454: [SPARK-44776] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/08/11 17:55:31 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42457: [SPARK-44625][CONNECT][FOLLOWUP] Make initialization of SparkConnectExecutionManager lazy - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/11 18:50:23 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42457: [SPARK-44625][CONNECT][FOLLOWUP] Make initialization of SparkConnectExecutionManager lazy - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/11 18:50:37 UTC, 0 replies.
- [GitHub] [spark] zeruibao opened a new pull request, #42458: [SPARK-43380][SQL] Revert `Fix Avro data type conversion issues` - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/11 19:00:59 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42458: [SPARK-43380][SQL] Revert `Fix Avro data type conversion issues` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 19:35:02 UTC, 1 replies.
- [GitHub] [spark] johnayoub commented on pull request #42272: [SPARK-44508][PYTHON][DOCS] Add user guide for Python user-defined table functions - posted by "johnayoub (via GitHub)" <gi...@apache.org> on 2023/08/11 19:59:52 UTC, 0 replies.
- [GitHub] [spark] zeruibao commented on pull request #42458: [SPARK-43380][SQL] Revert `Fix Avro data type conversion issues` - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/11 20:32:36 UTC, 0 replies.
- [GitHub] [spark] zeruibao commented on a diff in pull request #42458: [SPARK-43380][SQL] Revert `Fix Avro data type conversion issues` - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/11 20:32:39 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42442: [SPARK-44770][WEBUI] Add a displayOrder variable to WebUITab to specify the order in which tabs appear - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 20:47:54 UTC, 0 replies.
- [GitHub] [spark] jasonli-db opened a new pull request, #42459: [SPARK-44770][WEBUI] Add a displayOrder variable to WebUITab to specify the order in which tabs appear - posted by "jasonli-db (via GitHub)" <gi...@apache.org> on 2023/08/11 20:52:30 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #42460: [SPARK-44433] Terminate foreach batch runner when streaming query terminates - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/11 21:00:26 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #42460: [SPARK-44433] Terminate foreach batch runner when streaming query terminates - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/11 21:00:55 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42461: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/11 21:18:04 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42461: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/11 21:22:26 UTC, 3 replies.
- [GitHub] [spark] sandip-db opened a new pull request, #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/08/11 21:51:20 UTC, 0 replies.
- [GitHub] [spark] mingkangli-db opened a new pull request, #42463: [DO NOT MERGE] Avro connector: convert a union of a single type to a StructType instead of a simple type - posted by "mingkangli-db (via GitHub)" <gi...@apache.org> on 2023/08/11 21:51:49 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42420: [SPARK-44748][SQL] Query execution for the PARTITION BY clause in UDTF TABLE arguments - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/11 22:58:26 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42464: [SPARK-43509][PYTHON][CONNECT][FOLLOW-UP] Check SPARK_CONNECT_MODE_ENABLED when creating a session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:19:20 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42461: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/11 23:40:54 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42459: [SPARK-44770][WEBUI] Add a displayOrder variable to WebUITab to specify the order in which tabs appear - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 23:42:52 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42459: [SPARK-44770][WEBUI] Add a displayOrder variable to WebUITab to specify the order in which tabs appear - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/11 23:43:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42461: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:47:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42457: [SPARK-44625][CONNECT][FOLLOWUP] Make initialization of SparkConnectExecutionManager lazy - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:47:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42457: [SPARK-44625][CONNECT][FOLLOWUP] Make initialization of SparkConnectExecutionManager lazy - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:48:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42443: [SPARK-44461][3.5][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:48:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42443: [SPARK-44461][3.5][SS][PYTHON][CONNECT] Verify Python Version for spark connect streaming workers - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:48:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42456: [SPARK-44422][FOLLOWUP][CONNECT] Fix typo in ProtoUtils - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:50:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42456: [SPARK-44422][FOLLOWUP][CONNECT] Fix typo in ProtoUtils - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:51:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42452: [SPARK-44740][CONNECT][FOLLOW] Fix metadata values in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:54:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42452: [SPARK-44740][CONNECT][FOLLOW] Fix metadata values in Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:54:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42451: [SPARK-44775][PYTHON][DOCS] Add missing version information in DataFrame APIs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/11 23:55:15 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42420: [SPARK-44748][SQL] Query execution for the PARTITION BY clause in UDTF TABLE arguments - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/12 00:02:16 UTC, 5 replies.
- [GitHub] [spark] mridulm closed pull request #41489: [SPARK-43987][Shuffle] Separate finalizeShuffleMerge Processing to Dedicated Thread Pools - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/12 00:04:42 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41489: [SPARK-43987][Shuffle] Separate finalizeShuffleMerge Processing to Dedicated Thread Pools - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/12 00:05:09 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/12 00:06:50 UTC, 2 replies.
- [GitHub] [spark] mridulm closed pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/12 00:07:20 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42465: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/12 00:23:49 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42465: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/12 00:25:22 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42465: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/12 00:41:38 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #42296: [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/12 01:14:30 UTC, 2 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42466: [SPARK-44242][CORE] Use the `assertThrows` method to fix Java linter issue - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/12 02:25:39 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42446: [SPARK-44719][SQL] Fix NoClassDefFoundError when using Hive UDF - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/12 03:41:59 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42446: [SPARK-44719][SQL] Fix NoClassDefFoundError when using Hive UDF - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/12 03:42:12 UTC, 0 replies.
- [GitHub] [spark] ukby1234 commented on a diff in pull request #42296: [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions - posted by "ukby1234 (via GitHub)" <gi...@apache.org> on 2023/08/12 04:04:03 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/12 04:31:31 UTC, 2 replies.
- [GitHub] [spark] srielau opened a new pull request, #42467: [SPARK-44780][DOC] SQL temporary variables - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/12 05:31:42 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #42467: [SPARK-44780][DOC] SQL temporary variables - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/12 05:36:24 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42468: [WIP][SPARK-44781][SQL] Runtime filter should supports reuse exchange if it can reduce the data size of application side - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/12 05:38:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42435: [SPARK-44778][SQL] Add the alias `TIMEDIFF` for `TIMESTAMPDIFF` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/12 06:07:25 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42435: [SPARK-44778][SQL] Add the alias `TIMEDIFF` for `TIMESTAMPDIFF` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/12 06:08:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42109: [SPARK-44404][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1009,1010,1013,1015,1016,1278] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/12 07:22:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42109: [SPARK-44404][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1009,1010,1013,1015,1016,1278] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/12 07:22:41 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #42464: [SPARK-43509][PYTHON][CONNECT][FOLLOW-UP] Check SPARK_CONNECT_MODE_ENABLED when creating a session - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/12 09:21:22 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42466: [SPARK-44242][CORE][FOLLOWUP] Use the `assertThrows` method to fix Java linter issue - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/12 09:37:43 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42466: [SPARK-44242][CORE][FOLLOWUP] Use the `assertThrows` method to fix Java linter issue - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/12 09:37:55 UTC, 0 replies.
- [GitHub] [spark] zero323 opened a new pull request, #42469: [SPARK-44782][INFRA] Adjust PR template to Generative Tooling Guidance recommendations - posted by "zero323 (via GitHub)" <gi...@apache.org> on 2023/08/12 10:25:40 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #42469: [SPARK-44782][INFRA] Adjust PR template to Generative Tooling Guidance recommendations - posted by "zero323 (via GitHub)" <gi...@apache.org> on 2023/08/12 10:27:11 UTC, 2 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42470: [WIP][SPARK-44783][SQL][TESTS] Checks arrays as named and positional parameters - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/12 11:15:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42286: [MINOR][SQL] Rename shouldBroadcast to isDynamicPruning in InSubqueryExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/12 13:39:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42286: [MINOR][SQL] Rename shouldBroadcast to isDynamicPruning in InSubqueryExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/12 13:41:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/12 13:43:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42461: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/12 14:18:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42465: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/12 14:29:30 UTC, 0 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42471: [SPARK-44785] Convert common alreadyExistsExceptions and noSuchExceptions - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/12 17:20:41 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates for scalar and lateral subqueries - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/12 17:41:09 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42458: [SPARK-43380][SQL] Revert `Fix Avro data type conversion issues` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/12 18:01:30 UTC, 0 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42472: [SPARK-44786][SQL][CONNECT] Convert common Spark exceptions - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/12 18:32:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41033: Update bufbuild plugin references - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/13 00:17:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40954: [PYSPARK] [CONNECT] [ML] PySpark UDF supports python package dependencies - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/13 00:17:31 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #42357: [SPARK-44306][YARN] Group FileStatus with few RPC calls within Yarn Client - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/13 02:01:45 UTC, 6 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42473: [SPARK-44791][CONNECT] Make ArrowDeserializer work with REPL generated classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/13 02:10:48 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42473: [SPARK-44791][CONNECT] Make ArrowDeserializer work with REPL generated classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/13 02:14:36 UTC, 4 replies.
- [GitHub] [spark] wangyum commented on pull request #42447: [SPARK-44719][SQL][3.5] Fix NoClassDefFoundError when using Hive UDF - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/13 02:55:37 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/13 03:38:16 UTC, 2 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42474: [SPARK-44792][BUILD] Upgrade curator to 5.2.0 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/13 04:06:16 UTC, 0 replies.
- [GitHub] [spark] abmodi opened a new pull request, #42475: Fixing pipelineTime metric for WholeStageCodegen - posted by "abmodi (via GitHub)" <gi...@apache.org> on 2023/08/13 05:35:55 UTC, 0 replies.
- [GitHub] [spark] venkata91 commented on a diff in pull request #42357: [SPARK-44306][YARN] Group FileStatus with few RPC calls within Yarn Client - posted by "venkata91 (via GitHub)" <gi...@apache.org> on 2023/08/13 06:15:58 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #42075: [SPARK-43966][SQL][PYTHON] Support non-deterministic table-valued functions - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/08/13 17:22:18 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42418: [SPARK-44736][CONNECT] Add Dataset.explode to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/13 18:26:48 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42418: [SPARK-44736][CONNECT] Add Dataset.explode to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/13 18:27:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41033: Update bufbuild plugin references - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/14 00:17:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40954: [PYSPARK] [CONNECT] [ML] PySpark UDF supports python package dependencies - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/14 00:17:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42473: [SPARK-44791][CONNECT] Make ArrowDeserializer work with REPL generated classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 00:38:28 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42473: [SPARK-44791][CONNECT] Make ArrowDeserializer work with REPL generated classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 00:39:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42469: [SPARK-44782][INFRA] Adjust PR template to Generative Tooling Guidance recommendations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/14 00:50:00 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 02:02:36 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42469: [SPARK-44782][INFRA] Adjust PR template to Generative Tooling Guidance recommendations - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/14 02:08:08 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42476: [SPARK-44794][CONNECT] Make Streaming Queries work with REPL generated classes. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 02:39:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42451: [SPARK-44775][PYTHON][DOCS] Add missing version information in DataFrame APIs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/14 02:40:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42451: [SPARK-44775][PYTHON][DOCS] Add missing version information in DataFrame APIs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/14 02:41:12 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42476: [SPARK-44794][CONNECT] Make Streaming Queries work with REPL generated classes. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 02:42:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42477: [SPARK-44796][BUILD][CONNECT] Remove `grpc-java` plugin related configuration from the `connect/connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 03:29:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42478: [SPARK-44795][CONNECT] CodeGenerator Cache should be classloader specific - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 04:00:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42479: [SPARK-44798][BUILD] Fix Scala 2.13 mima check after SPARK-44705 merged - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 04:03:29 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42478: [SPARK-44795][CONNECT] CodeGenerator Cache should be classloader specific - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/14 04:57:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 05:14:52 UTC, 5 replies.
- [GitHub] [spark] wangyum closed pull request #42474: [SPARK-44792][BUILD] Upgrade curator to 5.2.0 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/14 05:42:48 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42474: [SPARK-44792][BUILD] Upgrade curator to 5.2.0 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/14 05:43:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42480: Test Java 17 + Pyspark - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 06:14:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42467: [SPARK-44780][DOC] SQL temporary variables - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/14 06:32:58 UTC, 2 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 06:33:05 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42482: [SPARK-43885][SQL][FOLLOWUP] Instruction#dataType should not fail - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/14 06:40:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42482: [SPARK-43885][SQL][FOLLOWUP] Instruction#dataType should not fail - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/14 06:41:10 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42480: Test Java 17 + Pyspark - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 06:53:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42483: [SPARK-44653][SQL][FOLLOWUP] ResolveUnion should not combine Unions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/14 07:10:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42483: [SPARK-44653][SQL][FOLLOWUP] ResolveUnion should not combine Unions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/14 07:10:56 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 07:24:17 UTC, 7 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42479: [SPARK-44798][BUILD] Fix Scala 2.13 mima check after SPARK-44705 merged - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 08:01:26 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/14 08:18:15 UTC, 7 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/14 08:18:17 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 08:21:31 UTC, 6 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42484: [SPARK-44802][INFRA] Token based ASF JIRA authentication - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 08:56:06 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42484: [SPARK-44802][INFRA] Token based ASF JIRA authentication - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 08:56:43 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42485: [SPARK-44791][CONNECT][TESTS][FOLLOWUP] Fix test case `Collect REPL generated class` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 09:04:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42485: [SPARK-44791][CONNECT][TESTS][FOLLOWUP] Fix test case `Collect REPL generated class` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 09:13:52 UTC, 3 replies.
- [GitHub] [spark] LuciferYang closed pull request #42480: Test Java 17 + Pyspark - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 09:15:52 UTC, 0 replies.
- [GitHub] [spark] majdyz commented on pull request #42394: [SPARK-44718][SQL] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value - posted by "majdyz (via GitHub)" <gi...@apache.org> on 2023/08/14 10:15:25 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski closed pull request #42461: [CONNECT][POC] Have real server and real simple client in tests - connect-client-jvm-shaded - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/14 10:15:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42470: [SPARK-44783][SQL][TESTS] Checks arrays as named and positional parameters - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 10:33:36 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42470: [SPARK-44783][SQL][TESTS] Checks arrays as named and positional parameters - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 10:33:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42465: [CONNECT][POC] Have real server and real simple client in tests - classpath order hack - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 10:58:43 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42465: [CONNECT][POC] Have real server and real simple client in tests - classpath order hack - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/14 11:07:35 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42486: [SPARK-44803][BUILD] Replace `publishOrSkip` with `publish` in SparkBuild to eliminate warnings - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/14 11:21:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42486: [SPARK-44803][BUILD] Replace `publishOrSkip` with `publish` in SparkBuild to eliminate warnings - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/14 11:27:49 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42486: [SPARK-44803][BUILD] Replace `publishOrSkip` with `publish` in SparkBuild to eliminate warnings - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 13:21:42 UTC, 1 replies.
- [GitHub] [spark] tgravescs commented on pull request #42434: [SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/08/14 13:28:37 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt opened a new pull request, #42487: [SPARK-44777][CORE] Allow for eager checkpoint on RDD - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/08/14 13:28:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 13:40:17 UTC, 0 replies.
- [GitHub] [spark] utkarsh39 commented on pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "utkarsh39 (via GitHub)" <gi...@apache.org> on 2023/08/14 14:17:55 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42483: [SPARK-44653][SQL][FOLLOWUP] ResolveUnion should not combine Unions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/14 14:19:11 UTC, 1 replies.
- [GitHub] [spark] wankunde opened a new pull request, #42488: [WIP][SPARK-44804][SQL] SortMergeJoin should respect the streamed side ordering - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/14 14:50:14 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/14 14:53:20 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42489: [SPARK-44799][CONNECT] Fix outer scopes resolution on the executor side. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 14:58:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42485: [SPARK-44791][CONNECT][TESTS][FOLLOWUP] Fix test case `Collect REPL generated class` for Scala 2.13 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 15:33:08 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42485: [SPARK-44791][CONNECT][TESTS][FOLLOWUP] Fix test case `Collect REPL generated class` for Scala 2.13 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/14 15:34:14 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42485: [SPARK-44791][CONNECT][TESTS][FOLLOWUP] Fix test case `Collect REPL generated class` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/14 15:38:44 UTC, 3 replies.
- [GitHub] [spark] ueshin commented on pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/14 15:43:36 UTC, 2 replies.
- [GitHub] [spark] ueshin commented on pull request #42422: [SPARK-44749][SQL][PYTHON] Support named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/14 15:56:35 UTC, 2 replies.
- [GitHub] [spark] ueshin closed pull request #42422: [SPARK-44749][SQL][PYTHON] Support named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/14 15:58:05 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski closed pull request #42465: [CONNECT][POC] Have real server and real simple client in tests - classpath order hack - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/14 17:00:30 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates for scalar and lateral subqueries - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/14 17:05:03 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42476: [SPARK-44794][CONNECT] Make Streaming Queries work with Connect's artifact management - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/14 17:30:27 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42476: [SPARK-44794][CONNECT] Make Streaming Queries work with Connect's artifact management - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/14 17:34:26 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42441: [SPARK-44806][CONNECT] Separate `connect-client-jvm-internal` module to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/14 18:04:54 UTC, 6 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42441: [SPARK-44806][CONNECT] Separate `connect-client-jvm-internal` module to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/14 18:08:18 UTC, 1 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42460: [SPARK-44433] Terminate foreach batch runner when streaming query terminates - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/14 18:31:16 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42490: [SPARK-44749][PYTHON][FOLLOWUP][TESTS] Add more tests for named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/14 19:14:57 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42490: [SPARK-44749][PYTHON][FOLLOWUP][TESTS] Add more tests for named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/14 19:15:25 UTC, 1 replies.
- [GitHub] [spark] eejbyfeldt commented on a diff in pull request #42487: [SPARK-44777][CORE] Eager checkpointing on RDDs - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/08/14 19:41:41 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42460: [SPARK-44433] Terminate foreach batch runner when streaming query terminates - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/14 20:13:58 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42464: [SPARK-43509][PYTHON][CONNECT][FOLLOW-UP] Check SPARK_CONNECT_MODE_ENABLED when creating a session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/14 20:21:39 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42490: [SPARK-44749][PYTHON][FOLLOWUP][TESTS] Add more tests for named arguments in Python UDTF - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/14 20:30:19 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42490: [SPARK-44749][PYTHON][FOLLOWUP][TESTS] Add more tests for named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/14 20:58:20 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42460: [SPARK-44433] Terminate foreach batch runner when streaming query terminates - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/14 21:28:12 UTC, 3 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/14 21:34:03 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #42491: [SPARK-44809] Removed unused RocksDB custom metrics for pause/writeBatch - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/14 22:35:30 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42491: [SPARK-44809][SS] Remove unused RocksDB custom metrics for pause/writeBatch - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/14 22:37:21 UTC, 2 replies.
- [GitHub] [spark] jdesjean commented on a diff in pull request #42454: [SPARK-44776][CONNECT] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/08/15 00:13:18 UTC, 5 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40904: [WIP][POC] foreachbatch spark connect - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/15 00:16:37 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #41335: [SPARK-43205][DOCS][FOLLOWUP] IDENTIFIER clause docs - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/15 00:33:04 UTC, 1 replies.
- [GitHub] [spark] szehon-ho commented on a diff in pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/15 00:39:35 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42492: [SPARK-44807][CONNECT] Add Dataset.metadataColumn to Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 00:42:10 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42492: [SPARK-44807][CONNECT] Add Dataset.metadataColumn to Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 00:42:39 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42492: [SPARK-44807][CONNECT] Add Dataset.metadataColumn to Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 00:44:11 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on a diff in pull request #41335: [SPARK-43205][DOCS][SQL][FOLLOWUP] IDENTIFIER clause docs + SQL Variable Integration - posted by "linhongliu-db (via GitHub)" <gi...@apache.org> on 2023/08/15 00:55:16 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42478: [SPARK-44795][CONNECT] CodeGenerator Cache should be classloader specific - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 01:10:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42478: [SPARK-44795][CONNECT] CodeGenerator Cache should be classloader specific - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 01:10:25 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42478: [SPARK-44795][CONNECT] CodeGenerator Cache should be classloader specific - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 01:10:54 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42490: [SPARK-44749][PYTHON][FOLLOWUP][TESTS] Add more tests for named arguments in Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/15 01:13:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/15 01:21:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/15 01:47:07 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates for scalar and lateral subqueries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 01:51:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates for scalar and lateral subqueries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 01:52:24 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/15 02:07:12 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/15 02:12:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42434: [SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 02:12:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42434: [SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 02:13:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42394: [SPARK-44718][SQL] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 02:17:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 02:22:51 UTC, 14 replies.
- [GitHub] [spark] ericsun95 commented on a diff in pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "ericsun95 (via GitHub)" <gi...@apache.org> on 2023/08/15 02:28:16 UTC, 10 replies.
- [GitHub] [spark] gengliangwang closed pull request #42458: [SPARK-43380][SQL] Revert `Fix Avro data type conversion issues` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/15 02:36:51 UTC, 0 replies.
- [GitHub] [spark] utkarsh39 opened a new pull request, #42494: [SPARK-44705] Fix Deprecation Version of ContetAwareIterator - posted by "utkarsh39 (via GitHub)" <gi...@apache.org> on 2023/08/15 02:37:25 UTC, 0 replies.
- [GitHub] [spark] utkarsh39 commented on a diff in pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "utkarsh39 (via GitHub)" <gi...@apache.org> on 2023/08/15 02:38:07 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #42482: [SPARK-43885][SQL][FOLLOWUP] Instruction#dataType should not fail - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 02:56:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42477: [SPARK-44796][BUILD][CONNECT] Remove `grpc-java` plugin related configuration from the `connect/connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/15 03:42:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42477: [SPARK-44796][BUILD][CONNECT] Remove `grpc-java` plugin related configuration from the `connect/connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/15 03:43:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42479: [SPARK-44798][BUILD] Fix Scala 2.13 mima check after SPARK-44705 merged - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/15 03:44:37 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42447: [SPARK-44719][SQL][3.5] Fix NoClassDefFoundError when using Hive UDF - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 03:45:22 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #42447: [SPARK-44719][SQL][3.5] Fix NoClassDefFoundError when using Hive UDF - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 03:47:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/15 03:49:15 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42434: [SPARK-43063][SQL][FOLLOWUP] Add a space between -> and value when first value is null - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 03:50:21 UTC, 0 replies.
- [GitHub] [spark] sandip-db commented on a diff in pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/08/15 05:24:30 UTC, 17 replies.
- [GitHub] [spark] yaooqinn closed pull request #42484: [SPARK-44802][INFRA] Token based ASF JIRA authentication - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 05:39:28 UTC, 0 replies.
- [GitHub] [spark] copperybean opened a new pull request, #42495: [SPARK-44812][SQL] Push filters through join generated from intersect - posted by "copperybean (via GitHub)" <gi...@apache.org> on 2023/08/15 07:22:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42485: [SPARK-44791][CONNECT][TESTS][FOLLOWUP] Fix test case `Collect REPL generated class` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/15 08:23:02 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 08:28:00 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 08:30:27 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42441: [SPARK-44806][CONNECT] Separate `connect-client-jvm-internal` module to be able to test real in-process server with a real RPC client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/15 08:32:21 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42454: [SPARK-44776][CONNECT] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/15 08:53:23 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42469: [SPARK-44782][INFRA] Adjust PR template to Generative Tooling Guidance recommendations - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 09:20:05 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42465: [CONNECT][POC] Have real server and real simple client in tests - classpath order hack - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/15 09:31:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42483: [SPARK-44653][SQL][FOLLOWUP] ResolveUnion should not combine Unions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 09:33:09 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/15 09:43:32 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42497: [SPARK-43205][SQL][FOLLOWUP] IDENTIFIER clause should accept alias and RuntimeReplaceable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 10:05:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42497: [SPARK-43205][SQL][FOLLOWUP] IDENTIFIER clause should accept alias and RuntimeReplaceable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 10:06:24 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42394: [SPARK-44718][SQL] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 10:09:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42394: [SPARK-44718][SQL] Match ColumnVector memory-mode config default to OffHeapMemoryMode config value - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 10:10:35 UTC, 0 replies.
- [GitHub] [spark] hdaikoku commented on pull request #42426: [SPARK-44756][CORE] Executor hangs when RetryingBlockTransferor fails to initiate retry - posted by "hdaikoku (via GitHub)" <gi...@apache.org> on 2023/08/15 10:20:30 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/15 10:32:03 UTC, 4 replies.
- [GitHub] [spark] lissali commented on pull request #40211: [SPARK-42616][SQL] SparkSQLCLIDriver shall only close started hive sessionState - posted by "lissali (via GitHub)" <gi...@apache.org> on 2023/08/15 11:55:21 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/15 12:18:51 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42498: [SPARK-44814][CONNECT][PYTHON]Test to protect from faulty protobuf versions - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/15 13:04:37 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42499: [SPARK-44815] Cache df.schema to avoid extra RPC - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/15 13:17:51 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40211: [SPARK-42616][SQL] SparkSQLCLIDriver shall only close started hive sessionState - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/15 13:33:17 UTC, 0 replies.
- [GitHub] [spark] nija-at opened a new pull request, #42500: [SPARK-44816][CONNECT] Improve error message when UDF class is not found - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/08/15 13:54:25 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42488: [SPARK-44804][SQL] SortMergeJoin should respect the streamed side ordering - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/15 14:36:15 UTC, 5 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42501: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/15 15:38:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42497: [SPARK-43205][SQL][FOLLOWUP] IDENTIFIER clause should accept alias and RuntimeReplaceable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/15 15:47:42 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42501: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/15 16:16:16 UTC, 2 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42501: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/15 16:16:22 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski closed pull request #42441: [SPARK-44806][CONNECT] Separate `connect-client-jvm-internal` module to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/15 16:18:55 UTC, 1 replies.
- [GitHub] [spark] grundprinzip closed pull request #42499: [SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/15 16:26:38 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42486: [SPARK-44803][BUILD] Replace `publish` with `publishOrSkip` in SparkBuild to eliminate warnings - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/15 16:59:12 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42476: [SPARK-44794][CONNECT] Make Streaming Queries work with Connect's artifact management - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 17:04:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42476: [SPARK-44794][CONNECT] Make Streaming Queries work with Connect's artifact management - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 17:05:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42484: [SPARK-44802][INFRA] Token based ASF JIRA authentication - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/15 17:10:23 UTC, 1 replies.
- [GitHub] [spark] szehon-ho commented on pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/15 17:15:53 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42414: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 17:17:37 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42441: [SPARK-44806][CONNECT] Separate `connect-client-jvm-internal` module to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/15 17:25:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42502: [SPARK-44802][INFRA][FOLLOWUP] Fix to consider JIRA_ACCESS_TOKEN in precheck conditions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/15 17:35:38 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42494: [SPARK-44705][FOLLOWUP] Fix Deprecation Version of ContetAwareIterator - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/15 18:10:47 UTC, 0 replies.
- [GitHub] [spark] zeruibao opened a new pull request, #42503: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/15 18:18:23 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42494: [SPARK-44705] Fix Deprecation Version of ContetAwareIterator - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/15 18:20:43 UTC, 0 replies.
- [GitHub] [spark] zeruibao commented on pull request #42424: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/15 18:26:13 UTC, 0 replies.
- [GitHub] [spark] zeruibao closed pull request #42424: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/15 18:38:20 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42504: [SPARK-44818] Fix race for pending interrupt issued before taskThread is initialized - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/15 18:50:53 UTC, 2 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #42504: [SPARK-44818] Fix race for pending interrupt issued before taskThread is initialized - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/15 18:57:26 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42075: [SPARK-43966][SQL][PYTHON] Support non-deterministic table-valued functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/15 19:03:10 UTC, 0 replies.
- [GitHub] [spark] YannisSismanis commented on a diff in pull request #41763: [SPARK-44219][SQL] Adds extra per-rule validations for optimization rewrites. - posted by "YannisSismanis (via GitHub)" <gi...@apache.org> on 2023/08/15 19:10:37 UTC, 12 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42505: [SPARK-44821][BUILD][K8S] Upgrade `kubernetes-client` to 6.8.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/15 21:01:22 UTC, 0 replies.
- [GitHub] [spark-docker] galacticgumshoe commented on pull request #52: Add Support for Scala 2.13 in Spark 3.4.1 - posted by "galacticgumshoe (via GitHub)" <gi...@apache.org> on 2023/08/15 21:02:11 UTC, 0 replies.
- [GitHub] [spark] JackBuggins commented on pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "JackBuggins (via GitHub)" <gi...@apache.org> on 2023/08/15 21:35:47 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/15 21:52:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42447: [SPARK-44719][SQL][3.5] Fix NoClassDefFoundError when using Hive UDF - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/15 23:13:05 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42500: [SPARK-44816][CONNECT] Improve error message when UDF class is not found - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/15 23:36:54 UTC, 0 replies.
- [GitHub] [spark] srielau opened a new pull request, #42506: [SPARK-43205][DOC] 3.5 identifier clause docs - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/15 23:43:17 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #42506: [SPARK-43205][DOC] 3.5 identifier clause docs - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/15 23:53:55 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42491: [SPARK-44809][SS] Remove unused RocksDB custom metrics for pause/writeBatch - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/16 00:07:00 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40904: [WIP][POC] foreachbatch spark connect - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/16 00:16:38 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #41335: [SPARK-43205][DOCS][SQL][FOLLOWUP] IDENTIFIER clause docs - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/16 00:32:33 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #41335: [SPARK-43205][DOCS][SQL][FOLLOWUP] IDENTIFIER clause docs - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/16 00:32:44 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/16 01:40:18 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 01:47:13 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42491: [SPARK-44809][SS] Remove unused RocksDB custom metrics for pause/writeBatch - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/16 02:17:18 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42507: [SPARK-44823][PYTHON] Update black to 23.7.0 and fix erroneous check - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 02:41:00 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42507: [SPARK-44823][PYTHON] Update black to 23.7.0 and fix erroneous check - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 02:54:32 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42471: [SPARK-44785][SQL][CONNECT] Convert common alreadyExistsExceptions and noSuchExceptions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/16 02:55:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42508: [SPARK-44795][CONNECT][TESTS][FOLLOWUP] Fix test case `Collect REPL generated class` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 03:29:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42508: [SPARK-44795][CONNECT][TESTS][FOLLOWUP] Fix test case `REPL class in UDF` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 03:38:45 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42426: [SPARK-44756][CORE] Executor hangs when RetryingBlockTransferor fails to initiate retry - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/16 03:42:56 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #42504: [SPARK-44818] Fix race for pending interrupt issued before taskThread is initialized - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/16 03:49:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42509: [SPARK-44824][CONNECT][TESTS] Reset `ammoniteOut` in the `afterEach` method of `ReplE2ESuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 03:52:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42509: [SPARK-44824][CONNECT][TESTS] Reset `ammoniteOut` in the `afterEach` method of `ReplE2ESuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 04:12:01 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42508: [SPARK-44795][CONNECT][TESTS][FOLLOWUP] Fix test case `REPL class in UDF` for Scala 2.13 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 04:26:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42510: Test tink 1.10.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 05:58:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42510: Test tink 1.10.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 05:59:39 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/16 06:24:10 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #53: [SPARK-44494] Pin minikube to v1.30.1 to fix spark-docker K8s CI - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/08/16 06:42:41 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #52: Add Support for Scala 2.13 in Spark 3.4.1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/08/16 06:43:19 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on a diff in pull request #52: Add Support for Scala 2.13 in Spark 3.4.1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/08/16 06:51:47 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42502: [SPARK-44802][INFRA][FOLLOWUP] Fix to consider JIRA_ACCESS_TOKEN in precheck conditions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/16 08:48:45 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42502: [SPARK-44802][INFRA][FOLLOWUP] Fix to consider JIRA_ACCESS_TOKEN in precheck conditions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/16 08:49:23 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42511: [SPARK-44828][BUILD] Upgrade ORC to 1.9.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/16 09:22:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42502: [SPARK-44802][INFRA][FOLLOWUP] Fix to consider JIRA_ACCESS_TOKEN in precheck conditions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/16 09:34:25 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42502: [SPARK-44802][INFRA][FOLLOWUP] Fix to consider JIRA_ACCESS_TOKEN in precheck conditions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/16 09:37:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42509: [SPARK-44824][CONNECT][TESTS] Reset `ammoniteOut` in the `afterEach` method of `ReplE2ESuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 09:48:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42512: [SPARK-44824][CONNECT][TESTS][3.5] Reset `ammoniteOut` in the `afterEach` method of `ReplE2ESuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 09:52:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42512: [SPARK-44824][CONNECT][TESTS][3.5] Reset `ammoniteOut` in the `afterEach` method of `ReplE2ESuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 09:56:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42508: [SPARK-44795][CONNECT][TESTS][FOLLOWUP] Fix test case `REPL class in UDF` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 09:57:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42508: [SPARK-44795][CONNECT][TESTS][FOLLOWUP] Fix test case `REPL class in UDF` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 09:58:40 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42505: [SPARK-44821][BUILD][K8S] Upgrade `kubernetes-client` to 6.8.1 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/16 10:56:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42513: [SPARK-44827][PYTHON][TESTS] Fix test_assert_approx_equal_decimaltype_custom_rtol_pass when SPARK_ANSI_SQL_MODE=true - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 11:07:49 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix test_assert_approx_equal_decimaltype_custom_rtol_pass when SPARK_ANSI_SQL_MODE=true - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 11:07:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix test_assert_approx_equal_decimaltype_custom_rtol_pass when SPARK_ANSI_SQL_MODE=true - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 11:11:46 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix `test_assert_approx_equal_decimaltype_custom_rtol_pass` when ansi mode enabled - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 11:33:33 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42511: [SPARK-44828][BUILD] Upgrade ORC to 1.9.1 - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/16 11:35:20 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #42507: [SPARK-44823][PYTHON] Update black to 23.7.0 and fix erroneous check - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 11:38:11 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42514: Upgrade grpc to 1.57.2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 12:01:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42151: [SPARK-44565][PYTHON][DOCS] Refine the docs for `Union`, `UnionAll` and `unionByName` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/16 12:23:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42501: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 12:24:55 UTC, 2 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #42500: [SPARK-44816][CONNECT] Improve error message when UDF class is not found - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/08/16 12:28:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42515: [SPARK-44831][PYTHON][DOCS] Refine DocString of `Union*` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/16 12:29:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42489: [SPARK-44799][CONNECT] Fix outer scopes resolution on the executor side. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 13:27:49 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42489: [SPARK-44799][CONNECT] Fix outer scopes resolution on the executor side. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 13:28:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42514: [SPARK-44830][CONNECT] Upgrade grpc-java to 1.57.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/16 13:40:29 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42469: [SPARK-44782][INFRA] Adjust PR template to Generative Tooling Guidance recommendations - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/16 13:52:38 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on a diff in pull request #42471: [SPARK-44785][SQL][CONNECT] Convert common alreadyExistsExceptions and noSuchExceptions - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/16 13:54:33 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42514: [SPARK-44830][CONNECT] Upgrade grpc-java to 1.57.2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/16 13:56:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42505: [SPARK-44821][BUILD][K8S] Upgrade `kubernetes-client` to 6.8.1 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/16 14:48:37 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42511: [SPARK-44828][BUILD] Upgrade ORC to 1.9.1 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/16 14:54:40 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42511: [SPARK-44828][BUILD] Upgrade ORC to 1.9.1 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/16 14:56:02 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42502: [SPARK-44802][INFRA][FOLLOWUP] Fix to consider JIRA_ACCESS_TOKEN in precheck conditions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/16 14:59:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42505: [SPARK-44821][BUILD][K8S] Upgrade `kubernetes-client` to 6.8.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/16 15:32:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42511: [SPARK-44828][BUILD] Upgrade ORC to 1.9.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/16 15:40:18 UTC, 0 replies.
- [GitHub] [spark] zeruibao commented on pull request #42503: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/16 16:05:30 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42492: [SPARK-44807][CONNECT] Add Dataset.metadataColumn to Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 16:14:53 UTC, 0 replies.
- [GitHub] [spark] vsevolodstep-db opened a new pull request, #42516: [SPARK-44829][CONNECT] Expose uploadAllArtifactClasses in ArtifactManager to `sql` package - posted by "vsevolodstep-db (via GitHub)" <gi...@apache.org> on 2023/08/16 16:42:39 UTC, 0 replies.
- [GitHub] [spark] vsevolodstep-db commented on pull request #42516: [SPARK-44829][CONNECT] Expose uploadAllArtifactClasses in ArtifactManager to `sql` package - posted by "vsevolodstep-db (via GitHub)" <gi...@apache.org> on 2023/08/16 16:43:46 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42517: [SPARK-44834][PYTHON][SQL] Add SQL query tests for Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/16 16:54:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42464: [SPARK-43509][PYTHON][CONNECT][FOLLOW-UP] Check SPARK_CONNECT_MODE_ENABLED when creating a session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/16 17:49:16 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/08/16 17:49:47 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42518: [SPARK-44832][CONNECT] Make transitive dependencies work for Maven build. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 18:18:44 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42518: [SPARK-44832][CONNECT] Make transitive dependencies work for Maven build. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 18:19:38 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42283: [SPARK-44433][PYTHON][CONNECT][SS][FOLLOWUP] Terminate listener process with `removeListener` and improvements - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/16 18:33:23 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42385: [SPARK-44705][PYTHON] Make PythonRunner single-threaded - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/16 18:37:19 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42512: [SPARK-44824][CONNECT][TESTS][3.5] Reset `ammoniteOut` in the `afterEach` method of `ReplE2ESuite` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 19:02:23 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #42512: [SPARK-44824][CONNECT][TESTS][3.5] Reset `ammoniteOut` in the `afterEach` method of `ReplE2ESuite` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 19:03:43 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42516: [SPARK-44829][CONNECT] Expose uploadAllArtifactClasses in ArtifactManager to `sql` package - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 19:05:24 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42516: [SPARK-44829][CONNECT] Expose uploadAllArtifactClasses in ArtifactManager to `sql` package - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/16 19:05:51 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42519: [SPARK-44822][PYTHON][SQL] Make Python UDTFs by default non-deterministic - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/16 19:44:46 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42519: [SPARK-44822][PYTHON][SQL] Make Python UDTFs by default non-deterministic - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/16 19:44:57 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42517: [SPARK-44834][PYTHON][SQL] Add SQL query tests for Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/16 20:12:23 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #42306: [SPARK-44647][SQL] Support SPJ where join keys are less than cluster keys - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/16 20:26:17 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42520: [SPARK-44836][PYTHON] Refactor Arrow Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/16 21:11:14 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42520: [SPARK-44836][PYTHON] Refactor Arrow Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/16 21:11:36 UTC, 2 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42521: [SPARK-44435][SS][CONNECT][DRAFT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/16 21:26:24 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42521: [SPARK-44435][SS][CONNECT][DRAFT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/16 21:30:21 UTC, 17 replies.
- [GitHub] [spark] heyihong commented on pull request #42377: [SPARK-44622][SQL][CONNECT] Implement error enrichment and setting server-side stacktrace - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/16 21:32:30 UTC, 2 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42522: [SPARK-44836][PYTHON][3.5] Refactor Arrow Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/16 21:43:58 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42520: [SPARK-44836][PYTHON] Refactor Arrow Python UDTF - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/16 21:49:57 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42523: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/16 22:09:16 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42523: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/16 22:19:24 UTC, 3 replies.
- [GitHub] [spark] ueshin commented on pull request #42522: [SPARK-44836][PYTHON][3.5] Refactor Arrow Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/16 22:24:15 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41093: Update build_and_test.yml - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/17 00:16:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41090: [SPARK-43406][SQL] enable spark sql to drop multiple partitions in on… - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/17 00:16:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41066: [SPARK-43385][SQL] The Generator's statistics should be ratio times greater than the child nodes - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/17 00:16:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40963: [SPARK-43288][SQL] DataSourceV2: CREATE TABLE LIKE - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/17 00:16:17 UTC, 0 replies.
- [GitHub] [spark] michaelzhan-db opened a new pull request, #42524: [SPARK-44837][SQL] Improve ALTER TABLE ALTER PARTITION column error message - posted by "michaelzhan-db (via GitHub)" <gi...@apache.org> on 2023/08/17 00:32:03 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #53: [SPARK-44494] Pin minikube to v1.30.1 to fix spark-docker K8s CI - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/08/17 00:47:05 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #42525: [SPARK-44841][PS] Support `value_counts` for pandas 2.0.0 and above. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/17 01:30:13 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42515: [SPARK-44831][PYTHON][DOCS] Refine DocString of `Union*` - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 01:36:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42388: [SPARK-43618][SPARK-43658][CONNECT][PS][TESTS] Enabling more tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/17 01:42:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42489: [SPARK-44799][CONNECT] Fix outer scopes resolution on the executor side. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/17 01:43:21 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42514: [SPARK-44830][CONNECT] Upgrade grpc to 1.57.2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/17 01:44:41 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42522: [SPARK-44836][PYTHON][3.5] Refactor Arrow Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/17 01:48:04 UTC, 0 replies.
- [GitHub] [spark-docker] LuciferYang commented on pull request #53: [SPARK-44494] Pin minikube to v1.30.1 to fix spark-docker K8s CI - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/17 02:15:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix test when ansi mode enabled - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/17 02:57:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42515: [SPARK-44831][PYTHON][DOCS] Refine DocString of `Union*` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/17 02:59:24 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42520: [SPARK-44836][PYTHON] Refactor Arrow Python UDTF - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/17 03:00:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42518: [SPARK-44832][CONNECT] Make transitive dependencies work properly for Scala Client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/17 03:43:31 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42518: [SPARK-44832][CONNECT] Make transitive dependencies work properly for Scala Client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/17 04:05:54 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #42526: [SPARK-44842][SPARK-43812][PS] Support stat functions for pandas 2.0.0 and enabling tests. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/17 04:17:25 UTC, 0 replies.
- [GitHub] [spark] rangadi closed pull request #41945: [TEMP] Feb impl - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/17 05:01:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42519: [SPARK-44822][PYTHON][SQL] Make Python UDTFs by default non-deterministic - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/17 05:08:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42519: [SPARK-44822][PYTHON][SQL] Make Python UDTFs by default non-deterministic - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/17 05:10:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/17 05:30:03 UTC, 12 replies.
- [GitHub] [spark] itholic opened a new pull request, #42527: [SPARK-43462][SPARK-43871][PS][TESTS] Enable `SeriesDateTimeTests` for pandas 2.0.0 and above - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/17 05:58:03 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42528: [SPARK-44844][TESTS] Exclude `python/build/*` path for local `lint-python` testing - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/17 06:07:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/17 07:00:24 UTC, 13 replies.
- [GitHub] [spark] zekai-li opened a new pull request, #42529: [SPARK-44845][YARN][DEPLOY]Fixed file system uri comparison function - posted by "zekai-li (via GitHub)" <gi...@apache.org> on 2023/08/17 07:31:10 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun closed pull request #53: [SPARK-44494] Pin minikube to v1.30.1 to fix spark-docker K8s CI - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/08/17 07:31:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42399: [SPARK-44721][CONNECT] Revamp retry logic and make retries run for 10 minutes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/17 07:56:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42399: [SPARK-44721][CONNECT] Revamp retry logic and make retries run for 10 minutes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/17 07:57:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42523: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/17 08:22:20 UTC, 0 replies.
- [GitHub] [spark] zuston commented on pull request #32804: [SPARK-26867][YARN] Spark Support of YARN Placement Constraint - posted by "zuston (via GitHub)" <gi...@apache.org> on 2023/08/17 08:49:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42525: [SPARK-44841][PS] Support `value_counts` for pandas 2.0.0 and above. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/17 08:50:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42525: [SPARK-44841][PS] Support `value_counts` for pandas 2.0.0 and above. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/17 08:51:08 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42530: [SPARK-43875][PS][TESTS] Enabling Categorical tests for Pandas 2.0.0 and above - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/17 09:16:34 UTC, 0 replies.
- [GitHub] [spark] zml1206 opened a new pull request, #42531: [SPARK-44846][SQL] PushFoldableIntoBranches in complex grouping expressions may cause bi… - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/17 09:58:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42527: [SPARK-43462][SPARK-43871][PS][TESTS] Enable `SeriesDateTimeTests` for pandas 2.0.0 and above - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/17 10:03:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42527: [SPARK-43462][SPARK-43871][PS][TESTS] Enable `SeriesDateTimeTests` for pandas 2.0.0 and above - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/17 10:03:29 UTC, 0 replies.
- [GitHub] [spark] kylerong-db opened a new pull request, #42532: [WIP][SPARK-] Support view with nested struct in Hive client - posted by "kylerong-db (via GitHub)" <gi...@apache.org> on 2023/08/17 10:04:49 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42533: [SPARK-44289][SPARK-43874][SPARK-43869][SPARK-43607][PS] Support `indexer_between_time` for pandas 2.0.0 & enabling more tests. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/17 12:05:49 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski closed pull request #42501: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/17 12:08:45 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42528: [SPARK-44844][BUILD] Exclude `python/build/*` path for local `lint-python` testing - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/17 12:08:49 UTC, 1 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42534: [WIP][SQL] Use the `DateFormatClass` expression to format a datetime in `to_char` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/17 12:22:33 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42535: [SPARK-44849] Expose SparkConnectExecutionManager.listActiveExecutions - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/17 12:35:49 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42535: [SPARK-44849] Expose SparkConnectExecutionManager.listActiveExecutions - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/17 12:35:57 UTC, 0 replies.
- [GitHub] [spark] Deependra-Patel opened a new pull request, #42536: [SPARK-39024][CORE][YARN] Enable graceful decommissioning of executors on "DECOMMISSIONING" node even in case of external shuffle service enabled - posted by "Deependra-Patel (via GitHub)" <gi...@apache.org> on 2023/08/17 12:43:29 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42531: [SPARK-44846][SQL] PushFoldableIntoBranches in complex grouping expressions may cause bi… - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/17 14:04:02 UTC, 6 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42537: [SPARK-44740][CONNECT][FOLLOW] Fix metadata values for Artifacts - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/17 14:11:45 UTC, 0 replies.
- [GitHub] [spark] cdkrot opened a new pull request, #42538: [SPARK-44850][CONNECT] Heartbeat in scala's Spark Connect - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/17 14:54:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42523: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/17 15:05:25 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42523: [SPARK-44806][CONNECT] Move internal client spark-connect-common to be able to test real in-process server with a real RPC client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/17 15:06:18 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42538: [SPARK-44850][CONNECT] Heartbeat in scala's Spark Connect - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/17 15:11:52 UTC, 4 replies.
- [GitHub] [spark] cdkrot commented on a diff in pull request #42538: [SPARK-44850][CONNECT] Heartbeat in scala's Spark Connect - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/08/17 15:12:37 UTC, 2 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42507: [SPARK-44823][PYTHON] Update black to 23.7.0 and fix erroneous check - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/17 15:15:52 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/17 15:16:41 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42535: [SPARK-44849] Expose SparkConnectExecutionManager.listActiveExecutions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/17 15:47:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42539: [SPARK-44852][BUILD] Exclude `junit-jupiter-api` from `curator-test` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/17 15:48:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42539: [SPARK-44852][BUILD] Exclude `junit-jupiter-api` from `curator-test` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/17 15:49:29 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #42535: [SPARK-44849] Expose SparkConnectExecutionManager.listActiveExecutions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/17 15:49:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/17 15:55:55 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42540: [SPARK-44853][PYTHON][DOCS] Refine docstring of DataFrame.columns property - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 16:47:29 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on a diff in pull request #42531: [SPARK-44846][SQL] PushFoldableIntoBranches in complex grouping expressions may cause bi… - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/17 16:48:24 UTC, 2 replies.
- [GitHub] [spark] hdaly0 opened a new pull request, #42541: Spark 44854 - posted by "hdaly0 (via GitHub)" <gi...@apache.org> on 2023/08/17 17:16:27 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42517: [SPARK-44834][PYTHON][SQL][TESTS] Add SQL query tests for Python UDTFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/17 17:37:16 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42517: [SPARK-44834][PYTHON][SQL][TESTS] Add SQL query tests for Python UDTFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/17 17:38:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42542: [SPARK-44214][CORE] Add driver log live UI for K8s environment - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/17 17:57:03 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42543: [SPARK-44834][PYTHON][SQL][TESTS][FOLLOW-UP] Update the analyzer results of the udtf tests - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 18:26:03 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42543: [SPARK-44834][PYTHON][SQL][TESTS][FOLLOW-UP] Update the analyzer results of the udtf tests - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 18:27:35 UTC, 1 replies.
- [GitHub] [spark] viirya opened a new pull request, #42544: [MINOR][SS] Fix incorrect property name in structured streaming doc - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/17 19:52:38 UTC, 0 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #42296: [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/17 20:20:09 UTC, 0 replies.
- [GitHub] [spark] hayssams opened a new pull request, #42545: Spark wrongly map the bOolean Type to BIT(1) in Snowflake - posted by "hayssams (via GitHub)" <gi...@apache.org> on 2023/08/17 20:22:23 UTC, 0 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #42112: [SPARK-44493][SQL] Support for translating catalyst expressions into partial datasource filters - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/17 20:40:05 UTC, 2 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #42071: [SPARK-44209] Expose amount of shuffle data available on the node - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/17 20:56:01 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #42460: [SPARK-44433][3.5x] Terminate foreach batch runner when streaming query terminates - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/17 21:02:07 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42420: [SPARK-44748][SQL] Query execution for the PARTITION BY clause in UDTF TABLE arguments - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 21:03:02 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42540: [SPARK-44853][PYTHON][DOCS] Refine docstring of DataFrame.columns property - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 21:28:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42546: [SPARK-44857][CORE][UI] Fix `getBaseURI` error in Spark Worker LogPage UI buttons - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/17 21:49:28 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42547: [SPARK-44858][PYTHON][DOCS] Refine dostring of DataFrame.isEmpty - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 21:50:40 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42547: [SPARK-44858][PYTHON][DOCS] Refine dostring of DataFrame.isEmpty - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/17 21:50:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42546: [SPARK-44857][CORE][UI] Fix `getBaseURI` error in Spark Worker LogPage UI buttons - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/17 21:52:50 UTC, 3 replies.
- [GitHub] [spark] viirya commented on pull request #42544: [SPARK-44859][SS] Fix incorrect property name in structured streaming doc - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/17 22:22:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42544: [SPARK-44859][SS] Fix incorrect property name in structured streaming doc - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/17 22:52:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42544: [SPARK-44859][SS] Fix incorrect property name in structured streaming doc - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/17 22:53:17 UTC, 0 replies.
- [GitHub] [spark] michaelzhan-db opened a new pull request, #42548: [WIP][SPARK-44750][PySpark][Connect] Apply configuration to sparksession during creation - posted by "michaelzhan-db (via GitHub)" <gi...@apache.org> on 2023/08/17 22:58:27 UTC, 0 replies.
- [GitHub] [spark] vitaliili-db opened a new pull request, #42549: [SPARK-44860] Add SESSION_USER function - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2023/08/17 23:25:13 UTC, 0 replies.
- [GitHub] [spark] jdesjean opened a new pull request, #42550: [SPARK-44861] jsonignore SparkListenerConnectOperationStarted.planRequest - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/08/17 23:58:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42515: [SPARK-44831][PYTHON][DOCS] Refine DocString of `DataFrame.{union, unionAll, unionByName}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 00:12:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42515: [SPARK-44831][PYTHON][DOCS] Refine DocString of `DataFrame.{union, unionAll, unionByName}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 00:12:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41093: Update build_and_test.yml - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/18 00:16:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41090: [SPARK-43406][SQL] enable spark sql to drop multiple partitions in on… - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/18 00:16:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41066: [SPARK-43385][SQL] The Generator's statistics should be ratio times greater than the child nodes - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/18 00:16:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40963: [SPARK-43288][SQL] DataSourceV2: CREATE TABLE LIKE - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/18 00:16:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40713: [SPARK-42551][SQL] Support more subexpression elimination cases - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/18 00:16:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42548: [WIP][SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 00:32:16 UTC, 3 replies.
- [GitHub] [spark] michaelzhan-db commented on a diff in pull request #42548: [WIP][SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "michaelzhan-db (via GitHub)" <gi...@apache.org> on 2023/08/18 00:37:46 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42530: [SPARK-43875][PS][TESTS] Enabling Categorical tests for Pandas 2.0.0 and above - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 00:39:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42530: [SPARK-43875][PS][TESTS] Enabling Categorical tests for Pandas 2.0.0 and above - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 00:40:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42546: [SPARK-44857][CORE][UI] Fix `getBaseURI` error in Spark Worker LogPage UI buttons - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/18 01:07:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42506: [SPARK-43205][DOC] identifier clause docs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/18 01:26:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42506: [SPARK-43205][DOC] identifier clause docs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/18 01:27:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42471: [SPARK-44785][SQL][CONNECT] Convert common alreadyExistsExceptions and noSuchExceptions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/18 01:29:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42255: [SPARK-40178][SQL][COONECT] support coalesce hints with ease for PySpark and R - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/18 01:45:46 UTC, 4 replies.
- [GitHub] [spark] itholic opened a new pull request, #42551: [SPARK-43563][SPARK-43459][SPARK-43451][SPARK-43506] Remove `squeeze` from `read_csv` & enabling more tests. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/18 02:08:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42507: [SPARK-44823][PYTHON] Update black to 23.7.0 and fix erroneous check - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 02:47:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42503: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/18 03:16:31 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on pull request #42488: [SPARK-44804][SQL] SortMergeJoin should respect the streamed side ordering - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/18 04:06:19 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42545: Spark wrongly map the BOOLEAN Type to BIT(1) in Snowflake - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/18 04:19:19 UTC, 0 replies.
- [GitHub] [spark-connect-go] zhengruifeng closed pull request #14: [SPARK-44681] Fix issues when writing Go application code using Spark Connect Go client library - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 04:20:34 UTC, 0 replies.
- [GitHub] [spark-connect-go] zhengruifeng commented on pull request #14: [SPARK-44681] Fix issues when writing Go application code using Spark Connect Go client library - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 04:20:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42533: [SPARK-44289][SPARK-43874][SPARK-43869][SPARK-43607][PS] Support `indexer_between_time` for pandas 2.0.0 & enabling more tests. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 04:54:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42533: [SPARK-44289][SPARK-43874][SPARK-43869][SPARK-43607][PS] Support `indexer_between_time` for pandas 2.0.0 & enabling more tests. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 04:55:36 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42552: [SPARK-44289][FOLLOWUP] Cleanup doctest - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/18 05:50:48 UTC, 0 replies.
- [GitHub] [spark] leesf opened a new pull request, #42553: [SPARK-44864] Align streaming statistics link format with other page links - posted by "leesf (via GitHub)" <gi...@apache.org> on 2023/08/18 06:30:16 UTC, 0 replies.
- [GitHub] [spark] zeruibao opened a new pull request, #42554: Make StreamingRelationV2 support metadata column - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/18 06:49:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42548: [WIP][SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 07:03:10 UTC, 1 replies.
- [GitHub] [spark] rangadi opened a new pull request, #42555: [SPARK-44433][3.5X] Terminate foreach batch runner when streaming query terminates - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/18 07:23:39 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #41119: [SPARK-42551][SQL] Support more subexpression elimination cases - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/18 07:32:25 UTC, 0 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #42555: [SPARK-44433][3.5X] Terminate foreach batch runner when streaming query terminates - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/18 08:19:24 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42543: [SPARK-44834][PYTHON][SQL][TESTS][FOLLOW-UP] Update the analyzer results of the udtf tests - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/18 08:32:06 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42543: [SPARK-44834][PYTHON][SQL][TESTS][FOLLOW-UP] Update the analyzer results of the udtf tests - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/18 08:33:20 UTC, 0 replies.
- [GitHub] [spark] hayssams commented on pull request #42545: SPARK-44866: Spark wrongly map the BOOLEAN Type to BIT(1) in Snowflake - posted by "hayssams (via GitHub)" <gi...@apache.org> on 2023/08/18 09:23:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42540: [SPARK-44853][PYTHON][DOCS] Refine docstring of DataFrame.columns property - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 09:31:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42540: [SPARK-44853][PYTHON][DOCS] Refine docstring of DataFrame.columns property - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 09:32:12 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #42556: [SPARK-44867][CONNECT] Refactor Spark Connect Docs to incorporate Scala setup - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/18 09:44:04 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42550: [SPARK-44861][CONNECT] jsonignore SparkListenerConnectOperationStarted.planRequest - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/18 10:10:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42450: [SPARK-44773][SQL] Code-gen CodegenFallback expression in WholeStageCodegen if possible - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/18 11:07:55 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42450: [SPARK-44773][SQL] Code-gen CodegenFallback expression in WholeStageCodegen if possible - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/18 11:10:10 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #42557: [WIP][SPARK-44870][SQL] Convert HashAggregate to SortAggregate if all grouping expressions are in child output orderings - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/18 12:18:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42537: [SPARK-44740][CONNECT][FOLLOW] Fix metadata values for Artifacts - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 12:31:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42537: [SPARK-44740][CONNECT][FOLLOW] Fix metadata values for Artifacts - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 12:31:56 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42558: [SPARK-44869][Doc] Add doc for insert by name statement - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/18 12:32:51 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42558: [SPARK-44869][Doc] Add doc for insert by name statement - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/18 12:33:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42425: [SPARK-44729][PYTHON][DOCS] Add canonical links to the PySpark docs page - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 12:37:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42552: [SPARK-44289][FOLLOWUP] Cleanup doctest - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 12:39:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42552: [SPARK-44289][FOLLOWUP] Cleanup doctest - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/18 12:39:21 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #42559: [SPARK-44871][SQL] Fix percentile_disc behaviour - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/18 13:12:23 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #42559: [SPARK-44871][SQL] Fix percentile_disc behaviour - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/18 13:18:03 UTC, 4 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #42559: [SPARK-44871][SQL] Fix percentile_disc behaviour - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/08/18 13:31:19 UTC, 0 replies.
- [GitHub] [spark] igorghi commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "igorghi (via GitHub)" <gi...@apache.org> on 2023/08/18 14:02:07 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on pull request #42559: [SPARK-44871][SQL] Fix percentile_disc behaviour - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/18 14:10:07 UTC, 2 replies.
- [GitHub] [spark] srowen closed pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/18 15:03:00 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42555: [SPARK-44433][3.5X] Terminate foreach batch runner when streaming query terminates - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/18 15:11:15 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #42555: [SPARK-44433][3.5X] Terminate foreach batch runner when streaming query terminates - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/18 15:17:50 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42539: [SPARK-44852][BUILD] Exclude `junit-jupiter-api` from `curator-test` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/18 16:05:32 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #42383: [SPARK-44549][SQL] Support window functions in correlated scalar subqueries - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/08/18 16:29:29 UTC, 3 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42560: [SPARK-44872][CONNECT] Server testing infra and ReattachableExecuteSuite - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/18 16:53:29 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42558: [SPARK-44869][Doc] Add doc for insert by name statement - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/18 17:14:50 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42558: [SPARK-44869][Doc] Add doc for insert by name statement - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/18 17:17:27 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42560: [SPARK-44872][CONNECT] Server testing infra and ReattachableExecuteSuite - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/18 17:22:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #42559: [SPARK-44871][SQL] Fix percentile_disc behaviour - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/18 17:34:13 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42555: [SPARK-44433][3.5X] Terminate foreach batch runner when streaming query terminates - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/18 17:40:08 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42555: [SPARK-44433][3.5X] Terminate foreach batch runner when streaming query terminates - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/18 17:40:09 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42561: [SPARK-44875][INFRA] Fix spelling for commenters to test SPARK-44813 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/18 17:42:13 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42561: [SPARK-44875][INFRA] Fix spelling for commentator to test SPARK-44813 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/18 17:48:51 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42454: [SPARK-44776][CONNECT] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/18 17:51:35 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42532: [SPARK-44873] Support view with nested struct in Hive client - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/18 17:59:01 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42560: [SPARK-44872][CONNECT] Server testing infra and ReattachableExecuteSuite - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/08/18 17:59:04 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/18 18:03:41 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42532: [SPARK-44873] Support alter view with nested columns in Hive client - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/18 18:07:00 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42532: [SPARK-44873] Support alter view with nested columns in Hive client - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/18 18:07:32 UTC, 0 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42562: [SPARK-44874][SQL][CONNECT] Handle unrecognizable exceptions - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/18 18:23:54 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #42549: [SPARK-44860] Add SESSION_USER function - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/18 18:41:18 UTC, 0 replies.
- [GitHub] [spark] bogao007 opened a new pull request, #42563: [SPARK-44877][CONNECT] Support python protobuf functions for Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/18 18:44:31 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42563: [SPARK-44877][CONNECT] Support python protobuf functions for Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/18 18:45:38 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42549: [SPARK-44860] Add SESSION_USER function - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/18 18:45:58 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42425: [SPARK-44729][PYTHON][DOCS] Add canonical links to the PySpark docs page - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/18 18:46:03 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42563: [SPARK-44877][CONNECT] Support python protobuf functions for Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/18 19:20:41 UTC, 2 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42564: [WIP][SPARK-44840][SQL] Make array_insert 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/18 19:24:01 UTC, 0 replies.
- [GitHub] [spark] kylerong-db opened a new pull request, #42565: [SPARK-44873][SPARK-39936][3.3] Support alter view with nested columns in Hive client - posted by "kylerong-db (via GitHub)" <gi...@apache.org> on 2023/08/18 19:56:10 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on pull request #42563: [SPARK-44877][CONNECT] Support python protobuf functions for Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/18 19:58:21 UTC, 0 replies.
- [GitHub] [spark] kylerong-db opened a new pull request, #42566: [SPARK-44873][SPARK-39936][3.4] Support alter view with nested columns in Hive client - posted by "kylerong-db (via GitHub)" <gi...@apache.org> on 2023/08/18 20:02:03 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #42567: [SPARK-44878] Disable strict limit for RocksDB write manager to avoid insertion exception on cache full - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/18 20:09:47 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42567: [SPARK-44878][SS] Disable strict limit for RocksDB write manager to avoid insertion exception on cache full - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/18 20:10:45 UTC, 3 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42568: [SPARK-44876][PYTHON] Enable and fix test_parity_arrow_python_udf. - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/18 20:33:11 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42568: [SPARK-44876][PYTHON] Enable and fix test_parity_arrow_python_udf. - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/18 20:33:32 UTC, 1 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42521: [SPARK-44435][SS][CONNECT][DRAFT] Tests for foreachBatch and Listener - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/08/18 20:37:41 UTC, 1 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42521: [SPARK-44435][SS][CONNECT][DRAFT] Tests for foreachBatch and Listener - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/18 20:53:22 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42569: [SPARK-44879][PYTHON][DOCS] Refine the docstring of spark.createDataFrame - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/18 21:09:05 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42569: [SPARK-44879][PYTHON][DOCS] Refine the docstring of spark.createDataFrame - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/18 21:09:46 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42568: [SPARK-44876][PYTHON] Enable and fix test_parity_arrow_python_udf. - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/18 21:11:38 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #42568: [SPARK-44876][PYTHON] Enable and fix test_parity_arrow_python_udf. - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/08/18 21:17:24 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #42568: [SPARK-44876][PYTHON] Enable and fix test_parity_arrow_python_udf. - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/08/18 21:20:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41120: [SPARK-21195][CORE] Dynamically register metrics from sources as they are reported - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/19 00:15:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41100: [SPARK-43420][SQL] Make DisableUnnecessaryBucketedScan smart with table cache - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/19 00:15:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40713: [SPARK-42551][SQL] Support more subexpression elimination cases - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/19 00:15:36 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42504: [SPARK-44818] Fix race for pending task kill issued before taskThread is initialized - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/19 00:20:39 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #42531: [SPARK-44846][SQL] PushFoldableIntoBranches in complex grouping expressions may cause bi… - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/19 00:22:48 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42566: [SPARK-44873][3.4] Support alter view with nested columns in Hive client - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/19 01:32:18 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #42469: [SPARK-44782][INFRA] Adjust PR template to Generative Tooling Guidance recommendations - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/19 02:13:46 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on pull request #42531: [SPARK-44846][SQL] Pull out complex grouping expressions after remove redundant aggregates - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/19 02:16:14 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #42531: [SPARK-44846][SQL] Pull out complex grouping expressions after remove redundant aggregates - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/19 02:30:33 UTC, 0 replies.
- [GitHub] [spark] zhouyifan279 commented on a diff in pull request #41105: [SPARK-43403][UI] Ensure old SparkUI in HistoryServer has been detached before loading new one - posted by "zhouyifan279 (via GitHub)" <gi...@apache.org> on 2023/08/19 02:35:24 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42545: SPARK-44866: Spark wrongly map the BOOLEAN Type to BIT(1) in Snowflake - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/19 02:39:29 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on pull request #42557: [SPARK-44870][SQL] Convert HashAggregate to SortAggregate if all grouping expressions are in child output orderings - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/19 03:21:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42566: [SPARK-44873][3.4] Support alter view with nested columns in Hive client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/19 04:40:01 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42565: [SPARK-44873][SPARK-39936][3.3] Support alter view with nested columns in Hive client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/19 04:41:05 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41100: [SPARK-43420][SQL] Make DisableUnnecessaryBucketedScan smart with table cache - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/19 04:42:10 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42529: [SPARK-44845][YARN][DEPLOY]Fixed file system uri comparison function - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/19 04:45:32 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42568: [SPARK-44876][PYTHON] Enable and fix test_parity_arrow_python_udf. - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/19 04:54:08 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42529: [SPARK-44845][YARN][DEPLOY]Fixed file system uri comparison function - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/19 05:49:36 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #41100: [SPARK-43420][SQL] Make DisableUnnecessaryBucketedScan smart with table cache - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/19 08:19:13 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42536: [SPARK-39024][CORE][YARN] Enable graceful decommissioning of executors on "DECOMMISSIONING" node even in case of external shuffle service enabled - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/19 08:31:08 UTC, 0 replies.
- [GitHub] [spark] Kimahriman opened a new pull request, #42570: [SPARK-22876][YARN] Respect YARN AM failure validity interval - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/08/19 15:03:59 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42571: [SPARK-44880][UI] Remove unnecessary right curly brace at the end of the thread locks info - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/19 17:06:49 UTC, 0 replies.
- [GitHub] [spark] heyihong closed pull request #42562: [SPARK-44874][SQL][CONNECT] Handle unrecognizable exceptions - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/19 21:00:20 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41120: [SPARK-21195][CORE] Dynamically register metrics from sources as they are reported - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/20 00:17:42 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41113: [SPARK-43400][SQL] Add Primary Key syntax support - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/20 00:17:43 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/20 00:17:44 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42571: [SPARK-44880][UI] Remove unnecessary right curly brace at the end of the thread locks info - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/20 01:40:12 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42571: [SPARK-44880][UI] Remove unnecessary right curly brace at the end of the thread locks info - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/20 01:41:51 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42549: [SPARK-44860] Add SESSION_USER function - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/20 03:06:14 UTC, 0 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #42572: [SPARK-44881][CORE]Executor stucked on retrying to fetch shuffle data when `java.lang.OutOfMemoryError. unable to create native thread` exception occurred. - posted by "hgs19921112 (via GitHub)" <gi...@apache.org> on 2023/08/20 03:57:59 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42572: [SPARK-44881[COMMON]Executor stucked on retrying to fetch shuffle data when `java.lang.OutOfMemoryError. unable to create native thread` exception occurred. - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/20 05:55:12 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42570: [SPARK-22876][YARN] Respect YARN AM failure validity interval - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/20 05:56:44 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/20 06:07:43 UTC, 1 replies.
- [GitHub] [spark] hgs19921112 commented on pull request #42572: [SPARK-44881[COMMON]Executor stucked on retrying to fetch shuffle data when `java.lang.OutOfMemoryError. unable to create native thread` exception occurred. - posted by "hgs19921112 (via GitHub)" <gi...@apache.org> on 2023/08/20 06:44:50 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42573: [SPARK-44882][PYTHON][CONNECT] Remove function uuid/random/chr from PySpark - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/20 09:30:05 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 opened a new pull request, #42574: [SPARK-43149] should create metadata first - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/08/20 12:33:12 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #42488: [SPARK-44804][SQL] SortMergeJoin should respect the streamed side ordering - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/20 13:53:45 UTC, 2 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42575: [WIP][SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/20 14:01:10 UTC, 0 replies.
- [GitHub] [spark] tindzk opened a new pull request, #42576: [SPARK-44885] NullPointerException is thrown when column with ROWID type contains NULL values - posted by "tindzk (via GitHub)" <gi...@apache.org> on 2023/08/20 19:52:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42573: [SPARK-44882][PYTHON][CONNECT] Remove function uuid/random/chr from PySpark - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/20 23:51:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42573: [SPARK-44882][PYTHON][CONNECT] Remove function uuid/random/chr from PySpark - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:16:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42573: [SPARK-44882][PYTHON][CONNECT] Remove function uuid/random/chr from PySpark - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:16:27 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41113: [SPARK-43400][SQL] Add Primary Key syntax support - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/21 00:16:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/21 00:16:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42568: [SPARK-44876][PYTHON] Fix Arrow-optimized Python UDF on Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:20:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42568: [SPARK-44876][PYTHON] Fix Arrow-optimized Python UDF on Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:20:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42569: [SPARK-44879][PYTHON][DOCS] Refine the docstring of spark.createDataFrame - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:24:31 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42569: [SPARK-44879][PYTHON][DOCS] Refine the docstring of spark.createDataFrame - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:24:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix test when ansi mode enabled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:26:26 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42575: [WIP][SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 00:27:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42255: [SPARK-40178][SQL][COONECT] Support coalesce hints with ease for PySpark and R - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:28:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42255: [SPARK-40178][SQL][COONECT] Support coalesce hints with ease for PySpark and R - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:28:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42547: [SPARK-44858][PYTHON][DOCS] Refine dostring of DataFrame.isEmpty - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 00:31:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42547: [SPARK-44858][PYTHON][DOCS] Refine dostring of DataFrame.isEmpty - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 00:32:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42526: [SPARK-44842][SPARK-43812][PS] Support stat functions for pandas 2.0.0 and enabling tests. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 00:36:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42526: [SPARK-44842][SPARK-43812][PS] Support stat functions for pandas 2.0.0 and enabling tests. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 00:37:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 00:40:16 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40085: [SPARK-42492][SQL] Add new function filter_value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 01:04:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42563: [SPARK-44877][CONNECT][PYTHON] Support python protobuf functions for Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 01:12:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42563: [SPARK-44877][CONNECT][PYTHON] Support python protobuf functions for Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 01:13:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42556: [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 01:15:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 01:22:44 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 01:24:55 UTC, 11 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42521: [SPARK-44435][SS][CONNECT][DRAFT] Tests for foreachBatch and Listener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 01:31:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42548: [WIP][SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 01:34:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42548: [WIP][SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 01:35:35 UTC, 3 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/21 01:36:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42575: [WIP][SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 01:53:32 UTC, 0 replies.
- [GitHub] [spark] zekai-li commented on a diff in pull request #42529: [SPARK-44845][YARN][DEPLOY]Fixed file system uri comparison function - posted by "zekai-li (via GitHub)" <gi...@apache.org> on 2023/08/21 02:03:17 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42566: [SPARK-44873][3.4] Support alter view with nested columns in Hive client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:16:10 UTC, 0 replies.
- [GitHub] [spark] dzypersonal commented on pull request #36162: [SPARK-32170][CORE] Improve the speculation through the stage task metrics. - posted by "dzypersonal (via GitHub)" <gi...@apache.org> on 2023/08/21 02:26:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42556: [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:32:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42554: Make StreamingRelationV2 support metadata column - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:33:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42553: [SPARK-44864] Align streaming statistics link format with other page links - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:33:40 UTC, 0 replies.
- [GitHub] [spark] imback82 opened a new pull request, #42577: [SPARK-XXXXX][SQL] Introduce CLUSTER BY clause for CREATE/REPLACE TABLE - posted by "imback82 (via GitHub)" <gi...@apache.org> on 2023/08/21 02:37:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42551: [SPARK-43563][SPARK-43459][SPARK-43451][SPARK-43506] Remove `squeeze` from `read_csv` & enabling more tests. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:41:45 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #42450: [SPARK-44773][SQL] Code-gen CodegenFallback expression in WholeStageCodegen if possible - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/08/21 02:47:00 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42550: [SPARK-44861][CONNECT] jsonignore SparkListenerConnectOperationStarted.planRequest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:47:05 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #42578: [SPARK-44841][FOLLOWUP] Add migration guide for the behavior change - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/21 02:55:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42541: Spark 44854 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:55:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42579: [SPARK-44887][DOCS] Fix wildcard import `from pyspark.sql.functions import *` in `Quick Start` Examples - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 02:58:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42541: Spark 44854 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 02:59:51 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42579: [SPARK-44887][DOCS] Fix wildcard import `from pyspark.sql.functions import *` in `Quick Start` Examples - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 03:01:41 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42534: [SPARK-44868][SQL] Convert datetime to string by `to_char`/`to_varchar` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:03:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42534: [SPARK-44868][SQL] Convert datetime to string by `to_char`/`to_varchar` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:03:29 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42551: [SPARK-43563][SPARK-43459][SPARK-43451][SPARK-43506] Remove `squeeze` from `read_csv` & enabling more tests. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/21 03:04:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42528: [SPARK-44844][BUILD] Exclude `python/build/*` path for local `lint-python` testing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:05:34 UTC, 0 replies.
- [GitHub] [spark] hdaikoku commented on pull request #42572: [SPARK-44881[COMMON]Executor stucked on retrying to fetch shuffle data when `java.lang.OutOfMemoryError. unable to create native thread` exception occurred. - posted by "hdaikoku (via GitHub)" <gi...@apache.org> on 2023/08/21 03:05:41 UTC, 0 replies.
- [GitHub] [spark] goodwanghan commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "goodwanghan (via GitHub)" <gi...@apache.org> on 2023/08/21 03:09:31 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42498: [SPARK-44814][CONNECT][PYTHON]Test to protect from faulty protobuf versions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:09:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42475: [SPARK-44793][SQL] Fixing pipelineTime metric for WholeStageCodegen - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:11:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42534: [SPARK-44868][SQL] Convert datetime to string by `to_char`/`to_varchar` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/21 03:11:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42471: [SPARK-44785][SQL][CONNECT] Convert common alreadyExistsExceptions and noSuchExceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:13:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42471: [SPARK-44785][SQL][CONNECT] Convert common alreadyExistsExceptions and noSuchExceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:14:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42467: [SPARK-44780][DOC] SQL temporary variables - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:14:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42455: [DRAFT] Fix Spark Connect Behavior for Default Session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:15:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42455: [DRAFT] Fix Spark Connect Behavior for Default Session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:15:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42553: [SPARK-44864] Align streaming statistics link format with other page links - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/21 03:20:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/21 03:22:42 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #41100: [SPARK-43420][SQL] Make DisableUnnecessaryBucketedScan smart with table cache - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/21 03:25:14 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42575: [WIP][SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/21 03:31:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41335: [SPARK-43205][DOCS][SQL][FOLLOWUP] IDENTIFIER clause docs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/21 03:37:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41335: [SPARK-43205][DOCS][SQL][FOLLOWUP] IDENTIFIER clause docs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/21 03:44:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42377: [SPARK-44622][SQL][CONNECT] Implement error enrichment and setting server-side stacktrace - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 03:46:24 UTC, 1 replies.
- [GitHub] [spark] itholic closed pull request #42528: [SPARK-44844][BUILD] Exclude `python/build/*` path for local `lint-python` testing - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/21 03:55:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 04:10:25 UTC, 17 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42579: [SPARK-44887][DOCS] Fix wildcard import `from pyspark.sql.functions import *` in `Quick Start` Examples - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 04:24:48 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42579: [SPARK-44887][DOCS] Fix wildcard import `from pyspark.sql.functions import *` in `Quick Start` Examples - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 04:25:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42580: [SPARK-44888][SQL][TESTS] Update the golden files of `SQLQueryTestSuite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/21 04:45:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42580: [SPARK-44888][SQL][TESTS] Re-generate golden files of `SQLQueryTestSuite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/21 05:41:29 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/21 05:42:39 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42575: [WIP][SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/21 05:56:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42578: [SPARK-44841][FOLLOWUP] Add migration guide for the behavior change - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 06:01:25 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42578: [SPARK-44841][FOLLOWUP] Add migration guide for the behavior change - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/21 06:02:34 UTC, 0 replies.
- [GitHub] [spark] junyuc25 opened a new pull request, #42581: [WIP] AWS sdk upgrade - posted by "junyuc25 (via GitHub)" <gi...@apache.org> on 2023/08/21 06:08:11 UTC, 0 replies.
- [GitHub] [spark] zeruibao commented on pull request #42554: [SPARK-44865] Make StreamingRelationV2 support metadata column - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/21 06:15:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 06:21:58 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42581: [WIP] AWS sdk upgrade - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 06:26:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42529: [SPARK-44845][YARN][DEPLOY]Fixed file system uri comparison function - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 06:27:33 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42581: [WIP] AWS sdk upgrade - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/21 06:53:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42559: [SPARK-44871][SQL] Fix percentile_disc behaviour - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/21 06:55:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42582: [SPARK-44889][PYTHON][CONNECT] Fix docstring of `monotonically_increasing_id` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 06:56:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42582: [SPARK-44889][PYTHON][CONNECT] Fix docstring of `monotonically_increasing_id` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 07:03:02 UTC, 1 replies.
- [GitHub] [spark] hgs19921112 closed pull request #42572: [SPARK-44881[COMMON]Executor stucked on retrying to fetch shuffle data when `java.lang.OutOfMemoryError. unable to create native thread` exception occurred. - posted by "hgs19921112 (via GitHub)" <gi...@apache.org> on 2023/08/21 07:47:56 UTC, 1 replies.
- [GitHub] [spark] hgs19921112 opened a new pull request, #42572: [SPARK-44881[COMMON]Executor stucked on retrying to fetch shuffle data when `java.lang.OutOfMemoryError. unable to create native thread` exception occurred. - posted by "hgs19921112 (via GitHub)" <gi...@apache.org> on 2023/08/21 07:54:24 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/08/21 08:19:41 UTC, 0 replies.
- [GitHub] [spark] chenyu-opensource opened a new pull request, #42583: [SPARK-44890][POM]Update miswritten remarks - posted by "chenyu-opensource (via GitHub)" <gi...@apache.org> on 2023/08/21 08:28:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42581: [WIP] AWS sdk upgrade - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/21 08:39:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42581: [WIP] AWS sdk upgrade - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/21 08:56:17 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/08/21 09:01:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42575: [SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/21 09:38:55 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42564: [SPARK-44840][SQL] Make array_insert 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/21 10:02:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42580: [SPARK-44888][SQL][TESTS] Regenerate golden files of `SQLQueryTestSuite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/21 10:02:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42580: [SPARK-44888][SQL][TESTS] Regenerate golden files of `SQLQueryTestSuite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/21 10:02:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42582: [SPARK-44889][PYTHON][CONNECT] Fix docstring of `monotonically_increasing_id` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 10:21:52 UTC, 0 replies.
- [GitHub] [spark-docker] wangyum opened a new pull request, #54: Add Apache Spark 3.3.3 Dockerfiles - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/21 10:40:46 UTC, 0 replies.
- [GitHub] [spark-docker] wangyum commented on pull request #54: Add Apache Spark 3.3.3 Dockerfiles - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/21 10:41:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42584: [SPARK-44891][PYTHON][CONNECT] Enable Doctests of `rand`, `randn` and `log` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 10:44:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42584: [SPARK-44891][PYTHON][CONNECT] Enable Doctests of `rand`, `randn` and `log` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 10:48:25 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42575: [SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/21 11:12:11 UTC, 1 replies.
- [GitHub] [spark] leesf commented on pull request #42553: [SPARK-44864] Align streaming statistics link format with other page links - posted by "leesf (via GitHub)" <gi...@apache.org> on 2023/08/21 11:21:39 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on a diff in pull request #54: [SPARK-44892] Add official image Dockerfile for Spark 3.3.3 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/08/21 11:31:42 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42585: [MINOR][SQL] Merge RefreshTable case on ResolveSessionCatalog - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/21 11:33:22 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42564: [SPARK-44840][SQL] Make `array_insert()` 1-based for negative indexes - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/21 11:43:13 UTC, 4 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42586: [SPARK-43288][SQL] Support Create Table Like on DataSourceV2 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/21 11:43:32 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X closed pull request #40979: [SPARK-43308][SQL] Improve scalar subquery logic plan when result are literal - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/21 11:55:16 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #41154: [SPARK-43327][CORE][3.3] Trigger `committer.setupJob` before plan execute in `FileFormatWriter#write` - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/08/21 12:33:00 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42564: [SPARK-44840][SQL] Make `array_insert()` 1-based for negative indexes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/21 12:48:42 UTC, 1 replies.
- [GitHub] [spark] gjxdxh commented on a diff in pull request #42454: [SPARK-44776][CONNECT] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "gjxdxh (via GitHub)" <gi...@apache.org> on 2023/08/21 13:23:24 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #42570: [SPARK-22876][YARN] Respect YARN AM failure validity interval - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/08/21 13:26:27 UTC, 0 replies.
- [GitHub] [spark] hdaly0 commented on a diff in pull request #42541: [SPARK-44854][PYTHON] Python timedelta to DayTimeIntervalType edge case bug - posted by "hdaly0 (via GitHub)" <gi...@apache.org> on 2023/08/21 13:35:40 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #42570: [SPARK-22876][YARN] Respect YARN AM failure validity interval - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/08/21 13:51:31 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #42564: [SPARK-44840][SQL] Make `array_insert()` 1-based for negative indexes - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/21 14:15:07 UTC, 1 replies.
- [GitHub] [spark] tindzk commented on pull request #42576: [SPARK-44885] NullPointerException is thrown when column with ROWID type contains NULL values - posted by "tindzk (via GitHub)" <gi...@apache.org> on 2023/08/21 15:04:43 UTC, 0 replies.
- [GitHub] [spark] siying commented on pull request #42567: [SPARK-44878][SS] Disable strict limit for RocksDB write manager to avoid insertion exception on cache full - posted by "siying (via GitHub)" <gi...@apache.org> on 2023/08/21 16:50:43 UTC, 1 replies.
- [GitHub] [spark] rednaxelafx commented on pull request #42550: [SPARK-44861][CONNECT] jsonignore SparkListenerConnectOperationStarted.planRequest - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2023/08/21 17:39:14 UTC, 0 replies.
- [GitHub] [spark] agubichev commented on a diff in pull request #42383: [SPARK-44549][SQL] Support window functions in correlated scalar subqueries - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/21 18:23:20 UTC, 7 replies.
- [GitHub] [spark] sandip-db commented on pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/08/21 18:26:59 UTC, 0 replies.
- [GitHub] [spark] ChenMichael opened a new pull request, #42587: [SPARK-44897] - Propagating local properties to subquery broadcast exec - posted by "ChenMichael (via GitHub)" <gi...@apache.org> on 2023/08/21 18:28:07 UTC, 0 replies.
- [GitHub] [spark] agubichev commented on pull request #42383: [SPARK-44549][SQL] Support window functions in correlated scalar subqueries - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/21 18:34:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42588: [SPARK-44898][BUILD] Upgrade `gcs-connector` to 2.2.17 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 18:48:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42588: [SPARK-44898][BUILD] Upgrade `gcs-connector` to 2.2.17 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 19:19:34 UTC, 4 replies.
- [GitHub] [spark] vitaliili-db closed pull request #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2023/08/21 19:20:50 UTC, 0 replies.
- [GitHub] [spark] vitaliili-db opened a new pull request, #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2023/08/21 19:29:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/21 19:46:59 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42589: Revert "[SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/21 20:09:22 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42590: [SPARK-44879][PYTHON][DOCS][3.5] Refine the docstring of spark.createDataFrame - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/21 20:22:05 UTC, 0 replies.
- [GitHub] [spark] JoshRosen closed pull request #42504: [SPARK-44818] Fix race for pending task kill issued before taskThread is initialized - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/08/21 20:26:07 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #42504: [SPARK-44818] Fix race for pending task kill issued before taskThread is initialized - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/08/21 20:26:29 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42503: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/21 20:35:33 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42579: [SPARK-44887][DOCS] Fix wildcard import `from pyspark.sql.functions import *` in `Quick Start` Examples - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/21 20:42:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42591: [SPARK-44784][CONNECT] Make SBT testing hermetic. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/21 20:50:56 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42592: [SPARK-44899][PYTHON][DOCS] Refine the docstring of DataFrame.collect - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/21 20:51:30 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] opened a new pull request, #42593: Bump org.apache.ivy:ivy from 2.5.1 to 2.5.2 - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/08/21 20:53:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42481: [SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 20:53:35 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42591: [SPARK-44784][CONNECT] Make SBT testing hermetic. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/21 20:54:44 UTC, 10 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42589: Revert "[SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI" - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 21:16:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42589: Revert "[SPARK-44801][SQL][UI] Capture analyzing failed queries in Listener and UI" - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 21:16:57 UTC, 0 replies.
- [GitHub] [spark] michaelzhan-db commented on pull request #42548: [WIP][SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "michaelzhan-db (via GitHub)" <gi...@apache.org> on 2023/08/21 21:32:49 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #42493: [SPARK-44811][BUILD] Upgrade Guava to 32+ - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/08/21 21:50:17 UTC, 0 replies.
- [GitHub] [spark] jdesjean commented on a diff in pull request #42550: [SPARK-44861][CONNECT] jsonignore SparkListenerConnectOperationStarted.planRequest - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/08/21 21:56:53 UTC, 4 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42567: [SPARK-44878][SS] Disable strict limit for RocksDB write manager to avoid insertion exception on cache full - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/21 22:05:13 UTC, 3 replies.
- [GitHub] [spark] allanf-db commented on a diff in pull request #42556: [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/08/21 22:06:03 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42588: [SPARK-44898][BUILD] Upgrade `gcs-connector` to 2.2.17 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 22:27:37 UTC, 0 replies.
- [GitHub] [spark] vitaliili-db commented on a diff in pull request #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2023/08/21 22:32:35 UTC, 1 replies.
- [GitHub] [spark] vitaliili-db commented on pull request #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2023/08/21 22:33:07 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on pull request #42420: [SPARK-44748][SQL] Query execution for the PARTITION BY clause in UDTF TABLE arguments - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/21 22:38:47 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/21 22:46:01 UTC, 3 replies.
- [GitHub] [spark] ueshin closed pull request #42420: [SPARK-44748][SQL] Query execution for the PARTITION BY clause in UDTF TABLE arguments - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/21 22:47:13 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/21 23:00:26 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/21 23:00:42 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42542: [SPARK-44214][CORE] Add driver log live UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/21 23:38:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42590: [SPARK-44879][PYTHON][DOCS][3.5] Refine the docstring of spark.createDataFrame - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 23:47:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42590: [SPARK-44879][PYTHON][DOCS][3.5] Refine the docstring of spark.createDataFrame - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 23:47:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42584: [SPARK-44891][PYTHON][CONNECT] Enable Doctests of `rand`, `randn` and `log` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 23:48:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42584: [SPARK-44891][PYTHON][CONNECT] Enable Doctests of `rand`, `randn` and `log` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/21 23:48:41 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #42595: [WIP][SPARK-44901][SQL] Add API in Python UDTF 'analyze' method to return partitioning/ordering expressions - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/21 23:52:11 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42542: [SPARK-44214][CORE] Support Spark Driver Live Log UI - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/21 23:56:40 UTC, 4 replies.
- [GitHub] [spark] linhongliu-db commented on pull request #42524: [SPARK-44837][SQL] Improve ALTER TABLE ALTER PARTITION column error message - posted by "linhongliu-db (via GitHub)" <gi...@apache.org> on 2023/08/22 00:00:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42593: Bump org.apache.ivy:ivy from 2.5.1 to 2.5.2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 00:13:47 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] commented on pull request #42593: Bump org.apache.ivy:ivy from 2.5.1 to 2.5.2 - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/08/22 00:13:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42551: [SPARK-43563][SPARK-43459][SPARK-43451][SPARK-43506] Remove `squeeze` from `read_csv` & enabling more tests. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 00:20:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42551: [SPARK-43563][SPARK-43459][SPARK-43451][SPARK-43506] Remove `squeeze` from `read_csv` & enabling more tests. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 00:21:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/22 00:37:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42542: [SPARK-44214][CORE] Support Spark Driver Live Log UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 00:50:44 UTC, 5 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42524: [SPARK-44837][SQL] Improve ALTER TABLE ALTER PARTITION column error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/22 00:54:14 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42542: [SPARK-44214][CORE] Support Spark Driver Live Log UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 00:58:17 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42454: [SPARK-44776][CONNECT] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 01:06:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42454: [SPARK-44776][CONNECT] Add ProducedRowCount to SparkListenerConnectOperationFinished - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 01:07:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42576: [SPARK-44885] NullPointerException is thrown when column with ROWID type contains NULL values - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 01:19:41 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/08/22 01:21:56 UTC, 1 replies.
- [GitHub] [spark] chenyu-opensource commented on pull request #42583: [SPARK-44890][BUILD]Update miswritten remarks - posted by "chenyu-opensource (via GitHub)" <gi...@apache.org> on 2023/08/22 01:25:47 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42542: [SPARK-44214][CORE] Support Spark Driver Live Log UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 02:26:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42462: [SPARK-44751][SQL] XML FileFormat Interface implementation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 02:40:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42583: [SPARK-44890][BUILD]Update miswritten remarks - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/22 02:47:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42596: [SPARK-44831][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/22 02:52:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42541: [SPARK-44854][PYTHON] Python timedelta to DayTimeIntervalType edge case bug - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 02:54:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42541: [SPARK-44854][PYTHON] Python timedelta to DayTimeIntervalType edge case bug - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 02:54:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42541: [SPARK-44854][PYTHON] Python timedelta to DayTimeIntervalType edge case bug - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 02:55:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42541: [SPARK-44854][PYTHON] Python timedelta to DayTimeIntervalType edge case bug - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 02:56:20 UTC, 0 replies.
- [GitHub] [spark] chenyu-opensource closed pull request #42583: [SPARK-44890][BUILD]Update miswritten remarks - posted by "chenyu-opensource (via GitHub)" <gi...@apache.org> on 2023/08/22 03:03:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42596: [SPARK-44903][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 03:06:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41154: [SPARK-43327][CORE][3.3] Trigger `committer.setupJob` before plan execute in `FileFormatWriter#write` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/22 03:07:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41154: [SPARK-43327][CORE][3.3] Trigger `committer.setupJob` before plan execute in `FileFormatWriter#write` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/22 03:08:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42596: [SPARK-44903][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 03:09:21 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42597: [SPARK-44904][PYTHON][DOCS] Correct the ‘versionchanged’ of `sql.functions.approx_percentile` to 3.5.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/22 03:27:49 UTC, 0 replies.
- [GitHub] [spark-docker] wangyum commented on a diff in pull request #54: [SPARK-44892] Add official image Dockerfile for Spark 3.3.3 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/22 03:53:48 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42597: [SPARK-44904][PYTHON][DOCS] Correct the `versionadded` of `sql.functions.approx_percentile` to 3.5.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 04:27:11 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42377: [SPARK-44622][SQL][CONNECT] Implement error enrichment and setting server-side stacktrace - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 04:34:27 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42597: [SPARK-44904][PYTHON][DOCS] Correct the `versionadded` of `sql.functions.approx_percentile` to 3.5.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/22 04:58:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42596: [SPARK-44903][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/22 05:03:09 UTC, 10 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42596: [SPARK-44903][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 05:18:00 UTC, 7 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42585: [MINOR][SQL] Merge RefreshTable case on ResolveSessionCatalog - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/22 05:54:25 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42521: [SPARK-44435][SS][CONNECT] Tests for foreachBatch and Listener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 05:56:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 06:02:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42597: [SPARK-44904][PYTHON][DOCS] Correct the `versionadded` of `sql.functions.approx_percentile` to 3.5.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 06:09:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42597: [SPARK-44904][PYTHON][DOCS] Correct the `versionadded` of `sql.functions.approx_percentile` to 3.5.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 06:09:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42597: [SPARK-44904][PYTHON][DOCS] Correct the `versionadded` of `sql.functions.approx_percentile` to 3.5.0 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 06:10:01 UTC, 0 replies.
- [GitHub] [spark] chenyu-opensource opened a new pull request, #42598: [SPARK-44890][BUILD]Update miswritten remarks - posted by "chenyu-opensource (via GitHub)" <gi...@apache.org> on 2023/08/22 06:11:26 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #42599: [DO-NOT-MERGE] Remove Guava from shared classes from IsolatedClientLoader - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/22 06:16:00 UTC, 0 replies.
- [GitHub] [spark] zwangsheng opened a new pull request, #42600: [SPARK-44906][K8S] Move Utils. SubstituteAppNExecIds logic into KubernetesConf.annotations - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/08/22 06:16:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42597: [SPARK-44904][PYTHON][DOCS] Correct the `versionadded` of `sql.functions.approx_percentile` to 3.5.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/22 06:18:02 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42601: [SPARK-44905][SQL] stateful lastRegex causes NullPointerException on eval for regexp_replace - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 06:19:29 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42521: [SPARK-44435][SS][CONNECT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/22 06:40:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42602: [MINOR][PYTHON][DOCS] Remove duplicated versionchanged per versionadded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 06:40:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42602: [MINOR][PYTHON][DOCS] Remove duplicated versionchanged per versionadded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 06:40:47 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42602: [MINOR][PYTHON][DOCS] Remove duplicated versionchanged per versionadded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 06:41:37 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42602: [MINOR][PYTHON][DOCS] Remove duplicated versionchanged per versionadded - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 06:46:13 UTC, 0 replies.
- [GitHub] [spark] chenyu-opensource commented on pull request #42598: [SPARK-44890][BUILD]Update miswritten remarks - posted by "chenyu-opensource (via GitHub)" <gi...@apache.org> on 2023/08/22 06:46:40 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42586: [SPARK-43288][SQL] Support Create Table Like on DataSourceV2 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/22 07:07:12 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42602: [MINOR][PYTHON][DOCS] Remove duplicated versionchanged per versionadded - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 07:09:50 UTC, 1 replies.
- [GitHub] [spark-docker] wangyum closed pull request #54: [SPARK-44892] Add official image Dockerfile for Spark 3.3.3 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/22 07:13:42 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42601: [SPARK-44905][SQL] Stateful lastRegex causes NullPointerException on eval for regexp_replace - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 07:14:27 UTC, 1 replies.
- [GitHub] [spark-docker] wangyum commented on pull request #54: [SPARK-44892] Add official image Dockerfile for Spark 3.3.3 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/22 07:14:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42603: [SPARK-44907][PYTHON][CONNECT] `DataFrame.join` should throw IllegalArgumentException for invalid join types - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 07:22:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42603: [SPARK-44907][PYTHON][CONNECT] `DataFrame.join` should throw IllegalArgumentException for invalid join types - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 07:22:37 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42496: [SPARK-44813][INFRA] The Jira Python misses our assignee when it searches users again - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 07:33:11 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42604: [SPARK-44813][INFRA][FOLLOWUP] Make the jira library optional - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 07:37:42 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42600: [SPARK-44906][K8S] Move Utils. SubstituteAppNExecIds logic into KubernetesConf.annotations - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 07:42:12 UTC, 0 replies.
- [GitHub] [spark] zwangsheng commented on a diff in pull request #42600: [SPARK-44906][K8S] Move Utils. SubstituteAppNExecIds logic into KubernetesConf.annotations - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/08/22 07:50:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/22 08:01:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/22 08:02:54 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42600: [SPARK-44906][K8S] Move Utils. SubstituteAppNExecIds logic into KubernetesConf.annotations - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 08:05:47 UTC, 1 replies.
- [GitHub] [spark] zwangsheng commented on pull request #42600: [SPARK-44906][K8S] Move Utils. SubstituteAppNExecIds logic into KubernetesConf.annotations - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/08/22 08:09:51 UTC, 0 replies.
- [GitHub] [spark] zhouyifan279 commented on a diff in pull request #41105: [WIP][SPARK-43403][UI] Ensure old SparkUI in HistoryServer has been detached before loading new one - posted by "zhouyifan279 (via GitHub)" <gi...@apache.org> on 2023/08/22 08:10:13 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/22 08:25:59 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42591: [SPARK-44784][CONNECT] Make SBT testing hermetic. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/22 08:32:05 UTC, 11 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #42605: [SPARK-44908][ML][CONNECT] Fix cross validator foldCol param functionality - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/22 08:47:29 UTC, 0 replies.
- [GitHub] [spark] tindzk commented on pull request #42576: [SPARK-44885][SQL] NullPointerException is thrown when column with ROWID type contains NULL values - posted by "tindzk (via GitHub)" <gi...@apache.org> on 2023/08/22 09:33:48 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #42606: [SPARK-44909][ML] Skip starting torch distributor log streaming server when it is not available - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/22 09:53:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42607: [SPARK-43780][SQL][FOLLOWUP] Fix the config doc `spark.sql.optimizer.decorrelateJoinPredicate.enabled` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 10:33:05 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42585: [MINOR][SQL] Merge RefreshTable case on ResolveSessionCatalog - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 10:33:29 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42585: [MINOR][SQL] Merge RefreshTable case on ResolveSessionCatalog - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 10:34:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates for scalar and lateral subqueries - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 10:34:41 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42604: [SPARK-44813][INFRA][FOLLOWUP] Make the jira library optional - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 10:36:39 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42604: [SPARK-44813][INFRA][FOLLOWUP] Make the jira library optional - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 10:37:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42472: [SPARK-44786][SQL][CONNECT] Convert common Spark exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 10:44:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42472: [SPARK-44786][SQL][CONNECT] Convert common Spark exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 10:45:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42576: [SPARK-44885][SQL] NullPointerException is thrown when column with ROWID type contains NULL values - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 10:47:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42576: [SPARK-44885][SQL] NullPointerException is thrown when column with ROWID type contains NULL values - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 10:48:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42602: [MINOR][PYTHON][DOCS] Remove duplicated versionchanged per versionadded - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/22 11:09:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42608: [SPARK-42017][PYTHON][CONNECT][TESTS] Enable `ColumnParityTests. test_access_column ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/22 11:26:23 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #40932: [SPARK-43266][SQL] Move MergeScalarSubqueries to spark-sql - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 12:08:45 UTC, 0 replies.
- [GitHub] [spark] peter-toth closed pull request #40932: [SPARK-43266][SQL] Move MergeScalarSubqueries to spark-sql - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 12:09:47 UTC, 0 replies.
- [GitHub] [spark] peter-toth closed pull request #42559: [SPARK-44871][SQL] Fix percentile_disc behaviour - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 12:14:07 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42607: [SPARK-43780][SQL][FOLLOWUP] Fix the config doc `spark.sql.optimizer.decorrelateJoinPredicate.enabled` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 12:31:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42607: [SPARK-43780][SQL][FOLLOWUP] Fix the config doc `spark.sql.optimizer.decorrelateJoinPredicate.enabled` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 12:32:51 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 opened a new pull request, #42609: [SPARK-44911] create hive table with invalid column should return error class - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/08/22 12:42:28 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #42610: [SPARK-44871][SQL][3.4] Fix percentile_disc behaviour - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 13:28:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42601: [SPARK-44905][SQL] Stateful lastRegex causes NullPointerException on eval for regexp_replace - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/22 13:50:36 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42542: [SPARK-44214][CORE] Support Spark Driver Live Log UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 13:53:59 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #42611: [SPARK-44871][SQL][3.3] Fix percentile_disc behaviour - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 14:02:14 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42601: [SPARK-44905][SQL] Stateful lastRegex causes NullPointerException on eval for regexp_replace - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 14:03:57 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #41119: [SPARK-42551][SQL] Support more subexpression elimination cases - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 14:56:16 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42609: [SPARK-44911] create hive table with invalid column should return error class - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/22 14:59:09 UTC, 0 replies.
- [GitHub] [spark] ConeyLiu opened a new pull request, #42612: [SPARK-44913][SQL] DS V2 supports push down V2 UDF that has magic method - posted by "ConeyLiu (via GitHub)" <gi...@apache.org> on 2023/08/22 15:02:08 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #37630: [SPARK-40193][SQL] Merge subquery plans with different filters - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 15:03:22 UTC, 0 replies.
- [GitHub] [spark] ConeyLiu commented on pull request #42612: [SPARK-44913][SQL] DS V2 supports push down V2 UDF that has magic method - posted by "ConeyLiu (via GitHub)" <gi...@apache.org> on 2023/08/22 15:03:49 UTC, 1 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #42609: [SPARK-44911][SQL] Create hive table with invalid column should return error class - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/08/22 15:04:32 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on a diff in pull request #42377: [SPARK-44622][SQL][CONNECT] Implement error enrichment and setting server-side stacktrace - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/08/22 15:48:20 UTC, 22 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #42613: [SPARK][BUILD] Upgrade `Apache ivy` from 2.5.1 to 2.5.2 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/22 16:12:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42610: [SPARK-44871][SQL][3.4] Fix percentile_disc behaviour - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 16:26:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42610: [SPARK-44871][SQL][3.4] Fix percentile_disc behaviour - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 16:28:36 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42614: [MINOR][INFRA] Disable InternalParquetRecordWriter logs for tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/22 16:47:09 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42596: [SPARK-44903][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/22 16:49:55 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/22 16:57:21 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42383: [SPARK-44549][SQL] Support window functions in correlated scalar subqueries - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/22 17:10:27 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37630: [SPARK-40193][SQL] Merge subquery plans with different filters - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/22 17:30:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42615: [SPARK-44916][DOCS][TESTS] Document Spark Driver Live Log UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 17:59:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42564: [SPARK-44840][SQL] Make `array_insert()` 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 18:02:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42564: [SPARK-44840][SQL] Make `array_insert()` 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 18:03:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42564: [SPARK-44840][SQL] Make `array_insert()` 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 18:04:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42616: [SPARK-44840][SQL][3.5] Make `array_insert()` 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 18:09:48 UTC, 0 replies.
- [GitHub] [spark] paymog commented on pull request #41199: [SPARK-43536][CORE] Fixing statsd sink reporter - posted by "paymog (via GitHub)" <gi...@apache.org> on 2023/08/22 18:13:23 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/22 18:46:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42615: [SPARK-44916][DOCS][TESTS] Document Spark Driver Live Log UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 18:57:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42616: [SPARK-44840][SQL][3.5] Make `array_insert()` 1-based for negative indexes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 19:50:29 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42600: [SPARK-44906][K8S] Move Utils. SubstituteAppNExecIds logic into KubernetesConf.annotations - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 19:56:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42600: [SPARK-44906][K8S] Move Utils. SubstituteAppNExecIds logic into KubernetesConf.annotations - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 19:59:25 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42578: [SPARK-44841][FOLLOWUP] Add migration guide for the behavior change - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/22 20:06:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42600: [SPARK-44906][K8S] Make `Kubernetes[Driver|Executor]Conf.annotations` substitute annotations instead of feature steps - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 20:07:15 UTC, 2 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42613: [SPARK-44914][BUILD] Upgrade `Apache ivy` from 2.5.1 to 2.5.2 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/22 20:08:17 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on a diff in pull request #41785: [SPARK-44241][Core] Mistakenly set io.connectionTimeout/connectionCreationTimeout to zero or negative will cause incessant executor cons/destructions - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/08/22 20:08:50 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42613: [SPARK-44914][BUILD] Upgrade `Apache ivy` from 2.5.1 to 2.5.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 20:10:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #41785: [SPARK-44241][Core] Mistakenly set io.connectionTimeout/connectionCreationTimeout to zero or negative will cause incessant executor cons/destructions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 20:18:51 UTC, 1 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #42556: [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/22 20:23:16 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42616: [SPARK-44840][SQL][3.5] Make `array_insert()` 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/22 20:24:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 20:42:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42615: [SPARK-44916][DOCS][TESTS] Document Spark Driver Live Log UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 20:51:24 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42615: [SPARK-44916][DOCS][TESTS] Document Spark Driver Live Log UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/22 21:04:30 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42617: [SPARK-44918][SQL][PYTHON] Support named arguments in scalar Python/Pandas UDFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/22 21:44:50 UTC, 0 replies.
- [GitHub] [spark] zeruibao commented on a diff in pull request #42503: [SPARK-43380][SQL] Fix Avro data type conversion issues without causing performance regression - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/08/22 22:33:36 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #42595: [SPARK-44901][SQL] Add API in Python UDTF 'analyze' method to return partitioning/ordering expressions - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/22 22:55:13 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42617: [SPARK-44918][SQL][PYTHON] Support named arguments in scalar Python/Pandas UDFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/22 22:56:59 UTC, 1 replies.
- [GitHub] [spark] JoshRosen commented on a diff in pull request #42599: [DO-NOT-MERGE] Remove Guava from shared classes from IsolatedClientLoader - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/08/22 23:25:51 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42603: [SPARK-44907][PYTHON][CONNECT] `DataFrame.join` should throw IllegalArgumentException for invalid join types - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 00:01:25 UTC, 0 replies.
- [GitHub] [spark] tianhanhu opened a new pull request, #42618: [SPARK-44919] Avro connector: convert a union of a single primitive type to a StructType - posted by "tianhanhu (via GitHub)" <gi...@apache.org> on 2023/08/23 00:02:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42614: [MINOR][INFRA] Disable o.a.p.h.InternalParquetRecordWriter logs for tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 00:04:47 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42617: [SPARK-44918][SQL][PYTHON] Support named arguments in scalar Python/Pandas UDFs - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/23 00:35:11 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42578: [SPARK-44841][FOLLOWUP] Add migration guide for the behavior change - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/23 00:45:29 UTC, 0 replies.
- [GitHub] [spark] JoshRosen opened a new pull request, #42619: [SPARK-44920][CORE] Use await() instead of awaitUninterruptibly() in TransportClientFactory.createClient() - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/08/23 00:49:04 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #42620: [SPARK-44921][SQL] Remove SqlBaseLexer.tokens from codebase - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/23 00:51:17 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #42620: [SPARK-44921][SQL] Remove SqlBaseLexer.tokens from codebase - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/23 00:56:35 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix test when ansi mode enabled - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/23 01:01:40 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42599: [DO-NOT-MERGE] Remove Guava from shared classes from IsolatedClientLoader - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/23 01:43:55 UTC, 4 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #42618: [SPARK-44919] Avro connector: convert a union of a single primitive type to a StructType - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/08/23 01:59:20 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #40932: [SPARK-43266][SQL] Move MergeScalarSubqueries to spark-sql - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/23 02:12:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42608: [SPARK-42017][PYTHON][CONNECT][TESTS] Enable `ColumnParityTests.test_access_column` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 02:12:11 UTC, 2 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42617: [SPARK-44918][SQL][PYTHON] Support named arguments in scalar Python/Pandas UDFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/23 02:23:42 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42621: [SPARK-43567][FOLLOWUP] Missing backtick from migration guide - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/23 02:31:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 02:31:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42614: [SPARK-44922][TESTS] Disable o.a.p.h.InternalParquetRecordWriter logs for tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 02:32:36 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42621: [SPARK-43567][FOLLOWUP] Missing backtick from migration guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 02:33:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42621: [SPARK-43567][FOLLOWUP] Missing backtick from migration guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 02:33:44 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42622: [SPARK-44923][PYTHON][DOCS] Some directories should be cleared when regenerating files - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/23 02:34:45 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42601: [SPARK-44905][SQL] Stateful lastRegex causes NullPointerException on eval for regexp_replace - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 02:36:51 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42622: [SPARK-44923][PYTHON][DOCS] Some directories should be cleared when regenerating files - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/23 02:37:28 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42622: [SPARK-44923][PYTHON][DOCS] Some directories should be cleared when regenerating files - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 02:45:56 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41785: [SPARK-44241][Core] Mistakenly set io.connectionTimeout/connectionCreationTimeout to zero or negative will cause incessant executor cons/destructions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 02:56:50 UTC, 0 replies.
- [GitHub] [spark] ragnarok56 opened a new pull request, #42623: [SPARK-44924][SS] Add config for FileStreamSource cached files - posted by "ragnarok56 (via GitHub)" <gi...@apache.org> on 2023/08/23 03:00:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42624: [SPARK-44925][K8S] K8s default service token file should not be materialized into token - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 03:09:49 UTC, 0 replies.
- [GitHub] [spark] zwangsheng commented on pull request #42600: [SPARK-44906][K8S] Make `Kubernetes[Driver|Executor]Conf.annotations` substitute annotations instead of feature steps - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/08/23 03:17:36 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42612: [SPARK-44913][SQL] DS V2 supports push down V2 UDF that has magic method - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/23 03:28:37 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 03:30:17 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42624: [SPARK-44925][K8S] K8s default service token file should not be materialized into token - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 03:33:57 UTC, 1 replies.
- [GitHub] [spark] ConeyLiu commented on a diff in pull request #42612: [SPARK-44913][SQL] DS V2 supports push down V2 UDF that has magic method - posted by "ConeyLiu (via GitHub)" <gi...@apache.org> on 2023/08/23 03:42:53 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42608: [SPARK-42017][PYTHON][CONNECT][TESTS] Enable `ColumnParityTests.test_access_column` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 03:45:38 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42567: [SPARK-44878][SS] Disable strict limit for RocksDB write manager to avoid insertion exception on cache full - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/23 04:13:29 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42625: [SPARK-44802][INFRA][FOLLOWUP] Eagerly check if the token is valid to align with the behavior of username/password authn - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 04:33:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42531: [SPARK-44846][SQL] Pull out complex grouping expressions after remove redundant aggregates - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/23 04:37:32 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42600: [SPARK-44906][K8S] Make `Kubernetes[Driver|Executor]Conf.annotations` substitute annotations instead of feature steps - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 04:40:58 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42619: [SPARK-44920][CORE] Use await() instead of awaitUninterruptibly() in TransportClientFactory.createClient() - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 04:42:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42616: [SPARK-44840][SQL][3.5] Make `array_insert()` 1-based for negative indexes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 04:43:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42616: [SPARK-44840][SQL][3.5] Make `array_insert()` 1-based for negative indexes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 04:43:46 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42614: [SPARK-44922][TESTS] Disable o.a.p.h.InternalParquetRecordWriter logs for tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 05:41:38 UTC, 0 replies.
- [GitHub] [spark] zwangsheng commented on a diff in pull request #42600: [SPARK-44906][K8S] Make `Kubernetes[Driver|Executor]Conf.annotations` substitute annotations instead of feature steps - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/08/23 05:44:32 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42624: [SPARK-44925][K8S] K8s default service token file should not be materialized into token - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 05:46:02 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42619: [SPARK-44920][CORE] Use await() instead of awaitUninterruptibly() in TransportClientFactory.createClient() - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 05:54:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42619: [SPARK-44920][CORE] Use await() instead of awaitUninterruptibly() in TransportClientFactory.createClient() - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 05:56:43 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42620: [SPARK-44921][SQL] Remove SqlBaseLexer.tokens from codebase - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 05:58:59 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42620: [SPARK-44921][SQL] Remove SqlBaseLexer.tokens from codebase - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 05:59:41 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42622: [SPARK-44923][PYTHON][BUILD] Some directories should be cleared when regenerating files - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/23 06:00:23 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #42622: [SPARK-44923][PYTHON][BUILD] Some directories should be cleared when regenerating files - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/23 06:01:26 UTC, 0 replies.
- [GitHub] [spark] melihsozdinler commented on a diff in pull request #42618: [SPARK-44919] Avro connector: convert a union of a single primitive type to a StructType - posted by "melihsozdinler (via GitHub)" <gi...@apache.org> on 2023/08/23 06:02:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42592: [SPARK-44899][PYTHON][DOCS] Refine the docstring of DataFrame.collect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 06:16:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42592: [SPARK-44899][PYTHON][DOCS] Refine the docstring of DataFrame.collect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 06:16:58 UTC, 0 replies.
- [GitHub] [spark] aimtsou commented on pull request #40960: [SPARK-43180][PYTHON-INFRA]: Upgrade mypy and pytest-mypypplugins packages - posted by "aimtsou (via GitHub)" <gi...@apache.org> on 2023/08/23 06:46:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42608: [SPARK-42017][PYTHON][CONNECT][TESTS] Make `df['col_name']` validate the column name - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 07:31:11 UTC, 3 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #42606: [SPARK-44909][ML] Skip starting torch distributor log streaming server when it is not available - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/23 07:31:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42608: [SPARK-42017][PYTHON][CONNECT][TESTS] Make `df['col_name']` validate the column name - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 07:34:11 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42611: [SPARK-44871][SQL][3.3] Fix percentile_disc behaviour - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/23 07:48:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42611: [SPARK-44871][SQL][3.3] Fix percentile_disc behaviour - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/23 07:49:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42626: [MINOR][PYTHON] Code cleanup: remove resolved todo items - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 08:05:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42591: [SPARK-44784][CONNECT] Make SBT testing hermetic. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 08:23:45 UTC, 1 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42627: [SPARK-44929][TESTS] Truncate log output for console appender in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 08:27:42 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on a diff in pull request #42531: [SPARK-44846][SQL] Pull out complex grouping expressions after remove redundant aggregates - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/23 08:46:32 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42575: [SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 08:53:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42628: [SPARK-44928][PYTHON][DOCS] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 08:54:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42628: [SPARK-44928][PYTHON][DOCS] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/23 08:54:18 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41763: [SPARK-44219][SQL] Adds extra per-rule validations for optimization rewrites. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/23 08:59:47 UTC, 5 replies.
- [GitHub] [spark] ConeyLiu opened a new pull request, #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "ConeyLiu (via GitHub)" <gi...@apache.org> on 2023/08/23 09:02:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42620: [SPARK-44921][SQL] Remove SqlBaseLexer.tokens from codebase - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 09:02:45 UTC, 0 replies.
- [GitHub] [spark] ConeyLiu commented on pull request #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "ConeyLiu (via GitHub)" <gi...@apache.org> on 2023/08/23 09:03:21 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42628: [SPARK-44928][PYTHON][DOCS] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 09:07:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42628: [SPARK-44928][PYTHON][DOCS] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 09:09:03 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42622: [SPARK-44923][PYTHON][BUILD] Some directories should be cleared when regenerating files - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 09:12:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42622: [SPARK-44923][PYTHON][BUILD] Some directories should be cleared when regenerating files - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/23 09:12:45 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #42605: [SPARK-44908][ML][CONNECT] Fix cross validator foldCol param functionality - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/23 10:19:58 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #42605: [SPARK-44908][ML][CONNECT] Fix cross validator foldCol param functionality - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/23 10:20:35 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #42606: [SPARK-44909][ML] Skip starting torch distributor log streaming server when it is not available - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/08/23 10:21:04 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42630: [SPARK-44931] Fix JSON Serialization of Spark Connect protos for Event Listener - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/23 10:59:16 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/23 11:02:53 UTC, 1 replies.
- [GitHub] [spark] ConeyLiu commented on a diff in pull request #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "ConeyLiu (via GitHub)" <gi...@apache.org> on 2023/08/23 11:37:46 UTC, 6 replies.
- [GitHub] [spark] wangyum commented on pull request #42609: [SPARK-44911][SQL] Create hive table with invalid column should return error class - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/23 12:58:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42613: [SPARK-44914][BUILD] Upgrade `Apache ivy` from 2.5.1 to 2.5.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 12:58:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42613: [SPARK-44914][BUILD] Upgrade `Apache ivy` from 2.5.1 to 2.5.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 12:59:05 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #42631: [MINOR] Fix typos in `pyspark_upgrade.rst` - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/23 13:10:23 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42631: [MINOR] Fix typos in `pyspark_upgrade.rst` - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/23 13:11:24 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42609: [SPARK-44911][SQL] Create hive table with invalid column should return error class - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/23 13:15:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42630: [SPARK-44931] Fix JSON Serialization of Spark Connect protos for Event Listener - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 13:50:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42630: [SPARK-44931] Fix JSON Serialization of Spark Connect protos for Event Listener - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 13:53:28 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42630: [SPARK-44931] Fix JSON Serialization of Spark Connect protos for Event Listener - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/23 14:06:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42632: [WIP][SQL] Convert binary to string by `to_char` for the formats: `hex`, `base64`, `utf-8` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/23 14:19:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42383: [SPARK-44549][SQL] Support window functions in correlated scalar subqueries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/23 14:44:59 UTC, 0 replies.
- [GitHub] [spark] zml1206 opened a new pull request, #42633: [SPARK-44846][SQL] Convert the distinct-like Aggregate to Project - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/23 14:47:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42383: [SPARK-44549][SQL] Support window functions in correlated scalar subqueries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/23 14:50:39 UTC, 0 replies.
- [GitHub] [spark] jdesjean commented on pull request #42630: [SPARK-44931] Fix JSON Serialization of Spark Connect protos for Event Listener - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/08/23 14:58:02 UTC, 0 replies.
- [GitHub] [spark] gbloisi-openaire opened a new pull request, #42634: [SPARK-44910][SQL] Encoders.bean does not support superclasses with generic type arguments - posted by "gbloisi-openaire (via GitHub)" <gi...@apache.org> on 2023/08/23 15:07:17 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/23 15:27:15 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #42630: [SPARK-44931] Fix JSON Serialization of Spark Connect protos for Event Listener - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/23 15:50:44 UTC, 0 replies.
- [GitHub] [spark] grundprinzip closed pull request #42630: [SPARK-44931] Fix JSON Serialization of Spark Connect protos for Event Listener - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/23 15:51:46 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42626: [MINOR][PYTHON] Code cleanup: remove resolved todo items - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 16:01:41 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42626: [MINOR][PYTHON] Code cleanup: remove resolved todo items - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 16:01:42 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42600: [SPARK-44906][K8S] Make `Kubernetes[Driver|Executor]Conf.annotations` substitute annotations instead of feature steps - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/23 16:03:38 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42181: [SPARK-44247][BUILD] Upgrade Arrow to 13.0.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/23 16:22:12 UTC, 2 replies.
- [GitHub] [spark] bqiang-stackadapt commented on pull request #24537: [SPARK-23887][SS] continuous query progress reporting - posted by "bqiang-stackadapt (via GitHub)" <gi...@apache.org> on 2023/08/23 16:30:27 UTC, 0 replies.
- [GitHub] [spark] venkateshbalaji99 commented on pull request #41199: [SPARK-43536][CORE] Fixing statsd sink reporter - posted by "venkateshbalaji99 (via GitHub)" <gi...@apache.org> on 2023/08/23 16:37:16 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42550: [SPARK-44861][CONNECT] jsonignore SparkListenerConnectOperationStarted.planRequest - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/23 16:37:56 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42550: [SPARK-44861][CONNECT] jsonignore SparkListenerConnectOperationStarted.planRequest - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/23 16:40:00 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42500: [SPARK-44816][CONNECT] Improve error message when UDF class is not found - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/23 16:42:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42500: [SPARK-44816][CONNECT] Improve error message when UDF class is not found - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/23 16:42:26 UTC, 0 replies.
- [GitHub] [spark] gowa commented on pull request #37206: [SPARK-39696][CORE] Ensure Concurrent r/w `TaskMetrics` not throw Exception - posted by "gowa (via GitHub)" <gi...@apache.org> on 2023/08/23 17:00:36 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42600: [SPARK-44906][K8S] Make `Kubernetes[Driver|Executor]Conf.annotations` substitute annotations instead of feature steps - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 17:42:57 UTC, 0 replies.
- [GitHub] [spark] thejdeep commented on pull request #42093: [SPARK-44497][WEBUI] Show task partition id in Task table - posted by "thejdeep (via GitHub)" <gi...@apache.org> on 2023/08/23 17:44:28 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42631: [MINOR][DOCS] Fix typos in `pyspark_upgrade.rst` - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/23 17:48:00 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42631: [MINOR][DOCS] Fix typos in `pyspark_upgrade.rst` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 18:10:10 UTC, 0 replies.
- [GitHub] [spark] wenyuen-db opened a new pull request, #42635: [SPARK-44934][SQL] Use outputSet instead of output to determine if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "wenyuen-db (via GitHub)" <gi...@apache.org> on 2023/08/23 21:01:31 UTC, 1 replies.
- [GitHub] [spark] wenyuen-db closed pull request #42635: [SPARK-44934][SQL] Use outputSet instead of output to determine if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "wenyuen-db (via GitHub)" <gi...@apache.org> on 2023/08/23 21:07:33 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on pull request #42635: [SPARK-44934][SQL] Use outputSet instead of output to determine if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "sigmod (via GitHub)" <gi...@apache.org> on 2023/08/23 21:21:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42636: [SPARK-44935][K8S] Fix `RELEASE` file to have the correct information in Docker images - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 21:24:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42636: [SPARK-44935][K8S] Fix `RELEASE` file to have the correct information in Docker images if exists - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 21:52:14 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42636: [SPARK-44935][K8S] Fix `RELEASE` file to have the correct information in Docker images if exists - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/23 22:40:13 UTC, 1 replies.
- [GitHub] [spark] michaelzhan-db opened a new pull request, #42637: Add examples to approxQuantile docstring - posted by "michaelzhan-db (via GitHub)" <gi...@apache.org> on 2023/08/23 22:40:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42636: [SPARK-44935][K8S] Fix `RELEASE` file to have the correct information in Docker images if exists - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 22:52:39 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42638: [SPARK-44936][CORE] Simplify the log when Spark HybridStore hits the memory limit - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 23:00:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42636: [SPARK-44935][K8S] Fix `RELEASE` file to have the correct information in Docker images if exists - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/23 23:01:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40707: [SPARK-43033][SQL] Avoid task retries due to AssertNotNull checks - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/24 00:16:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42548: [SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 00:36:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42548: [SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 00:37:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42548: [SPARK-44750][PYTHON][CONNECT] Apply configuration to sparksession during creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 00:41:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42608: [SPARK-42017][PYTHON][CONNECT] `df['col_name']` should validate the column name - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 00:47:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42608: [SPARK-42017][PYTHON][CONNECT] `df['col_name']` should validate the column name - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 00:48:17 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42617: [SPARK-44918][SQL][PYTHON] Support named arguments in scalar Python/Pandas UDFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/24 01:01:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42596: [SPARK-44903][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 01:06:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42596: [SPARK-44903][PYTHON][DOCS] Refine docstring of `approx_count_distinct` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 01:06:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 01:22:23 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42627: [SPARK-44929][TESTS] Truncate log output for console appender in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 01:59:40 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #42639: [SPARK-44938][SQL] Change default value of `spark.sql.maxSinglePartitionBytes` to 128m - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/24 02:03:22 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42639: [SPARK-44938][SQL] Change default value of `spark.sql.maxSinglePartitionBytes` to 128m - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/24 02:04:21 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42600: [SPARK-44906][K8S] Make `Kubernetes[Driver|Executor]Conf.annotations` substitute annotations instead of feature steps - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 02:18:08 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #42639: [SPARK-44938][SQL] Change default value of `spark.sql.maxSinglePartitionBytes` to 128m - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/24 02:23:04 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42638: [SPARK-44936][CORE] Simplify the log when Spark HybridStore hits the memory limit - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 02:26:49 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42631: [MINOR][DOCS] Fix typos in `pyspark_upgrade.rst` - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 02:37:08 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42631: [MINOR][DOCS] Fix typos in `pyspark_upgrade.rst` - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 02:37:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42638: [SPARK-44936][CORE] Simplify the log when Spark HybridStore hits the memory limit - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 02:42:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42638: [SPARK-44936][CORE] Simplify the log when Spark HybridStore hits the memory limit - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 02:43:58 UTC, 6 replies.
- [GitHub] [spark] itholic commented on pull request #42631: [MINOR][DOCS] Fix typos in `pyspark_upgrade.rst` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/24 03:06:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42638: [SPARK-44936][CORE] Simplify the log when Spark HybridStore hits the memory limit - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 03:13:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42627: [SPARK-44929][TESTS] Truncate log output for console appender in tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 03:20:25 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42638: [SPARK-44936][CORE] Simplify the log when Spark HybridStore hits the memory limit - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 03:25:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42627: [SPARK-44929][TESTS] Truncate log output for console appender in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 03:31:41 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42628: [SPARK-44928][PYTHON][DOCS] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 03:35:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42628: [SPARK-44928][PYTHON][DOCS] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 03:36:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42640: [SPARK-44928][PYTHON][DOCS][3.5] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 03:42:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42575: [SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 03:44:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42641: [SPARK-44097][SPARK-44229][SQL][TESTS] Reenable PandasUDF and o.a.s.sql.execution.arrow tests in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 03:45:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42642: [SPARK-43943][PYTHON] Correct a function alias - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 03:48:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42643: [SPARK-44121][SQL][TESTS] Renable Arrow-based connect tests in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 04:29:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42643: [SPARK-44121][CONNECT][TESTS] Renable Arrow-based connect tests in Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/24 05:05:19 UTC, 4 replies.
- [GitHub] [spark] LuciferYang closed pull request #42030: [SPARK-44452][CONNECT][TESTS] Move `test` function from `RemoteSparkSession` to `ConnectFunSuite` and ignore `ArrowEncoderSuite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/24 05:08:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42644: [SPARK-44127][R] Reenable test_sparkSQL_arrow.R in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 05:08:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42643: [SPARK-44121][CONNECT][TESTS] Renable Arrow-based connect tests in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 05:11:05 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42645: [SPARK-44939][R] Support Java 21 in SparkR SystemRequirements - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 05:20:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42645: [SPARK-44939][R] Support Java 21 in SparkR SystemRequirements - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 05:33:24 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42627: [SPARK-44929][TESTS] Standardize log output for console appender in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 05:43:51 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #42627: [SPARK-44929][TESTS] Standardize log output for console appender in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 05:51:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42645: [SPARK-44939][R] Support Java 21 in SparkR SystemRequirements - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 05:57:25 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42646: [SPARK-44302][BUILD] Reenable PySpark test on the daily test of Java 21 after the new arrow version release - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/24 06:13:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42646: [SPARK-44302][BUILD] Reenable PySpark test on the daily test of Java 21 after the new arrow version release - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 06:37:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42642: [SPARK-43943][FOLLOWUP] Correct a function alias - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 06:56:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42642: [SPARK-43943][FOLLOWUP] Correct a function alias - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 06:56:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42640: [SPARK-44928][PYTHON][DOCS][3.5] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 06:58:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42640: [SPARK-44928][PYTHON][DOCS][3.5] Replace the module alias 'sf' instead of 'F' in pyspark.sql import functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 06:58:40 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42647: [SPARK-44941][SQL] Turn off hive.conf.validation in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 07:07:46 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi opened a new pull request, #42648: Kyspark 3.2.x 4.x qa merge - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:14:24 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi closed pull request #42648: Kyspark 3.2.x 4.x qa merge - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:14:38 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi opened a new pull request, #42649: Kyspark 3.2.x 4.x qa merge - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:14:50 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi closed pull request #42649: Kyspark 3.2.x 4.x qa merge - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:14:55 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi opened a new pull request, #42650: Kyspark 3.2.x 4.x qa merge - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:15:19 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi closed pull request #42650: Kyspark 3.2.x 4.x qa merge - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:15:37 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi closed pull request #42651: Kyspark 3.2.x 4.x qa backup - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:17:36 UTC, 0 replies.
- [GitHub] [spark] zheniantoushipashi opened a new pull request, #42651: Kyspark 3.2.x 4.x qa backup - posted by "zheniantoushipashi (via GitHub)" <gi...@apache.org> on 2023/08/24 07:18:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42181: [SPARK-44247][BUILD] Upgrade Arrow to 13.0.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:21:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42644: [SPARK-44127][R][TESTS] Reenable test_sparkSQL_arrow.R in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:23:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42644: [SPARK-44127][R][TESTS] Reenable test_sparkSQL_arrow.R in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:23:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42641: [SPARK-44097][SPARK-44229][SQL][TESTS] Reenable PandasUDF and o.a.s.sql.execution.arrow tests in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:25:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42641: [SPARK-44097][SPARK-44229][SQL][TESTS] Reenable PandasUDF and o.a.s.sql.execution.arrow tests in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:27:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42643: [SPARK-44121][CONNECT][TESTS] Renable Arrow-based connect tests in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:34:24 UTC, 2 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #37206: [SPARK-39696][CORE] Ensure Concurrent r/w `TaskMetrics` not throw Exception - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/08/24 07:37:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42646: [SPARK-44302][BUILD] Reenable PySpark test on the daily test of Java 21 after the new arrow version release - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:39:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42639: [SPARK-44938][SQL] Change default value of `spark.sql.maxSinglePartitionBytes` to 128m - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:47:22 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42633: [SPARK-44846][SQL] Convert the distinct-like Aggregate to Project - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/24 07:51:12 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42643: [SPARK-44121][CONNECT][TESTS] Renable Arrow-based connect tests in Java 21 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 07:53:53 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42647: [SPARK-44941][SQL][TESTS] Turn off hive.conf.validation in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 08:08:37 UTC, 2 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42639: [SPARK-44938][SQL] Change default value of `spark.sql.maxSinglePartitionBytes` to 128m - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/24 08:30:46 UTC, 7 replies.
- [GitHub] [spark] pan3793 commented on pull request #42647: [SPARK-44941][SQL][TESTS] Turn off hive.conf.validation in tests - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/08/24 08:56:47 UTC, 0 replies.
- [GitHub] [spark] gerashegalov opened a new pull request, #42652: [SPARK-44943][SQL] Fix overflow detection logic in conv - posted by "gerashegalov (via GitHub)" <gi...@apache.org> on 2023/08/24 09:17:50 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on a diff in pull request #42633: [SPARK-44846][SQL] Convert the distinct-like Aggregate to Project - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/08/24 09:19:54 UTC, 4 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42653: [SPARK-44944][INFRA] Auto grant contributor role to first-time contributors - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 09:37:30 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42575: [SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 09:39:16 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42654: [SPARK-44863][UI][3.5] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 09:49:14 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42575: [SPARK-44863][UI] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 09:49:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42521: [SPARK-44435][SS][CONNECT] Tests for foreachBatch and Listener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/24 10:18:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42521: [SPARK-44435][SS][CONNECT] Tests for foreachBatch and Listener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/24 10:18:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/24 10:25:50 UTC, 1 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42655: [SPARK-44840][SQL][3.4] Make `array_insert()` 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/24 10:35:11 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #42655: [SPARK-44840][SQL][3.4] Make `array_insert()` 1-based for negative indexes - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/24 10:43:51 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42656: Test Java 21-ea - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/24 11:15:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42656: Test Java 21-ea - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/24 11:16:47 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42657: [SPARK-44820][DOCS] Switch languages consistently across docs for all code snippets - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/24 11:46:49 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42657: [SPARK-44820][DOCS] Switch languages consistently across docs for all code snippets - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/24 11:48:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42657: [SPARK-44820][DOCS] Switch languages consistently across docs for all code snippets - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/24 11:53:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42521: [SPARK-44435][SS][CONNECT] Tests for foreachBatch and Listener - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/24 12:02:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42655: [SPARK-44840][SQL][3.4] Make `array_insert()` 1-based for negative indexes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/24 12:10:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/24 13:17:44 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #42658: [SPARK-44945][DOCS][PYTHON] Automate PySpark error class documentation - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/24 13:20:52 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42658: [SPARK-44945][DOCS][PYTHON] Automate PySpark error class documentation - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/24 13:29:40 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42655: [SPARK-44840][SQL][3.4] Make `array_insert()` 1-based for negative indexes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/24 13:42:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42659: Test set shadeTestJar to false in core module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/24 14:00:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42660: [WIP][SQL] Reuse `ArrayInsert` in `ArrayAppend` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/24 14:07:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42614: [SPARK-44922][TESTS] Disable o.a.p.h.InternalParquetRecordWriter logs for tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/24 14:10:52 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42661: [SPARK-44743][SQL] Fix `reflect` method behavior match with hive - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/24 14:25:46 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42661: [SPARK-44743][SQL] Fix `reflect` method behavior match with hive - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/24 14:27:39 UTC, 1 replies.
- [GitHub] [spark] wenyuen-db commented on pull request #42635: [SPARK-44934][SQL] Use outputSet instead of output to check if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "wenyuen-db (via GitHub)" <gi...@apache.org> on 2023/08/24 14:43:13 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #42662: [SPARK-44948][DOCS][TESTS][PYTHON] Update document & test related to `Int64Index` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/24 14:48:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42661: [SPARK-44743][SQL] Fix `reflect` method behavior match with hive - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/24 14:49:40 UTC, 0 replies.
- [GitHub] [spark] peter-toth closed pull request #42635: [SPARK-44934][SQL] Use outputSet instead of output to check if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/24 15:00:52 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #42635: [SPARK-44934][SQL] Use outputSet instead of output to check if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/24 15:08:49 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42647: [SPARK-44941][SQL][TESTS] Turn off hive.conf.validation in tests - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/24 15:23:15 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/24 15:35:28 UTC, 0 replies.
- [GitHub] [spark] ChenMichael commented on pull request #42587: [SPARK-44897] - Propagating local properties to subquery broadcast exec - posted by "ChenMichael (via GitHub)" <gi...@apache.org> on 2023/08/24 16:09:34 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on a diff in pull request #42609: [SPARK-44911][SQL] Create hive table with invalid column should return error class - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/08/24 16:10:52 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #42661: [SPARK-44743][SQL] Fix `reflect` method behavior match with hive - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/24 16:23:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42566: [SPARK-44873][3.4] Support alter view with nested columns in Hive client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 16:23:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42565: [SPARK-44873][SPARK-39936][3.3] Support alter view with nested columns in Hive client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 16:24:20 UTC, 0 replies.
- [GitHub] [spark] arturobernalg commented on pull request #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "arturobernalg (via GitHub)" <gi...@apache.org> on 2023/08/24 16:49:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42639: [SPARK-44938][SQL] Change default value of `spark.sql.maxSinglePartitionBytes` to 128m - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 16:53:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42639: [SPARK-44938][SQL] Change default value of `spark.sql.maxSinglePartitionBytes` to 128m - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 16:53:15 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42658: [SPARK-44945][DOCS][PYTHON] Automate PySpark error class documentation - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/24 16:57:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42655: [SPARK-44840][SQL][3.4] Make `array_insert()` 1-based for negative indexes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 16:57:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42655: [SPARK-44840][SQL][3.4] Make `array_insert()` 1-based for negative indexes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 16:57:41 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42521: [SPARK-44435][SS][CONNECT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/24 17:12:03 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42521: [SPARK-44435][SS][CONNECT] Tests for foreachBatch and Listener - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 17:35:06 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42654: [SPARK-44863][UI][3.5] Add a button to download thread dump as a txt in Spark UI - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 17:43:46 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42635: [SPARK-44934][SQL] Use outputSet instead of output to check if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 17:53:59 UTC, 2 replies.
- [GitHub] [spark] ueshin closed pull request #42617: [SPARK-44918][SQL][PYTHON] Support named arguments in scalar Python/Pandas UDFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/24 17:56:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42661: [SPARK-44743][SQL] Fix `reflect` method behavior match with hive - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 18:33:45 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42663: [SPARK-44952][SQL][PYTHON] Support named arguments in aggregate Pandas UDFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/24 19:08:52 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42663: [SPARK-44952][SQL][PYTHON] Support named arguments in aggregate Pandas UDFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/24 19:19:25 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42663: [SPARK-44952][SQL][PYTHON] Support named arguments in aggregate Pandas UDFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/24 19:20:32 UTC, 1 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42664: [SPARK-44435][SPARK-44484][3.5][SS][CONNECT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/24 20:44:43 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42155: [SPARK-44547][CORE] Ignore fallback storage for cached RDD migration - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 22:15:16 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42664: [SPARK-44435][SPARK-44484][3.5][SS][CONNECT] Tests for foreachBatch and Listener - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 22:27:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42656: Test Java 21-ea - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/24 22:31:43 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42663: [SPARK-44952][SQL][PYTHON] Support named arguments in aggregate Pandas UDFs - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/24 22:36:48 UTC, 2 replies.
- [GitHub] [spark] szehon-ho commented on a diff in pull request #42306: [SPARK-44647][SQL] Support SPJ where join keys are less than cluster keys - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/24 22:41:13 UTC, 5 replies.
- [GitHub] [spark] ukby1234 commented on a diff in pull request #42155: [SPARK-44547][CORE] Ignore fallback storage for cached RDD migration - posted by "ukby1234 (via GitHub)" <gi...@apache.org> on 2023/08/24 22:48:34 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42665: [SPARK-44822][PYTHON][FOLLOW-UP] Make Python UDTFs by default non-deterministic - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/24 23:02:01 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42665: [SPARK-44822][PYTHON][FOLLOW-UP] Make Python UDTFs by default non-deterministic - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/24 23:02:27 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42664: [SPARK-44435][SPARK-44484][3.5][SS][CONNECT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/24 23:13:41 UTC, 0 replies.
- [GitHub] [spark] WweiL closed pull request #42664: [SPARK-44435][SPARK-44484][3.5][SS][CONNECT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/24 23:13:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix test when ansi mode enabled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 00:09:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42513: [SPARK-44827][PYTHON][TESTS] Fix test when ansi mode enabled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 00:09:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42657: [SPARK-44820][DOCS] Switch languages consistently across docs for all code snippets - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 00:10:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42662: [SPARK-44948][DOCS][TESTS][PYTHON] Update document & test related to `Int64Index` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 00:14:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42662: [SPARK-44948][DOCS][TESTS][PYTHON] Update document & test related to `Int64Index` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 00:15:19 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40707: [SPARK-43033][SQL] Avoid task retries due to AssertNotNull checks - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/25 00:16:50 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42657: [SPARK-44820][DOCS] Switch languages consistently across docs for all code snippets - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/25 00:48:02 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42657: [SPARK-44820][DOCS] Switch languages consistently across docs for all code snippets - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/08/25 00:48:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42658: [SPARK-44945][DOCS][PYTHON] Automate PySpark error class documentation - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/25 00:53:57 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42654: [SPARK-44863][UI][3.5] Add a button to download thread dump as a txt in Spark UI - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 01:06:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42658: [SPARK-44945][DOCS][PYTHON] Automate PySpark error class documentation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 01:12:02 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #40420: [SPARK-42617][PS] Support `isocalendar` from the pandas 2.0.0 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/25 01:42:13 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42635: [SPARK-44934][SQL] Use outputSet instead of output to check if column pruning occurred in PushdownPredicateAndPruneColumnsForCTEDef - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 01:44:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42654: [SPARK-44863][UI][3.5] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/25 02:00:26 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42654: [SPARK-44863][UI][3.5] Add a button to download thread dump as a txt in Spark UI - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/25 02:00:27 UTC, 0 replies.
- [GitHub] [spark] vivostar commented on pull request #23640: [SPARK-26682][SQL] Use taskAttemptID instead of attemptNumber for Had… - posted by "vivostar (via GitHub)" <gi...@apache.org> on 2023/08/25 02:02:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42666: [SPARK-44863][UI][FOLLOWUP] Move Mima rules to v40excludes - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/25 02:10:08 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42666: [SPARK-44863][UI][FOLLOWUP] Move Mima rules to v40excludes - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/25 02:10:43 UTC, 1 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #42667: [SPARK-44940][SQL] Improve performance of JSON parsing when "spark.sql.json.enablePartialResults" is enabled - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/08/25 02:26:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42664: [SPARK-44435][SPARK-44484][3.5][SS][CONNECT] Tests for foreachBatch and Listener - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 02:26:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42155: [SPARK-44547][CORE] Ignore fallback storage for cached RDD migration - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 02:29:08 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42665: [SPARK-44822][PYTHON][FOLLOW-UP] Make Python UDTFs by default non-deterministic - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 02:31:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42665: [SPARK-44822][PYTHON][FOLLOW-UP] Make Python UDTFs by default non-deterministic - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 02:32:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42668: Test Java 17 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/25 02:38:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42668: Test Java 17 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/25 02:41:28 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42668: Test Java 17 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 02:47:57 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42646: [SPARK-44302][BUILD] Reenable PySpark test on the daily test of Java 21 after the new arrow version release - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 02:51:37 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 02:53:26 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42669: [SPARK-44956][BUILD] Upgrade Jekyll to 4.3.2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/25 03:27:20 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42661: [SPARK-44743][SQL] Add `try_reflect` function - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/25 03:41:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42670: [SPARK-44957][PYTHON][SQL][TESTS] Make PySpark (pyspark-sql module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 04:47:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42671: [SPARK-44958][PYTHON][CONNECT][TESTS] Add a test to validate the parity of functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 04:47:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42671: [SPARK-44958][PYTHON][CONNECT][TESTS] Add a test to validate the parity of functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 04:47:52 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42670: [SPARK-44957][PYTHON][SQL][TESTS] Make PySpark (pyspark-sql module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 04:48:12 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42671: [SPARK-44958][PYTHON][CONNECT][TESTS] Add a test to validate the parity of functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 04:55:35 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42653: [SPARK-44944][INFRA] Auto grant contributor role to first-time contributors - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 05:12:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42653: [SPARK-44944][INFRA] Auto grant contributor role to first-time contributors - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 05:12:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42666: [SPARK-44863][UI][FOLLOWUP] Move Mima rules to v40excludes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 05:18:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42666: [SPARK-44863][UI][FOLLOWUP] Move Mima rules to v40excludes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 05:18:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42155: [SPARK-44547][CORE] Ignore fallback storage for cached RDD migration - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 05:19:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42672: [SPARK-42017][PYTHON][CONNECT][FOLLOWUP] Avoid double validation in `__getattr__ ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 05:45:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42672: [SPARK-42017][PYTHON][CONNECT][FOLLOWUP] Avoid double validation in `__getattr__ ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 05:47:51 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42554: [SPARK-44865][SS] Make StreamingRelationV2 support metadata column - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/25 05:51:06 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42653: [SPARK-44944][INFRA] Auto grant contributor role to first-time contributors - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/25 05:53:54 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42673: [SPARK-44959][BUILD] Upgrade sbt to 1.9.4 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/25 06:01:54 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/25 06:18:39 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/25 06:24:06 UTC, 0 replies.
- [GitHub] [spark] itholic closed pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/25 06:24:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42670: [SPARK-44957][PYTHON][SQL][TESTS] Make PySpark (pyspark-sql module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 06:39:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42670: [SPARK-44957][PYTHON][SQL][TESTS] Make PySpark (pyspark-sql module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 06:40:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42667: [SPARK-44940][SQL] Improve performance of JSON parsing when "spark.sql.json.enablePartialResults" is enabled - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 06:55:46 UTC, 9 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42674: [SPARK-44960][UI] Unescape and consist error summary across UI pages - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/25 06:57:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42675: [SPARK-42944][PYTHON][FOLLOW-UP] Rename tests from foreachBatch to foreach_batch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 06:58:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42675: [SPARK-42944][PYTHON][FOLLOW-UP] Rename tests from foreachBatch to foreach_batch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 06:58:32 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42587: [SPARK-44897][SQL] Propagating local properties to subquery broadcast exec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 06:59:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 07:01:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42629: [SPARK-44930][SQL] Deterministic ApplyFunctionExpression should be foldable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 07:01:48 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42488: [SPARK-44804][SQL] SortMergeJoin should respect the streamed side ordering - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/25 07:11:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42676: [SPARK-44961][PYTHON][TESTS] Make PySpark (pyspark-connect module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 07:13:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42677: [SPARK-44962][INFRA] Add Java 21 build to build_and_test.yml - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 07:14:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42676: [SPARK-44961][PYTHON][TESTS] Make PySpark (pyspark-connect module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 07:14:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42677: [SPARK-44962][INFRA] Add Java 21 build to build_and_test.yml - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 07:18:17 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42672: [SPARK-42017][PYTHON][CONNECT][FOLLOWUP] Avoid double validation in `__getattr__ ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 07:44:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42672: [SPARK-42017][PYTHON][CONNECT][FOLLOWUP] Avoid double validation in `__getattr__ ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 07:44:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42678: [SPARK-44963][PYTHON][ML][TESTS] Make PySpark (pyspark-ml module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 07:59:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42678: [SPARK-44963][PYTHON][ML][TESTS] Make PySpark (pyspark-ml module) tests passing without any dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 07:59:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42677: [SPARK-44962][INFRA] Add Java 21 `Maven` build to `build_and_test.yml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 08:12:38 UTC, 2 replies.
- [GitHub] [spark] dzhigimont commented on pull request #40420: [SPARK-42617][PS] Support `isocalendar` from the pandas 2.0.0 - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/08/25 08:17:59 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42677: [SPARK-44962][INFRA] Add Java 21 `Maven` build to `build_and_test.yml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 08:29:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42679: [SPARK-44964][ML][CONNECT] Clean up pyspark.ml.connect.functions doctest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 08:31:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42679: [SPARK-44964][ML][CONNECT] Clean up pyspark.ml.connect.functions doctest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 08:33:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42680: [SPARK-44965][PYTHON] Hide internal functions/variables from `pyspark.sql.functions` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 08:34:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42680: [SPARK-44965][PYTHON] Hide internal functions/variables from `pyspark.sql.functions` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 08:37:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42680: [SPARK-44965][PYTHON] Hide internal functions/variables from `pyspark.sql.functions` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 08:45:40 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42673: [SPARK-44959][BUILD] Upgrade sbt to 1.9.4 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/25 08:49:08 UTC, 4 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42681: [SPARK-44840][SQL][FOLLOWUP] Change the version from 3.5.0 to 3.4.2 for `spark.sql.legacy.negativeIndexInArrayInsert` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/25 08:50:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42681: [SPARK-44840][SQL][FOLLOWUP] Change the version from 3.5.0 to 3.4.2 for `spark.sql.legacy.negativeIndexInArrayInsert` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/25 08:51:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42673: [SPARK-44959][BUILD] Upgrade sbt to 1.9.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/25 08:54:57 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #42656: Test Java 21-ea - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/25 08:58:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42681: [SPARK-44840][SQL][FOLLOWUP] Change the version from 3.5.0 to 3.4.2 for `spark.sql.legacy.negativeIndexInArrayInsert` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 08:59:38 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42681: [SPARK-44840][SQL][FOLLOWUP] Change the version from 3.5.0 to 3.4.2 for `spark.sql.legacy.negativeIndexInArrayInsert` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 08:59:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42594: [SPARK-44839][SS][CONNECT] Better Error Logging when user tries to serialize spark session - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/25 09:04:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42393: [WIP][SPARK-43438][SQL] Error on missing input columns in `INSERT` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 09:15:31 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42682: [SPARK-44966][CORE] Change the never changed `var` to `val` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/25 09:20:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42673: [SPARK-44959][BUILD] Upgrade sbt to 1.9.4 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/25 09:27:36 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42668: Test Java 17 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/08/25 10:17:54 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42671: [SPARK-44958][PYTHON][CONNECT][TESTS] Add a test to validate the parity of functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 10:43:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42679: [SPARK-44964][ML][CONNECT][TESTS] Clean up pyspark.ml.connect.functions doctest - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/25 11:01:29 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42683: [SPARK-44967][SQL][CONNECT] Unit should be considered first before using Boolean for TreeNodeTag - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/25 11:34:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42661: [SPARK-44743][SQL] Add `try_reflect` function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 13:48:36 UTC, 9 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42554: [SPARK-44865][SS] Make StreamingRelationV2 support metadata column - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 14:08:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42554: [SPARK-44865][SS] Make StreamingRelationV2 support metadata column - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 14:09:12 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #42661: [SPARK-44743][SQL] Add `try_reflect` function - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/08/25 14:09:14 UTC, 0 replies.
- [GitHub] [spark] robert3005 opened a new pull request, #42684: [SPARK-21195][CORE] Dynamically register metrics from sources as they are reported - posted by "robert3005 (via GitHub)" <gi...@apache.org> on 2023/08/25 14:43:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42661: [SPARK-44743][SQL] Add `try_reflect` function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/25 14:46:11 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42674: [SPARK-44960][UI] Unescape and consist error summary across UI pages - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/25 14:47:37 UTC, 7 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42591: [SPARK-44784][CONNECT] Make SBT testing hermetic. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/25 15:08:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42668: [SPARK-44968][BUILD] Downgrade ivy from 2.5.2 to 2.5.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 15:26:36 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42668: [SPARK-44968][BUILD] Downgrade ivy from 2.5.2 to 2.5.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/25 15:29:32 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42682: [SPARK-44966][CORE][CONNECT] Change the never changed `var` to `val` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 15:32:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42682: [SPARK-44966][CORE][CONNECT] Change the never changed `var` to `val` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/25 15:32:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42674: [SPARK-44960][UI] Unescape and consist error summary across UI pages - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 15:39:12 UTC, 3 replies.
- [GitHub] [spark] hasnain-db opened a new pull request, #42685: [WIP][SPARK-44937][CORE] Add SSL/TLS support for RPC and Shuffle communications - posted by "hasnain-db (via GitHub)" <gi...@apache.org> on 2023/08/25 16:57:23 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42664: [SPARK-44435][SPARK-44484][3.5][SS][CONNECT] Tests for foreachBatch and Listener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/25 17:18:59 UTC, 0 replies.
- [GitHub] [spark] ChenMichael commented on a diff in pull request #42587: [SPARK-44897][SQL] Propagating local properties to subquery broadcast exec - posted by "ChenMichael (via GitHub)" <gi...@apache.org> on 2023/08/25 18:02:02 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42676: [SPARK-44961][PYTHON][CONNECT][TESTS] Make PySpark (pyspark-connect module) tests passing without any dependency - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 18:24:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42676: [SPARK-44961][PYTHON][CONNECT][TESTS] Make PySpark (pyspark-connect module) tests passing without any dependency - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 18:25:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42678: [SPARK-44963][PYTHON][ML][TESTS] Make PySpark (pyspark-ml module) tests passing without any dependency - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 18:27:28 UTC, 0 replies.
- [GitHub] [spark] hasnain-db commented on pull request #42685: [WIP][SPARK-44937][CORE] Add SSL/TLS support for RPC and Shuffle communications - posted by "hasnain-db (via GitHub)" <gi...@apache.org> on 2023/08/25 18:33:08 UTC, 1 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42686: [SPARK-44971][BUG Fix][Python] StreamingQueryProgress event fromJson bug fix - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/25 18:44:03 UTC, 0 replies.
- [GitHub] [spark] turp1twin commented on pull request #42685: [WIP][SPARK-44937][CORE] Add SSL/TLS support for RPC and Shuffle communications - posted by "turp1twin (via GitHub)" <gi...@apache.org> on 2023/08/25 18:49:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42686: [SPARK-44971][BUG Fix][Python] StreamingQueryProgress event fromJson bug fix - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 18:53:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42686: [SPARK-44971][Python] StreamingQueryProgress event fromJson bug fix - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 18:55:57 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42686: [SPARK-44971][Python] StreamingQueryProgress event fromJson bug fix - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/25 19:04:33 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42678: [SPARK-44963][PYTHON][ML][TESTS] Make PySpark (pyspark-ml module) tests passing without any optional dependency - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 19:57:19 UTC, 0 replies.
- [GitHub] [spark] hasnain-db commented on a diff in pull request #42685: [WIP][SPARK-44937][CORE] Add SSL/TLS support for RPC and Shuffle communications - posted by "hasnain-db (via GitHub)" <gi...@apache.org> on 2023/08/25 20:51:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42673: [SPARK-44959][BUILD] Upgrade sbt to 1.9.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/25 21:14:29 UTC, 3 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42687: [SPARK-44433][FOLLOWUP] Clean up Running python StreamingQueryLIstener processes when session expires - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/25 23:11:20 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42687: [SPARK-44433][FOLLOWUP] Clean up Running python StreamingQueryLIstener processes when session expires - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/25 23:11:38 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42646: [SPARK-44302][BUILD] Reenable PySpark test on the daily test of Java 21 after the new arrow version release - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/26 00:17:44 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42688: [SPARK-44974][CONNECT] Null out SparkSession/Dataset/KeyValueGroupedDatset on serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/26 00:52:30 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42689: [SPARK-44975][SQL] Remove BinaryArithmetic useless override resolved - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/26 02:44:32 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42661: [SPARK-44743][SQL] Add `try_reflect` function - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/26 03:27:12 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42679: [SPARK-44964][ML][CONNECT][TESTS] Clean up pyspark.ml.connect.functions doctest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/26 04:13:11 UTC, 0 replies.
- [GitHub] [spark] eubnara opened a new pull request, #42690: [SPARK-44976] Utils.getCurrentUserName should return the full principal name - posted by "eubnara (via GitHub)" <gi...@apache.org> on 2023/08/26 04:53:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42660: [SPARK-44969][SQL] Reuse `ArrayInsert` in `ArrayAppend` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/26 05:51:33 UTC, 0 replies.
- [GitHub] [spark] eubnara commented on pull request #42690: [SPARK-44976] Utils.getCurrentUserName should return the full principal name - posted by "eubnara (via GitHub)" <gi...@apache.org> on 2023/08/26 07:39:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42668: [SPARK-44968][BUILD] Downgrade ivy from 2.5.2 to 2.5.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/26 09:31:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42668: [SPARK-44968][BUILD] Downgrade ivy from 2.5.2 to 2.5.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/26 09:32:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42668: [SPARK-44968][BUILD] Downgrade ivy from 2.5.2 to 2.5.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/26 09:48:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42675: [SPARK-42944][PYTHON][FOLLOW-UP] Rename tests from foreachBatch to foreach_batch - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/26 09:55:12 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42689: [SPARK-44975][SQL] Remove BinaryArithmetic useless override resolved - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/26 18:09:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42689: [SPARK-44975][SQL] Remove BinaryArithmetic useless override resolved - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/26 18:11:32 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on a diff in pull request #42674: [SPARK-44960][UI] Unescape and consist error summary across UI pages - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/08/26 20:44:14 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41196: [SPARK-43505][K8S] support env variables substitution and executor library path - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/27 00:18:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41189: [DO NOT MERGE] [POC] run foreachBatch() on client - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/27 00:18:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41139: [SPARK-40887][K8S] Make `SPARK_DRIVER_LOG_URL_` and `SPARK_DRIVER_ATTRIBUTE_` work for Spark on K8S - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/27 00:18:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41083: [SPARK-43399][CORE] Add config to control threshold of unregister map ouput when fetch failed - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/27 00:18:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41071: [SPARK-43391][CORE] Idle connection should be kept when closeIdleConnection is disabled - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/27 00:18:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #39819: [SPARK-42252][CORE] Add `spark.shuffle.localDisk.file.output.buffer` and deprecate `spark.shuffle.unsafe.file.output.buffer` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/27 00:18:17 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42689: [SPARK-44975][SQL] Remove BinaryArithmetic useless override resolved - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/27 01:37:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42587: [SPARK-44897][SQL] Propagating local properties to subquery broadcast exec - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/27 02:23:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42679: [SPARK-44964][ML][CONNECT][TESTS] Clean up pyspark.ml.connect.functions doctest - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/27 03:26:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42591: [SPARK-44784][CONNECT] Make SBT testing hermetic. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/27 03:29:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42675: [SPARK-42944][PYTHON][FOLLOW-UP] Rename tests from foreachBatch to foreach_batch - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/27 04:52:13 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42093: [SPARK-44497][WEBUI] Show task partition id in Task table - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/27 05:29:07 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42093: [SPARK-44497][WEBUI] Show task partition id in Task table - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/27 05:30:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42679: [SPARK-44964][ML][CONNECT][TESTS] Clean up pyspark.ml.connect.functions doctest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/27 05:37:31 UTC, 0 replies.
- [GitHub] [spark] jinhai-cloud commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Support user configurable `protoc` and `protoc-gen-grpc-java` executables when building Spark Connect. - posted by "jinhai-cloud (via GitHub)" <gi...@apache.org> on 2023/08/27 05:40:31 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42625: [SPARK-44972][INFRA] Eagerly check if the token is valid to align with the behavior of username/password auth - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/27 05:47:40 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42674: [SPARK-44960][UI] Unescape and consist error summary across UI pages - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/27 05:56:22 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42691: [SPARK-44978][SQL][TEST] Fix SQLQueryTestSuite unable create table normally - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/27 06:32:05 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42691: [SPARK-44978][SQL][TEST] Fix SQLQueryTestSuite unable create table normally - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/27 06:33:17 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Support user configurable `protoc` and `protoc-gen-grpc-java` executables when building Spark Connect. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/27 07:00:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42673: [SPARK-44959][BUILD] Upgrade sbt to 1.9.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/27 07:14:14 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42660: [SPARK-44969][SQL] Reuse `ArrayInsert` in `ArrayAppend` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/27 07:36:48 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #42529: [SPARK-44845][YARN][DEPLOY]Fixed file system uri comparison function - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/27 17:56:12 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #42529: [SPARK-44845][YARN][DEPLOY]Fixed file system uri comparison function - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/27 17:59:08 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42674: [SPARK-44960][UI] Unescape and consist error summary across UI pages - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/08/27 21:09:16 UTC, 4 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #42667: [SPARK-44940][SQL] Improve performance of JSON parsing when "spark.sql.json.enablePartialResults" is enabled - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/08/27 23:08:26 UTC, 8 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41196: [SPARK-43505][K8S] support env variables substitution and executor library path - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/28 00:17:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41189: [DO NOT MERGE] [POC] run foreachBatch() on client - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/28 00:17:34 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41139: [SPARK-40887][K8S] Make `SPARK_DRIVER_LOG_URL_` and `SPARK_DRIVER_ATTRIBUTE_` work for Spark on K8S - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/28 00:17:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41071: [SPARK-43391][CORE] Idle connection should be kept when closeIdleConnection is disabled - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/28 00:17:36 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #39819: [SPARK-42252][CORE] Add `spark.shuffle.localDisk.file.output.buffer` and deprecate `spark.shuffle.unsafe.file.output.buffer` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/28 00:17:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42692: [SPARK-42944][PYTHON][FOLLOW-UP][3.5] Rename tests from foreachBatch to foreach_batch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 00:48:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42691: [SPARK-44978][SQL][TEST] Fix SQLQueryTestSuite unable create table normally - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 00:53:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42686: [SPARK-44971][PYTHON] StreamingQueryProgress event fromJson bug fix - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 00:59:48 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42691: [SPARK-44978][SQL][TEST] Fix SQLQueryTestSuite unable create table normally - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/28 01:14:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42691: [SPARK-44978][SQL][TEST] Fix SQLQueryTestSuite unable create table normally - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 01:22:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42691: [SPARK-44978][SQL][TEST] Fix SQLQueryTestSuite unable create table normally - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 01:22:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42680: [SPARK-44965][PYTHON] Hide internal functions/variables from `pyspark.sql.functions` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/28 01:46:57 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #42683: [SPARK-44967][SQL][CONNECT] Unit should be considered first before using Boolean for TreeNodeTag - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/08/28 02:18:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42693: [SPARK-44980][PYTHON][CONNECT] Fix inherited namedtuples to work in createDataFrame - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 02:22:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42694: [SPARK-44981][PYTHON][CONNECT] Filter out static configurations used in local mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 02:45:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42678: [SPARK-44963][PYTHON][ML][TESTS] Make PySpark (pyspark-ml module) tests passing without any optional dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 02:54:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42678: [SPARK-44963][PYTHON][ML][TESTS] Make PySpark (pyspark-ml module) tests passing without any optional dependency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 02:55:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42587: [SPARK-44897][SQL] Propagating local properties to subquery broadcast exec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/28 02:56:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42587: [SPARK-44897][SQL] Propagating local properties to subquery broadcast exec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/28 02:57:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42695: [SPARK-44982][CONNECT] Mark Spark Connect server configurations as static - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 03:06:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42693: [SPARK-44980][PYTHON][CONNECT] Fix inherited namedtuples to work in createDataFrame - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 03:14:06 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42694: [SPARK-44981][PYTHON][CONNECT] Filter out static configurations used in local mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 03:14:19 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42696: Test Yarn module with `-Dtest.exclude.tags=org.apache.spark.tags.ExtendedLevelDBTest` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/28 03:21:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41673: [SPARK-44091][YARN][TESTS] Introduce `withResourceTypes` to `ResourceRequestTestHelper` to restore `resourceTypes` as default value after testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/28 03:31:20 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42694: [SPARK-44981][PYTHON][CONNECT] Filter out static configurations used in local mode - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/28 03:41:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42658: [SPARK-44945][DOCS][PYTHON] Automate PySpark error class documentation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 04:14:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42692: [SPARK-42944][PYTHON][FOLLOW-UP][3.5] Rename tests from foreachBatch to foreach_batch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 04:32:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42692: [SPARK-42944][PYTHON][FOLLOW-UP][3.5] Rename tests from foreachBatch to foreach_batch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 04:33:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41673: [SPARK-44091][YARN][TESTS] Introduce `withResourceTypes` to `ResourceRequestTestHelper` to restore `resourceTypes` as default value after testing - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 04:35:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41673: [SPARK-44091][YARN][TESTS] Introduce `withResourceTypes` to `ResourceRequestTestHelper` to restore `resourceTypes` as default value after testing - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 04:39:29 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #42696: Test Yarn module with `-Dtest.exclude.tags=org.apache.spark.tags.ExtendedLevelDBTest` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/28 04:57:17 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42674: [SPARK-44960][UI] Unescape and consist error summary across UI pages - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/28 05:52:15 UTC, 0 replies.
- [GitHub] [spark] wForget opened a new pull request, #42697: [MINOR][SQL][DOC] Fix incorrect link in sql menu and typo - posted by "wForget (via GitHub)" <gi...@apache.org> on 2023/08/28 06:07:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42693: [SPARK-44980][PYTHON][CONNECT] Fix inherited namedtuples to work in createDataFrame - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 06:47:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42698: [SPARK-44984][PYTHON][CONNECT] Remove `_get_alias` from DataFrame - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/28 07:50:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42698: [SPARK-44984][PYTHON][CONNECT] Remove `_get_alias` from DataFrame - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/28 07:53:52 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42632: [SPARK-44983][SQL] Convert binary to string by `to_char` for the formats: `hex`, `base64`, `utf-8` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/28 07:55:48 UTC, 4 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42699: [SPARK-44985][CORE] Use toString instead of stacktrace for task reaper threadDump - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/28 08:04:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42699: [SPARK-44985][CORE] Use toString instead of stacktrace for task reaper threadDump - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/28 08:05:38 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #42697: [MINOR][SQL][DOC] Fix incorrect link in sql menu and typo - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/28 08:09:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42700: [DO-NOT-MERGE] Avoid setting duration and stream size that might slow down the tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 08:10:17 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42697: [MINOR][SQL][DOC] Fix incorrect link in sql menu and typo - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/28 08:11:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42694: [SPARK-44981][PYTHON][CONNECT] Filter out static configurations used in local mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 08:34:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42695: [SPARK-44982][CONNECT] Mark Spark Connect server configurations as static - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 08:36:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42695: [SPARK-44982][CONNECT] Mark Spark Connect server configurations as static - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/28 08:36:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42701: Refactor `ExternalSorter#getPartition` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/28 08:44:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42632: [SPARK-44983][SQL] Convert binary to string by `to_char` for the formats: `hex`, `base64`, `utf-8` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/28 08:45:01 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #42701: Test Refactor `ExternalSorter#getPartition` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/28 08:54:11 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42702: [SPARK-44986][DOCS] There should be a gap at the bottom of the HTML - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/28 08:54:47 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42702: [SPARK-44986][DOCS] There should be a gap at the bottom of the HTML - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/28 08:55:42 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42702: [SPARK-44986][DOCS] There should be a gap at the bottom of the HTML - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/28 11:43:54 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42698: [SPARK-44984][PYTHON][CONNECT] Remove `_get_alias` from DataFrame - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/28 11:45:25 UTC, 0 replies.
- [GitHub] [spark] eubnara commented on pull request #42690: [SPARK-44976] Preserve full principal user name on executor side - posted by "eubnara (via GitHub)" <gi...@apache.org> on 2023/08/28 12:56:10 UTC, 0 replies.
- [GitHub] [spark] eubnara commented on a diff in pull request #42690: [SPARK-44976] Preserve full principal user name on executor side - posted by "eubnara (via GitHub)" <gi...@apache.org> on 2023/08/28 12:57:19 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42688: [SPARK-44974][CONNECT] Null out SparkSession/Dataset/KeyValueGroupedDatset on serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/28 13:04:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42688: [SPARK-44974][CONNECT] Null out SparkSession/Dataset/KeyValueGroupedDatset on serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/28 13:05:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42518: [SPARK-44832][CONNECT] Make transitive dependencies work properly for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/28 13:08:30 UTC, 2 replies.
- [GitHub] [spark] sthagedorn commented on pull request #39291: [SPARK-41629][CONNECT] Support for Protocol Extensions in Relation and Expression - posted by "sthagedorn (via GitHub)" <gi...@apache.org> on 2023/08/28 13:38:28 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39291: [SPARK-41629][CONNECT] Support for Protocol Extensions in Relation and Expression - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/28 13:43:42 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42632: [SPARK-44983][SQL] Convert binary to string by `to_char` for the formats: `hex`, `base64`, `utf-8` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/28 13:54:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42632: [SPARK-44983][SQL] Convert binary to string by `to_char` for the formats: `hex`, `base64`, `utf-8` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/28 13:56:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42703: [SPARK-44868][SQL][FOLLOWUP] Invoke the `to_varchar` function in Scala API - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/28 14:13:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42524: [SPARK-44837][SQL] Improve ALTER TABLE ALTER PARTITION column error message - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/28 14:26:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42556: [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/28 14:39:11 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42556: [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/28 14:39:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42625: [SPARK-44972][INFRA] Eagerly check if the token is valid to align with the behavior of username/password auth - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:05:29 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on pull request #42556: [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/28 16:05:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42625: [SPARK-44972][INFRA] Eagerly check if the token is valid to align with the behavior of username/password auth - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:06:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42699: [SPARK-44985][CORE] Use toString instead of stacktrace for task reaper threadDump - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:07:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42699: [SPARK-44985][CORE] Use toString instead of stacktrace for task reaper threadDump - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:07:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42695: [SPARK-44982][CONNECT] Mark Spark Connect server configurations as static - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:08:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42694: [SPARK-44981][PYTHON][CONNECT] Filter out static configurations used in local mode - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:08:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42704: [SPARK-44989][INFRA] Add a directional message to promote JIRA_ACCESS_TOKEN - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:32:38 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42704: [SPARK-44989][INFRA] Add a directional message to promote JIRA_ACCESS_TOKEN - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 16:38:30 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42704: [SPARK-44989][INFRA] Add a directional message to promote JIRA_ACCESS_TOKEN - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/28 16:57:22 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42700: [DO-NOT-MERGE] Avoid setting duration and stream size that might slow down the tests - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/28 17:11:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42704: [SPARK-44989][INFRA] Add a directional message to promote JIRA_ACCESS_TOKEN - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 17:12:45 UTC, 3 replies.
- [GitHub] [spark] michaelzhan-db commented on a diff in pull request #42524: [SPARK-44837][SQL] Improve ALTER TABLE ALTER PARTITION column error message - posted by "michaelzhan-db (via GitHub)" <gi...@apache.org> on 2023/08/28 17:25:05 UTC, 3 replies.
- [GitHub] [spark] wayneguow opened a new pull request, #39819: [SPARK-42252][CORE] Add `spark.shuffle.localDisk.file.output.buffer` and deprecate `spark.shuffle.unsafe.file.output.buffer` - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/08/28 17:36:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39819: [SPARK-42252][CORE] Add `spark.shuffle.localDisk.file.output.buffer` and deprecate `spark.shuffle.unsafe.file.output.buffer` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 17:36:40 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #42518: [SPARK-44832][CONNECT] Make transitive dependencies work properly for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/28 17:53:50 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42703: [SPARK-44868][SQL][FOLLOWUP] Invoke the `to_varchar` function in Scala API - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/28 17:57:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42703: [SPARK-44868][SQL][FOLLOWUP] Invoke the `to_varchar` function in Scala API - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/28 17:57:38 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/28 18:03:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42306: [SPARK-44647][SQL] Support SPJ where join keys are less than cluster keys - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 18:12:38 UTC, 1 replies.
- [GitHub] [spark] szehon-ho commented on pull request #42306: [SPARK-44647][SQL] Support SPJ where join keys are less than cluster keys - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/08/28 18:21:29 UTC, 0 replies.
- [GitHub] [spark] gcoffmanQ commented on pull request #41791: [SPARK-44285] MSK IAM Support - posted by "gcoffmanQ (via GitHub)" <gi...@apache.org> on 2023/08/28 18:38:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41683: [SPARK-36680][SQL] Supports Dynamic Table Options for Spark SQL - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 18:44:54 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42704: [SPARK-44989][INFRA] Add a directional message to promote JIRA_ACCESS_TOKEN - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 18:48:19 UTC, 0 replies.
- [GitHub] [spark] agubichev opened a new pull request, #42705: [SPARK-36191][SQL] Handle limit and order by in correlated scalar (lateral) subqueries - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/28 19:26:36 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #41791: [SPARK-44285] MSK IAM Support - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/08/28 20:27:26 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42702: [SPARK-44986][DOCS] There should be a gap at the bottom of the HTML - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/28 21:00:49 UTC, 0 replies.
- [GitHub] [spark] valentinp17 opened a new pull request, #42706: [SPARK-42304][SQL] Assign name to _LEGACY_ERROR_TEMP_2189 - posted by "valentinp17 (via GitHub)" <gi...@apache.org> on 2023/08/28 21:25:06 UTC, 0 replies.
- [GitHub] [spark] valentinp17 commented on pull request #42706: [SPARK-42304][SQL] Assign name to _LEGACY_ERROR_TEMP_2189 - posted by "valentinp17 (via GitHub)" <gi...@apache.org> on 2023/08/28 21:35:32 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42707: [SPARK-44993][CORE] Move `compareChecksums` from `ShuffleChecksumTestHelpe` to `ShuffleChecksumUtils` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 21:43:26 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42708: [SPARK-44994][PYTHON][DOCS] Refine docstring of DataFrame.filter - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/28 22:14:47 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42595: [SPARK-44901][SQL] Add API in Python UDTF 'analyze' method to return partitioning/ordering expressions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/28 22:51:04 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42709: [SPARK-44995][K8S] Promote SparkKubernetesClientFactory to DeveloperApi - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 23:00:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42707: [SPARK-44993][CORE] Add `ShuffleChecksumUtils.compareChecksums` by reusing `ShuffleChecksumTestHelp.compareChecksums` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 23:29:10 UTC, 2 replies.
- [GitHub] [spark] tianhanhu-db commented on a diff in pull request #42618: [SPARK-44919][AVRO] Avro connector: convert a union of a single primitive type to a StructType - posted by "tianhanhu-db (via GitHub)" <gi...@apache.org> on 2023/08/28 23:39:43 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42709: [SPARK-44995][K8S] Promote `SparkKubernetesClientFactory` to `DeveloperApi` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/28 23:50:49 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/29 00:05:40 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42707: [SPARK-44993][CORE] Add `ShuffleChecksumUtils.compareChecksums` by reusing `ShuffleChecksumTestHelp.compareChecksums` - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/08/29 00:08:50 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42710: [SPARK-44996][K8S] `VolcanoFeatureStep` should not create `DefaultVolcanoClient` if not needed - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 00:16:44 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #41244: [WIP] fix trim bug - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/29 00:17:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41083: [SPARK-43399][CORE] Add config to control threshold of unregister map ouput when fetch failed - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/29 00:17:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42707: [SPARK-44993][CORE] Add `ShuffleChecksumUtils.compareChecksums` by reusing `ShuffleChecksumTestHelp.compareChecksums` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 00:23:31 UTC, 5 replies.
- [GitHub] [spark] itholic commented on pull request #42706: [SPARK-42304][SQL] Assign name to _LEGACY_ERROR_TEMP_2189 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/29 00:26:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42707: [SPARK-44993][CORE] Add `ShuffleChecksumUtils.compareChecksums` by reusing `ShuffleChecksumTestHelp.compareChecksums` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 00:29:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42709: [SPARK-44995][K8S] Promote `SparkKubernetesClientFactory` to `DeveloperApi` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 00:30:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42710: [SPARK-44996][K8S] `VolcanoFeatureStep` should not create `DefaultVolcanoClient` if not needed - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 00:32:12 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42706: [SPARK-42304][SQL] Assign name to _LEGACY_ERROR_TEMP_2189 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/29 00:33:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42683: [SPARK-44967][SQL][CONNECT] Unit should be considered first before using Boolean for TreeNodeTag - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/29 00:46:23 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42710: [SPARK-44996][K8S] Use `lazy val` for `DefaultVolcanoClient` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 00:48:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42488: [SPARK-44804][SQL] SortMergeJoin should respect the streamed side ordering - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/29 01:10:55 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42680: [SPARK-44965][PYTHON] Hide internal functions/variables from `pyspark.sql.functions` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/29 01:29:43 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #42618: [SPARK-44919][AVRO] Avro connector: convert a union of a single primitive type to a StructType - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/08/29 01:32:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42708: [SPARK-44994][PYTHON][DOCS] Refine docstring of DataFrame.filter - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/29 01:32:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42637: [SPARK-44728][PYTHON][DOCS]Add examples to approxQuantile docstring - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/29 01:45:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42637: [SPARK-44728][PYTHON][DOCS]Add examples to approxQuantile docstring - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/29 01:46:40 UTC, 3 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 02:41:46 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42549: [SPARK-44860][SQL] Add SESSION_USER function - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 02:42:54 UTC, 0 replies.
- [GitHub] [spark] wForget opened a new pull request, #42711: [SPARK-44998] Do not retry when FileNotFoundException occurs - posted by "wForget (via GitHub)" <gi...@apache.org> on 2023/08/29 03:03:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/29 03:15:35 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/29 03:17:29 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/29 03:22:29 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #42510: [BUILD] Upgrade tink to 1.10.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/29 03:23:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/29 03:26:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42713: External sorter partitioner - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/29 03:35:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42714: [SPARK-45000][PYTHON][CONNECT] Implement DataFrame.foreach - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 04:10:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42715: [SPARK-45001][PYTHON][CONNECT] Implement DataFrame.foreachPartition - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 04:36:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42715: [SPARK-45001][PYTHON][CONNECT] Implement DataFrame.foreachPartition - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/29 04:41:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42715: [SPARK-45001][PYTHON][CONNECT] Implement DataFrame.foreachPartition - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 04:59:29 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42710: [SPARK-44996][K8S] Use `lazy val` for `DefaultVolcanoClient` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 05:13:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42710: [SPARK-44996][K8S] Use `lazy val` for `DefaultVolcanoClient` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 05:13:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42710: [SPARK-44996][K8S] Use `lazy val` for `DefaultVolcanoClient` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 05:13:48 UTC, 1 replies.
- [GitHub] [spark] xuanyuanking commented on pull request #42428: [SPARK-44742][PYTHON][DOCS] Add Spark version drop down to the PySpark doc site - posted by "xuanyuanking (via GitHub)" <gi...@apache.org> on 2023/08/29 05:47:51 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #42716: [SPARK-45002] Avoid uncaught exception from state store maintenance task thread on error to prevent executor being killed - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/29 05:54:47 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42716: [SPARK-45002][SS] Avoid uncaught exception from state store maintenance task thread on error to prevent executor being killed - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/29 05:55:49 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42717: [SPARK-45003][PYTHON][DOCS] Refine docstring of `asc/desc` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/29 06:11:38 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #42716: [SPARK-45002][SS] Avoid uncaught exception from state store maintenance task thread on error to prevent executor being killed - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/29 06:13:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42717: [SPARK-45003][PYTHON][DOCS] Refine docstring of `asc/desc` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/29 06:15:40 UTC, 1 replies.
- [GitHub] [spark] sadikovi commented on pull request #42667: [SPARK-44940][SQL] Improve performance of JSON parsing when "spark.sql.json.enablePartialResults" is enabled - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/08/29 06:16:58 UTC, 1 replies.
- [GitHub] [spark] anishshri-db commented on a diff in pull request #42716: [SPARK-45002][SS] Avoid uncaught exception from state store maintenance task thread on error to prevent executor being killed - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/29 06:21:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42393: [SPARK-43438][SQL] Error on missing input columns in `INSERT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/29 07:28:59 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42716: [SPARK-45002][SS] Avoid uncaught exception from state store maintenance task thread on error to prevent executor being killed - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 07:40:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42718: [SPARK-44981][PYTHON][CONNECT][FOLLOW-UP] Explicitly pass runtime configurations only - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 07:58:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42718: [SPARK-44981][PYTHON][CONNECT][FOLLOW-UP] Explicitly pass runtime configurations only - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 07:58:41 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42718: [SPARK-44981][PYTHON][CONNECT][FOLLOW-UP] Explicitly pass runtime configurations only - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 08:02:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42393: [SPARK-43438][SQL] Error on missing input columns in `INSERT` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/29 08:06:46 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42714: [SPARK-45000][PYTHON][CONNECT] Implement DataFrame.foreach - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 09:07:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42714: [SPARK-45000][PYTHON][CONNECT] Implement DataFrame.foreach - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 09:08:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42715: [SPARK-45001][PYTHON][CONNECT] Implement DataFrame.foreachPartition - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 09:10:37 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #42719: [SPARK-45005][CONNECT][PS][TESTS] Reducing the CI time by splitting the slow tests - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/29 09:24:57 UTC, 0 replies.
- [GitHub] [spark] valentinp17 commented on a diff in pull request #42706: [SPARK-42304][SQL] Assign name to _LEGACY_ERROR_TEMP_2189 - posted by "valentinp17 (via GitHub)" <gi...@apache.org> on 2023/08/29 09:45:18 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42720: [SPARK-45006][UI] Use the same date format of other UI date elements for the x-axis of timelines - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 10:01:07 UTC, 0 replies.
- [GitHub] [spark] tisonkun opened a new pull request, #42721: Interrupt receiveLoop on MessageLoop shutdown - posted by "tisonkun (via GitHub)" <gi...@apache.org> on 2023/08/29 10:28:25 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42722: [SPARK-45007][INFRA] Fix merged pull requests resolution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 10:41:39 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42722: [SPARK-45007][INFRA] Fix merged pull requests resolution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 10:45:18 UTC, 1 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #42574: [SPARK-43149][SQL] `CreateDataSourceTableCommand` should create metadata first - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/08/29 11:35:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42683: [SPARK-44967][SQL][CONNECT] Unit should be considered first before using Boolean for TreeNodeTag - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/29 11:38:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42683: [SPARK-44967][SQL][CONNECT] Unit should be considered first before using Boolean for TreeNodeTag - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/29 11:39:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41630: [SPARK-44080][SQL] Support overriding SQL configurations for new connections - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/29 11:42:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42574: [SPARK-43149][SQL] `CreateDataSourceTableCommand` should create metadata first - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/29 11:47:17 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41630: [SPARK-44080][SQL] Support overriding SQL configurations for new connections - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 13:14:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42718: [SPARK-44981][PYTHON][CONNECT][FOLLOW-UP] Explicitly pass runtime configurations only - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 13:45:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42715: [SPARK-45001][PYTHON][CONNECT] Implement DataFrame.foreachPartition - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 13:46:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42715: [SPARK-45001][PYTHON][CONNECT] Implement DataFrame.foreachPartition - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/29 13:46:51 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42720: [SPARK-45006][UI] Use the same date format of other UI date elements for the x-axis of timelines - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 14:14:25 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42377: [SPARK-44622][SQL][CONNECT] Implement error enrichment and setting server-side stacktrace - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/29 14:29:08 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42722: [SPARK-45007][INFRA] Fix merged pull requests resolution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:09:41 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42722: [SPARK-45007][INFRA] Fix merged pull requests resolution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 15:14:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42720: [SPARK-45006][UI] Use the same date format of other UI date elements for the x-axis of timelines - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:19:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42720: [SPARK-45006][UI] Use the same date format of other UI date elements for the x-axis of timelines - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:19:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42722: [SPARK-45007][INFRA] Fix merged pull requests resolution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:22:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42722: [SPARK-45007][INFRA] Fix merged pull requests resolution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:22:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42713: [SPARK-44999][CORE] Refactor `ExternalSorter` to reduce checks on `shouldPartition` when calling `getPartition` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:30:16 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #42705: [SPARK-36191][SQL] Handle limit and order by in correlated scalar (lateral) subqueries - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/08/29 15:30:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42717: [SPARK-45003][PYTHON][DOCS] Refine docstring of `asc/desc` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:32:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42717: [SPARK-45003][PYTHON][DOCS] Refine docstring of `asc/desc` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:33:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 15:36:43 UTC, 1 replies.
- [GitHub] [spark] jchen5 commented on pull request #42705: [SPARK-36191][SQL] Handle limit and order by in correlated scalar (lateral) subqueries - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/08/29 15:38:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42713: [SPARK-44999][CORE] Refactor `ExternalSorter` to reduce checks on `shouldPartition` when calling `getPartition` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/29 15:53:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42717: [SPARK-45003][PYTHON][DOCS] Refine docstring of `asc/desc` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/29 15:54:07 UTC, 0 replies.
- [GitHub] [spark] michaelzhan-db commented on a diff in pull request #42637: [SPARK-44728][PYTHON][DOCS]Add examples to approxQuantile docstring - posted by "michaelzhan-db (via GitHub)" <gi...@apache.org> on 2023/08/29 16:20:16 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42723: [SPARK-45008][INFRA] Improve branch suggestion for backporting - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/29 16:22:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42724: [SPARK-44915][CORE] Validate checksum of remounted PVC's shuffle data before recovery - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 17:20:05 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42708: [SPARK-44994][PYTHON][DOCS] Refine docstring of DataFrame.filter - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/29 18:43:30 UTC, 0 replies.
- [GitHub] [spark] andylam-db opened a new pull request, #42725: Decorrelate predicate subqueries in join condition - posted by "andylam-db (via GitHub)" <gi...@apache.org> on 2023/08/29 19:12:07 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42393: [SPARK-43438][SQL] Error on missing input columns in `INSERT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/29 20:04:57 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42595: [SPARK-44901][SQL] Add API in Python UDTF 'analyze' method to return partitioning/ordering expressions - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/08/29 20:07:36 UTC, 3 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42726: [SPARK-44640][PYTHON][FOLLOW-UP] Update INVALID_ARROW_UDTF_RETURN_TYPE error message - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/29 20:15:54 UTC, 0 replies.
- [GitHub] [spark] agubichev commented on a diff in pull request #42705: [SPARK-36191][SQL] Handle limit and order by in correlated scalar (lateral) subqueries - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/29 20:29:38 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42726: [SPARK-44640][PYTHON][FOLLOW-UP] Update INVALID_ARROW_UDTF_RETURN_TYPE error message - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/08/29 20:39:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42393: [SPARK-43438][SQL] Error on missing input columns in `INSERT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/29 20:39:51 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42726: [SPARK-44640][PYTHON][FOLLOW-UP] Update INVALID_ARROW_UDTF_RETURN_TYPE error message - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/29 20:50:11 UTC, 0 replies.
- [GitHub] [spark] agubichev commented on pull request #42705: [SPARK-36191][SQL] Handle limit and order by in correlated scalar (lateral) subqueries - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/08/29 20:51:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42727: [SPARK-45010][INFRA] Limit GHA job execution time to up to 5 hours in `build_and_test.yml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 20:55:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42727: [SPARK-45010][INFRA] Limit GHA job execution time to up to 5 hours in `build_and_test.yml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/29 20:58:37 UTC, 4 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #42725: [SPARK-45009][SQL] Decorrelate predicate subqueries in join condition - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/08/29 20:59:00 UTC, 1 replies.
- [GitHub] [spark] andylam-db commented on a diff in pull request #42725: [SPARK-45009][SQL] Decorrelate predicate subqueries in join condition - posted by "andylam-db (via GitHub)" <gi...@apache.org> on 2023/08/29 21:27:03 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42717: [SPARK-45003][PYTHON][DOCS] Refine docstring of `asc/desc` - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/29 21:39:09 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42728: [SPARK-45011][PYTHON][DOCS] Refine docstring of Column.between - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/29 21:50:39 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42726: [SPARK-44640][PYTHON][FOLLOW-UP] Update INVALID_ARROW_UDTF_RETURN_TYPE error message - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/29 22:28:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #41244: [WIP] fix trim bug - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/08/30 00:17:04 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42716: [SPARK-45002][SS] Avoid uncaught exception from state store maintenance task thread on error to prevent executor being killed - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/30 01:22:14 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42716: [SPARK-45002][SS] Avoid uncaught exception from state store maintenance task thread on error to prevent executor being killed - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/30 01:23:02 UTC, 0 replies.
- [GitHub] [spark] wForget commented on pull request #42711: [SPARK-44998] Do not retry when FileNotFoundException occurs - posted by "wForget (via GitHub)" <gi...@apache.org> on 2023/08/30 01:37:14 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #42729: [SPARK-45012][SQL] CheckAnalysis should throw inlined plan in AnalysisException - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/30 01:39:13 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42727: [SPARK-45010][INFRA] Limit GHA job execution time to up to 5 hours in `build_and_test.yml` - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/30 01:44:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42637: [SPARK-44728][PYTHON][DOCS]Add examples to approxQuantile docstring - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 02:45:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42728: [SPARK-45011][PYTHON][DOCS] Refine docstring of Column.between - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 02:48:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42728: [SPARK-45011][PYTHON][DOCS] Refine docstring of Column.between - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 02:48:37 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42377: [SPARK-44622][SQL][CONNECT] Implement error enrichment and setting server-side stacktrace - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/30 02:49:51 UTC, 10 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42717: [SPARK-45003][PYTHON][DOCS] Refine docstring of `asc/desc` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 02:55:51 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42719: [WIP][SPARK-45005][CONNECT][PS][TESTS] Reducing the CI time by splitting the slow tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 03:05:47 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42719: [WIP][SPARK-45005][CONNECT][PS][TESTS] Reducing the CI time by splitting the slow tests - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/30 03:27:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42727: [SPARK-45010][INFRA] Limit GHA job execution time to up to 5 hours in `build_and_test.yml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/30 03:32:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/30 03:36:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42700: [DO-NOT-MERGE] Avoid setting duration and stream size that might slow down the tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/30 03:42:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42730: [SPARK-44742][PYTHON][DOCS][FOLLOWUP] Add Spark version drop down to the PySpark doc site - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/30 03:43:23 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #42729: [SPARK-45012][SQL] CheckAnalysis should throw inlined plan in AnalysisException - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/08/30 04:09:24 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42723: [SPARK-45008][INFRA] Improve branch suggestion for backporting - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/30 04:12:02 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42727: [SPARK-45010][INFRA] Limit GHA job execution time to up to 5 hours in `build_and_test.yml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 04:13:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42723: [SPARK-45008][INFRA] Improve branch suggestion for backporting - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 04:35:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42706: [SPARK-42304][SQL] Rename `_LEGACY_ERROR_TEMP_2189` to `GET_TABLES_BY_TYPE_UNSUPPORTED_BY_HIVE_VERSION` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 04:39:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42706: [SPARK-42304][SQL] Rename `_LEGACY_ERROR_TEMP_2189` to `GET_TABLES_BY_TYPE_UNSUPPORTED_BY_HIVE_VERSION` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 04:40:39 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 04:45:39 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42729: [SPARK-45012][SQL] CheckAnalysis should throw inlined plan in AnalysisException - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/30 04:48:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42729: [SPARK-45012][SQL] CheckAnalysis should throw inlined plan in AnalysisException - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 05:09:15 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42729: [SPARK-45012][SQL] CheckAnalysis should throw inlined plan in AnalysisException - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 05:11:46 UTC, 2 replies.
- [GitHub] [spark] wangyum commented on pull request #41630: [SPARK-44080][SQL] Support overriding SQL configurations for new connections - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/08/30 05:27:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42719: [WIP][SPARK-45005][CONNECT][PS][TESTS] Reducing the CI time by splitting the slow tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/30 05:28:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42731: [SPARK-45014][CONNECT] Clean up fileserver when cleaning up files, jars and archives in SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/30 05:47:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42731: [SPARK-45014][CONNECT] Clean up fileserver when cleaning up files, jars and archives in SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/30 05:48:19 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/30 05:48:28 UTC, 3 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/30 06:53:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42732: [SPARK-43923][FOLLOW] Propagate extra tags to SparkListenerConnectOperationFinished - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/08/30 06:58:23 UTC, 0 replies.
- [GitHub] [spark] valentinp17 commented on pull request #42706: [SPARK-42304][SQL] Rename `_LEGACY_ERROR_TEMP_2189` to `GET_TABLES_BY_TYPE_UNSUPPORTED_BY_HIVE_VERSION` - posted by "valentinp17 (via GitHub)" <gi...@apache.org> on 2023/08/30 07:11:26 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42733: [WIP] test scala213 run on container - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/30 08:25:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42734: [SPARK-45016][PYTHON][CONNECT] Add missing `try_remote_functions` annotations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 09:04:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42735: [SPARK-45015][PYTHON][DOCS] Refine DocStrings of `try_{add, subtract, multiply, divide, avg, sum}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 09:18:58 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #42731: [SPARK-45014][CONNECT] Clean up fileserver when cleaning up files, jars and archives in SparkContext - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/08/30 09:57:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42736: [SPARK-45017][PYTHON] Add `CalendarIntervalType` to PySpark - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/30 10:30:36 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42733: [WIP] [SPARK-45019][BUILD] Make workflow scala213 on container & clean env - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/30 11:01:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42737: [SPARK-44987][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_1100` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/30 11:08:16 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42737: [SPARK-44987][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_1100` - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/30 11:31:32 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #42737: [SPARK-44987][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_1100` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/30 11:38:58 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42738: [SPARK-44990][SQL] Reduce the frequency of get `spark.sql.legacy.nullValueWrittenAsQuotedEmptyStringCsv` - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/30 12:43:06 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42738: [SPARK-44990][SQL] Reduce the frequency of get `spark.sql.legacy.nullValueWrittenAsQuotedEmptyStringCsv` - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/30 12:44:51 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42739: [SPARK-45021][BUILD] Remove `antlr4-maven-plugin` configuration from `sql/catalyst/pom.xml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/30 13:24:56 UTC, 0 replies.
- [GitHub] [spark] dzhigimont commented on pull request #40436: [SPARK-42619][PS] Add `show_counts` parameter for DataFrame.info - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/08/30 13:26:08 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42711: [SPARK-44998] Do not retry when FileNotFoundException occurs - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/08/30 14:19:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/30 14:36:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/30 14:37:14 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #42740: [SPARK-][SQL] Provide context for dataset API errors - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/30 15:25:18 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42732: [SPARK-43923][CONNECT][FOLLOW-UP] Propagate extra tags to SparkListenerConnectOperationFinished - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/30 15:35:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42739: [SPARK-45021][BUILD] Remove `antlr4-maven-plugin` configuration from `sql/catalyst/pom.xml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 15:40:58 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42686: [SPARK-44971][PYTHON] StreamingQueryProgress event fromJson bug fix - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/08/30 16:48:28 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42736: [SPARK-45017][PYTHON] Add `CalendarIntervalType` to PySpark - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 17:20:46 UTC, 3 replies.
- [GitHub] [spark] itholic commented on pull request #42719: [SPARK-45005][CONNECT][PS][TESTS] Reducing the CI time by splitting the slow tests - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/30 17:36:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42738: [SPARK-44990][SQL] Reduce the frequency of get `spark.sql.legacy.nullValueWrittenAsQuotedEmptyStringCsv` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 17:55:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42719: [SPARK-45005][CONNECT][PS][TESTS] Reducing the CI time by splitting the slow tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 18:08:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42738: [SPARK-44990][SQL] Reduce the frequency of get `spark.sql.legacy.nullValueWrittenAsQuotedEmptyStringCsv` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 18:10:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42734: [SPARK-45016][PYTHON][CONNECT] Add missing `try_remote_functions` annotations - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 18:20:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42734: [SPARK-45016][PYTHON][CONNECT] Add missing `try_remote_functions` annotations - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 18:21:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42730: [SPARK-44742][PYTHON][DOCS][FOLLOWUP] Upgrade `pydata_sphinx_theme` to 0.8.0 in `spark-rm` Dockerfile - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 18:26:18 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40420: [SPARK-42617][PS] Support `isocalendar` from the pandas 2.0.0 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/30 18:27:51 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #42706: [SPARK-42304][SQL] Rename `_LEGACY_ERROR_TEMP_2189` to `GET_TABLES_BY_TYPE_UNSUPPORTED_BY_HIVE_VERSION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/30 18:31:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42736: [SPARK-45017][PYTHON] Add `CalendarIntervalType` to PySpark - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 18:34:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42730: [SPARK-44742][PYTHON][DOCS][FOLLOWUP] Upgrade `pydata_sphinx_theme` to 0.8.0 in `spark-rm` Dockerfile - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 18:42:56 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42595: [SPARK-44901][SQL] Add API in Python UDTF 'analyze' method to return partitioning/ordering expressions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/30 18:44:17 UTC, 2 replies.
- [GitHub] [spark] xuanyuanking commented on pull request #42730: [SPARK-44742][PYTHON][DOCS][FOLLOWUP] Upgrade `pydata_sphinx_theme` to 0.8.0 in `spark-rm` Dockerfile - posted by "xuanyuanking (via GitHub)" <gi...@apache.org> on 2023/08/30 19:08:25 UTC, 0 replies.
- [GitHub] [spark] dzhigimont commented on a diff in pull request #40436: [SPARK-42619][PS] Add `show_counts` parameter for DataFrame.info - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/08/30 19:19:20 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42634: [SPARK-44910][SQL] Encoders.bean does not support superclasses with generic type arguments - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/08/30 19:42:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42712: [SPARK-44997][DOCS] Align example order (Python -> Scala/Java -> R) in all Spark Doc Content - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/30 19:44:07 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40436: [SPARK-42619][PS] Add `show_counts` parameter for DataFrame.info - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/08/30 19:52:28 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42734: [SPARK-45016][PYTHON][CONNECT] Add missing `try_remote_functions` annotations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 00:22:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42736: [SPARK-45017][PYTHON] Add `CalendarIntervalType` to PySpark - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 00:26:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42736: [SPARK-45017][PYTHON] Add `CalendarIntervalType` to PySpark - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 00:26:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42686: [SPARK-44971][PYTHON] StreamingQueryProgress event fromJson bug fix - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/31 00:27:55 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42686: [SPARK-44971][PYTHON] StreamingQueryProgress event fromJson bug fix - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/31 00:30:19 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42733: [SPARK-45019][BUILD] Make workflow scala213 on container & clean env - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/31 00:53:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42735: [SPARK-45015][PYTHON][DOCS] Refine DocStrings of `try_{add, subtract, multiply, divide, avg, sum}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 00:55:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42735: [SPARK-45015][PYTHON][DOCS] Refine DocStrings of `try_{add, subtract, multiply, divide, avg, sum}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 00:55:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42741: [SPARK-45024][PYTHON][CONNECT] Filter out some configurations in Session Creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 01:46:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42739: [SPARK-45021][BUILD] Remove `antlr4-maven-plugin` configuration from `sql/catalyst/pom.xml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/31 02:13:20 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #42742: [SPARK-45025] Allow block manager memory store iterator to handle thread interrupt and perform task completion gracefully - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/31 02:36:42 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42742: [SPARK-45025] Allow block manager memory store iterator to handle thread interrupt and perform task completion gracefully - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/08/31 02:41:25 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42729: [SPARK-45012][SQL] CheckAnalysis should throw inlined plan in AnalysisException - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/31 02:51:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42729: [SPARK-45012][SQL] CheckAnalysis should throw inlined plan in AnalysisException - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/31 02:51:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42738: [SPARK-44990][SQL] Reduce the frequency of get `spark.sql.legacy.nullValueWrittenAsQuotedEmptyStringCsv` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/31 02:54:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42731: [SPARK-45014][CONNECT] Clean up fileserver when cleaning up files, jars and archives in SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/31 02:54:56 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42738: [SPARK-44990][SQL] Reduce the frequency of get `spark.sql.legacy.nullValueWrittenAsQuotedEmptyStringCsv` - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/31 02:56:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42743: [SPARK-45018][PYTHON][CONNECT] Add CalendarIntervalType to Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 03:13:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42743: [SPARK-45018][PYTHON][CONNECT] Add CalendarIntervalType to Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 03:19:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42741: [SPARK-45024][PYTHON][CONNECT] Filter out some configurations in Session Creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 03:22:03 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42743: [SPARK-45018][PYTHON][CONNECT] Add CalendarIntervalType to Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 03:41:19 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42744: [SPARK-44990][SQL][FOLLOWUP] Add benchmark for write null value to csv - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/31 04:26:15 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42744: [SPARK-44990][SQL][FOLLOWUP] Add benchmark for write null value to csv - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/31 04:26:44 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42744: [SPARK-44990][SQL][FOLLOWUP] Add benchmark for write null value to csv - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/31 04:34:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42741: [SPARK-45024][PYTHON][CONNECT] Filter out some configurations in Session Creation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 04:36:57 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42744: [SPARK-44990][SQL][FOLLOWUP] Add benchmark for write null value to csv - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/31 05:45:31 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42745: [SPARK-45027][PYTHON] Hide internal functions/variables in `pyspark.sql.functions` from auto-completion - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 06:01:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42731: [SPARK-45014][CONNECT] Clean up fileserver when cleaning up files, jars and archives in SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/08/31 06:21:33 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 commented on a diff in pull request #42296: [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/08/31 06:54:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42236: [SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/31 07:17:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42743: [SPARK-45018][PYTHON][CONNECT] Add CalendarIntervalType to Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 07:28:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42746: Revert "[SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module" - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/31 07:28:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42744: [SPARK-44990][SQL][FOLLOWUP] Add benchmark for write null value to csv - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/31 07:33:04 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42747: [SPARK-44750][PYTHON][CONNECT][TESTS][FOLLOW-UP] Avoid creating session twice in `SparkConnectSessionWithOptionsTest` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 07:34:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42746: Revert "[SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module" - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/31 07:36:51 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42748: [SPARK-45028][PYTHON][DOCS] Refine docstring of `DataFrame.drop` - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/31 08:44:15 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42742: [SPARK-45025] Allow block manager memory store iterator to handle thread interrupt and perform task completion gracefully - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/08/31 08:46:08 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42744: [SPARK-44990][SQL][FOLLOWUP] Remove lazy of `nullAsQuotedEmptyString` - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/31 09:09:05 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42287: [SPARK-44632][CORE] DiskBlockManager should check and be able to handle stale directories - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/31 09:19:10 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42287: [SPARK-44632][CORE] DiskBlockManager should check and be able to handle stale directories - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/31 09:19:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42748: [SPARK-45028][PYTHON][DOCS] Refine docstring of `DataFrame.drop` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 09:21:06 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42749: [SPARK-45031][INFRA] Choose the right merge code path and merge hash for reopened PRs - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/31 09:50:48 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42750: [SPARK-44942][INFRA] Use Jira notification options to sync with Github - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/31 10:27:41 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42748: [SPARK-45028][PYTHON][DOCS] Refine docstring of `DataFrame.drop` - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/08/31 11:16:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42746: Revert "[SPARK-43646][CONNECT][TESTS] Make both SBT and Maven use `spark-proto` uber jar to test the `connect` module" - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/31 11:43:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42751: [SPARK-45029][CONNECT][TESTS] Ignore `from_protobuf messageClassName/from_protobuf messageClassName options` in `PlanGenerationTestSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/31 11:53:25 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42752: [WIP][SQL] Support maps constructed from arrays in parameterized `sql()` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/31 12:05:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42751: [SPARK-45029][CONNECT][TESTS] Ignore `from_protobuf messageClassName/from_protobuf messageClassName options` in `PlanGenerationTestSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/31 12:33:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42753: [SPARK-45032[CONNECT] Fix compilation warnings related to `Top-level wildcard is not allowed and will error under -Xsource:3` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/08/31 12:51:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42754: [SPARK-45026][CONNECT] `spark.sql` should support datatypes not compatible with arrow - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 13:30:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42754: [SPARK-45026][CONNECT] `spark.sql` should support datatypes not compatible with arrow - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/08/31 13:31:20 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42740: [WIP][SPARK-45022][SQL] Provide context for dataset API errors - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/08/31 13:54:18 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/08/31 14:24:20 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #42755: [SPARK-45034][SQL] Support deterministic mode function - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/31 16:21:30 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42756: [SPARK-45037][BUILD] Upload unit tests log files for timeouted cancel - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/08/31 17:19:29 UTC, 0 replies.
- [GitHub] [spark] sunchao opened a new pull request, #42757: [SPARK-45036][SQL] SPJ: Simplify the logic to handle partially clustered distribution - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/08/31 17:22:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #42737: [SPARK-44987][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_1100` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/31 18:20:03 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42737: [SPARK-44987][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_1100` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/31 18:22:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42737: [SPARK-44987][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_1100` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/08/31 18:23:11 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #42740: [WIP][SPARK-45022][SQL] Provide context for dataset API errors - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/08/31 19:04:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #42737: [SPARK-44987][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_1100` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/08/31 19:50:55 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #42742: [SPARK-45025] Allow block manager memory store iterator to handle thread interrupt and perform task completion gracefully - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/08/31 20:14:51 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42708: [SPARK-44994][PYTHON][DOCS] Refine docstring of DataFrame.filter - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/31 21:58:54 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42758: [SPARK-45038][PYTHON][DOCS] Refine docstring of `max` - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/08/31 22:28:53 UTC, 0 replies.
- [GitHub] [spark] planga82 opened a new pull request, #42759: [SPARK-45039][UI] Include full identifier in Storage tab - posted by "planga82 (via GitHub)" <gi...@apache.org> on 2023/08/31 23:16:39 UTC, 0 replies.