You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40217: [WIP][SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 00:08:22 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40184: [SPARK-42569][CONNECT] Throw exceptions for unsupported session API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 00:19:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40184: [SPARK-42569][CONNECT] Throw exceptions for unsupported session API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 00:21:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36441: [SPARK-39091][SQL] Updating specific SQL Expression traits that don't compose when multiple are extended due to nodePatterns being final. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/01 00:23:59 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 00:24:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 00:25:05 UTC, 0 replies.
- [GitHub] [spark] ritikam2 commented on pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "ritikam2 (via GitHub)" <gi...@apache.org> on 2023/03/01 00:31:56 UTC, 10 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 00:36:32 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40227: [SPARK-41870][CONNECT][PYTHON] Fix createDataFrame to handle duplicated column names - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/01 00:40:29 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/01 00:43:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40223: [SPARK-42624][PYTHON][TESTS] Reorganize imports in test_functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 00:43:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40223: [SPARK-42624][PYTHON][TESTS] Reorganize imports in test_functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 00:43:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40220: [PYTHON] Change alias for numpy deprecated and removed types - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 00:44:49 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 00:47:30 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40228: [SPARK-41874][CONNECT] Support SameSemantics in Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/01 00:55:12 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40228: [SPARK-41874][CONNECT] Support SameSemantics in Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/01 00:55:33 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #40229: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/01 01:16:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40225: [SPARK-42625][BUILD] Upgrade `zstd-jni` to 1.5.4-2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/01 01:17:05 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40225: [SPARK-42625][BUILD] Upgrade `zstd-jni` to 1.5.4-2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/01 01:17:27 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #40229: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/01 01:20:05 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/03/01 01:23:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 01:31:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40226: [SPARK-41868][CONNECT][PYTHON] Fix createDataFrame to support durations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 01:42:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40226: [SPARK-41868][CONNECT][PYTHON] Fix createDataFrame to support durations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 01:43:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/01 01:44:08 UTC, 4 replies.
- [GitHub] [spark] yaooqinn closed pull request #40211: [SPARK-42616][SQL] SparkSQLCLIDriver shall only close started hive sessionState - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/01 01:47:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/01 01:48:03 UTC, 3 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40211: [SPARK-42616][SQL] SparkSQLCLIDriver shall only close started hive sessionState - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/01 01:48:49 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/01 01:59:22 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40228: [SPARK-41874][CONNECT] Support SameSemantics in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 02:10:56 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/01 02:13:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 02:14:17 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 02:15:28 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40228: [SPARK-41874][CONNECT] Support SameSemantics in Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/01 02:17:07 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/03/01 02:20:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40228: [SPARK-41874][CONNECT][PYTHON] Support SameSemantics in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 02:21:57 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/01 02:22:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40228: [SPARK-41874][CONNECT][PYTHON] Support SameSemantics in Spark Connect - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 02:24:58 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40214: [SPARK-42491][BUILD] Upgrade jetty to 9.4.51.v20230217 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/01 02:40:40 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #40214: [SPARK-42491][BUILD] Upgrade jetty to 9.4.51.v20230217 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/01 02:44:37 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40214: [SPARK-42491][BUILD] Upgrade jetty to 9.4.51.v20230217 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/01 02:44:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 02:51:42 UTC, 1 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/01 03:13:23 UTC, 2 replies.
- [GitHub] [spark] wankunde commented on pull request #40157: [SPARK-42551][SQL] Support subexpression elimination in FilterExec - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/03/01 03:21:50 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/01 04:38:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40198: [SPARK-42337][SQL][FOLLOWUP] Update the error message for INVALID_TEMP_OBJ_REFERENCE - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/01 04:54:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40198: [SPARK-42337][SQL][FOLLOWUP] Update the error message for INVALID_TEMP_OBJ_REFERENCE - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/01 04:55:43 UTC, 0 replies.
- [GitHub] [spark] navinvishy commented on a diff in pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/03/01 05:10:04 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #40196: [SPARK-42603][SQL] Set spark.sql.legacy.createHiveTableByDefault to false. - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/03/01 05:25:22 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40229: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/01 05:36:31 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40229: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/01 05:55:13 UTC, 1 replies.
- [GitHub] [spark] shrprasa commented on pull request #40128: [SPARK-42466][K8S]: Cleanup k8s upload directory when job terminates - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/01 06:08:33 UTC, 10 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40218: [SPARK-42579][CONNECT] Part-1: `function.lit` support `Array[_]` dataType - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/01 07:22:07 UTC, 20 replies.
- [GitHub] [spark] jelmerk commented on a diff in pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "jelmerk (via GitHub)" <gi...@apache.org> on 2023/03/01 07:31:03 UTC, 0 replies.
- [GitHub] [spark] jelmerk commented on pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "jelmerk (via GitHub)" <gi...@apache.org> on 2023/03/01 07:33:24 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/01 07:49:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/01 07:50:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/01 07:51:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/01 07:55:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/01 07:59:26 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40190: [SPARK-42597][SQL] UnwrapCastInBinaryComparison support unwrap timestamp type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/01 08:01:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40230: [MINOR][BUILD] Delete a useless TODO from `dev/test-dependencies.sh` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/01 08:13:11 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/01 08:19:33 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40218: [SPARK-42579][CONNECT] Part-1: `function.lit` support `Array[_]` dataType - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/01 08:30:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 08:46:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 08:56:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/01 08:59:00 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40227: [SPARK-41870][CONNECT][PYTHON] Fix createDataFrame to handle duplicated column names - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 08:59:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40227: [SPARK-41870][CONNECT][PYTHON] Fix createDataFrame to handle duplicated column names - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 09:00:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40231: [SPARK-42628][SQL][DOCS] Add a migration note for bloom filter join - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/01 09:01:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40231: [SPARK-42628][SQL][DOCS] Add a migration note for bloom filter join - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/01 09:01:33 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/01 09:02:46 UTC, 9 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40231: [SPARK-42628][SQL][DOCS] Add a migration note for bloom filter join - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/01 09:29:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40216: [SPARK-42593][PS] Deprecate & remove the APIs that will be removed in pandas 2.0. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 09:31:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40216: [SPARK-42593][PS] Deprecate & remove the APIs that will be removed in pandas 2.0. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/01 09:31:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40195: [SPARK-42553][SQL] ensure at least one time unit after "interval" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/01 09:40:14 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/01 09:48:24 UTC, 11 replies.
- [GitHub] [spark] aimtsou commented on pull request #40220: [PYTHON] Change alias for numpy deprecated and removed types - posted by "aimtsou (via GitHub)" <gi...@apache.org> on 2023/03/01 09:52:41 UTC, 1 replies.
- [GitHub] [spark] huangxiaopingRD commented on pull request #40196: [SPARK-42603][SQL] Set spark.sql.legacy.createHiveTableByDefault to false. - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/03/01 12:24:12 UTC, 0 replies.
- [GitHub] [spark] huangxiaopingRD opened a new pull request, #40232: [SPARK-42629][DOCS] Update the description of default data source in the document - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/03/01 12:26:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39624: [SPARK-42101][SQL] Introduce Materializable and MaterializableQueryStage for AQE framework - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/01 12:33:59 UTC, 13 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40233: [SPARK-42630][CONNECT][PYTHON] Make `parse_data_type` use new proto message `DDLParse` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 12:52:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40233: [SPARK-42630][CONNECT][PYTHON] Make `parse_data_type` use new proto message `DDLParse` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/01 12:54:11 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40230: [MINOR][BUILD] Delete a invalid TODO from `dev/test-dependencies.sh` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/01 13:31:29 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40230: [MINOR][BUILD] Delete a invalid TODO from `dev/test-dependencies.sh` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/01 13:31:33 UTC, 0 replies.
- [GitHub] [spark] jiang13021 commented on a diff in pull request #40195: [SPARK-42553][SQL] ensure at least one time unit after "interval" - posted by "jiang13021 (via GitHub)" <gi...@apache.org> on 2023/03/01 13:54:01 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #39990: [SPARK-42415][SQL] The built-in dialects support OFFSET and paging query. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/01 13:55:04 UTC, 1 replies.
- [GitHub] [spark] aimtsou commented on pull request #40220: [WIP][PYTHON] Change alias for numpy deprecated and removed types - posted by "aimtsou (via GitHub)" <gi...@apache.org> on 2023/03/01 14:31:11 UTC, 3 replies.
- [GitHub] [spark] srowen commented on pull request #40220: [WIP][PYTHON] Change alias for numpy deprecated and removed types - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/01 14:35:10 UTC, 3 replies.
- [GitHub] [spark] wangyum opened a new pull request, #36766: [SPARK-32184][SQL] Remove inferred predicate if it has InOrCorrelatedExistsSubquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/01 15:15:54 UTC, 0 replies.
- [GitHub] [spark] tomvanbussel opened a new pull request, #40234: [SPARK-34827][CONNECT] Support custom extensions in Scala client - posted by "tomvanbussel (via GitHub)" <gi...@apache.org> on 2023/03/01 15:28:17 UTC, 0 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads twice when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/03/01 16:08:29 UTC, 14 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 16:15:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 16:19:35 UTC, 7 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 16:22:19 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 16:23:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40235: [SPARK-42632][CONNECT] Fix scala paths in integration tests - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 17:49:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40235: [SPARK-42632][CONNECT] Fix scala paths in integration tests - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 17:49:19 UTC, 0 replies.
- [GitHub] [spark] tomvanbussel commented on pull request #40234: [SPARK-42631][CONNECT] Support custom extensions in Scala client - posted by "tomvanbussel (via GitHub)" <gi...@apache.org> on 2023/03/01 18:04:01 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40234: [SPARK-42631][CONNECT] Support custom extensions in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 18:25:57 UTC, 5 replies.
- [GitHub] [spark] the8thC opened a new pull request, #40236: [SPARK-38735][SQL][Tests] Add tests for the error class: INTERNAL_ERROR - posted by "the8thC (via GitHub)" <gi...@apache.org> on 2023/03/01 18:40:58 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #40190: [SPARK-42597][SQL] UnwrapCastInBinaryComparison support unwrap timestamp type - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/01 18:54:50 UTC, 0 replies.
- [GitHub] [spark] tomvanbussel commented on a diff in pull request #40234: [SPARK-42631][CONNECT] Support custom extensions in Scala client - posted by "tomvanbussel (via GitHub)" <gi...@apache.org> on 2023/03/01 18:57:40 UTC, 1 replies.
- [GitHub] [spark] chenhao-db opened a new pull request, #40237: [SPARK-42635][SQL] Fix the TimestampAdd expression. - posted by "chenhao-db (via GitHub)" <gi...@apache.org> on 2023/03/01 19:02:57 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/01 19:08:17 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 19:09:09 UTC, 0 replies.
- [GitHub] [spark] chenhao-db commented on pull request #40237: [SPARK-42635][SQL] Fix the TimestampAdd expression. - posted by "chenhao-db (via GitHub)" <gi...@apache.org> on 2023/03/01 19:10:25 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40239: [SPARK-42637][CONNECT] Add SparkSession.stop() - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/01 19:27:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/01 19:50:46 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/01 21:42:31 UTC, 5 replies.
- [GitHub] [spark] amaliujia commented on pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/01 21:43:23 UTC, 2 replies.
- [GitHub] [spark] sunchao closed pull request #40224: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/01 21:45:55 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #40224: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/01 21:46:21 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40240: [SPARK-42458][CONNECT][PYTHON] Fixes createDataFrame to support DDL string as schema - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/01 22:05:03 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/01 22:19:49 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #36698: [SPARK-39316][SQL] Merge PromotePrecision and CheckOverflow into decimal binary arithmetic - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/01 22:38:43 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #40229: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/01 22:51:47 UTC, 1 replies.
- [GitHub] [spark] xkrogen commented on pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/03/01 23:16:13 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/01 23:34:57 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40224: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 00:09:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40240: [SPARK-42458][CONNECT][PYTHON] Fixes createDataFrame to support DDL string as schema - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:13:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40240: [SPARK-42458][CONNECT][PYTHON] Fixes createDataFrame to support DDL string as schema - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:13:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40239: [SPARK-42637][CONNECT] Add SparkSession.stop() - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:14:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40239: [SPARK-42637][CONNECT] Add SparkSession.stop() - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:14:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40235: [SPARK-42632][CONNECT] Fix scala paths in integration tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:15:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40235: [SPARK-42632][CONNECT] Fix scala paths in integration tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:16:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 00:17:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 00:17:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40233: [SPARK-42630][CONNECT][PYTHON] Make `parse_data_type` use new proto message `DDLParse` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:17:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:18:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:18:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 00:19:06 UTC, 20 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:19:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40220: [WIP][PYTHON] Change alias for numpy deprecated and removed types - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 00:21:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36766: [SPARK-32184][SQL] Remove inferred predicate if it has InOrCorrelatedExistsSubquery - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/02 00:22:50 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36441: [SPARK-39091][SQL] Updating specific SQL Expression traits that don't compose when multiple are extended due to nodePatterns being final. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/02 00:22:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 00:25:08 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40241: [SPARK-42640][CONNECT] Remove stale entries from the excluding rules for CompatibilitySuite - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/02 00:25:32 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40241: [SPARK-42640][CONNECT] Remove stale entries from the excluding rules for CompatibilitySuite - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/02 00:25:55 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40242: [SPARK-42639][CONNECT] Add createDataFrame/createDataset methods - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 00:26:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40234: [SPARK-42631][CONNECT] Support custom extensions in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 00:36:53 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40234: [SPARK-42631][CONNECT] Support custom extensions in Scala client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 00:37:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 00:39:37 UTC, 2 replies.
- [GitHub] [spark] hvanhovell closed pull request #40234: [SPARK-42631][CONNECT] Support custom extensions in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 00:42:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40240: [SPARK-42458][CONNECT][PYTHON] Fixes createDataFrame to support DDL string as schema - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/02 00:46:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40243: [WIP][CONNECT][BUILD] Upgrade buf from 1.14.0 to 1.15.0 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/02 01:00:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/02 01:07:05 UTC, 4 replies.
- [GitHub] [spark] ueshin commented on pull request #40240: [SPARK-42458][CONNECT][PYTHON] Fixes createDataFrame to support DDL string as schema - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/02 01:30:28 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #36698: [SPARK-39316][SQL] Merge PromotePrecision and CheckOverflow into decimal binary arithmetic - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/02 01:56:23 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40233: [SPARK-42630][CONNECT][PYTHON] Make `parse_data_type` use new proto message `DDLParse` - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/02 01:57:45 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39624: [SPARK-42101][SQL] Introduce Materializable and MaterializableQueryStage for AQE framework - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/02 02:36:09 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40244: Implement spark.udf.registerJavaFunction - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/02 02:55:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40245: [SPARK-41823][CONNECT][FOLLOW-UP][TESTS] Disable ANSI mode in ProtoToParsedPlanTestSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 02:58:22 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/02 03:15:27 UTC, 2 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39624: [SPARK-42101][SQL] Introduce Materializable and MaterializableQueryStage for AQE framework - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/02 03:26:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 03:38:20 UTC, 2 replies.
- [GitHub] [spark] gengliangwang closed pull request #40229: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/02 04:34:18 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/02 04:39:41 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 04:46:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40245: [SPARK-41823][CONNECT][FOLLOW-UP][TESTS] Disable ANSI mode in ProtoToParsedPlanTestSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 04:46:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40245: [SPARK-41823][CONNECT][FOLLOW-UP][TESTS] Disable ANSI mode in ProtoToParsedPlanTestSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 04:47:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40246: [SPARK-42644][INFRA] Add `hive` dependency to `connect` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 04:50:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 04:52:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40246: [SPARK-42644][INFRA] Add `hive` dependency to `connect` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 04:55:24 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40246: [SPARK-42644][INFRA] Add `hive` dependency to `connect` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 05:16:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40247: [SPARK-42646][BUILD] Upgrade cyclonedx from 2.7.3 to 2.7.5 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/02 05:22:43 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40247: [SPARK-42646][BUILD] Upgrade cyclonedx from 2.7.3 to 2.7.5 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/02 05:23:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40247: [SPARK-42646][BUILD] Upgrade cyclonedx from 2.7.3 to 2.7.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 05:24:46 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40247: [SPARK-42646][BUILD] Upgrade cyclonedx from 2.7.3 to 2.7.5 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 05:24:52 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40247: [SPARK-42646][BUILD] Upgrade cyclonedx from 2.7.3 to 2.7.5 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 05:25:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40218: [SPARK-42579][CONNECT] Part-1: `function.lit` support `Array[_]` dataType - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 05:29:49 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40246: [SPARK-42644][INFRA] Add `hive` dependency to `connect` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 05:57:07 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40195: [SPARK-42553][SQL] Ensure at least one time unit after "interval" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/02 06:36:39 UTC, 3 replies.
- [GitHub] [spark] MaxGekk closed pull request #40195: [SPARK-42553][SQL] Ensure at least one time unit after "interval" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/02 06:37:40 UTC, 0 replies.
- [GitHub] [spark] the8thC commented on pull request #40236: [SPARK-38735][SQL][Tests] Add tests for the error class: INTERNAL_ERROR - posted by "the8thC (via GitHub)" <gi...@apache.org> on 2023/03/02 06:45:46 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40246: [SPARK-42644][INFRA] Add `hive` dependency to `connect` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 06:46:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40246: [SPARK-42644][INFRA] Add `hive` dependency to `connect` module - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/02 06:49:23 UTC, 0 replies.
- [GitHub] [spark] aimtsou commented on pull request #40220: [WIP][SPARK-42647][PYTHON] Change alias for numpy deprecated and removed types - posted by "aimtsou (via GitHub)" <gi...@apache.org> on 2023/03/02 07:07:29 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40242: [SPARK-42639][CONNECT] Add createDataFrame/createDataset methods - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 07:13:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40242: [SPARK-42639][CONNECT] Add createDataFrame/createDataset methods - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 07:14:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40220: [WIP][SPARK-42647][PYTHON] Change alias for numpy deprecated and removed types - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 07:23:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 07:24:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40248: [SPARK-42648][BUILD] Upgrade `versions-maven-plugin` to 2.15.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 08:12:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40249: [SPARK-42649][CORE] Remove the standard Apache License header from the top of third-party source files - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 08:13:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40249: [SPARK-42649][CORE] Remove the standard Apache License header from the top of third-party source files - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 08:20:14 UTC, 1 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40250: [WIP][SPARK-42642][DOCS][PYTHON] Updating remaining Spark documentation code examples to show Python by default - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/03/02 08:27:35 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/02 08:33:37 UTC, 2 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40190: [SPARK-42597][SQL] UnwrapCastInBinaryComparison support unwrap timestamp type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/02 08:41:23 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40249: [SPARK-42649][CORE] Remove the standard Apache License header from the top of third-party source files - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/02 09:04:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40250: [SPARK-42642][DOCS][PYTHON] Updating remaining Spark documentation code examples to show Python by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 09:28:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40250: [SPARK-42642][DOCS][PYTHON] Updating remaining Spark documentation code examples to show Python by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 09:29:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40237: [SPARK-42635][SQL] Fix the TimestampAdd expression. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/02 09:35:22 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40243: [SPARK-42641][CONNECT][BUILD] Upgrade buf from 1.14.0 to 1.15.0 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/02 09:47:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40243: [SPARK-42641][CONNECT][BUILD] Upgrade buf from 1.14.0 to 1.15.0 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/02 09:48:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40251: [SPARK-41725][PYTHON][TESTS][FOLLOW-UP] Remove collect in SQL command execution in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 10:18:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40251: [SPARK-41725][PYTHON][TESTS][FOLLOW-UP] Remove collect for SQL command execution in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 11:37:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40251: [SPARK-41725][PYTHON][TESTS][FOLLOW-UP] Remove collect for SQL command execution in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/02 11:38:01 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40217: [SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/02 12:01:25 UTC, 3 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/02 12:14:06 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/02 12:15:31 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/02 12:37:35 UTC, 0 replies.
- [GitHub] [spark] jiang13021 opened a new pull request, #40253: [SPARK-42553][SQL] Ensure at least one time unit after "interval" - posted by "jiang13021 (via GitHub)" <gi...@apache.org> on 2023/03/02 12:43:24 UTC, 0 replies.
- [GitHub] [spark] jiang13021 commented on pull request #40195: [SPARK-42553][SQL] Ensure at least one time unit after "interval" - posted by "jiang13021 (via GitHub)" <gi...@apache.org> on 2023/03/02 12:51:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40253: [SPARK-42553][SQL] Ensure at least one time unit after "interval" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/02 13:00:33 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40220: [WIP][SPARK-42647][PYTHON] Change alias for numpy deprecated and removed types - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/02 13:18:30 UTC, 1 replies.
- [GitHub] [spark] jiang13021 commented on pull request #40253: [SPARK-42553][SQL][3.3] Ensure at least one time unit after "interval" - posted by "jiang13021 (via GitHub)" <gi...@apache.org> on 2023/03/02 13:23:09 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/02 13:45:46 UTC, 0 replies.
- [GitHub] [spark] yabola commented on pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads twice when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/03/02 14:03:07 UTC, 2 replies.
- [GitHub] [spark] srowen closed pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/02 14:43:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40253: [SPARK-42553][SQL][3.3] Ensure at least one time unit after "interval" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/02 15:23:53 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40253: [SPARK-42553][SQL][3.3] Ensure at least one time unit after "interval" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/02 15:26:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 15:53:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40254: [SPARK-42654][BUILD] WIP - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 16:11:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40255: [SPARK-42558][CONNECT] DataFrameStatFunctions WIP - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 16:13:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40255: [SPARK-42558][CONNECT] DataFrameStatFunctions WIP - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/02 16:14:22 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/02 16:44:00 UTC, 3 replies.
- [GitHub] [spark] vicennial opened a new pull request, #40256: [SPARK-42653][CONNECT] Artifact transfer from Scala/JVM client to Server - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/03/02 16:49:47 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC2 - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/03/02 17:44:54 UTC, 2 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/02 18:22:34 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 18:33:30 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 18:35:11 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 18:36:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40256: [SPARK-42653][CONNECT] Artifact transfer from Scala/JVM client to Server - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 18:52:37 UTC, 6 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/02 18:55:27 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/02 19:39:21 UTC, 1 replies.
- [GitHub] [spark] shrprasa opened a new pull request, #40258: [WIP][SPARK-42655]:Incorrect ambiguous column reference error - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/02 20:18:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 20:21:49 UTC, 11 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40241: [SPARK-42640][CONNECT] Remove stale entries from the excluding rules for CompatibilitySuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 20:54:22 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40241: [SPARK-42640][CONNECT] Remove stale entries from the excluding rules for CompatibilitySuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/02 20:55:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40259: [SPARK-42609][CONNECT] Add tests for grouping() and grouping_id() functions - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/02 22:14:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40259: [SPARK-42609][CONNECT] Add tests for grouping() and grouping_id() functions - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/02 22:15:00 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/02 22:50:02 UTC, 4 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #40256: [SPARK-42653][CONNECT] Artifact transfer from Scala/JVM client to Server - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/03/02 22:52:17 UTC, 5 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40260: [SPARK-42630][CONNECT][PYTHON] Delay parsing DDL string until SparkConnectClient is available - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/02 23:31:54 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/03 00:07:44 UTC, 1 replies.
- [GitHub] [spark] mridulm closed pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/03 00:11:13 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40261: [SPARK-42615][CONNECT][FOLLOWUP] Fix SparkConnectAnalyzeHandler to use withActive - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/03 00:15:28 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40220: [SPARK-42647][PYTHON] Change alias for numpy deprecated and removed types - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/03 00:50:30 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40220: [SPARK-42647][PYTHON] Change alias for numpy deprecated and removed types - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/03 00:51:32 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 02:00:08 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 02:00:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40256: [SPARK-42653][CONNECT] Artifact transfer from Scala/JVM client to Server - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/03 02:26:49 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40261: [SPARK-42615][CONNECT][FOLLOWUP] Fix SparkConnectAnalyzeHandler to use withActive - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/03 02:29:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40261: [SPARK-42615][CONNECT][FOLLOWUP] Fix SparkConnectAnalyzeHandler to use withActive - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/03 02:30:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40260: [SPARK-42630][CONNECT][PYTHON] Delay parsing DDL string until SparkConnectClient is available - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/03 02:45:11 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40260: [SPARK-42630][CONNECT][PYTHON] Delay parsing DDL string until SparkConnectClient is available - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/03 02:46:46 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 02:47:09 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/03 02:55:32 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40260: [SPARK-42630][CONNECT][PYTHON] Delay parsing DDL string until SparkConnectClient is available - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/03 02:58:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 03:17:54 UTC, 6 replies.
- [GitHub] [spark] amaliujia commented on pull request #40256: [SPARK-42653][CONNECT] Artifact transfer from Scala/JVM client to Server - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/03 03:19:04 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40262: [SPARK-42651][SQL] Optimize global sort to driver sort - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/03 03:35:54 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/03 05:03:08 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40263: [SPARK-42659][ML] Reimplement `FPGrowthModel.transform` with dataframe operations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/03 05:49:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 05:54:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40257: [SPARK-42656][CONNECT] Adding SCALA REPL shell script for JVM client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 05:55:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40237: [SPARK-42635][SQL] Fix the TimestampAdd expression. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/03 06:37:18 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #40237: [SPARK-42635][SQL] Fix the TimestampAdd expression. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/03 06:38:57 UTC, 0 replies.
- [GitHub] [spark] chenhao-db opened a new pull request, #40264: [SPARK-42635][SQL][3.3] Fix the TimestampAdd expression - posted by "chenhao-db (via GitHub)" <gi...@apache.org> on 2023/03/03 06:51:02 UTC, 0 replies.
- [GitHub] [spark] chenhao-db commented on pull request #40264: [SPARK-42635][SQL][3.3] Fix the TimestampAdd expression - posted by "chenhao-db (via GitHub)" <gi...@apache.org> on 2023/03/03 06:53:36 UTC, 2 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40265: [SPARK-42556][CONNECT] Dataset.colregex should link a plan_id when it only matches a single column. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/03 07:04:40 UTC, 0 replies.
- [GitHub] [spark] mskapilks opened a new pull request, #40266: [SPARK-42660] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "mskapilks (via GitHub)" <gi...@apache.org> on 2023/03/03 07:16:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40259: [SPARK-42609][CONNECT][TESTS] Add tests for grouping() and grouping_id() functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/03 07:17:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40259: [SPARK-42609][CONNECT][TESTS] Add tests for grouping() and grouping_id() functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/03 07:18:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40262: [SPARK-42651][SQL] Optimize global sort to driver sort - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/03 07:25:20 UTC, 0 replies.
- [GitHub] [spark] mskapilks commented on pull request #40266: [SPARK-42660] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "mskapilks (via GitHub)" <gi...@apache.org> on 2023/03/03 07:31:44 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40262: [SPARK-42651][SQL] Optimize global sort to driver sort - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/03 07:41:05 UTC, 1 replies.
- [GitHub] [spark] mskapilks commented on pull request #40266: [SPARK-42660][SQL] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "mskapilks (via GitHub)" <gi...@apache.org> on 2023/03/03 08:22:31 UTC, 4 replies.
- [GitHub] [spark] zsxwing commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "zsxwing (via GitHub)" <gi...@apache.org> on 2023/03/03 09:09:14 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40264: [SPARK-42635][SQL][3.3] Fix the TimestampAdd expression - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/03 09:22:56 UTC, 2 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40263: [SPARK-42659][ML] Reimplement `FPGrowthModel.transform` with dataframe operations - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/03 10:05:59 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40265: [SPARK-42556][CONNECT] Dataset.colregex should link a plan_id when it only matches a single column. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 11:48:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40256: [SPARK-42653][CONNECT] Artifact transfer from Scala/JVM client to Server - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 11:53:44 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40265: [SPARK-42556][CONNECT] Dataset.colregex should link a plan_id when it only matches a single column. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 12:12:37 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/03 12:28:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40267: [SPARK-42653][CONNECT][FOLLOW-UP] Fix Scala 2.13 build failure by explicit Seq conversion - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 12:31:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40267: [SPARK-42653][CONNECT][FOLLOW-UP] Fix Scala 2.13 build failure by explicit Seq conversion - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 12:31:16 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40256: [SPARK-42653][CONNECT] Artifact transfer from Scala/JVM client to Server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 12:32:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40267: [SPARK-42653][CONNECT][FOLLOW-UP] Fix Scala 2.13 build failure by explicit Seq conversion - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 12:33:24 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #40157: [SPARK-42551][SQL] Support subexpression elimination in FilterExec and JoinExec - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/03/03 13:14:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40255: [SPARK-42558][CONNECT] Partial implement `DataFrameStatFunctions` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 13:25:14 UTC, 10 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40267: [SPARK-42653][CONNECT][FOLLOW-UP] Fix Scala 2.13 build failure by explicit Seq conversion - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/03 13:39:55 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40263: [SPARK-42659][ML] Reimplement `FPGrowthModel.transform` with dataframe operations - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/03 14:09:28 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #40248: [SPARK-42648][BUILD] Upgrade `versions-maven-plugin` to 2.15.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/03 14:10:21 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40248: [SPARK-42648][BUILD] Upgrade `versions-maven-plugin` to 2.15.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/03 14:10:25 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/03/03 14:32:31 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #40268: [WIP][SPARK-42500][SQL] ConstantPropagation support more cases - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/03 14:39:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40217: [SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 15:17:35 UTC, 5 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #40269: [WIP][DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/03 16:08:14 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40270: [SPARK-42497][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/03 16:09:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40255: [SPARK-42558][CONNECT] Partial implement `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/03 16:19:01 UTC, 10 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40270: [SPARK-42497][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/03 16:21:41 UTC, 9 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40254: [SPARK-42654][BUILD] Upgrade dropwizard metrics 4.2.17 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/03 17:03:41 UTC, 6 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40255: [SPARK-42558][CONNECT] Partial implement `DataFrameStatFunctions` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 17:17:18 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40270: [SPARK-42497][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 17:19:21 UTC, 0 replies.
- [GitHub] [spark] FurcyPin opened a new pull request, #40271: [WIP][SPARK-42258][PYTHON] pyspark.sql.functions should not expose typing.cast - posted by "FurcyPin (via GitHub)" <gi...@apache.org> on 2023/03/03 17:31:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40255: [SPARK-42558][CONNECT] Partial implement `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/03 17:37:36 UTC, 1 replies.
- [GitHub] [spark] shrprasa commented on pull request #40258: [WIP][SPARK-42655]:Incorrect ambiguous column reference error - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/03 18:14:03 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/03 18:37:59 UTC, 3 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #40128: [SPARK-42466][K8S]: Cleanup k8s upload directory when job terminates - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/03/03 18:46:45 UTC, 1 replies.
- [GitHub] [spark] shrprasa commented on a diff in pull request #40128: [SPARK-42466][K8S]: Cleanup k8s upload directory when job terminates - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/03 19:02:11 UTC, 8 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40255: [SPARK-42558][CONNECT] Partial implement `DataFrameStatFunctions` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/03 19:16:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40271: [WIP][SPARK-42258][PYTHON] pyspark.sql.functions should not expose typing.cast - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/03 19:23:38 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40236: [SPARK-38735][SQL][Tests] Add tests for the error class: INTERNAL_ERROR - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/03 19:38:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40236: [SPARK-38735][SQL][Tests] Add tests for the error class: INTERNAL_ERROR - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/03 19:42:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/03 20:19:10 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40272: [SPARK-42667][CONNECT] Spark Connect: newSession API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/03 22:20:57 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40272: [SPARK-42667][CONNECT] Spark Connect: newSession API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/03 23:10:46 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40272: [SPARK-42667][CONNECT] Spark Connect: newSession API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 23:20:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40228: [SPARK-41874][CONNECT][PYTHON] Support SameSemantics in Spark Connect - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/03 23:22:19 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40272: [SPARK-42667][CONNECT] Spark Connect: newSession API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/03 23:36:03 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #40273: [SPARK-42668][SS] Catch exception while trying to close compressed stream in HDFSStateStoreProvider abort - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/04 00:36:49 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #40273: [SPARK-42668][SS] Catch exception while trying to close compressed stream in HDFSStateStoreProvider abort - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/04 00:37:48 UTC, 1 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40274: [SPARK-42215][CONNECT] Single command to run Scala Client IT tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/04 01:06:06 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40274: [SPARK-42215][CONNECT] Single command to run Scala Client IT tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/04 01:06:37 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40228: [SPARK-41874][CONNECT][PYTHON] Support SameSemantics in Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/04 01:07:36 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40272: [SPARK-42667][CONNECT] Spark Connect: newSession API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/04 01:45:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40274: [SPARK-42215][CONNECT] Simplify Scala Client IT tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/04 02:01:43 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40274: [SPARK-42215][CONNECT] Simplify Scala Client IT tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/04 02:08:56 UTC, 4 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40228: [SPARK-41874][CONNECT][PYTHON] Support SameSemantics in Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/04 02:09:19 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40218: [SPARK-42579][CONNECT] Part-1: `function.lit` support `Array[_]` dataType - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/04 02:28:56 UTC, 10 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40275: [SPARK-42557][CONNECT] Add Broadcast to functions - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/04 02:34:46 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40275: [SPARK-42557][CONNECT] Add Broadcast to functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/04 02:53:19 UTC, 3 replies.
- [GitHub] [spark] shrprasa commented on pull request #40258: [SPARK-42655][SQL]:Incorrect ambiguous column reference error - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/04 02:53:45 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40252: [SPARK-42555][CONNECT] Add JDBC to DataFrameReader - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/04 02:59:47 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/04 03:01:55 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40269: [WIP][DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/04 03:07:03 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/04 03:49:29 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40265: [SPARK-42556][CONNECT] Dataset.colregex should link a plan_id when it only matches a single column. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/04 05:43:39 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40265: [SPARK-42556][CONNECT] Dataset.colregex should link a plan_id when it only matches a single column. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/04 05:46:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40255: [SPARK-42558][CONNECT] Implement `DataFrameStatFunctions` except `bloomFilter` functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/04 06:14:58 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40265: [SPARK-42556][CONNECT] Dataset.colregex should link a plan_id when it only matches a single column. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/04 06:21:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40263: [SPARK-42659][ML] Reimplement `FPGrowthModel.transform` with dataframe operations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/04 06:28:24 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40270: [SPARK-42662][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/04 06:37:19 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40277: [SPARK-42555][CONNECT][FOLLOWUP] Add the new proto msg to support the remaining jdbc API - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/04 06:54:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40277: [SPARK-42555][CONNECT][FOLLOWUP] Add the new proto msg to support the remaining jdbc API - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/04 06:57:32 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/04 07:05:58 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #40268: [SPARK-42500][SQL] ConstantPropagation support more cases - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/04 08:54:04 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/04 13:00:09 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40255: [SPARK-42558][CONNECT] Implement `DataFrameStatFunctions` except `bloomFilter` functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/04 13:56:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40270: [SPARK-42662][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/04 14:01:00 UTC, 3 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/04 14:08:46 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40278: [SPARK-42670][BUILD] Upgrade maven-surefire-plugin to 3.0.0-M9 & eliminate build warnings - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/04 14:56:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40279: [MINOR][CONNECT] Remove unused imports proto file - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/04 15:08:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40264: [SPARK-42635][SQL][3.3] Fix the TimestampAdd expression - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/04 19:16:58 UTC, 0 replies.
- [GitHub] [spark] goodwanghan commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "goodwanghan (via GitHub)" <gi...@apache.org> on 2023/03/05 00:16:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36265: [SPARK-38951][SQL] Aggregate aliases override field names in ResolveAggregateFunctions - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/05 00:23:48 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40280: [SPARK-42671][CONNECT] Fix bug for createDataFrame from complex type schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/05 02:29:25 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40280: [SPARK-42671][CONNECT] Fix bug for createDataFrame from complex type schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/05 02:30:02 UTC, 3 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40280: [SPARK-42671][CONNECT] Fix bug for createDataFrame from complex type schema - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/05 02:44:45 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40280: [SPARK-42671][CONNECT] Fix bug for createDataFrame from complex type schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/05 03:16:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40255: [SPARK-42558][CONNECT] Implement `DataFrameStatFunctions` except `bloomFilter` functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/05 03:34:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40275: [SPARK-42557][CONNECT] Add Broadcast to functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/05 03:38:10 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40270: [SPARK-42662][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/05 05:00:33 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40255: [SPARK-42558][CONNECT] Implement `DataFrameStatFunctions` except `bloomFilter` functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/05 05:26:45 UTC, 1 replies.
- [GitHub] [spark] ivoson opened a new pull request, #40281: [SPARK-41497][CORE][Follow UP]Modify config `spark.rdd.cache.visibilityTracking.enabled` support version to 3.5.0 - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/05 06:10:54 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on pull request #40281: [SPARK-41497][CORE][Follow UP]Modify config `spark.rdd.cache.visibilityTracking.enabled` support version to 3.5.0 - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/05 06:11:53 UTC, 1 replies.
- [GitHub] [spark] khalidmammadov commented on a diff in pull request #40015: [SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/03/05 07:54:10 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40282: [SPARK-42672][PYTHON][DOCS] Document error class list - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/05 08:02:41 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40282: [SPARK-42672][PYTHON][DOCS] Document error class list - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/05 08:04:51 UTC, 10 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40273: [SPARK-42668][SS] Catch exception while trying to close compressed stream in HDFSStateStoreProvider abort - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/05 08:20:24 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40283: [SPARK-42673][BUILD] Ban 3.9.x for Spark Maven build - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/05 08:23:51 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40284: [SPARK-42674][BUILD] Upgrade scalafmt from 3.7.1 to 3.7.2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/05 08:32:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40283: [SPARK-42673][BUILD] Ban Maven 3.9.x for Spark build - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/05 08:45:06 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40275: [SPARK-42557][CONNECT] Add Broadcast to functions - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/05 11:56:23 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #40275: [SPARK-42557][CONNECT] Add Broadcast to functions - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/05 12:01:24 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/05 12:16:59 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40278: [SPARK-42670][BUILD] Upgrade maven-surefire-plugin to 3.0.0-M9 & eliminate build warnings - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/05 12:17:24 UTC, 3 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40278: [SPARK-42670][BUILD] Upgrade maven-surefire-plugin to 3.0.0-M9 & eliminate build warnings - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/05 12:41:37 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40285: [SPARK-42675][CONNECT][TESTS] Drop temp view after test `test temp view` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/05 13:11:59 UTC, 0 replies.
- [GitHub] [spark] the8thC commented on a diff in pull request #40236: [SPARK-38735][SQL][TESTS] Add tests for the error class: INTERNAL_ERROR - posted by "the8thC (via GitHub)" <gi...@apache.org> on 2023/03/05 13:49:53 UTC, 0 replies.
- [GitHub] [spark] ivoson opened a new pull request, #40286: [SPARK-42577][CORE] Add max attempts limitation for stages to avoid potential infinite retry - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/05 14:04:04 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40236: [SPARK-38735][SQL][TESTS] Add tests for the error class: INTERNAL_ERROR - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/05 15:07:15 UTC, 0 replies.
- [GitHub] [spark] FurcyPin commented on a diff in pull request #40271: [WIP][SPARK-42258][PYTHON] pyspark.sql.functions should not expose typing.cast - posted by "FurcyPin (via GitHub)" <gi...@apache.org> on 2023/03/05 16:28:07 UTC, 3 replies.
- [GitHub] [spark] wangyum closed pull request #40285: [SPARK-42675][CONNECT][TESTS] Drop temp view after test `test temp view` - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/06 00:10:52 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40285: [SPARK-42675][CONNECT][TESTS] Drop temp view after test `test temp view` - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/06 00:11:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38736: [SPARK-41214][SQL] - SQL Metrics are missing from Spark UI when AQE for Cached DataFrame is enabled - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/06 00:21:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36265: [SPARK-38951][SQL] Aggregate aliases override field names in ResolveAggregateFunctions - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/06 00:21:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40284: [SPARK-42674][BUILD] Upgrade scalafmt from 3.7.1 to 3.7.2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/06 00:41:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40284: [SPARK-42674][BUILD] Upgrade scalafmt from 3.7.1 to 3.7.2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/06 00:41:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40282: [SPARK-42672][PYTHON][DOCS] Document error class list - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/06 00:42:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40281: [SPARK-41497][CORE][Follow UP]Modify config `spark.rdd.cache.visibilityTracking.enabled` support version to 3.5.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/06 00:43:46 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40271: [WIP][SPARK-42258][PYTHON] pyspark.sql.functions should not expose typing.cast - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/06 00:56:02 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 01:03:27 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #40277: [SPARK-42555][CONNECT][FOLLOWUP] Add the new proto msg to support the remaining jdbc API - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 01:05:47 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40286: [SPARK-42577][CORE] Add max attempts limitation for stages to avoid potential infinite retry - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/06 01:54:57 UTC, 8 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40287: [SPARK-42562][CONNECT] UnresolvedNamedLambdaVariable in python do not need unique names - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 01:59:27 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40280: [SPARK-42671][CONNECT] Fix bug for createDataFrame from complex type schema - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:08:00 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40279: [MINOR][CONNECT] Remove unused protobuf imports to eliminate build warnings - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:09:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40275: [SPARK-42557][CONNECT] Add Broadcast to functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:10:41 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40275: [SPARK-42557][CONNECT] Add Broadcast to functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:11:01 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40255: [SPARK-42558][CONNECT] Implement `DataFrameStatFunctions` except `bloomFilter` functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:12:20 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40255: [SPARK-42558][CONNECT] Implement `DataFrameStatFunctions` except `bloomFilter` functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:13:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40285: [SPARK-42675][CONNECT][TESTS] Drop temp view after test `test temp view` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/06 02:14:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:17:56 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:18:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/06 02:27:20 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40254: [SPARK-42654][BUILD] Upgrade dropwizard metrics 4.2.17 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/06 02:32:54 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40254: [SPARK-42654][BUILD] Upgrade dropwizard metrics 4.2.17 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/06 02:32:59 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40270: [SPARK-42662][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:34:59 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #40288: [SPARK-42496][CONNECT][DOCS] Introduction Spark Connect at main page. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/06 02:35:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40217: [SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 02:44:22 UTC, 3 replies.
- [GitHub] [spark] itholic commented on pull request #40288: [SPARK-42496][CONNECT][DOCS] Introduction Spark Connect at main page. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/06 02:44:35 UTC, 2 replies.
- [GitHub] [spark] Yikf commented on pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/06 03:05:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40287: [SPARK-42562][CONNECT] UnresolvedNamedLambdaVariable in python do not need unique names - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 03:07:03 UTC, 3 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40277: [SPARK-42555][CONNECT][FOLLOWUP] Add the new proto msg to support the remaining jdbc API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 03:08:11 UTC, 4 replies.
- [GitHub] [spark] wangyum commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/06 03:11:52 UTC, 0 replies.
- [GitHub] [spark] Yikf opened a new pull request, #40289: [SPARK-42478][SQL][3.2] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/06 03:16:33 UTC, 0 replies.
- [GitHub] [spark] Yikf opened a new pull request, #40290: [SPARK-42478][SQL][3.3] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/06 03:17:41 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on pull request #40289: [SPARK-42478][SQL][3.2] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/06 03:19:07 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on pull request #40290: [SPARK-42478][SQL][3.3] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/06 03:19:15 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40291: [WIP][SPARK-42578][CONNECT] Add JDBC to DataFrameWriter - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 03:24:44 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40287: [SPARK-42562][CONNECT] UnresolvedNamedLambdaVariable in python do not need unique names - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 03:25:25 UTC, 7 replies.
- [GitHub] [spark] beliefer commented on pull request #40291: [WIP][SPARK-42578][CONNECT] Add JDBC to DataFrameWriter - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 03:26:29 UTC, 2 replies.
- [GitHub] [spark] huangxiaopingRD closed pull request #40196: [SPARK-42603][SQL] Set spark.sql.legacy.createHiveTableByDefault to false. - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/03/06 03:34:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40287: [SPARK-42562][CONNECT] UnresolvedNamedLambdaVariable in python do not need unique names - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/06 03:34:52 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40277: [SPARK-42555][CONNECT][FOLLOWUP] Add the new proto msg to support the remaining jdbc API - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 03:35:12 UTC, 3 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #40292: [SPARK-42676] Write temp checkpoints for streaming queries to local filesystem even if default FS is set differently - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/06 03:50:36 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #40292: [SPARK-42676] Write temp checkpoints for streaming queries to local filesystem even if default FS is set differently - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/06 03:51:10 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40293: [SPARK-42677][SQL] Fix the invalid tests for broadcast hint - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 04:10:12 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40291: [WIP][SPARK-42578][CONNECT] Add JDBC to DataFrameWriter - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 04:22:52 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40228: [SPARK-41874][CONNECT][PYTHON] Support SameSemantics in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/06 05:13:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40228: [SPARK-41874][CONNECT][PYTHON] Support SameSemantics in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/06 05:14:35 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40283: [SPARK-42673][BUILD] Ban Maven 3.9.x for Spark build - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/06 05:23:04 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40283: [SPARK-42673][BUILD] Ban Maven 3.9.x for Spark build - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/06 05:27:05 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40244: [WIP][SPARK-42643][CONNECT][PYTHON] Implement `spark.udf.registerJavaFunction` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/06 05:41:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40283: [SPARK-42673][BUILD] Make `build/mvn` build Spark only with the verified maven version - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/06 05:54:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40283: [SPARK-42673][BUILD] Make `build/mvn` build Spark only with the verified maven version - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/06 05:55:01 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40291: [WIP][SPARK-42578][CONNECT] Add JDBC to DataFrameWriter - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/06 06:04:27 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40294: [SPARK-40610][SQL] Support unwrap date type to string type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/06 06:47:12 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40292: [SPARK-42676][SS] Write temp checkpoints for streaming queries to local filesystem even if default FS is set differently - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/06 06:54:26 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40292: [SPARK-42676][SS] Write temp checkpoints for streaming queries to local filesystem even if default FS is set differently - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/06 06:55:38 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40280: [SPARK-42671][CONNECT] Fix bug for createDataFrame from complex type schema - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/06 07:09:26 UTC, 3 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/06 07:43:36 UTC, 0 replies.
- [GitHub] [spark] hboutemy commented on pull request #40283: [SPARK-42673][BUILD] Make `build/mvn` build Spark only with the verified maven version - posted by "hboutemy (via GitHub)" <gi...@apache.org> on 2023/03/06 07:50:58 UTC, 2 replies.
- [GitHub] [spark] gnodet commented on pull request #40283: [SPARK-42673][BUILD] Make `build/mvn` build Spark only with the verified maven version - posted by "gnodet (via GitHub)" <gi...@apache.org> on 2023/03/06 08:16:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40293: [SPARK-42677][SQL][TESTS] Fix the invalid tests for broadcast hint - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/06 08:17:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40293: [SPARK-42677][SQL][TESTS] Fix the invalid tests for broadcast hint - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/06 08:17:57 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40294: [SPARK-40610][SQL] Support unwrap date type to string type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/06 08:32:29 UTC, 1 replies.
- [GitHub] [spark] vitaliili-db opened a new pull request, #40295: [SPARK-42681] Relax ordering constraint for ALTER TABLE ADD|REPLACE c… - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2023/03/06 08:42:20 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40293: [SPARK-42677][SQL][TESTS] Fix the invalid tests for broadcast hint - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 08:49:02 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40291: [WIP][SPARK-42578][CONNECT] Add JDBC to DataFrameWriter - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 09:02:09 UTC, 2 replies.
- [GitHub] [spark] panbingkun commented on pull request #40217: [SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/06 09:05:09 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40296: [WIP][SPARK-42680][CONNECT] Create the helper function withSQLConf for connect test framework - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 09:14:07 UTC, 0 replies.
- [GitHub] [spark] micaelcapitao commented on pull request #23735: [SPARK-26801][SQL] Read avro types other than record - posted by "micaelcapitao (via GitHub)" <gi...@apache.org> on 2023/03/06 10:38:19 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40268: [SPARK-42500][SQL] ConstantPropagation support more cases - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/06 10:59:52 UTC, 7 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40244: [SPARK-42643][CONNECT][PYTHON] Implement `spark.udf.registerJavaFunction` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/06 11:03:40 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40244: [SPARK-42643][CONNECT][PYTHON] Implement `spark.udf.registerJavaFunction` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/06 11:05:52 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40273: [SPARK-42668][SS] Catch exception while trying to close compressed stream in HDFSStateStoreProvider abort - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/06 11:23:29 UTC, 0 replies.
- [GitHub] [spark] dolfinus commented on pull request #30572: [SPARK-33628][SQL] Use the Hive.getPartitionsByNames method instead of Hive.getPartitions in the HiveClientImpl - posted by "dolfinus (via GitHub)" <gi...@apache.org> on 2023/03/06 12:00:13 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40296: [SPARK-42680][CONNECT] Create the helper function withSQLConf for connect test framework - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/06 12:07:43 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40268: [SPARK-42500][SQL] ConstantPropagation support more cases - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/06 12:29:42 UTC, 8 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40291: [WIP][SPARK-42578][CONNECT] Add JDBC to DataFrameWriter - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 12:37:32 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40244: [SPARK-42643][CONNECT][PYTHON] Register Java (aggregate) user-defined functions - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/06 12:50:13 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/06 13:33:05 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40097: [WIP][SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/06 14:07:09 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/06 14:24:51 UTC, 0 replies.
- [GitHub] [spark] haoyanzhang opened a new pull request, #40298: [SPARK-42595][SQL] Support query inserted partitions after insert data into table when hive.exec.dynamic.partition=true - posted by "haoyanzhang (via GitHub)" <gi...@apache.org> on 2023/03/06 15:08:27 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on a diff in pull request #40286: [SPARK-42577][CORE] Add max attempts limitation for stages to avoid potential infinite retry - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/06 15:36:19 UTC, 7 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40299: [SPARK-42684][SQL] v2 catalog should not allow column default value by default - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/06 15:38:18 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks opened a new pull request, #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/06 16:05:31 UTC, 0 replies.
- [GitHub] [spark] alkis opened a new pull request, #40301: [SPARK-42685] optimize Utils.byteToString routines - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/03/06 16:09:19 UTC, 0 replies.
- [GitHub] [spark] alkis opened a new pull request, #40302: [SPARK-42686] defer formatting for debug messages in TaskMemoryManager - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/03/06 16:20:26 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40303: [SPARK-42656][CONNECT][Followup] Improve the script to start spark-connect server - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/06 16:26:37 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40274: [SPARK-42215][CONNECT] Simplify Scala Client IT tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/06 16:29:20 UTC, 3 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40304: [SPARK-42665] Mute udf test - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/06 16:49:06 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40282: [SPARK-42672][PYTHON][DOCS] Document error class list - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/03/06 18:07:14 UTC, 1 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40305: [WIP] Spark Connect Shell - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/06 18:37:44 UTC, 0 replies.
- [GitHub] [spark] huanliwang-db opened a new pull request, #40306: [SPARK-42687][SS] Better error message for the unsupport `pivot` operation in Streaming - posted by "huanliwang-db (via GitHub)" <gi...@apache.org> on 2023/03/06 18:45:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40283: [SPARK-42673][BUILD] Make `build/mvn` build Spark only with the verified maven version - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/06 19:06:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40217: [SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 19:17:16 UTC, 0 replies.
- [GitHub] [spark] mridulm opened a new pull request, #40307: Draft: SPARK-42689: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/06 19:30:38 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40307: [DRAFT][CORE][SHUFFLE]: SPARK-42689: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/06 19:36:19 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40307: [DRAFT][SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/06 19:39:13 UTC, 5 replies.
- [GitHub] [spark] amaliujia commented on pull request #40303: [SPARK-42656][CONNECT][Followup] Improve the script to start spark-connect server - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/06 20:08:07 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/06 21:21:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40307: [DRAFT][SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/06 21:23:48 UTC, 0 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #40307: [DRAFT][SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/03/06 21:26:06 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40304: [SPARK-42665][CONNECT][Test] Mute Scala Client UDF test - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/06 21:50:18 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/06 21:51:19 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40307: [DRAFT][SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/06 21:56:48 UTC, 2 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/06 21:59:34 UTC, 33 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40290: [SPARK-42478][SQL][3.3] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/06 22:06:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40290: [SPARK-42478][SQL][3.3] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/06 22:06:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40289: [SPARK-42478][SQL][3.2] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/06 22:08:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40309: [SPARK-42688][CONNECT] Rename Connect proto Request client_id to session_id - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/06 22:22:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40309: [SPARK-42688][CONNECT] Rename Connect proto Request client_id to session_id - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/06 22:22:56 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40303: [SPARK-42656][CONNECT][Followup] Improve the script to start spark-connect server - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/06 22:49:40 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40305: [SPARK-42656][CONNECT][Followup] Spark Connect Shell - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/06 22:50:27 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40306: [SPARK-42687][SS] Better error message for the unsupport `pivot` operation in Streaming - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/06 23:13:07 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40306: [SPARK-42687][SS] Better error message for the unsupport `pivot` operation in Streaming - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/06 23:14:10 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40218: [SPARK-42579][CONNECT] Part-1: `function.lit` support `Array[_]` dataType - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 23:26:14 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40218: [SPARK-42579][CONNECT] Part-1: `function.lit` support `Array[_]` dataType - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/06 23:26:49 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40310: [SPARK-42022][CONNECT][PYTHON] Fix createDataFrame to autogenerate missing column names - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/07 00:20:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38736: [SPARK-41214][SQL] - SQL Metrics are missing from Spark UI when AQE for Cached DataFrame is enabled - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/07 00:21:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40296: [SPARK-42680][CONNECT][TESTS] Create the helper function withSQLConf for connect test framework - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/07 00:29:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40303: [SPARK-42656][CONNECT][Followup] Improve the script to start spark-connect server - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 02:04:47 UTC, 0 replies.
- [GitHub] [spark] vitaliili-db commented on pull request #40295: [SPARK-42681] Relax ordering constraint for ALTER TABLE ADD|REPLACE column options - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2023/03/07 02:05:17 UTC, 2 replies.
- [GitHub] [spark] hvanhovell closed pull request #40303: [SPARK-42656][CONNECT][Followup] Improve the script to start spark-connect server - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 02:05:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40309: [SPARK-42688][CONNECT] Rename Connect proto Request client_id to session_id - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 02:11:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40309: [SPARK-42688][CONNECT] Rename Connect proto Request client_id to session_id - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 02:11:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40296: [SPARK-42680][CONNECT][TESTS] Create the helper function withSQLConf for connect test framework - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 02:14:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40296: [SPARK-42680][CONNECT][TESTS] Create the helper function withSQLConf for connect test framework - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 02:14:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40244: [SPARK-42643][CONNECT][PYTHON] Register Java (aggregate) user-defined functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 02:18:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/07 02:39:36 UTC, 17 replies.
- [GitHub] [spark] beliefer commented on pull request #40296: [SPARK-42680][CONNECT][TESTS] Create the helper function withSQLConf for connect test framework - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/07 02:48:29 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/07 02:55:42 UTC, 42 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/07 03:37:49 UTC, 11 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/07 04:16:18 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/07 04:42:38 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/07 04:44:55 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/07 05:01:54 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40288: [WIP][SPARK-42496][CONNECT][DOCS] Introduction Spark Connect at main page. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/07 05:30:46 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40310: [SPARK-42022][CONNECT][PYTHON] Fix createDataFrame to autogenerate missing column names - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/07 05:48:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40310: [SPARK-42022][CONNECT][PYTHON] Fix createDataFrame to autogenerate missing column names - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/07 05:50:32 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40215: [SPARK-42591][SS][DOCS] Add examples of unblocked workloads after SPARK-42376 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/07 05:55:22 UTC, 2 replies.
- [GitHub] [spark] jerqi commented on pull request #40307: [DRAFT][SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "jerqi (via GitHub)" <gi...@apache.org> on 2023/03/07 06:17:29 UTC, 6 replies.
- [GitHub] [spark] pan3793 commented on pull request #40307: [DRAFT][SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/07 06:32:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40311: [SPARK-42559][CONNECT][TESTS][FOLLOW-UP] Disable ANSI in several tests at DataFrameNaFunctionSuite.scala - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 06:58:34 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40215: [SPARK-42591][SS][DOCS] Add examples of unblocked workloads after SPARK-42376 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/07 07:00:42 UTC, 1 replies.
- [GitHub] [spark] olaky commented on a diff in pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "olaky (via GitHub)" <gi...@apache.org> on 2023/03/07 07:19:33 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40271: SPARK-42258][PYTHON] pyspark.sql.functions should not expose typing.cast - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/07 07:23:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40304: [SPARK-42665][CONNECT][Test] Mute Scala Client UDF test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 07:25:13 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40311: [SPARK-42559][CONNECT][TESTS][FOLLOW-UP] Disable ANSI in several tests at DataFrameNaFunctionSuite.scala - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/07 07:28:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40304: [SPARK-42665][CONNECT][Test] Mute Scala Client UDF test - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 07:36:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40304: [SPARK-42665][CONNECT][Test] Mute Scala Client UDF test - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 07:36:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40311: [SPARK-42559][CONNECT][TESTS][FOLLOW-UP] Disable ANSI in several tests at DataFrameNaFunctionSuite.scala - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 08:09:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40311: [SPARK-42559][CONNECT][TESTS][FOLLOW-UP] Disable ANSI in several tests at DataFrameNaFunctionSuite.scala - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 08:10:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/07 08:12:00 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/07 08:14:57 UTC, 4 replies.
- [GitHub] [spark] xingchaozh opened a new pull request, #40312: [SPARK-42695][SQL] Skew join handling in stream side of broadcast hash join - posted by "xingchaozh (via GitHub)" <gi...@apache.org> on 2023/03/07 08:55:20 UTC, 0 replies.
- [GitHub] [spark] FurcyPin commented on a diff in pull request #40271: SPARK-42258][PYTHON] pyspark.sql.functions should not expose typing.cast - posted by "FurcyPin (via GitHub)" <gi...@apache.org> on 2023/03/07 09:08:47 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40313: [SPARK-42697][WEBUI] Fix /api/v1/applications to return total uptime instead of 0 for the duration field - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/07 10:18:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40302: [SPARK-42686][CORE] Defer formatting for debug messages in TaskMemoryManager - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/07 10:31:48 UTC, 1 replies.
- [GitHub] [spark] AngersZhuuuu opened a new pull request, #40314: [SPARK-42698][CORE] SparkSubmit should pass exitCode to AM side - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/07 10:59:08 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should pass exitCode to AM side - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/07 10:59:57 UTC, 1 replies.
- [GitHub] [spark] AngersZhuuuu opened a new pull request, #40315: [SPARK-42699][CONNECTOR] SparkConnectServer should make client and AM same exit code - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/07 11:11:33 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #40315: [SPARK-42699][CONNECTOR] SparkConnectServer should make client and AM same exit code - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/07 11:11:42 UTC, 0 replies.
- [GitHub] [spark] alkis commented on pull request #40302: [SPARK-42686][CORE] Defer formatting for debug messages in TaskMemoryManager - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/03/07 11:12:20 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40316: [WIP][SPARK-42679][CONNECT] createDataFrame doesn't work with non-nullable schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/07 12:22:13 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #40215: [SPARK-42591][SS][DOCS] Add examples of unblocked workloads after SPARK-42376 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/07 12:25:31 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40305: [SPARK-42656][CONNECT][Followup] Spark Connect Shell - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 12:34:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40305: [SPARK-42656][CONNECT][Followup] Spark Connect Shell - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 12:35:01 UTC, 0 replies.
- [GitHub] [spark] waitinfuture commented on pull request #40307: [DRAFT][SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "waitinfuture (via GitHub)" <gi...@apache.org> on 2023/03/07 12:42:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 12:55:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 12:55:44 UTC, 3 replies.
- [GitHub] [spark] srowen commented on pull request #40258: [SPARK-42655][SQL] Incorrect ambiguous column reference error - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/07 13:06:15 UTC, 5 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40318: [SPARK-42656][SPARK SHELL][CONNECT][FOLLOWUP] Add same `ClassNotFoundException` catch to `repl.Main` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 13:06:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40305: [SPARK-42656][CONNECT][Followup] Spark Connect Shell - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 13:07:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40318: [SPARK-42656][SPARK SHELL][CONNECT][FOLLOWUP] Add same `ClassNotFoundException` catch to `repl.Main` for Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 13:09:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 13:14:12 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #40258: [SPARK-42655][SQL] Incorrect ambiguous column reference error - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/07 13:19:09 UTC, 20 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40315: [SPARK-42699][CONNECTOR] SparkConnectServer should make client and AM same exit code - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 13:23:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/07 14:02:58 UTC, 8 replies.
- [GitHub] [spark] panbingkun commented on pull request #40316: [WIP][SPARK-42679][CONNECT] createDataFrame doesn't work with non-nullable schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/07 14:10:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/07 14:12:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/07 14:16:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 14:31:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40190: [SPARK-42597][SQL] Support unwrap date type to timestamp type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/07 14:31:25 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40294: [SPARK-40610][SQL] Support unwrap date type to string type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/07 14:34:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40319: [SPARK-42692][CONNECT] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 14:34:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40319: [SPARK-42692][CONNECT] Implement `Dataset.toJSON` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 15:07:32 UTC, 2 replies.
- [GitHub] [spark] hvanhovell closed pull request #40318: [SPARK-42656][SPARK SHELL][CONNECT][FOLLOWUP] Add same `ClassNotFoundException` catch to `repl.Main` for Scala 2.13 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 15:13:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40319: [SPARK-42692][CONNECT] Implement `Dataset.toJSON` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/07 15:23:23 UTC, 2 replies.
- [GitHub] [spark] justaparth opened a new pull request, #40320: Update code example formatting for protobuf parsing readme - posted by "justaparth (via GitHub)" <gi...@apache.org> on 2023/03/07 15:30:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40319: [SPARK-42692][CONNECT] Implement `Dataset.toJSON` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/07 16:33:06 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #40190: [SPARK-42597][SQL] Support unwrap date type to timestamp type - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/07 17:13:41 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks opened a new pull request, #40321: [SPARK-42704] SubqueryAlias propagates metadata columns that child outputs - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/07 18:06:58 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40313: [SPARK-42697][WEBUI] Fix /api/v1/applications to return total uptime instead of 0 for the duration field - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/07 18:07:09 UTC, 4 replies.
- [GitHub] [spark] dbtsai commented on pull request #40320: Update code example formatting for protobuf parsing readme - posted by "dbtsai (via GitHub)" <gi...@apache.org> on 2023/03/07 18:41:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/07 19:39:58 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40319: [SPARK-42692][CONNECT] Implement `Dataset.toJSON` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/07 19:46:45 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40295: [SPARK-42681] Relax ordering constraint for ALTER TABLE ADD|REPLACE column options - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/07 19:55:01 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/07 19:57:46 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40295: [SPARK-42681] Relax ordering constraint for ALTER TABLE ADD|REPLACE column options - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/07 20:04:50 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40215: [SPARK-42591][SS][DOCS] Add examples of unblocked workloads after SPARK-42376 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/07 20:24:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40314: [SPARK-42698][CORE] SparkSubmit should pass exitCode to AM side - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/07 22:41:31 UTC, 0 replies.
- [GitHub] [spark] eric-maynard commented on pull request #39519: [SPARK-41995][SQL] Accept non-foldable expressions in schema_of_json - posted by "eric-maynard (via GitHub)" <gi...@apache.org> on 2023/03/07 22:49:10 UTC, 0 replies.
- [GitHub] [spark] eric-maynard closed pull request #39519: [SPARK-41995][SQL] Accept non-foldable expressions in schema_of_json - posted by "eric-maynard (via GitHub)" <gi...@apache.org> on 2023/03/07 22:49:14 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks commented on pull request #40321: [SPARK-42704] SubqueryAlias propagates metadata columns that child outputs - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/07 23:50:29 UTC, 0 replies.
- [GitHub] [spark] itholic closed pull request #40288: [SPARK-42496][CONNECT][DOCS] Introduction Spark Connect at main page. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/07 23:53:08 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #40322: Added small fix - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/03/08 00:13:23 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40323: [SPARK-42705][CONNECT] Fix spark.sql to return values from the command - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/08 00:16:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38740: [SQL] Add product encoders for local classes - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/08 00:21:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40310: [SPARK-42022][CONNECT][PYTHON] Fix createDataFrame to autogenerate missing column names - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 00:35:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40310: [SPARK-42022][CONNECT][PYTHON] Fix createDataFrame to autogenerate missing column names - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 00:35:59 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40282: [SPARK-42672][PYTHON][DOCS] Document error class list - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/08 00:50:40 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40315: [SPARK-42699][CONNECT] SparkConnectServer should make client and AM same exit code - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 01:07:43 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40323: [SPARK-42705][CONNECT] Fix spark.sql to return values from the command - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/08 02:10:48 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40313: [SPARK-42697][WEBUI] Fix /api/v1/applications to return total uptime instead of 0 for the duration field - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/08 02:22:58 UTC, 2 replies.
- [GitHub] [spark] AngersZhuuuu commented on a diff in pull request #40315: [SPARK-42699][CONNECT] SparkConnectServer should make client and AM same exit code - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/08 02:26:33 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40316: [SPARK-42679][CONNECT] createDataFrame doesn't work with non-nullable schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/08 02:26:56 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40323: [SPARK-42705][CONNECT] Fix spark.sql to return values from the command - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/08 02:28:01 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on a diff in pull request #40314: [SPARK-42698][CORE] SparkSubmit should pass exitCode to AM side - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/08 02:37:28 UTC, 1 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40324: [WIP][SPARK-42496][CONNECT][DOCS] Adding Spark Connect to the Spark 3.4 documentation - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/03/08 02:44:27 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40316: [SPARK-42679][CONNECT] createDataFrame doesn't work with non-nullable schema - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/08 02:45:59 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40323: [SPARK-42705][CONNECT] Fix spark.sql to return values from the command - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 03:06:12 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40323: [SPARK-42705][CONNECT] Fix spark.sql to return values from the command - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 03:06:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40325: [SPARK-42707][CONNECT][DOCS] Update developer documentation about API stability warning - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 03:21:16 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/08 03:33:51 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/08 03:36:37 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40325: [SPARK-42707][CONNECT][DOCS] Update developer documentation about API stability warning - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/08 03:37:58 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/08 03:58:17 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40295: [SPARK-42681][SQL] Relax ordering constraint for ALTER TABLE ADD|REPLACE column descriptor - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/08 04:04:30 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40295: [SPARK-42681][SQL] Relax ordering constraint for ALTER TABLE ADD|REPLACE column descriptor - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/08 04:04:57 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #40326: [SPARK-42708] [Docs] Improve doc about protobuf java file can't be indexed. - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/08 04:09:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40299: [SPARK-42684][SQL] v2 catalog should not allow column default value by default - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 04:26:13 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40327: [SPARK-42266][PYTHON] Remove the parent directory in shell.py execution when IPython is used - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 04:41:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40327: [SPARK-42266][PYTHON] Remove the parent directory in shell.py execution when IPython is used - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 04:42:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40325: [SPARK-42707][CONNECT][DOCS] Update developer documentation about API stability warning - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 04:43:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 04:46:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40317: [SPARK-42700][BUILD] Add `h2` as test dependency of connect-server module - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 04:46:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40325: [SPARK-42707][CONNECT][DOCS] Update developer documentation about API stability warning - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 04:46:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40325: [SPARK-42707][CONNECT][DOCS] Update developer documentation about API stability warning - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 04:47:26 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #40299: [SPARK-42684][SQL] v2 catalog should not allow column default value by default - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/08 05:05:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40233: [WIP][SPARK-42630][CONNECT][PYTHON] Make `parse_data_type` use new proto message `DDLParse` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 05:07:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40233: [WIP][SPARK-42630][CONNECT][PYTHON] Make `parse_data_type` use new proto message `DDLParse` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 05:08:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40328: [SPARK-42709][PYTHON] Remove the assumption of `__file__` being available - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 05:17:22 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40323: [SPARK-42705][CONNECT] Fix spark.sql to return values from the command - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/08 06:02:39 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40329: Rename FrameMap proto to MapPartitions - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/08 06:22:50 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40244: [SPARK-42643][CONNECT][PYTHON] Register Java (aggregate) user-defined functions - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/08 06:23:34 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40323: [SPARK-42705][CONNECT] Fix spark.sql to return values from the command - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/08 06:25:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40322: [SPARK-41775][PYTHON][FOLLOW-UP] Updating error message for training using PyTorch functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 06:47:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40322: [SPARK-41775][PYTHON][FOLLOW-UP] Updating error message for training using PyTorch functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 06:47:46 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40330: [SPARK-42712][PYTHON][DOC] Improve docstring of mapInPandas and mapInArrow - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/08 07:01:00 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40330: [SPARK-42712][PYTHON][DOC] Improve docstring of mapInPandas and mapInArrow - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/08 07:01:41 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40329: [SPARK-42710][CONNECT][PYTHON] Rename FrameMap proto to MapPartitions - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/08 07:02:16 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40331: [SPARK-42713][PYTHON][DOCS] Add '__getattr__' and '__getitem__' of DataFrame and Column to API reference - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 07:05:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40258: [SPARK-42655][SQL] Incorrect ambiguous column reference error - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/08 07:16:11 UTC, 3 replies.
- [GitHub] [spark] panbingkun commented on pull request #40316: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/08 08:28:14 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40316: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/08 08:33:39 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40319: [SPARK-42692][CONNECT] Implement `Dataset.toJSON` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/08 08:49:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40332: [SPARK-42690][CONNECT] Implement CSV/JSON parsing functions for Scala client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/08 08:50:18 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40307: [SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/08 09:06:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40333: [SPARK-42702][SPARK-42623][SQL] Support parameterized query in subquery and CTE - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 09:11:14 UTC, 1 replies.
- [GitHub] [spark] DonnyZone commented on pull request #39865: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "DonnyZone (via GitHub)" <gi...@apache.org> on 2023/03/08 09:15:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should pass exitCode to AM side for yarn mode - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 09:20:50 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40332: [SPARK-42690][CONNECT] Implement CSV/JSON parsing functions for Scala client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/08 09:29:22 UTC, 18 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 09:30:13 UTC, 5 replies.
- [GitHub] [spark] cloud-fan closed pull request #39999: WIP - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 09:35:38 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/08 10:01:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40332: [SPARK-42690][CONNECT] Implement CSV/JSON parsing functions for Scala client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 10:29:59 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40330: [SPARK-42712][PYTHON][DOC] Improve docstring of mapInPandas and mapInArrow - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 10:36:55 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40330: [SPARK-42712][PYTHON][DOC] Improve docstring of mapInPandas and mapInArrow - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 10:37:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40330: [SPARK-42712][PYTHON][DOC] Improve docstring of mapInPandas and mapInArrow - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 10:37:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40327: [SPARK-42266][PYTHON] Remove the parent directory in shell.py execution when IPython is used - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 10:38:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40329: [SPARK-42710][CONNECT][PYTHON] Rename FrameMap proto to MapPartitions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 10:43:55 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40332: [SPARK-42690][CONNECT] Implement CSV/JSON parsing functions for Scala client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/08 11:07:30 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40331: [SPARK-42713][PYTHON][DOCS] Add '__getattr__' and '__getitem__' of DataFrame and Column to API reference - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 11:13:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40331: [SPARK-42713][PYTHON][DOCS] Add '__getattr__' and '__getitem__' of DataFrame and Column to API reference - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/08 11:14:06 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #40334: Support key-grouped partitioning without HasPartitionKey - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/08 11:20:39 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40335: [SPARK-42717][BUILD] Upgrade mysql-connector-java from 8.0.31 to 8.0.32 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/08 11:25:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40299: [SPARK-42684][SQL] v2 catalog should not allow column default value by default - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 12:05:58 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40336: [SPARK-42706][SQL][DOCS] List the error class to user-facing documentation. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/08 12:12:33 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40336: [SPARK-42706][SQL][DOCS] Document the Spark SQL error classes in user-facing documentation. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/08 12:15:12 UTC, 3 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40316: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/08 12:41:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40337: [SPARK-42718][BUILD] Upgrade rocksdbjni to 7.10.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/08 12:44:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39949: [SPARK-42386][SQL] Rewrite HiveGenericUDF with Invoke - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 12:45:08 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/08 12:47:49 UTC, 10 replies.
- [GitHub] [spark] MaicoTimmerman opened a new pull request, #40338: [MINOR][PYTHON] Change TypeVar to private symbols - posted by "MaicoTimmerman (via GitHub)" <gi...@apache.org> on 2023/03/08 12:53:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40332: [SPARK-42690][CONNECT] Implement CSV/JSON parsing functions for Scala client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/08 13:02:35 UTC, 3 replies.
- [GitHub] [spark] jerqi opened a new pull request, #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality`.enabled` - posted by "jerqi (via GitHub)" <gi...@apache.org> on 2023/03/08 13:12:02 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40190: [SPARK-42597][SQL] Support unwrap date type to timestamp type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/08 13:30:01 UTC, 3 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40324: [WIP][SPARK-42496][CONNECT][DOCS] Adding Spark Connect to the Spark 3.4 documentation - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/08 14:08:53 UTC, 4 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #40340: [SPARK-42701][SQL] Add the `try_aes_decrypt()` function - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/08 14:14:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40336: [SPARK-42706][SQL][DOCS] Document the Spark SQL error classes in user-facing documentation. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/08 14:24:55 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40329: [SPARK-42710][CONNECT][PYTHON] Rename FrameMap proto to MapPartitions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/08 14:45:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40277: [SPARK-42555][CONNECT][FOLLOWUP] Add the new proto msg to support the remaining jdbc API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/08 14:50:28 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40333: [SPARK-42702][SPARK-42623][SQL] Support parameterized query in subquery and CTE - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/03/08 14:53:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40332: [SPARK-42690][CONNECT] Implement CSV/JSON parsing functions for Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/08 14:54:56 UTC, 3 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40336: [SPARK-42706][SQL][DOCS] Document the Spark SQL error classes in user-facing documentation. - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/03/08 15:12:18 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/03/08 15:17:54 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40336: [SPARK-42706][SQL][DOCS] Document the Spark SQL error classes in user-facing documentation. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/08 15:56:16 UTC, 0 replies.
- [GitHub] [spark] chong0929 opened a new pull request, #40341: [SPARK-42715][SQL] Tips for Optimizing NegativeArraySizeException - posted by "chong0929 (via GitHub)" <gi...@apache.org> on 2023/03/08 16:28:02 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40307: [SPARK-42689][CORE][SHUFFLE]: Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/08 17:07:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40307: [SPARK-42689][CORE][SHUFFLE] Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/08 17:16:16 UTC, 7 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40307: [SPARK-42689][CORE][SHUFFLE] Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/08 17:25:41 UTC, 5 replies.
- [GitHub] [spark] ryan-johnson-databricks commented on a diff in pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/08 17:46:03 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40328: [SPARK-42709][PYTHON] Remove the assumption of `__file__` being available - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/08 18:00:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40328: [SPARK-42709][PYTHON] Remove the assumption of `__file__` being available - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/08 18:00:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40333: [SPARK-42702][SPARK-42623][SQL] Support parameterized query in subquery and CTE - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/08 18:01:59 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality.enabled` - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/08 18:10:01 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should pass exitCode to AM side for yarn mode - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/08 18:25:16 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #40342: [SPARK-42721][CONNECT] RPC logging interceptor - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/08 19:20:43 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40343: [SPARK-42722][CONNECT][PYTHON] Python Connect def schema() should not cache the schema - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/08 20:24:11 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40343: [SPARK-42722][CONNECT][PYTHON] Python Connect def schema() should not cache the schema - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/08 20:24:16 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40344: [SPARK-42656][CONNECT][Followup] Fix the spark-connect script - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/08 21:20:28 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #40302: [SPARK-42686][CORE] Defer formatting for debug messages in TaskMemoryManager - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/08 21:29:09 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40344: [SPARK-42656][CONNECT][Followup] Fix the spark-connect script - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/08 21:32:26 UTC, 3 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40344: [SPARK-42656][CONNECT][Followup] Fix the spark-connect script - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/08 21:43:57 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on pull request #40344: [SPARK-42656][CONNECT][Followup] Fix the spark-connect script - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/08 21:44:03 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40345: [SPARK-42723][SQL] Support parser data type json "timestamp_ltz" as TimestampType - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/08 22:10:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/08 22:20:12 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40346: [SPARK-42667][CONNECT][FOLLOW-UP] SparkSession created by newSession should not share the channel - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/08 23:57:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40346: [SPARK-42667][CONNECT][FOLLOW-UP] SparkSession created by newSession should not share the channel - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/08 23:59:54 UTC, 3 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38740: [SQL] Add product encoders for local classes - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/09 00:20:50 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/09 00:30:20 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/09 00:30:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/09 00:32:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/09 00:33:27 UTC, 2 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/09 00:36:58 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40346: [SPARK-42667][CONNECT][FOLLOW-UP] SparkSession created by newSession should not share the channel - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/09 00:37:08 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40343: [SPARK-42722][CONNECT][PYTHON] Python Connect def schema() should not cache the schema - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 00:39:46 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40346: [SPARK-42667][CONNECT][FOLLOW-UP] SparkSession created by newSession should not share the channel - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/09 00:40:23 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40343: [SPARK-42722][CONNECT][PYTHON] Python Connect def schema() should not cache the schema - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 00:41:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40329: [SPARK-42710][CONNECT][PYTHON] Rename FrameMap proto to MapPartitions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 00:50:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40344: [SPARK-42656][CONNECT][Followup] Fix the spark-connect script - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 01:29:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40344: [SPARK-42656][CONNECT][Followup] Fix the spark-connect script - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 01:29:56 UTC, 0 replies.
- [GitHub] [spark] allanf-db commented on a diff in pull request #40324: [WIP][SPARK-42496][CONNECT][DOCS] Adding Spark Connect to the Spark 3.4 documentation - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/03/09 01:35:22 UTC, 6 replies.
- [GitHub] [spark] jerqi commented on pull request #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality.enabled` - posted by "jerqi (via GitHub)" <gi...@apache.org> on 2023/03/09 01:54:22 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40307: [SPARK-42689][CORE][SHUFFLE] Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/09 02:11:54 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40307: [SPARK-42689][CORE][SHUFFLE] Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/09 02:12:26 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #40302: [SPARK-42686][CORE] Defer formatting for debug messages in TaskMemoryManager - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/09 02:23:27 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40313: [SPARK-42697][WEBUI] Fix /api/v1/applications to return total uptime instead of 0 for the duration field - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/09 02:24:46 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality.enabled` - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/09 02:26:50 UTC, 2 replies.
- [GitHub] [spark] jerqi commented on a diff in pull request #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality.enabled` - posted by "jerqi (via GitHub)" <gi...@apache.org> on 2023/03/09 02:29:00 UTC, 2 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40313: [SPARK-42697][WEBUI] Fix /api/v1/applications to return total uptime instead of 0 for the duration field - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/09 02:50:06 UTC, 0 replies.
- [GitHub] [spark] liang3zy22 opened a new pull request, #40347: [SPARK-42711][BUILD]Update usage info for sbt tool - posted by "liang3zy22 (via GitHub)" <gi...@apache.org> on 2023/03/09 03:13:22 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40348: [SPARK-42724][CONNECT][BUILD] Upgrade buf to v1.15.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/09 03:13:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40238: [SPARK-42633][CONNECT] Make LocalRelation take an actual schema - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 03:13:28 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40345: [SPARK-42723][SQL] Support parser data type json "timestamp_ltz" as TimestampType - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/09 03:29:53 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40345: [SPARK-42723][SQL] Support parser data type json "timestamp_ltz" as TimestampType - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/09 03:30:25 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should pass exitCode to AM side for yarn mode - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/09 04:12:32 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #40313: [SPARK-42697][WEBUI] Fix /api/v1/applications to return total uptime instead of 0 for the duration field - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/09 05:33:56 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #40335: [SPARK-42717][BUILD] Upgrade mysql-connector-java from 8.0.31 to 8.0.32 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/09 05:38:09 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40335: [SPARK-42717][BUILD] Upgrade mysql-connector-java from 8.0.31 to 8.0.32 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/09 05:38:38 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40347: [SPARK-42711][BUILD]Update usage info for sbt tool - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/09 05:46:16 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/09 05:56:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40340: [SPARK-42701][SQL] Add the `try_aes_decrypt()` function - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/09 06:12:28 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40340: [SPARK-42701][SQL] Add the `try_aes_decrypt()` function - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/09 06:13:17 UTC, 0 replies.
- [GitHub] [spark] liang3zy22 commented on pull request #40347: [SPARK-42711][BUILD]Update usage info for sbt tool - posted by "liang3zy22 (via GitHub)" <gi...@apache.org> on 2023/03/09 06:24:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 06:56:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40332: [SPARK-42690][CONNECT] Implement CSV/JSON parsing functions for Scala client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 06:59:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40342: [SPARK-42721][CONNECT] RPC logging interceptor - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 07:05:20 UTC, 1 replies.
- [GitHub] [spark] mridulm closed pull request #40307: [SPARK-42689][CORE][SHUFFLE] Allow ShuffleDriverComponent to declare if shuffle data is reliably stored - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/09 07:10:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality.enabled` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 07:11:28 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #40342: [SPARK-42721][CONNECT] RPC logging interceptor - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/09 07:15:49 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40326: [SPARK-42708] [Docs] Improve doc about protobuf java file can't be indexed. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 07:17:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40341: [SPARK-42715][SQL] Tips for Optimizing NegativeArraySizeException - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 07:19:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality.enabled` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/09 07:34:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40348: [SPARK-42724][CONNECT][BUILD] Upgrade buf to v1.15.1 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 07:46:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40348: [SPARK-42724][CONNECT][BUILD] Upgrade buf to v1.15.1 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 07:47:18 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on a diff in pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "sigmod (via GitHub)" <gi...@apache.org> on 2023/03/09 07:49:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40342: [SPARK-42721][CONNECT] RPC logging interceptor - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 07:54:26 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 08:46:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40329: [SPARK-42710][CONNECT][PYTHON] Rename FrameMap proto to MapPartitions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 08:49:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40329: [SPARK-42710][CONNECT][PYTHON] Rename FrameMap proto to MapPartitions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 08:50:21 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #40315: [SPARK-42699][CONNECT] SparkConnectServer should make client and AM same exit code - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/09 08:52:22 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40350: [SPARK-42710][CONNECT][PYTHON] Implement `DataFrame.mapInArrow` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/09 09:04:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40301: [SPARK-42685][CORE] Optimize Utils.bytesToString routines - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/09 09:15:51 UTC, 0 replies.
- [GitHub] [spark] huangxiaopingRD opened a new pull request, #40351: [SPARK-42727][CORE] Support executing spark commands in the root directory when local mode is specified - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/03/09 09:18:03 UTC, 0 replies.
- [GitHub] [spark] alkis commented on a diff in pull request #40301: [SPARK-42685][CORE] Optimize Utils.bytesToString routines - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/03/09 10:38:56 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39949: [SPARK-42386][SQL] Rewrite HiveGenericUDF with Invoke - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/09 10:47:52 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array params - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 10:59:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40260: [SPARK-42630][CONNECT][PYTHON] Introduce UnparsedDataType and delay parsing DDL string until SparkConnectClient is available - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 11:14:11 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array params - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/09 11:14:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40260: [SPARK-42630][CONNECT][PYTHON] Introduce UnparsedDataType and delay parsing DDL string until SparkConnectClient is available - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/09 11:14:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array params - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 11:20:43 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/09 11:35:29 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40352: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 11:41:25 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/09 11:58:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39624: [SPARK-42101][SQL] Make AQE support InMemoryTableScanExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/09 11:59:35 UTC, 24 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/09 12:08:19 UTC, 3 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #40326: [SPARK-42708] [Docs] Improve doc about protobuf java file can't be indexed. - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/09 12:14:06 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39624: [SPARK-42101][SQL] Make AQE support InMemoryTableScanExec - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/09 12:20:06 UTC, 15 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39990: [SPARK-42415][SQL] The built-in dialects support OFFSET and paging query. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/09 12:26:07 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/09 12:33:07 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/09 12:41:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40352: [SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 13:19:40 UTC, 3 replies.
- [GitHub] [spark] peter-toth commented on pull request #40266: [SPARK-42660][SQL] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/09 14:19:46 UTC, 3 replies.
- [GitHub] [spark] ebarault commented on pull request #37153: [SPARK-26052] Add type comments to exposed Prometheus metrics - posted by "ebarault (via GitHub)" <gi...@apache.org> on 2023/03/09 14:31:15 UTC, 0 replies.
- [GitHub] [spark] yeachan153 commented on pull request #36434: [SPARK-38969][K8S] Fix Decom reporting - posted by "yeachan153 (via GitHub)" <gi...@apache.org> on 2023/03/09 14:43:24 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #40336: [SPARK-42706][SQL][DOCS] Document the Spark SQL error classes in user-facing documentation. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/09 14:54:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40236: [SPARK-38735][SQL][TESTS] Add tests for the error class: INTERNAL_ERROR - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/09 15:11:45 UTC, 3 replies.
- [GitHub] [spark] tomvanbussel opened a new pull request, #40354: [SPARK-42735][CONNECT][SCALA] Allow passing extra confs to RemoteSparkSession - posted by "tomvanbussel (via GitHub)" <gi...@apache.org> on 2023/03/09 15:13:30 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/09 15:19:10 UTC, 0 replies.
- [GitHub] [spark] navinvishy commented on pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/03/09 15:50:20 UTC, 2 replies.
- [GitHub] [spark] zzzzming95 commented on a diff in pull request #40341: [SPARK-42715][SQL] Tips for Optimizing NegativeArraySizeException - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/03/09 16:00:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 16:31:58 UTC, 26 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/09 16:37:49 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #40342: [SPARK-42721][CONNECT] RPC logging interceptor - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/09 16:40:14 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/09 17:08:31 UTC, 0 replies.
- [GitHub] [spark] the8thC commented on pull request #40236: [SPARK-38735][SQL][TESTS] Add tests for the error class: INTERNAL_ERROR - posted by "the8thC (via GitHub)" <gi...@apache.org> on 2023/03/09 17:30:16 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #40339: [SPARK-42719][CORE] `MapOutputTracker#getMapLocation` should respect `spark.shuffle.reduceLocality.enabled` - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/09 17:50:15 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #36434: [SPARK-38969][K8S] Fix Decom reporting - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/03/09 17:51:29 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40354: [SPARK-42735][CONNECT][SCALA] Allow passing extra confs to RemoteSparkSession - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/09 18:34:53 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40352: [WIP][SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/09 18:36:53 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/09 18:54:00 UTC, 0 replies.
- [GitHub] [spark] gpiotti commented on pull request #28946: [SPARK-32123][PYSPARK] Setting `spark.sql.session.timeZone` only partially respected - posted by "gpiotti (via GitHub)" <gi...@apache.org> on 2023/03/09 19:11:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40236: [SPARK-38735][SQL][TESTS] Add tests for the error class: INTERNAL_ERROR - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/09 19:35:27 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40356: [SPARK-42733][CONNECT][PYTHON] Fix DataFrameWriter.save to work without path parameter - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/09 20:35:04 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40356: [SPARK-42733][CONNECT][PYTHON] Fix DataFrameWriter.save to work without path parameter - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/09 22:17:35 UTC, 2 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40356: [SPARK-42733][CONNECT][PYTHON] Fix DataFrameWriter.save to work without path parameter - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/09 22:29:11 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40354: [SPARK-42735][CONNECT][SCALA] Allow passing extra confs to RemoteSparkSession - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/09 23:22:47 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/03/09 23:49:05 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks commented on pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/09 23:52:10 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40315: [SPARK-42699][CONNECT] SparkConnectServer should make client and AM same exit code - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/10 00:05:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/10 00:20:51 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40357: [SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 00:41:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40356: [SPARK-42733][CONNECT][PYTHON] Fix DataFrameWriter.save to work without path parameter - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/10 00:51:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40356: [SPARK-42733][CONNECT][PYTHON] Fix DataFrameWriter.save to work without path parameter - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/10 00:52:16 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40357: [WIP][SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 01:01:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40350: [SPARK-42726][CONNECT][PYTHON] Implement `DataFrame.mapInArrow` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/10 01:08:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40350: [SPARK-42726][CONNECT][PYTHON] Implement `DataFrame.mapInArrow` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/10 01:08:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40302: [SPARK-42686][CORE] Defer formatting for debug messages in TaskMemoryManager - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/10 01:13:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40324: [WIP][SPARK-42496][CONNECT][DOCS] Adding Spark Connect to the Spark 3.4 documentation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/10 01:23:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40333: [SPARK-42702][SPARK-42623][SQL] Support parameterized query in subquery and CTE - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 01:30:50 UTC, 3 replies.
- [GitHub] [spark] cloud-fan closed pull request #40333: [SPARK-42702][SPARK-42623][SQL] Support parameterized query in subquery and CTE - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 01:31:34 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40333: [SPARK-42702][SPARK-42623][SQL] Support parameterized query in subquery and CTE - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/10 01:48:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40321: [SPARK-42704] SubqueryAlias propagates metadata columns that child outputs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 02:01:36 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40357: [WIP][SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 02:12:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array params - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/10 02:18:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array params - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/10 02:19:29 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/10 02:37:24 UTC, 14 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40352: [WIP][SPARK-42664][CONNECT] Support `bloomFilter` function for `DataFrameStatFunctions` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/10 02:43:05 UTC, 8 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40357: [SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/10 02:45:05 UTC, 0 replies.
- [GitHub] [spark] chong0929 commented on a diff in pull request #40341: [SPARK-42715][SQL] Tips for Optimizing NegativeArraySizeException - posted by "chong0929 (via GitHub)" <gi...@apache.org> on 2023/03/10 02:51:51 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40357: [SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 02:59:52 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40357: [SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 03:04:53 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40357: [SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 03:05:58 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40350: [SPARK-42726][CONNECT][PYTHON] Implement `DataFrame.mapInArrow` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 03:16:06 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40266: [SPARK-42660][SQL] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/10 03:21:27 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks commented on a diff in pull request #40321: [SPARK-42704] SubqueryAlias propagates metadata columns that child outputs - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/10 04:20:49 UTC, 3 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40358: [SPARK-42733][CONNECT][Followup] Write without path or table - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/10 04:23:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40357: [SPARK-42739][BUILD] Ensure release tag to be pushed to release branch - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/10 04:48:17 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/10 05:13:49 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #40276: [SPARK-42630][CONNECT][PYTHON] Implement data type string parser - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/10 05:13:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40358: [SPARK-42733][CONNECT][Followup] Write without path or table - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/10 05:39:32 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40359: [SPARK-42740][SQL] Fix the bug that pushdown offset or paging is invalid for some built-in dialect - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/10 05:55:31 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40360: [SPARK-42741][SQL] Do not unwrap casts in binary comparison when literal is null - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/10 06:20:13 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should also stop SparkContext when exit program in yarn mode and pass exitCode to AM side - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/10 06:33:58 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on pull request #40359: [SPARK-42740][SQL] Fix the bug that pushdown offset or paging is invalid for some built-in dialect - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/10 06:34:07 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40360: [SPARK-42741][SQL] Do not unwrap casts in binary comparison when literal is null - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/10 06:42:03 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40359: [SPARK-42740][SQL] Fix the bug that pushdown offset or paging is invalid for some built-in dialect - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 06:47:29 UTC, 7 replies.
- [GitHub] [spark] thousandhu opened a new pull request, #40361: [SPARK_42742]access apiserver by pod env - posted by "thousandhu (via GitHub)" <gi...@apache.org> on 2023/03/10 06:56:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should also stop SparkContext when exit program in yarn mode and pass exitCode to AM side - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 07:19:44 UTC, 2 replies.
- [GitHub] [spark] thousandhu commented on pull request #40361: [SPARK_42742]access apiserver by pod env - posted by "thousandhu (via GitHub)" <gi...@apache.org> on 2023/03/10 07:21:29 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40362: [PARK-42743][SQL] Support analyze TimestampNTZ columns - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/10 08:08:06 UTC, 0 replies.
- [GitHub] [spark] thousandhu opened a new pull request, #40363: [SPARK_42744] delete uploaded file when job finish for k8s - posted by "thousandhu (via GitHub)" <gi...@apache.org> on 2023/03/10 09:19:10 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40359: [SPARK-42740][SQL] Fix the bug that pushdown offset or paging is invalid for some built-in dialect - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/10 09:43:22 UTC, 5 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #40364: [SPARK-42745][SQL] Improved AliasAwareOutputExpression works with DSv2 - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/10 09:53:54 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #40364: [SPARK-42745][SQL] Improved AliasAwareOutputExpression works with DSv2 - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/10 10:00:18 UTC, 2 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40364: [SPARK-42745][SQL] Improved AliasAwareOutputExpression works with DSv2 - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/10 10:13:44 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40364: [SPARK-42745][SQL] Improved AliasAwareOutputExpression works with DSv2 - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/10 10:24:28 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40362: [SPARK-42743][SQL] Support analyze TimestampNTZ columns - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/10 11:33:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40362: [SPARK-42743][SQL] Support analyze TimestampNTZ columns - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/10 11:34:21 UTC, 0 replies.
- [GitHub] [spark] kenny-ddd opened a new pull request, #40365: [MINOR] Fix typo of LimitPushDownThroughWindow - posted by "kenny-ddd (via GitHub)" <gi...@apache.org> on 2023/03/10 12:07:48 UTC, 1 replies.
- [GitHub] [spark] kenny-ddd commented on pull request #40365: [MINOR] Fix typo of LimitPushDownThroughWindow - posted by "kenny-ddd (via GitHub)" <gi...@apache.org> on 2023/03/10 12:10:18 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40364: [SPARK-42745][SQL] Improved AliasAwareOutputExpression works with DSv2 - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/10 12:27:06 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40365: [MINOR] Fix typo of LimitPushDownThroughWindow - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/10 12:40:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40364: [SPARK-42745][SQL] Improved AliasAwareOutputExpression works with DSv2 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 12:58:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40364: [SPARK-42745][SQL] Improved AliasAwareOutputExpression works with DSv2 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 12:58:53 UTC, 0 replies.
- [GitHub] [spark] kenny-ddd closed pull request #40365: [MINOR] Fix typo of LimitPushDownThroughWindow - posted by "kenny-ddd (via GitHub)" <gi...@apache.org> on 2023/03/10 13:03:28 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40366: [SPARK-42691][CONNECT] Implement Dataset.semanticHash - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/10 13:06:28 UTC, 0 replies.
- [GitHub] [spark] kenny-ddd commented on pull request #40365: [MINOR][SQL] Fix typo of LimitPushDownThroughWindow - posted by "kenny-ddd (via GitHub)" <gi...@apache.org> on 2023/03/10 13:13:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40367: [SPARK-42747][ML] Fix incorrect internal status of LoR and AFT - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/10 13:28:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40367: [SPARK-42747][ML] Fix incorrect internal status of LoR and AFT - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/10 13:35:36 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40367: [SPARK-42747][ML] Fix incorrect internal status of LoR and AFT - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/10 13:36:40 UTC, 0 replies.
- [GitHub] [spark] LucaCanali closed pull request #39757: [SPARK-41585][YARN][DOC] Improve doc of the excludeNodes configuration by clarifying the dependency with dynamic allocation - posted by "LucaCanali (via GitHub)" <gi...@apache.org> on 2023/03/10 13:55:28 UTC, 0 replies.
- [GitHub] [spark] LucaCanali commented on pull request #39757: [SPARK-41585][YARN][DOC] Improve doc of the excludeNodes configuration by clarifying the dependency with dynamic allocation - posted by "LucaCanali (via GitHub)" <gi...@apache.org> on 2023/03/10 13:55:28 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #40368: [SPARK-42748] Server-side Artifact Management - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/03/10 14:35:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40369: [SPARK-42398][SQL][FOLLOWUP] DelegatingCatalogExtension should override the new createTable method - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 15:18:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40369: [SPARK-42398][SQL][FOLLOWUP] DelegatingCatalogExtension should override the new createTable method - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 15:18:59 UTC, 0 replies.
- [GitHub] [spark] dzhigimont opened a new pull request, #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/03/10 15:44:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40371: Revert "[SPARK-41498] Propagate metadata through Union" - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 16:08:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40371: Revert "[SPARK-41498] Propagate metadata through Union" - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/10 16:09:41 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40278: [SPARK-42670][BUILD] Upgrade maven-surefire-plugin to 3.0.0-M9 & eliminate build warnings - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/10 17:09:57 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40301: [SPARK-42685][CORE] Optimize Utils.bytesToString routines - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/10 17:27:51 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40301: [SPARK-42685][CORE] Optimize Utils.bytesToString routines - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/10 17:27:57 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40347: [SPARK-42711][BUILD]Update usage info for sbt tool - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/10 17:38:24 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40362: [SPARK-42743][SQL] Support analyze TimestampNTZ columns - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/10 17:42:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should also stop SparkContext when exit program in yarn mode and pass exitCode to AM side - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/10 17:45:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40371: Revert "[SPARK-41498] Propagate metadata through Union" - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/10 18:30:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40371: Revert "[SPARK-41498] Propagate metadata through Union" - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/10 18:30:59 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40358: [SPARK-42733][CONNECT][Followup] Write without path or table - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/10 18:38:09 UTC, 1 replies.
- [GitHub] [spark] gerashegalov opened a new pull request, #40372: [SPARK-42752][PYSPARK][SQL] Make PySpark exceptions printable during initialization - posted by "gerashegalov (via GitHub)" <gi...@apache.org> on 2023/03/10 20:39:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40369: [SPARK-42398][SQL][FOLLOWUP] DelegatingCatalogExtension should override the new createTable method - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/10 21:00:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40369: [SPARK-42398][SQL][FOLLOWUP] DelegatingCatalogExtension should override the new createTable method - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/10 21:00:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40337: [SPARK-42718][BUILD] Upgrade rocksdbjni to 7.10.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/10 21:02:36 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #40373: [Draft] Streaming Spark Connect POC - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/10 21:40:13 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #40373: [Draft] Streaming Spark Connect POC - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/10 22:00:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40346: [SPARK-42667][CONNECT][FOLLOW-UP] SparkSession created by newSession should not share the channel - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/10 23:37:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40346: [SPARK-42667][CONNECT][FOLLOW-UP] SparkSession created by newSession should not share the channel - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/10 23:39:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40342: [SPARK-42721][CONNECT] RPC logging interceptor - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/10 23:52:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/11 00:18:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/11 00:18:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38574: [SPARK-41060][K8S] Fix generating driver and executor Config Maps - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/11 00:18:09 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40316: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/11 02:05:36 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40374: [SPARK-42721][CONNECT][FOLLOWUP] Apply scalafmt to LoggingInterceptor - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/11 03:21:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40342: [SPARK-42721][CONNECT] RPC logging interceptor - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/11 03:23:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40374: [SPARK-42721][CONNECT][FOLLOWUP] Apply scalafmt to LoggingInterceptor - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/11 03:24:21 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40360: [SPARK-42741][SQL] Do not unwrap casts in binary comparison when literal is null - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/11 03:29:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40363: [SPARK_42744] delete uploaded file when job finish for k8s - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/11 03:30:58 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40374: [SPARK-42721][CONNECT][FOLLOWUP] Apply scalafmt to LoggingInterceptor - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/11 03:37:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40374: [SPARK-42721][CONNECT][FOLLOWUP] Apply scalafmt to LoggingInterceptor - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/11 03:43:04 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/11 04:22:48 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40360: [SPARK-42741][SQL] Do not unwrap casts in binary comparison when literal is null - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/11 06:20:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40375: [SPARK-42755][CONNECT] Factor literal value conversion out to `connect-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/11 07:49:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40376: [SPARK-42756][CONNECT][PYTHON] Helper function to convert proto literal to value in Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/11 08:58:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40376: [SPARK-42756][CONNECT][PYTHON] Helper function to convert proto literal to value in Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/11 09:00:16 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40366: [SPARK-42691][CONNECT][PYTHON] Implement Dataset.semanticHash - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/11 09:04:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40366: [SPARK-42691][CONNECT][PYTHON] Implement Dataset.semanticHash - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/11 09:04:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40375: [SPARK-42755][CONNECT] Factor literal value conversion out to `connect-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/11 09:09:25 UTC, 1 replies.
- [GitHub] [spark] ivoson commented on pull request #40286: [SPARK-42577][CORE] Add max attempts limitation for stages to avoid potential infinite retry - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/11 10:18:28 UTC, 3 replies.
- [GitHub] [spark] dzhigimont commented on pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/03/11 10:24:02 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #40269: [WIP][DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/11 10:42:19 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40269: [WIP][DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/11 10:45:31 UTC, 3 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/11 11:14:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40349: [SPARK-42725][CONNECT][PYTHON] Make LiteralExpression support array params - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/11 11:57:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/11 11:59:37 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40366: [SPARK-42691][CONNECT][PYTHON] Implement Dataset.semanticHash - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/11 12:00:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40375: [SPARK-42755][CONNECT] Factor literal value conversion out to `connect-common` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/11 12:12:42 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40375: [SPARK-42755][CONNECT] Factor literal value conversion out to `connect-common` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/11 12:18:54 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40378: [WIP][SPARK-42758][BUILD][MLLIB] Remove dependency on breeze - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/11 13:13:40 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40278: [SPARK-42670][BUILD] Upgrade maven-surefire-plugin to 3.0.0-M9 & eliminate build warnings - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/11 13:48:39 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40278: [SPARK-42670][BUILD] Upgrade maven-surefire-plugin to 3.0.0-M9 & eliminate build warnings - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/11 13:48:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40379: [SPARK-42759][BUILD] Avoid repeated downloads of maven tar ball when the target directory already exists - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/11 14:42:45 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40367: [SPARK-42747][ML] Fix incorrect internal status of LoR and AFT - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/11 14:46:03 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40367: [SPARK-42747][ML] Fix incorrect internal status of LoR and AFT - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/11 14:46:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40379: [SPARK-42759][BUILD] Avoid repeated install `build/apache-maven` when target already exists - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/11 15:14:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #40379: [SPARK-42759][BUILD] Avoid repeated install `build/apache-maven` when target already exists - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/11 15:15:00 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40114: [SPARK-42513][SQL] Push down topK through join - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/11 15:51:51 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #40374: [SPARK-42721][CONNECT][FOLLOWUP] Apply scalafmt to LoggingInterceptor - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/11 16:58:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40379: [SPARK-42759][BUILD] Avoid duplicated `build/apache-maven` install when target already exists - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/11 17:43:21 UTC, 0 replies.
- [GitHub] [spark] 1511351836 opened a new pull request, #40380: [SPARK-42760][DOCS][PYTHON] provide one format for writing to kafka - posted by "1511351836 (via GitHub)" <gi...@apache.org> on 2023/03/11 19:39:56 UTC, 0 replies.
- [GitHub] [spark] 1511351836 closed pull request #40380: [SPARK-42760][DOCS][PYTHON] provide one format for writing to kafka - posted by "1511351836 (via GitHub)" <gi...@apache.org> on 2023/03/11 19:43:57 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #40381: [SPARK-42761] Upgrade `fabric8:kubernetes-client` to 6.5.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/11 19:50:07 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40381: [SPARK-42761] Upgrade `fabric8:kubernetes-client` to 6.5.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/11 19:53:27 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40382: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/11 20:16:07 UTC, 0 replies.
- [GitHub] [spark] holdenk opened a new pull request, #40383: [SPARK-42762][K8S][MINOR] Improve logging in K8s on disconnect when using statefulsets. - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/03/11 21:13:41 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/12 00:21:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38674: [SPARK-41160][YARN] Fix error when submitting a task to the yarn that enabled the timeline service - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/12 00:21:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38574: [SPARK-41060][K8S] Fix generating driver and executor Config Maps - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/12 00:21:09 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40378: [SPARK-42758][BUILD][MLLIB] Remove dependency on breeze - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/12 00:35:36 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40378: [SPARK-42758][BUILD][MLLIB] Remove dependency on breeze - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/12 00:35:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40381: [SPARK-42761][BUILD] Upgrade `fabric8:kubernetes-client` to 6.5.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:03:24 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40383: [SPARK-42762][K8S][MINOR] Improve logging in K8s on disconnect when using statefulsets. - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:06:31 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40383: [SPARK-42762][K8S] Improve logging in K8s on disconnect when using StatefulSets - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:11:57 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #40383: [SPARK-42762][K8S] Improve logging in K8s on disconnect when using StatefulSets - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/03/12 01:21:52 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40383: [SPARK-42762][K8S] Improve logging in K8s on disconnect when using StatefulSets - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:36:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40383: [SPARK-42762][K8S] Improve logging in K8s on disconnect when using StatefulSets - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:43:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40384: [SPARK-42763][BUILD] Upgrade ZooKeeper from 3.6.3 to 3.6.4 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:46:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40384: [SPARK-42763][BUILD] Upgrade ZooKeeper from 3.6.3 to 3.6.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:49:57 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40384: [SPARK-42763][BUILD] Upgrade ZooKeeper from 3.6.3 to 3.6.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:49:58 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40384: [SPARK-42763][BUILD] Upgrade ZooKeeper from 3.6.3 to 3.6.4 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/12 01:50:33 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40382: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/12 03:01:58 UTC, 2 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40382: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/12 03:43:10 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40381: [SPARK-42761][BUILD] Upgrade `fabric8:kubernetes-client` to 6.5.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/12 09:45:02 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/12 10:11:37 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #40365: [MINOR][SQL] Fix incorrect comment in LimitPushDownThroughWindow - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/12 14:08:25 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40365: [MINOR][SQL] Fix incorrect comment in LimitPushDownThroughWindow - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/12 14:13:36 UTC, 1 replies.
- [GitHub] [spark] StevenChenDatabricks opened a new pull request, #40385: [SPARK-42753] ReusedExchange refers to non-existent nodes - posted by "StevenChenDatabricks (via GitHub)" <gi...@apache.org> on 2023/03/12 19:27:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40365: [MINOR][SQL] Fix incorrect comment in LimitPushDownThroughWindow - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 20:20:32 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40386: [MINOR][SQL][FOLLOWUP] Fix scalastyle in LimitPushDownThroughWindow - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 20:23:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40386: [MINOR][SQL][FOLLOWUP] Fix scalastyle in LimitPushDownThroughWindow - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 20:24:35 UTC, 7 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40365: [MINOR][SQL] Fix incorrect comment in LimitPushDownThroughWindow - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 20:38:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40381: [SPARK-42761][BUILD][K8S] Upgrade `kubernetes-client` to 6.5.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 20:58:54 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40380: [SPARK-42760][DOCS][PYTHON] provide one format for writing to kafka - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/12 21:17:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40386: [MINOR][SQL][FOLLOWUP] Fix scalastyle in LimitPushDownThroughWindow - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/12 21:21:21 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40386: [MINOR][SQL][FOLLOWUP] Fix scalastyle in LimitPushDownThroughWindow - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/12 21:28:02 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #40286: [SPARK-42577][CORE] Add max attempts limitation for stages to avoid potential infinite retry - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/12 23:21:13 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38887: [SPARK-41368][SQL] Reorder the window partition expressions by expression stats - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/13 00:19:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38674: [SPARK-41160][YARN] Fix error when submitting a task to the yarn that enabled the timeline service - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/13 00:19:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40375: [SPARK-42755][CONNECT] Factor literal value conversion out to `connect-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 00:49:35 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40387: [SPARK-42764][K8S] Parameterize the max number of attempts for driver props fetcher in KubernetesExecutorBackend - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 00:54:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40387: [SPARK-42764][K8S] Parameterize the max number of attempts for driver props fetcher in KubernetesExecutorBackend - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 01:04:17 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40387: [SPARK-42764][K8S] Parameterize the max number of attempts for driver props fetcher in KubernetesExecutorBackend - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 01:05:58 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/13 01:30:35 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40387: [SPARK-42764][K8S] Parameterize the max number of attempts for driver props fetcher in KubernetesExecutorBackend - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/13 01:33:10 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40376: [SPARK-42756][CONNECT][PYTHON] Helper function to convert proto literal to value in Python Client - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/13 01:34:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40376: [SPARK-42756][CONNECT][PYTHON] Helper function to convert proto literal to value in Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 01:43:05 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 01:49:46 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40388: [SPARK-42765][CONNECT][PYTHON] Regulate the import path of `pandas_udf` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/13 02:19:59 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40376: [SPARK-42756][CONNECT][PYTHON] Helper function to convert proto literal to value in Python Client - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/13 02:36:27 UTC, 0 replies.
- [GitHub] [spark] liang3zy22 commented on a diff in pull request #40347: [SPARK-42711][BUILD]Update usage info for sbt tool - posted by "liang3zy22 (via GitHub)" <gi...@apache.org> on 2023/03/13 02:43:25 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39624: [SPARK-42101][SQL] Make AQE support InMemoryTableScanExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 03:09:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39624: [SPARK-42101][SQL] Make AQE support InMemoryTableScanExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 03:09:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40382: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 03:11:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40359: [SPARK-42740][SQL] Fix the bug that pushdown offset or paging is invalid for some built-in dialect - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 03:12:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40382: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 03:12:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40359: [SPARK-42740][SQL] Fix the bug that pushdown offset or paging is invalid for some built-in dialect - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 03:12:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40388: [SPARK-42765][CONNECT][PYTHON] Regulate the import path of `pandas_udf` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 03:33:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40372: [SPARK-42752][PYSPARK][SQL] Make PySpark exceptions printable during initialization - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 03:34:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40389: [SPARK-42767][CONNECT][TESTS] Add check condition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/13 04:08:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40389: [SPARK-42767][CONNECT][TESTS] Add check condition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/13 04:09:47 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/13 04:30:42 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads twice when no filters in vectorized reader - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/13 04:56:27 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39037: [SPARK-41214][SQL] Fix AQE cache does not update plan and metrics - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/13 05:00:54 UTC, 0 replies.
- [GitHub] [spark] ulysses-you closed pull request #39037: [SPARK-41214][SQL] Fix AQE cache does not update plan and metrics - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/13 05:00:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40262: [SPARK-42651][SQL] Optimize global sort to driver sort - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 05:10:49 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 05:12:09 UTC, 4 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/13 05:24:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40388: [SPARK-42765][CONNECT][PYTHON] Regulate the import path of `pandas_udf` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 05:27:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40390: [SPARK-42768][SQL] Enable cached plan apply AQE by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 05:29:21 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40262: [SPARK-42651][SQL] Optimize global sort to driver sort - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/13 05:41:01 UTC, 1 replies.
- [GitHub] [spark] wangshengjie123 opened a new pull request, #40391: [WIP][SPARK-42766][YARN] YarnAllocator filter excluded nodes when launching containers - posted by "wangshengjie123 (via GitHub)" <gi...@apache.org> on 2023/03/13 06:19:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40387: [SPARK-42764][K8S] Parameterize the max number of attempts for driver props fetcher in KubernetesExecutorBackend - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 06:19:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40392: [SPARK-42769][K8S] Add `ENV_DRIVER_POD_IP` env variable to executor pods - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 06:23:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40375: [SPARK-42755][CONNECT] Factor literal value conversion out to `connect-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 06:29:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40376: [SPARK-42756][CONNECT][PYTHON] Helper function to convert proto literal to value in Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 06:31:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40386: [MINOR][SQL][FOLLOWUP] Fix scalastyle in LimitPushDownThroughWindow - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/13 06:32:19 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #33403: [SPARK-36193][CORE] Recover SparkSubmit.runMain not to stop SparkContext in non-K8s env - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2023/03/13 06:40:10 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #40286: [SPARK-42577][CORE] Add max attempts limitation for stages to avoid potential infinite retry - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/13 07:23:01 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40372: [SPARK-42752][PYSPARK][SQL] Make PySpark exceptions printable during initialization - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/13 07:46:51 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40263: [SPARK-42659][ML] Reimplement `FPGrowthModel.transform` with dataframe operations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 07:54:39 UTC, 4 replies.
- [GitHub] [spark] wangshengjie123 commented on pull request #40391: [WIP][SPARK-42766][YARN] YarnAllocator filter excluded nodes when launching containers - posted by "wangshengjie123 (via GitHub)" <gi...@apache.org> on 2023/03/13 08:08:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/13 08:12:44 UTC, 2 replies.
- [GitHub] [spark] Stove-hust opened a new pull request, #40393: []SPARK-40082] - posted by "Stove-hust (via GitHub)" <gi...@apache.org> on 2023/03/13 08:25:23 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40394: [SPARK-42771][SQL] Refactor HiveGenericUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/13 08:42:36 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40394: [SPARK-42771][SQL] Refactor HiveGenericUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/13 08:47:37 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/13 08:56:15 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40392: [SPARK-42769][K8S] Add `ENV_DRIVER_POD_IP` env variable to executor pods - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 08:57:34 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40388: [SPARK-42765][CONNECT][PYTHON] Regulate the import path of `pandas_udf` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/13 09:21:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40395: [SPARK-42770] WIP - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/13 09:37:33 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #40392: [SPARK-42769][K8S] Add `SPARK_DRIVER_POD_IP` env variable to executor pods - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/13 09:43:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40395: [SPARK-42770][CONNECT] Add `truncatedTo(ChronoUnit.MICROS)` to make `SQLImplicitsTestSuite` test pass on Linux - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/13 10:13:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40392: [SPARK-42769][K8S] Add `SPARK_DRIVER_POD_IP` env variable to executor pods - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 10:20:12 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40395: [SPARK-42770][CONNECT] Add `truncatedTo(ChronoUnit.MICROS)` to make `SQLImplicitsTestSuite` in Java 17 daily test GA task pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/13 10:51:47 UTC, 4 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40396: [SPARK-42772][SQL] Change the default value of JDBC options about push down to true - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/13 10:57:13 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #39990: [SPARK-42415][SQL] The built-in dialects support OFFSET and paging query. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/13 10:58:17 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40396: [SPARK-42772][SQL] Change the default value of JDBC options about push down to true - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/13 10:58:36 UTC, 2 replies.
- [GitHub] [spark] pan3793 commented on pull request #39160: [SPARK-41667][K8S] Expose env var SPARK_DRIVER_POD_NAME in Driver Pod - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/13 11:29:04 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40397: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/13 11:58:58 UTC, 0 replies.
- [GitHub] [spark] jdferreira opened a new pull request, #40398: Update `translate` docblock - posted by "jdferreira (via GitHub)" <gi...@apache.org> on 2023/03/13 12:12:05 UTC, 0 replies.
- [GitHub] [spark] zwangsheng commented on pull request #38202: [SPARK-40763][K8S] Should expose driver service name to config for user features - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/03/13 12:22:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40399: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec more type safe - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 12:34:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40399: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec more type safe - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 12:35:04 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40399: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec more type safe - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/13 12:46:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/13 13:34:31 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40395: [SPARK-42770][CONNECT] Add `truncatedTo(ChronoUnit.MICROS)` to make `SQLImplicitsTestSuite` in Java 17 daily test GA task pass - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/13 13:55:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40394: [SPARK-42771][SQL] Refactor HiveGenericUDF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 14:21:23 UTC, 13 replies.
- [GitHub] [spark] srowen commented on pull request #40263: [SPARK-42659][ML] Reimplement `FPGrowthModel.transform` with dataframe operations - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/13 14:45:52 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #40347: [SPARK-42711][BUILD]Update usage info and shellcheck warn/error fix for build/sbt tool - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/13 14:47:27 UTC, 0 replies.
- [GitHub] [spark] johanl-db commented on a diff in pull request #40308: [SPARK-42151][SQL] Align UPDATE assignments with table attributes - posted by "johanl-db (via GitHub)" <gi...@apache.org> on 2023/03/13 15:27:34 UTC, 0 replies.
- [GitHub] [spark] ClownXC opened a new pull request, #40400: [SPARK-41359][SQL] Use `PhysicalDataType` instead of DataType in UnsafeRow - posted by "ClownXC (via GitHub)" <gi...@apache.org> on 2023/03/13 15:39:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40399: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec more type safe - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/13 15:59:55 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40392: [SPARK-42769][K8S] Add `SPARK_DRIVER_POD_IP` env variable to executor pods - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/13 16:25:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40392: [SPARK-42769][K8S] Add `SPARK_DRIVER_POD_IP` env variable to executor pods - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 17:00:55 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/13 17:06:31 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40392: [SPARK-42769][K8S] Add `SPARK_DRIVER_POD_IP` env variable to executor pods - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/13 17:07:43 UTC, 0 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40401: [SPARK-42773][DOCS][PYTHON] Minor update to 3.4.0 version change message for Spark Connect - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/03/13 17:52:39 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on pull request #40122: [SPARK-42349][PYTHON] Support pandas cogroup with multiple df - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/03/13 18:56:26 UTC, 0 replies.
- [GitHub] [spark] olaky commented on a diff in pull request #40321: [SPARK-42704] SubqueryAlias propagates metadata columns that child outputs - posted by "olaky (via GitHub)" <gi...@apache.org> on 2023/03/13 19:16:28 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40402: [SPARK-42020][CONNECT][PYTHON] Support UserDefinedType in Spark Connect - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/13 19:58:51 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db opened a new pull request, #40403: [SPARK-42754][SQL][UI] Fix backward compatibility issue in nested SQL execution - posted by "linhongliu-db (via GitHub)" <gi...@apache.org> on 2023/03/13 21:49:02 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40262: [SPARK-42651][SQL] Optimize global sort to driver sort - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/13 21:57:47 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40393: []SPARK-40082] - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/13 22:00:40 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40404: [SPARK-42777][SQL] Support converting TimestampNTZ catalog stats to plan stats - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/13 22:47:31 UTC, 0 replies.
- [GitHub] [spark] atronchi commented on pull request #18990: [SPARK-21782][Core] Repartition creates skews when numPartitions is a power of 2 - posted by "atronchi (via GitHub)" <gi...@apache.org> on 2023/03/13 23:12:43 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on pull request #40388: [SPARK-42765][CONNECT][PYTHON] Regulate the import path of `pandas_udf` - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/13 23:33:00 UTC, 3 replies.
- [GitHub] [spark] itholic commented on pull request #40401: [SPARK-42773][DOCS][PYTHON] Minor update to 3.4.0 version change message for Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/13 23:41:47 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #18990: [SPARK-21782][Core] Repartition creates skews when numPartitions is a power of 2 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/13 23:50:53 UTC, 1 replies.
- [GitHub] [spark] liang3zy22 closed pull request #40347: [SPARK-42711][BUILD]Update usage info and shellcheck warn/error fix for build/sbt tool - posted by "liang3zy22 (via GitHub)" <gi...@apache.org> on 2023/03/13 23:54:44 UTC, 0 replies.
- [GitHub] [spark] liang3zy22 commented on pull request #40347: [SPARK-42711][BUILD]Update usage info and shellcheck warn/error fix for build/sbt tool - posted by "liang3zy22 (via GitHub)" <gi...@apache.org> on 2023/03/13 23:54:44 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #40190: [SPARK-42597][SQL] Support unwrap date type to timestamp type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 00:00:52 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40190: [SPARK-42597][SQL] Support unwrap date type to timestamp type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 00:02:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38887: [SPARK-41368][SQL] Reorder the window partition expressions by expression stats - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/14 00:18:27 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on pull request #40403: [SPARK-42754][SQL][UI] Fix backward compatibility issue in nested SQL execution - posted by "linhongliu-db (via GitHub)" <gi...@apache.org> on 2023/03/14 00:25:43 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40402: [SPARK-42020][CONNECT][PYTHON] Support UserDefinedType in Spark Connect - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/14 00:33:48 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on pull request #18990: [SPARK-21782][Core] Repartition creates skews when numPartitions is a power of 2 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/14 01:02:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40401: [SPARK-42773][DOCS][PYTHON] Minor update to 3.4.0 version change message for Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/14 01:04:08 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/14 01:16:17 UTC, 10 replies.
- [GitHub] [spark] wangshengjie123 commented on pull request #40391: [SPARK-42766][YARN] YarnAllocator filter excluded nodes when launching containers - posted by "wangshengjie123 (via GitHub)" <gi...@apache.org> on 2023/03/14 01:22:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40402: [SPARK-42020][CONNECT][PYTHON] Support UserDefinedType in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/14 01:28:38 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40394: [SPARK-42771][SQL] Refactor HiveGenericUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/14 01:34:24 UTC, 9 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40402: [SPARK-42020][CONNECT][PYTHON] Support UserDefinedType in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/14 01:37:07 UTC, 3 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/14 01:39:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40142: [SPARK-41171][SQL] Infer and push down window limit through window if partitionSpec is empty - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/14 01:42:20 UTC, 0 replies.
- [GitHub] [spark] gerashegalov commented on a diff in pull request #40372: [SPARK-42752][PYSPARK][SQL] Make PySpark exceptions printable during initialization - posted by "gerashegalov (via GitHub)" <gi...@apache.org> on 2023/03/14 01:50:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40385: [SPARK-42753] ReusedExchange refers to non-existent nodes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/14 02:21:14 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40401: [SPARK-42773][DOCS][PYTHON] Minor update to 3.4.0 version change message for Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/14 02:24:17 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40388: [SPARK-42765][CONNECT][PYTHON] Regulate the import path of `pandas_udf` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/14 03:05:25 UTC, 3 replies.
- [GitHub] [spark] StevenChenDatabricks commented on pull request #40385: [SPARK-42753] ReusedExchange refers to non-existent nodes - posted by "StevenChenDatabricks (via GitHub)" <gi...@apache.org> on 2023/03/14 03:13:00 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40405: [WIP][SPARK-42340][CONNECT][PYTHON] Implement `GroupedData.applyInPandas` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/14 03:20:10 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40360: [SPARK-42741][SQL] Do not unwrap casts in binary comparison when literal is null - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 03:25:17 UTC, 1 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #40396: [SPARK-42772][SQL] Change the default value of JDBC options about push down to true - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/03/14 03:41:47 UTC, 0 replies.
- [GitHub] [spark] otterc commented on pull request #40393: []SPARK-40082] - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/03/14 03:53:59 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40404: [SPARK-42777][SQL] Support converting TimestampNTZ catalog stats to plan stats - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/14 03:59:37 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40404: [SPARK-42777][SQL] Support converting TimestampNTZ catalog stats to plan stats - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/14 04:00:20 UTC, 0 replies.
- [GitHub] [spark] gatorsmile commented on pull request #40336: [SPARK-42706][SQL][DOCS] Document the Spark SQL error classes in user-facing documentation. - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2023/03/14 04:12:32 UTC, 0 replies.
- [GitHub] [spark] gatorsmile commented on pull request #40216: [SPARK-42593][PS] Deprecate & remove the APIs that will be removed in pandas 2.0. - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2023/03/14 04:18:01 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40406: [SPARK-42101][SQL][FOLLOWUP] Improve TableCacheQueryStage with CoalesceShufflePartitions - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/14 04:23:27 UTC, 0 replies.
- [GitHub] [spark] Stove-hust commented on pull request #40393: []SPARK-40082] - posted by "Stove-hust (via GitHub)" <gi...@apache.org> on 2023/03/14 04:30:34 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40395: [SPARK-42770][CONNECT] Add `truncatedTo(ChronoUnit.MICROS)` to make `SQLImplicitsTestSuite` in Java 17 daily test GA task pass - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 04:44:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40395: [SPARK-42770][CONNECT] Add `truncatedTo(ChronoUnit.MICROS)` to make `SQLImplicitsTestSuite` in Java 17 daily test GA task pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/14 05:13:10 UTC, 1 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40407: [SPARK-42778][SQL] QueryStageExec should respect supportsRowBased - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/14 06:15:31 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40406: [SPARK-42101][SQL][FOLLOWUP] Improve TableCacheQueryStage with CoalesceShufflePartitions - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/14 06:19:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 06:19:51 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40407: [SPARK-42778][SQL] QueryStageExec should respect supportsRowBased - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/14 06:20:52 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 06:22:31 UTC, 0 replies.
- [GitHub] [spark] thousandhu commented on a diff in pull request #40363: [SPARK_42744] delete uploaded file when job finish for k8s - posted by "thousandhu (via GitHub)" <gi...@apache.org> on 2023/03/14 06:22:51 UTC, 1 replies.
- [GitHub] [spark] thousandhu commented on pull request #40363: [SPARK_42744] delete uploaded file when job finish for k8s - posted by "thousandhu (via GitHub)" <gi...@apache.org> on 2023/03/14 06:27:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40324: [SPARK-42496][CONNECT][DOCS] Adding Spark Connect to the Spark 3.4 documentation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 06:38:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40324: [SPARK-42496][CONNECT][DOCS] Adding Spark Connect to the Spark 3.4 documentation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 06:39:08 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40388: [SPARK-42765][CONNECT][PYTHON] Enable importing `pandas_udf` from `pyspark.sql.connect.functions` - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/14 07:42:04 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40388: [SPARK-42765][CONNECT][PYTHON] Enable importing `pandas_udf` from `pyspark.sql.connect.functions` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/14 07:42:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40388: [SPARK-42765][CONNECT][PYTHON] Enable importing `pandas_udf` from `pyspark.sql.connect.functions` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 07:57:30 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40388: [SPARK-42765][CONNECT][PYTHON] Enable importing `pandas_udf` from `pyspark.sql.connect.functions` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 08:07:21 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #40408: [SPARK-42780][BUILD] Upgrade `Tink` to 1.8.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/14 08:15:32 UTC, 0 replies.
- [GitHub] [spark] 1511351836 closed pull request #40380: [SPARK-42781][DOCS][PYTHON] provide one format for writing to kafka - posted by "1511351836 (via GitHub)" <gi...@apache.org> on 2023/03/14 08:16:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40408: [SPARK-42780][BUILD] Upgrade `Tink` to 1.8.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 08:31:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40282: [SPARK-42672][PYTHON][DOCS] Document error class list - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/14 08:47:24 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40409: [SPARK-42782][SQL][TESTS] Port the tests for get_json_object from the Apache Hive project - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 08:47:29 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40409: [SPARK-42782][SQL][TESTS] Port the tests for get_json_object from the Apache Hive project - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 08:53:35 UTC, 6 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/14 08:56:25 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40142: [SPARK-41171][SQL] Infer and push down window limit through window if partitionSpec is empty - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/14 09:00:20 UTC, 0 replies.
- [GitHub] [spark] 1511351836 opened a new pull request, #40411: [SPARK-42781][DOCS][PYTHON] provide one format for writing to kafka - posted by "1511351836 (via GitHub)" <gi...@apache.org> on 2023/03/14 09:05:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40389: [SPARK-42767][CONNECT][TESTS] Add a precondition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 09:08:11 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40389: [SPARK-42767][CONNECT][TESTS] Add a precondition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/14 09:09:46 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 09:11:16 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/14 09:11:31 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/14 09:12:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40368: [SPARK-42748][CONNECT] Server-side Artifact Management - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 09:13:11 UTC, 2 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #40368: [SPARK-42748][CONNECT] Server-side Artifact Management - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/03/14 09:20:01 UTC, 9 replies.
- [GitHub] [spark] Stove-hust opened a new pull request, #40412: [SPARK-42784] should still create subDir when the number of subDir in merge dir is less than conf - posted by "Stove-hust (via GitHub)" <gi...@apache.org> on 2023/03/14 09:25:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40370: [SPARK-42620][PS] Add `inclusive` parameter for (DataFrame|Series).between_time - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 09:26:49 UTC, 0 replies.
- [GitHub] [spark] Stove-hust commented on pull request #40412: [SPARK-42784] should still create subDir when the number of subDir in merge dir is less than conf - posted by "Stove-hust (via GitHub)" <gi...@apache.org> on 2023/03/14 09:28:16 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40358: [SPARK-42733][CONNECT][Followup] Write without path or table - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 09:31:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40358: [SPARK-42733][CONNECT][Followup] Write without path or table - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 09:31:37 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40413: [WIP]Typed selected - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/14 09:54:47 UTC, 0 replies.
- [GitHub] [spark] zwangsheng opened a new pull request, #40414: [SPARK #42785][K8S][Core] When spark submit without `--deploy-mode`, avoid facing NPE in Kubernetes Case - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/03/14 10:01:15 UTC, 0 replies.
- [GitHub] [spark] zwangsheng commented on pull request #40414: [SPARK #42785][K8S][Core] When spark submit without `--deploy-mode`, avoid facing NPE in Kubernetes Case - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/03/14 10:02:00 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40415: [Do not merge] Add JDBC to DataFrameWriter - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/14 10:20:07 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40414: [SPARK #42785][K8S][Core] When spark submit without `--deploy-mode`, avoid facing NPE in Kubernetes Case - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/14 10:20:17 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/14 10:33:41 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #40414: [SPARK #42785][K8S][Core] When spark submit without `--deploy-mode`, avoid facing NPE in Kubernetes Case - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/14 10:38:19 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40413: [WIP]Typed Select - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/14 10:39:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40326: [SPARK-42708][DOCS] Improve doc about protobuf java file can't be indexed. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 11:05:21 UTC, 1 replies.
- [GitHub] [spark] zwangsheng commented on pull request #40414: [SPARK-42785][K8S][Core] When spark submit without `--deploy-mode`, avoid facing NPE in Kubernetes Case - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/03/14 11:05:22 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40413: [WIP]Typed Select - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/14 11:05:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40409: [SPARK-42782][SQL][TESTS] Port the tests for get_json_object from the Apache Hive project - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 11:11:07 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40353: [SPARK-42732][PYSPARK][CONNECT] Support spark connect session getActiveSession method - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 11:11:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40416: [SPARK-42731][CONNECT][DOCS] Document Spark Connect configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 11:20:36 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40409: [SPARK-42782][SQL][TESTS] Hive compatibility check for get_json_object - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 11:24:48 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/14 12:00:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40408: [SPARK-42780][BUILD] Upgrade `Tink` to 1.8.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/14 12:01:37 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/14 12:04:48 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #40416: [SPARK-42731][CONNECT][DOCS] Document Spark Connect configurations - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/14 12:18:38 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #40326: [SPARK-42708][DOCS] Improve doc about protobuf java file can't be indexed. - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/14 12:19:29 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40396: [SPARK-42772][SQL] Change the default value of JDBC options about push down to true - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/14 12:22:44 UTC, 4 replies.
- [GitHub] [spark] wangyum closed pull request #40360: [SPARK-42741][SQL] Do not unwrap casts in binary comparison when literal is null - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 12:32:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40406: [SPARK-42101][SQL][FOLLOWUP] Improve TableCacheQueryStage with CoalesceShufflePartitions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/14 12:59:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40406: [SPARK-42101][SQL][FOLLOWUP] Improve TableCacheQueryStage with CoalesceShufflePartitions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/14 13:01:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40407: [SPARK-42778][SQL] QueryStageExec should respect supportsRowBased - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/14 13:03:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40407: [SPARK-42778][SQL] QueryStageExec should respect supportsRowBased - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/14 13:03:48 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40372: [SPARK-42752][PYSPARK][SQL] Make PySpark exceptions printable during initialization - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/14 13:30:25 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40372: [SPARK-42752][PYSPARK][SQL] Make PySpark exceptions printable during initialization - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/14 13:30:48 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40398: [MINOR][DOCS] Update `translate` docblock - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/14 13:35:30 UTC, 1 replies.
- [GitHub] [spark] shrprasa commented on pull request #40414: [SPARK-42785][K8S][Core] When spark submit without `--deploy-mode`, avoid facing NPE in Kubernetes Case - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/14 13:35:48 UTC, 0 replies.
- [GitHub] [spark] MaicoTimmerman commented on pull request #40338: [MINOR][PYTHON] Change TypeVar to private symbols - posted by "MaicoTimmerman (via GitHub)" <gi...@apache.org> on 2023/03/14 14:00:39 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40417: [SPARK-42778][SQL][3.4] QueryStageExec should respect supportsRowBased - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/14 14:01:30 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #40122: [SPARK-42349][PYTHON] Support pandas cogroup with multiple df - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/14 14:03:58 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40418: [WIP][SPARK-42790][SQL] Abstract the excluded method for better test for JDBC docker tests. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/14 14:08:48 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 14:58:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40414: [SPARK-42785][K8S][Core] When spark submit without `--deploy-mode`, avoid facing NPE in Kubernetes Case - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 15:49:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40395: [SPARK-42770][CONNECT] Add `truncatedTo(ChronoUnit.MICROS)` to make `SQLImplicitsTestSuite` in Java 17 daily test GA task pass - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 15:53:24 UTC, 0 replies.
- [GitHub] [spark] dzhigimont opened a new pull request, #40420: [SPARK-42617][PS] Support `isocalendar` from the pandas 2.0.0 - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/03/14 15:59:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40409: [SPARK-42782][SQL][TESTS] Hive compatibility check for get_json_object - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 16:04:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40403: [SPARK-42754][SQL][UI] Fix backward compatibility issue in nested SQL execution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 16:07:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40408: [SPARK-42780][BUILD] Upgrade `Tink` to 1.8.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 16:37:12 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/14 16:48:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40358: [SPARK-42733][CONNECT][Followup] Write without path or table - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 16:49:34 UTC, 8 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/14 16:51:42 UTC, 3 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/14 16:54:24 UTC, 5 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40368: [SPARK-42748][CONNECT] Server-side Artifact Management - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/14 16:57:59 UTC, 12 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 17:03:20 UTC, 3 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40358: [SPARK-42733][CONNECT][Followup] Write without path or table - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/14 17:19:31 UTC, 4 replies.
- [GitHub] [spark] NarekDW opened a new pull request, #40422: [MINOR] Use getParameterCount function instead of getParameterTypes.length - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/03/14 18:08:06 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #40423: [SPARK-41775][PYTHON][FOLLOW-UP] Torch distributor multiple gpus per task - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/03/14 18:41:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40424: [SPARK-42793][CONNECT] `connect` module requires `build_profile_flags` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 22:05:01 UTC, 0 replies.
- [GitHub] [spark] huanliwang-db opened a new pull request, #40425: [SPARK-42794][SS] Increase the lockAcquireTimeoutMs to 2 minutes for acquiring the RocksDB state store in Structure Streaming - posted by "huanliwang-db (via GitHub)" <gi...@apache.org> on 2023/03/14 22:14:34 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on a diff in pull request #40425: [SPARK-42794][SS] Increase the lockAcquireTimeoutMs to 2 minutes for acquiring the RocksDB state store in Structure Streaming - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/14 22:16:47 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40426: [SPARK-42796][SQL] Support accessing TimestampNTZ columns in CachedBatch - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/14 22:45:07 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #40427: [SPARK-42792][SS] Add support for WRITE_FLUSH_BYTES for RocksDB used in streaming stateful operators - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/14 22:46:27 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #40427: [SPARK-42792][SS] Add support for WRITE_FLUSH_BYTES for RocksDB used in streaming stateful operators - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/14 22:47:17 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:31:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:31:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40416: [SPARK-42731][CONNECT][DOCS] Document Spark Connect configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:40:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40416: [SPARK-42731][CONNECT][DOCS] Document Spark Connect configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:40:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40416: [SPARK-42731][CONNECT][DOCS] Document Spark Connect configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:40:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:41:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40377: [SPARK-42757][CONNECT] Implement textFile for DataFrameReader - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:41:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40338: [MINOR][PYTHON] Change TypeVar to private symbols - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:43:49 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40424: [SPARK-42793][CONNECT] `connect` module requires `build_profile_flags` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/14 23:54:09 UTC, 2 replies.
- [GitHub] [spark] wangyum commented on pull request #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/14 23:55:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40424: [SPARK-42793][CONNECT] `connect` module requires `build_profile_flags` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:55:58 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40424: [SPARK-42793][CONNECT] `connect` module requires `build_profile_flags` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:56:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40420: [SPARK-42617][PS] Support `isocalendar` from the pandas 2.0.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:56:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40420: [SPARK-42617][PS] Support `isocalendar` from the pandas 2.0.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/14 23:57:10 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40423: [SPARK-41775][PYTHON][FOLLOW-UP] Torch distributor multiple gpus per task - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 00:25:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40425: [SPARK-42794][SS] Increase the lockAcquireTimeoutMs to 2 minutes for acquiring the RocksDB state store in Structure Streaming - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 00:26:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40282: [SPARK-42672][PYTHON][DOCS] Document error class list - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 00:38:39 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40388: [SPARK-42765][CONNECT][PYTHON] Enable importing `pandas_udf` from `pyspark.sql.connect.functions` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/15 00:42:14 UTC, 0 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40428: Grammatical improvements - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/03/15 00:57:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40396: [SPARK-42772][SQL] Change the default value of JDBC options about push down to true - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 01:02:27 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/15 01:18:23 UTC, 4 replies.
- [GitHub] [spark] chenhao-db opened a new pull request, #40429: [SPARK-42775][SQL] Throw exception when ApproximatePercentile result doesn't fit into output decimal type. - posted by "chenhao-db (via GitHub)" <gi...@apache.org> on 2023/03/15 01:20:59 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40417: [SPARK-42778][SQL][3.4] QueryStageExec should respect supportsRowBased - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/15 01:22:07 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40417: [SPARK-42778][SQL][3.4] QueryStageExec should respect supportsRowBased - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 01:44:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40426: [SPARK-42796][SQL] Support accessing TimestampNTZ columns in CachedBatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 01:47:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40426: [SPARK-42796][SQL] Support accessing TimestampNTZ columns in CachedBatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 01:48:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40324: [SPARK-42496][CONNECT][DOCS] Adding Spark Connect to the Spark 3.4 documentation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 01:56:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40336: [SPARK-42706][SQL][DOCS] Document the Spark SQL error classes in user-facing documentation. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 02:11:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40422: [MINOR] Use getParameterCount function instead of getParameterTypes.length - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 02:32:42 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40425: [SPARK-42794][SS] Increase the lockAcquireTimeoutMs to 2 minutes for acquiring the RocksDB state store in Structure Streaming - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/15 02:36:31 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40425: [SPARK-42794][SS] Increase the lockAcquireTimeoutMs to 2 minutes for acquiring the RocksDB state store in Structure Streaming - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/15 02:37:26 UTC, 0 replies.
- [GitHub] [spark] Stove-hust commented on pull request #40393: [SPARK-40082] Schedule mergeFinalize when push merge shuffleMapStage retry but no running tasks - posted by "Stove-hust (via GitHub)" <gi...@apache.org> on 2023/03/15 02:45:25 UTC, 9 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40430: [SPARK-42798][BUILD] Upgrade protobuf-java to 3.22.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 02:58:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40428: [SPARK-42797][CONNECT][DOCS] Grammatical improvements for Spark Connect content - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 03:33:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40431: [SPARK-42799][BUILD] Update SBT build `xercesImpl` version to match with `pom.xml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 03:33:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40428: [SPARK-42797][CONNECT][DOCS] Grammatical improvements for Spark Connect content - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 03:33:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40388: [SPARK-42765][CONNECT][PYTHON] Enable importing `pandas_udf` from `pyspark.sql.connect.functions` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 03:35:02 UTC, 0 replies.
- [GitHub] [spark] StevenChenDatabricks commented on a diff in pull request #40385: [SPARK-42753] ReusedExchange refers to non-existent nodes - posted by "StevenChenDatabricks (via GitHub)" <gi...@apache.org> on 2023/03/15 03:37:16 UTC, 8 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40431: [SPARK-42799][BUILD] Update SBT build `xercesImpl` version to match with `pom.xml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 03:38:01 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40432: [SPARK-42800][CONNECT][PYTHON][ML] Implement ml function `{array_to_vector, vector_to_array}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/15 04:01:42 UTC, 0 replies.
- [GitHub] [spark] otterc commented on pull request #40393: [SPARK-40082] Schedule mergeFinalize when push merge shuffleMapStage retry but no running tasks - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/03/15 04:02:34 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40432: [SPARK-42800][CONNECT][PYTHON][ML] Implement ml function `{array_to_vector, vector_to_array}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/15 04:05:33 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40396: [SPARK-42772][SQL] Change the default value of JDBC options about push down to true - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 04:09:59 UTC, 7 replies.
- [GitHub] [spark] harupy commented on a diff in pull request #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "harupy (via GitHub)" <gi...@apache.org> on 2023/03/15 05:00:14 UTC, 5 replies.
- [GitHub] [spark] itholic opened a new pull request, #40433: [SPARK-42706][SQL][DOCS][3.4] Document the Spark SQL error classes in user-facing documentation - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/15 05:10:24 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40433: [SPARK-42706][SQL][DOCS][3.4] Document the Spark SQL error classes in user-facing documentation - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/15 05:11:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40385: [SPARK-42753] ReusedExchange refers to non-existent nodes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 05:12:10 UTC, 8 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40434: [SPARK-42801][CONNECT][TESTS] Ignore flaky test in Java 8 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 05:53:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40434: [SPARK-42801][CONNECT][TESTS] Ignore flaky `write jdbc` test of `ClientE2ETestSuite` on Java 8 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 06:05:04 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40389: [SPARK-42767][CONNECT][TESTS] Add a precondition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 06:07:07 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40434: [SPARK-42801][CONNECT][TESTS] Ignore flaky `write jdbc` test of `ClientE2ETestSuite` on Java 8 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 06:28:06 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40433: [SPARK-42706][SQL][DOCS][3.4] Document the Spark SQL error classes in user-facing documentation - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/15 06:48:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40433: [SPARK-42706][SQL][DOCS][3.4] Document the Spark SQL error classes in user-facing documentation - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/15 06:51:58 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40427: [SPARK-42792][SS] Add support for WRITE_FLUSH_BYTES for RocksDB used in streaming stateful operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/15 06:53:20 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40427: [SPARK-42792][SS] Add support for WRITE_FLUSH_BYTES for RocksDB used in streaming stateful operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/15 06:53:56 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #40422: [SPARK-42803] Use getParameterCount function instead of getParameterTypes.length - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/03/15 07:16:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40431: [SPARK-42799][BUILD] Update SBT build `xercesImpl` version to match with `pom.xml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 07:39:35 UTC, 0 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40435: [SPARK-42496][CONNECT][DOCS] Addressing feedback to remove last ">>>" and adding type(spark) example - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/03/15 08:08:07 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/15 08:13:08 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40431: [SPARK-42799][BUILD] Update SBT build `xercesImpl` version to match with `pom.xml` - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/15 08:13:40 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 08:57:37 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/15 08:59:32 UTC, 0 replies.
- [GitHub] [spark] dzhigimont opened a new pull request, #40436: [SPARK-42619][PS] Add `show_counts` parameter for DataFrame.info - posted by "dzhigimont (via GitHub)" <gi...@apache.org> on 2023/03/15 09:09:35 UTC, 0 replies.
- [GitHub] [spark] Yikf opened a new pull request, #40437: [SPARK-41259][SQL] SparkSQLDriver Output schema and result string should be consistent - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/15 09:42:42 UTC, 0 replies.
- [GitHub] [spark] Yikf closed pull request #38795: [SPARK-41259][SQL] Spark-sql cli query results should correspond to schema - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/15 09:43:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 10:13:09 UTC, 0 replies.
- [GitHub] [spark] lordk911 commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by "lordk911 (via GitHub)" <gi...@apache.org> on 2023/03/15 10:14:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40394: [SPARK-42771][SQL] Refactor HiveGenericUDF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 10:15:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40394: [SPARK-42771][SQL] Refactor HiveGenericUDF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 10:16:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40438: [WIP][CONNECT] Catalog - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 10:22:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40438: [WIP][CONNECT] Catalog - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 10:23:00 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on pull request #40437: [SPARK-41259][SQL] SparkSQLDriver Output schema and result string should be consistent - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/15 10:34:52 UTC, 2 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40353: [SPARK-42732][PYTHON][CONNECT] Update spark connect session `getOrCreate` behavior to check existing global `_active_spark_session` first - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/15 10:40:47 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #40439: [SPARK-42807][CORE] Apply custom log URL pattern for yarn-client AM log URL in SHS - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/03/15 10:51:17 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #40440: [SPARK-42808][CORE] Avoid getting availableProcessors every time in MapOutputTrackerMaster#getStatistics - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/03/15 10:59:55 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40441: Spark 42809 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/15 11:13:59 UTC, 0 replies.
- [GitHub] [spark] panbingkun closed pull request #40441: Spark 42809 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/15 11:14:48 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40442: [SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/15 11:16:45 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #40389: [SPARK-42767][CONNECT][TESTS] Add a precondition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/15 11:18:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 11:25:23 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 11:25:55 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40389: [SPARK-42767][CONNECT][TESTS] Add a precondition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 11:36:30 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40338: [MINOR][PYTHON] Change TypeVar to private symbols - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 11:38:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40432: [SPARK-42800][CONNECT][PYTHON][ML] Implement ml function `{array_to_vector, vector_to_array}` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 11:48:32 UTC, 8 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40432: [SPARK-42800][CONNECT][PYTHON][ML] Implement ml function `{array_to_vector, vector_to_array}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/15 11:50:02 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 12:01:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 12:01:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40436: [SPARK-42619][PS] Add `show_counts` parameter for DataFrame.info - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 12:04:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40435: [SPARK-42496][CONNECT][DOCS][FOLLOW-UP] Addressing feedback to remove last ">>>" and adding type(spark) example - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 12:04:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40435: [SPARK-42496][CONNECT][DOCS][FOLLOW-UP] Addressing feedback to remove last ">>>" and adding type(spark) example - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 12:04:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40422: [SPARK-42803] Use getParameterCount function instead of getParameterTypes.length - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 12:11:30 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #40397: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/15 12:24:51 UTC, 1 replies.
- [GitHub] [spark] zero323 commented on pull request #40338: [MINOR][PYTHON] Change TypeVar to private symbols - posted by "zero323 (via GitHub)" <gi...@apache.org> on 2023/03/15 12:25:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40438: [WIP][SPARK-42806][CONNECT] Catalog - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 12:41:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40397: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/15 12:46:08 UTC, 6 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40442: [SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 13:26:41 UTC, 7 replies.
- [GitHub] [spark] vicennial opened a new pull request, #40443: [SPARK-42812][CONNECT] Add client_type to AddArtifactsRequest protobuf message - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/03/15 13:30:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40440: [SPARK-42808][CORE] Avoid getting availableProcessors every time in `MapOutputTrackerMaster#getStatistics` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 13:35:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40397: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/15 14:09:56 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40403: [SPARK-42754][SQL][UI] Fix backward compatibility issue in nested SQL execution - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 14:13:45 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #40444: [SPARK-42813][K8S] Print application info when waitAppCompletion is false - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/15 14:17:19 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #40444: [SPARK-42813][K8S] Print application info when waitAppCompletion is false - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/15 14:22:17 UTC, 4 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40445: [SPARK-42814][BUILD] Upgrade some maven plugins - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/15 15:15:11 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut conditional expression - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/15 15:42:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40444: [SPARK-42813][K8S] Print application info when waitAppCompletion is false - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 15:44:24 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40410: [SPARK-42783][SQL] Infer window group limit should run as late as possible - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/15 16:02:24 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40444: [SPARK-42813][K8S] Print application info when waitAppCompletion is false - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/15 16:45:23 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #40447: [SPARK-42816] Support Max Message size up to 128MB - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/15 16:45:34 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40447: [SPARK-42816] Support Max Message size up to 128MB - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/15 16:56:23 UTC, 1 replies.
- [GitHub] [spark] jdferreira commented on pull request #40398: [MINOR][DOCS] Update `translate` docblock - posted by "jdferreira (via GitHub)" <gi...@apache.org> on 2023/03/15 17:05:35 UTC, 0 replies.
- [GitHub] [spark] otterc opened a new pull request, #40448: Logging the shuffle service name once in ApplicationMaster - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/03/15 17:48:43 UTC, 0 replies.
- [GitHub] [spark] otterc commented on pull request #40448: Logging the shuffle service name once in ApplicationMaster - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/03/15 17:55:40 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/15 18:19:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/15 18:21:06 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #40422: [SPARK-42803][CORE][SQL][ML] Use getParameterCount function instead of getParameterTypes.length - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/03/15 18:31:38 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/15 19:34:03 UTC, 4 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/15 20:33:34 UTC, 2 replies.
- [GitHub] [spark] rithwik-db commented on pull request #40423: [SPARK-41775][PYTHON][FOLLOW-UP] Torch distributor multiple gpus per task - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/03/15 20:46:16 UTC, 0 replies.
- [GitHub] [spark] rithwik-db closed pull request #40423: [SPARK-41775][PYTHON][FOLLOW-UP] Torch distributor multiple gpus per task - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/03/15 20:46:27 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40450: [SPARK-42818][CONNECT][PYTHON] Implement DataFrameReader/Writer.jdbc - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/15 22:01:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40442: [SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 23:59:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40442: [SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/15 23:59:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40447: [SPARK-42816] Support Max Message size up to 128MB - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 00:06:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40448: [SPARK-42817][CORE] Logging the shuffle service name once in ApplicationMaster - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 00:13:30 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37725: [DO-NOT-MERGE] Exceptions without error classes in SQL golden files - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/16 00:21:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40447: [SPARK-42816][CONNECT] Support Max Message size up to 128MB - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 00:23:17 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #40396: [SPARK-42772][SQL] Change the default value of JDBC options about push down to true - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/16 01:04:25 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut conditional expression - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/16 01:13:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40450: [SPARK-42818][CONNECT][PYTHON] Implement DataFrameReader/Writer.jdbc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 01:38:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40450: [SPARK-42818][CONNECT][PYTHON] Implement DataFrameReader/Writer.jdbc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 01:38:28 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40451: [SPARK-42818][CONNECT][PYTHON][FOLLOWUP] Add versionchanged - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/16 01:57:28 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40447: [SPARK-42816][CONNECT] Support Max Message size up to 128MB - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/16 02:13:46 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut conditional expression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/16 02:21:42 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40452: [MINOR] Add comments of `xercesImpl` upgrade precautions in `pom.xml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 02:27:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40431: [SPARK-42799][BUILD] Update SBT build `xercesImpl` version to match with `pom.xml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 02:28:13 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut conditional expression - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/16 02:38:29 UTC, 1 replies.
- [GitHub] [spark] williamhyun opened a new pull request, #40453: [SPARK-42820][BUILD] Update ORC to 1.8.3 - posted by "williamhyun (via GitHub)" <gi...@apache.org> on 2023/03/16 02:41:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40450: [SPARK-42818][CONNECT][PYTHON] Implement DataFrameReader/Writer.jdbc - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/16 02:44:18 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40454: [SPARK-42821][SQL] Remove unused parameters in splitFiles methods - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/16 02:59:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40451: [SPARK-42818][CONNECT][PYTHON][FOLLOWUP] Add versionchanged - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/16 03:18:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40451: [SPARK-42818][CONNECT][PYTHON][FOLLOWUP] Add versionchanged - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/16 03:19:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40452: [MINOR] Add comments of `xercesImpl` upgrade precautions in `pom.xml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 04:01:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40448: [SPARK-42817][CORE] Logging the shuffle service name once in ApplicationMaster - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 04:02:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40448: [SPARK-42817][CORE] Logging the shuffle service name once in ApplicationMaster - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 04:03:57 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40389: [SPARK-42767][CONNECT][TESTS] Add a precondition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 04:04:32 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40453: [SPARK-42820][BUILD] Update ORC to 1.8.3 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/16 04:07:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40452: [MINOR] Add comments of `xercesImpl` upgrade precautions in `pom.xml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 04:07:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40452: [MINOR] Add comments of `xercesImpl` upgrade precautions in `pom.xml` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 04:13:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40452: [MINOR] Add comments of `xercesImpl` upgrade precautions in `pom.xml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 04:13:34 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut expression - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/16 04:15:16 UTC, 1 replies.
- [GitHub] [spark] shuwang21 commented on pull request #40448: [SPARK-42817][CORE] Logging the shuffle service name once in ApplicationMaster - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/03/16 04:27:58 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut expression - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/16 04:30:07 UTC, 9 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40432: [SPARK-42800][CONNECT][PYTHON][ML] Implement ml function `{array_to_vector, vector_to_array}` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/16 04:30:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40453: [SPARK-42820][BUILD] Update ORC to 1.8.3 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 04:58:24 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40453: [SPARK-42820][BUILD] Update ORC to 1.8.3 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 05:00:34 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe commented on pull request #40412: [SPARK-42784] should still create subDir when the number of subDir in merge dir is less than conf - posted by "zhouyejoe (via GitHub)" <gi...@apache.org> on 2023/03/16 05:29:45 UTC, 1 replies.
- [GitHub] [spark] navinvishy commented on a diff in pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/03/16 05:36:38 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40447: [SPARK-42816][CONNECT] Support Max Message size up to 128MB - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/16 05:59:09 UTC, 3 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #40455: [SPARK-42819][SS] Add support for setting max_write_buffer_number and write_buffer_size for RocksDB used in streaming - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/16 06:25:13 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #40455: [SPARK-42819][SS] Add support for setting max_write_buffer_number and write_buffer_size for RocksDB used in streaming - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/16 06:29:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40443: [SPARK-42812][CONNECT] Add client_type to AddArtifactsRequest protobuf message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 06:29:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40443: [SPARK-42812][CONNECT] Add client_type to AddArtifactsRequest protobuf message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 06:29:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40432: [SPARK-42800][CONNECT][PYTHON][ML] Implement ml function `{array_to_vector, vector_to_array}` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/16 07:04:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/16 08:45:41 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40438: [WIP][SPARK-42806][CONNECT] Add `Catalog` support - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 09:12:08 UTC, 6 replies.
- [GitHub] [spark] Surbhi-Vijay commented on pull request #40171: [SPARK-42598][TEST] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "Surbhi-Vijay (via GitHub)" <gi...@apache.org> on 2023/03/16 09:12:12 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40456: [SPARK-42720][PS][SQL] Uses expression for distributed-sequence default index instead of plan - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 09:19:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40456: [SPARK-42720][PS][SQL] Uses expression for distributed-sequence default index instead of plan - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 09:20:07 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40270: [WIP][SPARK-42662][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 09:20:24 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40171: [SPARK-42598][TEST] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/16 09:23:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40456: [SPARK-42720][PS][SQL] Uses expression for distributed-sequence default index instead of plan - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/16 09:36:46 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/16 09:51:50 UTC, 1 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40457: [SPARK-42823][SQL] spark-sql shell supports multipart namespaces for initialization - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/16 10:38:59 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40455: [SPARK-42819][SS] Add support for setting max_write_buffer_number and write_buffer_size for RocksDB used in streaming - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/16 10:51:12 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40455: [SPARK-42819][SS] Add support for setting max_write_buffer_number and write_buffer_size for RocksDB used in streaming - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/16 10:52:37 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40438: [SPARK-42806][CONNECT] Add `Catalog` support - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/16 11:06:49 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40438: [SPARK-42806][CONNECT] Add `Catalog` support - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 11:23:26 UTC, 13 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 11:35:39 UTC, 1 replies.
- [GitHub] [spark] navinvishy opened a new pull request, #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/03/16 11:36:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 11:37:21 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40447: [SPARK-42816][CONNECT] Support Max Message size up to 128MB - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/16 12:25:57 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/16 15:13:46 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/16 15:21:55 UTC, 0 replies.
- [GitHub] [spark] allanf-db commented on pull request #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes. - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/03/16 15:43:27 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/16 15:49:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40454: [SPARK-42821][SQL] Remove unused parameters in splitFiles methods - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/16 16:34:00 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #40459: [SPARK-42826][PS][DOCS] Add migration note for API changes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/16 16:44:31 UTC, 0 replies.
- [GitHub] [spark] j03wang opened a new pull request, #40460: [SPARK-42828] More explicit Python type annotations for GroupedData - posted by "j03wang (via GitHub)" <gi...@apache.org> on 2023/03/16 19:18:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40459: [SPARK-42826][PS][DOCS] Add migration notes for update to supported pandas version. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/16 19:19:11 UTC, 0 replies.
- [GitHub] [spark] j03wang commented on pull request #40460: [SPARK-42828] More explicit Python type annotations for GroupedData - posted by "j03wang (via GitHub)" <gi...@apache.org> on 2023/03/16 19:22:11 UTC, 0 replies.
- [GitHub] [spark] ritikam2 commented on a diff in pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "ritikam2 (via GitHub)" <gi...@apache.org> on 2023/03/16 19:26:46 UTC, 4 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/16 20:18:42 UTC, 5 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40389: [SPARK-42767][CONNECT][TESTS] Add a precondition to start connect server fallback with `in-memory` and auto ignored some tests strongly depend on hive - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/16 20:24:08 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/16 21:22:33 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40448: [SPARK-42817][CORE] Logging the shuffle service name once in ApplicationMaster - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/16 21:27:41 UTC, 0 replies.
- [GitHub] [spark] otterc commented on pull request #40448: [SPARK-42817][CORE] Logging the shuffle service name once in ApplicationMaster - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/03/16 21:29:45 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/03/16 23:44:08 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/03/16 23:44:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37725: [DO-NOT-MERGE] Exceptions without error classes in SQL golden files - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/17 00:20:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40459: [SPARK-42826][PS][DOCS] Add migration notes for update to supported pandas version. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/17 00:43:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40459: [SPARK-42826][PS][DOCS] Add migration notes for update to supported pandas version. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/17 00:43:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/17 01:15:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #40454: [SPARK-42821][SQL] Remove unused parameters in splitFiles methods - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/17 01:20:46 UTC, 1 replies.
- [GitHub] [spark] gatorsmile commented on a diff in pull request #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2023/03/17 01:33:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/17 02:12:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40458: [SPARK-42824][CONNECT][PYTHON] Provide a clear error message for unsupported JVM attributes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/17 02:13:10 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40418: [SPARK-42790][SQL] Abstract the excluded method for better test for JDBC docker tests. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/17 02:36:39 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #40312: [SPARK-42695][SQL] Skew join handling in stream side of broadcast hash join - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/17 02:41:27 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40454: [SPARK-42821][SQL] Remove unused parameters in splitFiles methods - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/17 02:55:26 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #40461: [SPARK-42831][SQL] Show result expressions in AggregateExec - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/03/17 03:00:49 UTC, 0 replies.
- [GitHub] [spark] panbingkun closed pull request #40454: [SPARK-42821][SQL] Remove unused parameters in splitFiles methods - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:14:45 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/17 03:20:26 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40457: [SPARK-42823][SQL] spark-sql shell supports multipart namespaces for initialization - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/17 03:25:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40457: [SPARK-42823][SQL] spark-sql shell supports multipart namespaces for initialization - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:27:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40457: [SPARK-42823][SQL] `spark-sql` shell supports multipart namespaces for initialization - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:29:25 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40457: [SPARK-42823][SQL] `spark-sql` shell supports multipart namespaces for initialization - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/17 03:33:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40445: [SPARK-42814][BUILD] Upgrade maven plugins to latest versions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:33:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40457: [SPARK-42823][SQL] `spark-sql` shell supports multipart namespaces for initialization - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:33:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40440: [SPARK-42808][CORE] Avoid getting availableProcessors every time in `MapOutputTrackerMaster#getStatistics` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:36:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40456: [SPARK-42720][PS][SQL] Uses expression for distributed-sequence default index instead of plan - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:37:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40417: [SPARK-42778][SQL][3.4] QueryStageExec should respect supportsRowBased - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:39:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:41:27 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40361: [SPARK_42742]access apiserver by pod env - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 03:42:49 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40456: [SPARK-42720][PS][SQL] Uses expression for distributed-sequence default index instead of plan - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 03:43:19 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40457: [SPARK-42823][SQL] `spark-sql` shell supports multipart namespaces for initialization - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 03:50:28 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/17 04:26:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40418: [SPARK-42790][SQL] Abstract the excluded method for better test for JDBC docker tests. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 04:50:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40445: [SPARK-42814][BUILD] Upgrade maven plugins to latest versions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/17 04:57:53 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40393: [SPARK-40082] Schedule mergeFinalize when push merge shuffleMapStage retry but no running tasks - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/17 05:52:19 UTC, 7 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40463: [SPARK-42557][CONNECT][FOLLOWUP] Remove `broadcast` exclude `ProblemFilters` from mima check - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/17 06:07:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 06:23:18 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 07:23:04 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40171: [SPARK-42598][TEST] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 07:46:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut expression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 07:47:37 UTC, 7 replies.
- [GitHub] [spark] alkis opened a new pull request, #40464: [SPARK-XXXXX] scheduler micro opts - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/03/17 08:10:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38947: [SPARK-41233][SQL][PYTHON] Add `array_prepend` function - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/17 08:23:15 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40437: [SPARK-41259][SQL] SparkSQLDriver Output schema and result string should be consistent - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 08:24:40 UTC, 2 replies.
- [GitHub] [spark] LucaCanali commented on pull request #39127: [SPARK-41585][YARN] The Spark exclude node functionality for YARN should work independently of dynamic allocation - posted by "LucaCanali (via GitHub)" <gi...@apache.org> on 2023/03/17 08:44:03 UTC, 0 replies.
- [GitHub] [spark] kazuyukitanimura opened a new pull request, #40465: [SPARK-42833][SQL] Refactor `applyExtensions` in `SparkSession` - posted by "kazuyukitanimura (via GitHub)" <gi...@apache.org> on 2023/03/17 08:44:20 UTC, 0 replies.
- [GitHub] [spark] lsgrep commented on pull request #37738: add Support Java Class with circular references - posted by "lsgrep (via GitHub)" <gi...@apache.org> on 2023/03/17 08:46:13 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40463: [SPARK-42557][CONNECT][FOLLOWUP] Remove `broadcast` `ProblemFilters.exclude` rule from mima check - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/17 08:56:45 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/17 09:07:12 UTC, 7 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 09:44:03 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40466: [SPARK-42835][SQL][TESTS] Add test cases for Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/17 09:58:02 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40418: [SPARK-42790][SQL] Abstract the excluded method for better test for JDBC docker tests. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/17 10:00:25 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37738: add Support Java Class with circular references - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/17 10:49:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40456: [SPARK-42720][PS][SQL] Uses expression for distributed-sequence default index instead of plan - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/17 10:54:00 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40467: [WIP][SPARK-42584][CONNECT] Improve output of Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/17 12:26:04 UTC, 0 replies.
- [GitHub] [spark] unical1988 opened a new pull request, #40468: changed error class name _LEGACY_ERROR_TEMP_2000 - posted by "unical1988 (via GitHub)" <gi...@apache.org> on 2023/03/17 15:17:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40464: [SPARK-XXXXX] scheduler micro opts - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/17 15:21:58 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40442: [SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/17 17:18:44 UTC, 4 replies.
- [GitHub] [spark] kazuyukitanimura commented on pull request #40465: [SPARK-42833][SQL] Refactor `applyExtensions` in `SparkSession` - posted by "kazuyukitanimura (via GitHub)" <gi...@apache.org> on 2023/03/17 17:38:44 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40465: [SPARK-42833][SQL] Refactor `applyExtensions` in `SparkSession` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 17:40:18 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40438: [SPARK-42806][CONNECT] Add `Catalog` support - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/17 17:54:05 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40463: [SPARK-42557][CONNECT][FOLLOWUP] Remove `broadcast` `ProblemFilters.exclude` rule from mima check - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/17 18:11:53 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40469: [SPARK-42848][CONNECT][PYTHON] Implement DataFraem.registerTempTable - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/17 18:14:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40438: [SPARK-42806][CONNECT] Add `Catalog` support - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/17 18:43:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40438: [SPARK-42806][CONNECT] Add `Catalog` support - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/17 18:43:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40469: [SPARK-42848][CONNECT][PYTHON] Implement DataFraem.registerTempTable - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/17 18:50:45 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40463: [SPARK-42557][CONNECT][FOLLOWUP] Remove `broadcast` `ProblemFilters.exclude` rule from mima check - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/17 18:52:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for Column.explain - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/17 18:57:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40421: [SPARK-42779][SQL] Allow V2 writes to indicate advisory shuffle partition size - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 20:10:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40465: [SPARK-42833][SQL] Refactor `applyExtensions` in `SparkSession` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/17 20:13:22 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40470: [SPARK-41818][SPARK-41843][CONNECT][PYTHON][TESTS] Enable more parity tests - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/17 21:29:42 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads twice when no filters in vectorized reader - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/17 21:55:24 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40269: [DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/17 22:01:09 UTC, 5 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40413: [SPARK-42786][Connect] Typed Select - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/17 22:13:36 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40269: [DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/17 22:24:02 UTC, 0 replies.
- [GitHub] [spark] gerashegalov commented on pull request #40372: [SPARK-42752][PYSPARK][SQL] Make PySpark exceptions printable during initialization - posted by "gerashegalov (via GitHub)" <gi...@apache.org> on 2023/03/17 22:32:45 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40402: [SPARK-42020][CONNECT][PYTHON] Support UserDefinedType in Spark Connect - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/17 22:43:39 UTC, 2 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40471: [SPARK-42850][SQL] Remove duplicated rule CombineFilters in Optimizer - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/17 23:05:42 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40471: [SPARK-42850][SQL] Remove duplicated rule CombineFilters in Optimizer - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/17 23:05:55 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40472: [SPARK-42247][CONNECT][PYTHON] Fix UserDefinedFunction to have returnType - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/17 23:35:36 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38948: [SPARK-41419][K8S] Decrement PVC_COUNTER when the pod deletion happens - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/18 00:18:46 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx opened a new pull request, #40473: [SPARK-42851][SQL] Guard EquivalentExpressions.addExpr() with supportedExpression() - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2023/03/18 01:16:18 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx commented on pull request #40473: [SPARK-42851][SQL] Guard EquivalentExpressions.addExpr() with supportedExpression() - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2023/03/18 01:25:36 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/18 02:29:38 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40422: [SPARK-42803][CORE][SQL][ML] Use getParameterCount function instead of getParameterTypes.length - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/18 02:46:38 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40422: [SPARK-42803][CORE][SQL][ML] Use getParameterCount function instead of getParameterTypes.length - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/18 02:46:40 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40269: [DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/18 05:18:29 UTC, 4 replies.
- [GitHub] [spark] srielau opened a new pull request, #40474: [SPARK-42849] [WIP] [SQL] Session Variables - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/03/18 06:13:42 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #40473: [SPARK-42851][SQL] Guard EquivalentExpressions.addExpr() with supportedExpression() - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/18 08:38:49 UTC, 1 replies.
- [GitHub] [spark] Kimahriman commented on pull request #40473: [SPARK-42851][SQL] Guard EquivalentExpressions.addExpr() with supportedExpression() - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/03/18 13:11:34 UTC, 1 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #40475: [SPARK-42852][SQL] Revert NamedLambdaVariable related changes from EquivalentExpressions - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/18 16:00:18 UTC, 0 replies.
- [GitHub] [spark] VindhyaG commented on pull request #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "VindhyaG (via GitHub)" <gi...@apache.org> on 2023/03/18 17:38:49 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38951: [SPARK-41416][SQL] Rewrite self join in in predicate to aggregate - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/19 00:21:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38948: [SPARK-41419][K8S] Decrement PVC_COUNTER when the pod deletion happens - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/19 00:21:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/19 00:21:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38714: [WIP][SPARK-41141]. avoid introducing a new aggregate expression in the analysis phase when subquery is referencing it - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/19 00:21:57 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40476: [WIP][MINOR][BUILD] Remove unused properties in pom file - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/19 13:02:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40476: [WIP][MINOR][BUILD] Remove unused properties in pom file - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/19 13:08:16 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 opened a new pull request, #40477: [SPARK-42805][WIP] `DeduplicateRelations` rule show process `LOGICAL_RDD` - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/03/19 14:20:37 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #39775: [SPARK-42219][CORE] Introducing a config to close all active SparkContexts after the Main method has finished - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/19 14:29:43 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40476: [WIP][MINOR][BUILD] Remove unused properties in pom file - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/19 15:27:46 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40440: [SPARK-42808][CORE] Avoid getting availableProcessors every time in `MapOutputTrackerMaster#getStatistics` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/19 15:35:56 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/19 17:45:23 UTC, 8 replies.
- [GitHub] [spark] pan3793 commented on pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/19 18:14:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40269: [SPARK-42853][DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/19 18:30:21 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40269: [SPARK-42853][DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/19 18:34:00 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40269: [SPARK-42853][DOC] Updating the Style for the Spark Docs based on the Webpage - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/19 18:34:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40470: [SPARK-41818][SPARK-41843][CONNECT][PYTHON][TESTS] Enable more parity tests - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/19 18:42:21 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #40478: [SPARK-42779][SQL][FOLLOWUP] Allow V2 writes to indicate advisory shuffle partition size - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/19 19:40:46 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #40478: [SPARK-42779][SQL][FOLLOWUP] Allow V2 writes to indicate advisory shuffle partition size - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/03/19 19:41:07 UTC, 1 replies.
- [GitHub] [spark] alkis commented on pull request #40464: [MINOR] scheduler micro opts - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/03/19 23:46:11 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38951: [SPARK-41416][SQL] Rewrite self join in in predicate to aggregate - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/20 00:20:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/20 00:20:52 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38714: [WIP][SPARK-41141]. avoid introducing a new aggregate expression in the analysis phase when subquery is referencing it - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/20 00:20:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38661: [SPARK-41085][SQL] Support Bit manipulation function COUNTSET - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/20 00:20:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40402: [SPARK-42020][CONNECT][PYTHON] Support UserDefinedType in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:24:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40402: [SPARK-42020][CONNECT][PYTHON] Support UserDefinedType in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:24:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40469: [SPARK-42848][CONNECT][PYTHON] Implement DataFrame.registerTempTable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:25:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40469: [SPARK-42848][CONNECT][PYTHON] Implement DataFrame.registerTempTable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:25:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40470: [SPARK-41818][SPARK-41843][CONNECT][PYTHON][TESTS] Enable more parity tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:25:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40472: [SPARK-42247][CONNECT][PYTHON] Fix UserDefinedFunction to have returnType - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:26:00 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40470: [SPARK-41818][SPARK-41843][CONNECT][PYTHON][TESTS] Enable more parity tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:26:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40472: [SPARK-42247][CONNECT][PYTHON] Fix UserDefinedFunction to have returnType - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:29:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40464: [MINOR] scheduler micro opts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:39:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40464: [MINOR] scheduler micro opts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:39:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40470: [SPARK-41818][SPARK-41843][CONNECT][PYTHON][TESTS] Enable more parity tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/20 00:42:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for `Column.explain` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:42:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40468: changed error class name _LEGACY_ERROR_TEMP_2000 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:43:00 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for `Column.explain` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/20 00:44:04 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40471: [SPARK-42850][SQL] Remove duplicated rule CombineFilters in Optimizer - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:47:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40471: [SPARK-42850][SQL] Remove duplicated rule CombineFilters in Optimizer - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:48:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40475: [SPARK-42852][SQL] Revert NamedLambdaVariable related changes from EquivalentExpressions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:54:13 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40475: [SPARK-42852][SQL] Revert NamedLambdaVariable related changes from EquivalentExpressions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 00:55:19 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #40479: [CONNECT][ML][WIP] Spark connect ml scala 1 - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/20 00:58:35 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40479: [CONNECT][ML][WIP] Spark connect ML for scala client - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/20 01:01:11 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40480: [SPARK-42508][BUILD][FOLLOW-UP] Exlcude org.apache.spark.ml.param.FloatParam$ for Scala 2.13 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 01:12:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40480: [SPARK-42508][BUILD][FOLLOW-UP] Exlcude org.apache.spark.ml.param.FloatParam$ for Scala 2.13 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 01:13:01 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for `Column.explain` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/20 01:24:33 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for `Column.explain` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/20 01:24:43 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40481: [SPARK-42827][CONNECT] Support `functions#array_prepend` for Scala connect client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 01:47:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40468: changed error class name _LEGACY_ERROR_TEMP_2000 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 02:14:26 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40476: [WIP][MINOR][BUILD] Remove unused properties in pom file - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/20 02:21:03 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40476: [MINOR][BUILD] Remove unused properties in pom file - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/20 02:25:58 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/20 02:28:04 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40405: [SPARK-42340][CONNECT][PYTHON] Implement Grouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/20 02:33:41 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40405: [SPARK-42340][CONNECT][PYTHON] Implement Grouped Map API - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/20 02:37:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40480: [SPARK-42508][BUILD][FOLLOW-UP] Exlcude org.apache.spark.ml.param.FloatParam$ for Scala 2.13 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 02:39:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40481: [SPARK-42827][CONNECT] Support `functions#array_prepend` for Scala connect client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 04:46:49 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #40468: changed error class name _LEGACY_ERROR_TEMP_2000 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/20 04:55:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40482: Revert "[SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1" - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 05:21:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40482: Revert "[SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1" - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 05:28:23 UTC, 1 replies.
- [GitHub] [spark] gatorsmile commented on pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2023/03/20 06:45:45 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40483: [MINOR][TEST] Fix spelling of 'regex' for RegexFilter - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/20 07:42:43 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #40482: Revert "[SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1" - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/20 08:24:24 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40482: Revert "[SPARK-42809][BUILD] Upgrade scala-maven-plugin from 4.8.0 to 4.8.1" - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/20 08:25:02 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40393: [SPARK-40082] Schedule mergeFinalize when push merge shuffleMapStage retry but no running tasks - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/20 08:48:23 UTC, 3 replies.
- [GitHub] [spark] Stove-hust commented on a diff in pull request #40393: [SPARK-40082] Schedule mergeFinalize when push merge shuffleMapStage retry but no running tasks - posted by "Stove-hust (via GitHub)" <gi...@apache.org> on 2023/03/20 08:49:33 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40481: [SPARK-42827][CONNECT] Support `functions#array_prepend` for Scala connect client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/20 08:50:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40481: [SPARK-42827][CONNECT] Support `functions#array_prepend` for Scala connect client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/20 08:50:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40479: [CONNECT][ML][WIP] Spark connect ML for scala client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/20 09:07:25 UTC, 3 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40484: [SPARK-42868][SQL] Support eliminate sorts in AQE Optimizer - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/20 09:11:40 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #40475: [SPARK-42852][SQL] Revert NamedLambdaVariable related changes from EquivalentExpressions - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/20 09:20:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40485: [SPARK-42870][CONNECT] Move `toCatalystValue` to `connect-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/20 10:27:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40483: [MINOR][TEST] Fix spelling of 'regex' for RegexFilter - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 10:41:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40483: [MINOR][TEST] Fix spelling of 'regex' for RegexFilter - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 10:41:39 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/20 10:57:55 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40405: [SPARK-42340][CONNECT][PYTHON] Implement Grouped Map API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 11:03:52 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40405: [SPARK-42340][CONNECT][PYTHON] Implement Grouped Map API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 11:04:16 UTC, 0 replies.
- [GitHub] [spark] lyy-pineapple commented on a diff in pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by "lyy-pineapple (via GitHub)" <gi...@apache.org> on 2023/03/20 11:06:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for `Column.explain` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/20 11:16:28 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40486: [SPARK-42340][CONNECT][PYTHON] Implement Grouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/20 11:19:46 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40486: [SPARK-42340][CONNECT][PYTHON] Implement Grouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/20 11:22:41 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40487: [WIP] Implement CoGrouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/20 11:26:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40478: [SPARK-42779][SQL][FOLLOWUP] Allow V2 writes to indicate advisory shuffle partition size - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/20 11:28:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40478: [SPARK-42779][SQL][FOLLOWUP] Allow V2 writes to indicate advisory shuffle partition size - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/20 11:28:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/20 11:42:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/20 11:49:18 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/20 11:50:03 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40484: [SPARK-42868][SQL] Support eliminate sorts in AQE Optimizer - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/20 11:58:21 UTC, 0 replies.
- [GitHub] [spark] jiamin13579 commented on pull request #26433: [SPARK-29771][K8S] Add configure to limit executor failures - posted by "jiamin13579 (via GitHub)" <gi...@apache.org> on 2023/03/20 11:59:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40456: [SPARK-42720][PS][SQL] Uses expression for distributed-sequence default index instead of plan - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 12:16:43 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #40488: [SPARK-42851][SQL] Replace EquivalentExpressions with mutable map in PhysicalAggregation - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/20 12:32:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40489: [SPARK-42871][BUILD] Upgrade slf4j to 2.0.7 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 12:43:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40490: [SPARK-42536][BUILD] Upgrade log4j2 to 2.20.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 12:49:50 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40473: [SPARK-42851][SQL] Guard EquivalentExpressions.addExpr() with supportedExpression() - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/20 12:53:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40465: [SPARK-42833][SQL] Refactor `applyExtensions` in `SparkSession` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/20 12:59:00 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40463: [SPARK-42557][CONNECT][FOLLOWUP] Remove `broadcast` `ProblemFilters.exclude` rule from mima check - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/20 13:04:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40487: [WIP] Implement CoGrouped Map API - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 13:08:29 UTC, 1 replies.
- [GitHub] [spark] DHKold opened a new pull request, #40491: [SPARK-41006] Generate new ConfigMap names for each run - posted by "DHKold (via GitHub)" <gi...@apache.org> on 2023/03/20 13:38:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40485: [SPARK-42870][CONNECT] Move `toCatalystValue` to `connect-common` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 13:39:27 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40476: [MINOR][BUILD] Remove unused properties in pom file - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/20 13:41:36 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40476: [MINOR][BUILD] Remove unused properties in pom file - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/20 13:41:57 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40418: [SPARK-42790][SQL] Abstract the excluded method for better test for JDBC docker tests. - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/20 13:46:39 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40418: [SPARK-42790][SQL] Abstract the excluded method for better test for JDBC docker tests. - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/20 13:46:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40449: [SPARK-42791][SQL] Create a new golden file test framework for analysis - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 15:14:41 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40492: [SPARK-42791][SQL][FOLLOWUP] Re-generate golden files for `array_prepend` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 15:29:26 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #40488: [SPARK-42851][SQL] Replace EquivalentExpressions with mutable map in PhysicalAggregation - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/20 15:32:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40492: [SPARK-42791][SQL][FOLLOWUP] Re-generate golden files for `array_prepend` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/20 15:33:02 UTC, 2 replies.
- [GitHub] [spark] ruilibuaa opened a new pull request, #40493: modified for SPARK-42839: Assign a name to the error class _LEGACY_ER… - posted by "ruilibuaa (via GitHub)" <gi...@apache.org> on 2023/03/20 16:13:56 UTC, 0 replies.
- [GitHub] [spark] sudoliyang opened a new pull request, #40494: [MINOR][DOCS] Fix typos - posted by "sudoliyang (via GitHub)" <gi...@apache.org> on 2023/03/20 16:16:28 UTC, 0 replies.
- [GitHub] [spark] yabola opened a new pull request, #40495: only test for reading footer within file range - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/03/20 16:30:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40478: [SPARK-42779][SQL][FOLLOWUP] Allow V2 writes to indicate advisory shuffle partition size - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/20 16:42:40 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/20 16:46:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/20 16:48:40 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40494: [MINOR][DOCS] Fix typos - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/20 17:18:34 UTC, 0 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #40393: [SPARK-40082] Schedule mergeFinalize when push merge shuffleMapStage retry but no running tasks - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/03/20 17:34:28 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40493: modified for SPARK-42839: Assign a name to the error class _LEGACY_ER… - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/20 18:11:01 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/20 18:20:45 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/20 18:33:43 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40492: [SPARK-42791][SQL][FOLLOWUP] Re-generate golden files for `array_prepend` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/20 18:54:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40492: [SPARK-42791][SQL][FOLLOWUP] Re-generate golden files for `array_prepend` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/20 18:55:55 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #39127: [SPARK-41585][YARN] The Spark exclude node functionality for YARN should work independently of dynamic allocation - posted by "asfgit (via GitHub)" <gi...@apache.org> on 2023/03/20 19:02:16 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #39127: [SPARK-41585][YARN] The Spark exclude node functionality for YARN should work independently of dynamic allocation - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/03/20 19:02:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40485: [SPARK-42870][CONNECT] Move `toCatalystValue` to `connect-common` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 19:56:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40485: [SPARK-42870][CONNECT] Move `toCatalystValue` to `connect-common` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 19:56:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/20 20:03:45 UTC, 2 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40497: [SPARK-42875][CONNECT][PYTHON] Fix toPandas to handle timezone and map types properly - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/20 20:08:41 UTC, 0 replies.
- [GitHub] [spark] JohnTortugo commented on pull request #40225: [SPARK-42625][BUILD] Upgrade `zstd-jni` to 1.5.4-2 - posted by "JohnTortugo (via GitHub)" <gi...@apache.org> on 2023/03/20 22:09:20 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40498: [WIP] reader table API could also accept options - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/20 22:58:15 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40498: [WIP] reader table API could also accept options - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/20 23:10:45 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40498: [WIP] reader table API could also accept options - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/20 23:43:47 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40498: [WIP] reader table API could also accept options - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/20 23:55:17 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40498: [WIP] reader table API could also accept options - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/20 23:59:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38608: [SPARK-41080][SQL] Support Bit manipulation function SETBIT - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/21 00:18:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38661: [SPARK-41085][SQL] Support Bit manipulation function COUNTSET - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/21 00:18:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38534: [SPARK-38505][SQL] Make partial aggregation adaptive - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/21 00:18:43 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx commented on pull request #40488: [SPARK-42851][SQL] Replace EquivalentExpressions with mutable map in PhysicalAggregation - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2023/03/21 00:38:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40499: [SPARK-42876][SQL] DataType's physicalDataType should be private[sql] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 01:31:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40473: [SPARK-42851][SQL] Guard EquivalentExpressions.addExpr() with supportedExpression() - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/21 01:35:43 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40497: [SPARK-42875][CONNECT][PYTHON] Fix toPandas to handle timezone and map types properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 01:43:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40497: [SPARK-42875][CONNECT][PYTHON] Fix toPandas to handle timezone and map types properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 01:44:08 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40487: [WIP] Implement CoGrouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/21 01:48:09 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40476: [MINOR][BUILD] Remove unused properties in pom file - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/21 01:49:29 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/21 01:58:45 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40488: [SPARK-42851][SQL] Replace EquivalentExpressions with mutable map in PhysicalAggregation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/21 02:28:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40499: [SPARK-42876][SQL] DataType's physicalDataType should be private[sql] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/21 02:33:03 UTC, 0 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #40495: test for reading footer within file range - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/03/21 02:39:59 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40499: [SPARK-42876][SQL] DataType's physicalDataType should be private[sql] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 02:56:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40500: [WIP][ML][3.4] Make `IsotonicRegression.PointsAccumulator` private - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 03:17:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40270: [WIP][SPARK-42662][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/21 03:41:15 UTC, 1 replies.
- [GitHub] [spark] itholic closed pull request #40270: [WIP][SPARK-42662][CONNECT][PYTHON][PS] Support `withSequenceColumn` as PySpark DataFrame internal function. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/21 03:41:16 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40500: [WIP][ML][3.4] Make `IsotonicRegression.PointsAccumulator` private - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/21 03:45:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40500: [WIP][ML][3.4] Make `IsotonicRegression.PointsAccumulator` private - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 03:45:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40485: [SPARK-42870][CONNECT] Move `toCatalystValue` to `connect-common` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 04:40:59 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40500: [SPARK-42864][ML][3.4] Make `IsotonicRegression.PointsAccumulator` private - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/21 04:48:29 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40501: [SPARK-42864][ML] Make IsotonicRegression.PointsAccumulator private - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 04:53:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/21 04:56:52 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/21 04:58:27 UTC, 3 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #40495: test for reading footer within file range - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/03/21 05:13:24 UTC, 1 replies.
- [GitHub] [spark] rednaxelafx commented on a diff in pull request #40488: [SPARK-42851][SQL] Replace EquivalentExpressions with mutable map in PhysicalAggregation - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2023/03/21 05:21:15 UTC, 0 replies.
- [GitHub] [spark] sudoliyang commented on pull request #40494: [MINOR][DOCS] Fix typos - posted by "sudoliyang (via GitHub)" <gi...@apache.org> on 2023/03/21 05:21:30 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40500: [SPARK-42864][ML][3.4] Make `IsotonicRegression.PointsAccumulator` private - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 05:27:51 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40500: [SPARK-42864][ML][3.4] Make `IsotonicRegression.PointsAccumulator` private - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/21 05:34:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40501: [SPARK-42864][ML] Make IsotonicRegression.PointsAccumulator private - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 05:39:28 UTC, 0 replies.
- [GitHub] [spark] yliou opened a new pull request, #40502: [SPARK-42829] [UI] add repeat identifier to cached RDD on stage page - posted by "yliou (via GitHub)" <gi...@apache.org> on 2023/03/21 05:43:32 UTC, 0 replies.
- [GitHub] [spark] yliou opened a new pull request, #40503: [SPARK-42830] [UI] Link skipped stages on Spark UI - posted by "yliou (via GitHub)" <gi...@apache.org> on 2023/03/21 06:04:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 06:12:29 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 06:12:42 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #39332: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/21 06:14:34 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/21 06:14:35 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40488: [SPARK-42851][SQL] Replace EquivalentExpressions with mutable map in PhysicalAggregation - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/21 07:10:20 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/21 07:21:56 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on a diff in pull request #40341: [WIP][SPARK-42715][SQL] Tips for Optimizing NegativeArraySizeException - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/03/21 07:29:17 UTC, 0 replies.
- [GitHub] [spark] frankliee opened a new pull request, #40504: [SPARK-42880] Update running-on-yarn.md for log4j2 - posted by "frankliee (via GitHub)" <gi...@apache.org> on 2023/03/21 07:30:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40504: [SPARK-42880] Update running-on-yarn.md to log4j2 syntax - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/21 07:32:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40494: [MINOR][DOCS] Fix typos - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/21 07:36:39 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40486: [SPARK-42340][CONNECT][PYTHON][3.4] Implement Grouped Map API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/21 07:42:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40505: [MINOR][DOCS] Remove SparkSession constructor invocation in the example - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/21 07:49:28 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #40505: [MINOR][DOCS] Remove SparkSession constructor invocation in the example - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/21 07:53:09 UTC, 0 replies.
- [GitHub] [spark] frankliee commented on pull request #40504: [SPARK-42880] Update running-on-yarn.md to log4j2 syntax - posted by "frankliee (via GitHub)" <gi...@apache.org> on 2023/03/21 07:54:00 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40506: [SPARK-42881][SQL] get_json_object Codegen Support - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/21 07:56:54 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/21 08:09:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/21 08:15:52 UTC, 2 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/21 08:15:57 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #37360: [SPARK-39931][PYTHON][WIP] Improve applyInPandas performance for very small groups - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/21 08:17:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40499: [SPARK-42876][SQL] DataType's physicalDataType should be private[sql] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/21 08:26:45 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #40499: [SPARK-42876][SQL] DataType's physicalDataType should be private[sql] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/21 08:27:00 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #40334: [SPARK-42716][SQL] DataSourceV2 supports reporting key-grouped partitioning without HasPartitionKey - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/03/21 08:29:44 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #40408: [SPARK-42780][BUILD] Upgrade `Tink` to 1.8.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/21 08:52:54 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40507: [SPARK-42662][CONNECT][PS] Add `_distributed_sequence_id` for distributed-sequence index. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/21 09:03:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40507: [SPARK-42662][CONNECT][PS] Add `_distributed_sequence_id` for distributed-sequence index. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/21 09:35:34 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40486: [SPARK-42340][CONNECT][PYTHON][3.4] Implement Grouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/21 09:59:37 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40486: [SPARK-42340][CONNECT][PYTHON][3.4] Implement Grouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/21 09:59:38 UTC, 0 replies.
- [GitHub] [spark] zwangsheng commented on pull request #40118: [SPARK-26365][K8S] In kuberentes cluster mode, spark submit should pass driver exit code - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/03/21 10:03:34 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40507: [SPARK-42662][CONNECT][PS] Add `_distributed_sequence_id` for distributed-sequence index. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/21 10:54:21 UTC, 9 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/21 11:22:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40507: [SPARK-42662][CONNECT][PS] Add `_distributed_sequence_id` for distributed-sequence index. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/21 11:24:44 UTC, 0 replies.
- [GitHub] [spark] vkn1234 commented on pull request #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "vkn1234 (via GitHub)" <gi...@apache.org> on 2023/03/21 11:32:15 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/21 11:54:32 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #40505: [MINOR][DOCS] Remove SparkSession constructor invocation in the example - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/21 12:02:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40504: [SPARK-42880] Update running-on-yarn.md to log4j2 syntax - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/21 12:11:19 UTC, 1 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #40508: [MINOR][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/21 12:47:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40508: [MINOR][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/21 12:48:57 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #40477: [SPARK-42805]`DeduplicateRelations` rule show process `LOGICAL_RDD` - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/03/21 13:15:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40473: [SPARK-42851][SQL] Guard EquivalentExpressions.addExpr() with supportedExpression() - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/21 13:28:05 UTC, 0 replies.
- [GitHub] [spark] peter-toth closed pull request #40488: [SPARK-42851][SQL] Replace EquivalentExpressions with mutable map in PhysicalAggregation - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/21 13:31:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40495: test reading footer within file range - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/21 14:21:11 UTC, 2 replies.
- [GitHub] [spark] srowen closed pull request #40490: [SPARK-42536][BUILD] Upgrade log4j2 to 2.20.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/21 14:45:42 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40490: [SPARK-42536][BUILD] Upgrade log4j2 to 2.20.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/21 14:45:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40490: [SPARK-42536][BUILD] Upgrade log4j2 to 2.20.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/21 14:46:11 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #40489: [SPARK-42871][BUILD] Upgrade slf4j to 2.0.7 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/21 14:46:32 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40440: [SPARK-42808][CORE] Avoid getting availableProcessors every time in `MapOutputTrackerMaster#getStatistics` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/21 14:57:19 UTC, 0 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #40495: test reading footer within file range - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/03/21 14:57:59 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40440: [SPARK-42808][CORE] Avoid getting availableProcessors every time in `MapOutputTrackerMaster#getStatistics` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/21 14:58:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40489: [SPARK-42871][BUILD] Upgrade slf4j to 2.0.7 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/21 15:06:09 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40508: [MINOR][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/21 15:06:35 UTC, 2 replies.
- [GitHub] [spark] vicennial commented on pull request #40368: [SPARK-42748][CONNECT] Server-side Artifact Management - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/03/21 15:19:22 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40508: [MINOR][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/21 15:24:37 UTC, 5 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40504: [SPARK-42880][DOCS] Update running-on-yarn.md to log4j2 syntax - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/21 15:52:15 UTC, 1 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/03/21 15:54:20 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/21 16:05:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40444: [SPARK-42813][K8S] Print application info when waitAppCompletion is false - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/21 16:08:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40444: [SPARK-42813][K8S] Print application info when waitAppCompletion is false - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/21 16:09:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40509: [SPARK-42885][K8S][BUILD] Upgrade `kubernetes-client` to 6.5.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/21 16:15:04 UTC, 0 replies.
- [GitHub] [spark] unical1988 commented on pull request #40468: "[SPARK-42838][SQL] changed error class name _LEGACY_ERROR_TEMP_2000" - posted by "unical1988 (via GitHub)" <gi...@apache.org> on 2023/03/21 16:29:08 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40508: [MINOR][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/03/21 16:35:39 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40447: [SPARK-42816][CONNECT] Support Max Message size up to 128MB - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 17:05:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40509: [SPARK-42885][K8S][BUILD] Upgrade `kubernetes-client` to 6.5.1 - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 17:07:45 UTC, 0 replies.
- [GitHub] [spark] simonvanderveldt commented on pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by "simonvanderveldt (via GitHub)" <gi...@apache.org> on 2023/03/21 17:15:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40509: [SPARK-42885][K8S][BUILD] Upgrade `kubernetes-client` to 6.5.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/21 19:17:41 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40440: [SPARK-42808][CORE] Avoid getting availableProcessors every time in `MapOutputTrackerMaster#getStatistics` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/21 20:13:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40509: [SPARK-42885][K8S][BUILD] Upgrade `kubernetes-client` to 6.5.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/21 20:53:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40490: [SPARK-42536][BUILD] Upgrade log4j2 to 2.20.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/21 20:55:49 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #40490: [SPARK-42536][BUILD] Upgrade log4j2 to 2.20.0 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/21 20:57:57 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/21 21:28:38 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/21 21:31:49 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 21:40:34 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 21:45:30 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/21 22:12:37 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40510: [SPARK-42889][CONNECT][PYTHON] Implement cache, persist, unpersist, and storageLevel - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/21 22:41:46 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/21 22:55:40 UTC, 2 replies.
- [GitHub] [spark] cnauroth opened a new pull request, #40511: [SPARK-42888][BUILD] Upgrade GCS connector from 2.2.7 to 2.2.11. - posted by "cnauroth (via GitHub)" <gi...@apache.org> on 2023/03/21 23:04:13 UTC, 0 replies.
- [GitHub] [spark] cnauroth commented on pull request #40511: [SPARK-42888][BUILD] Upgrade GCS connector from 2.2.7 to 2.2.11. - posted by "cnauroth (via GitHub)" <gi...@apache.org> on 2023/03/21 23:08:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38608: [SPARK-41080][SQL] Support Bit manipulation function SETBIT - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/22 00:17:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38534: [SPARK-38505][SQL] Make partial aggregation adaptive - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/22 00:17:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40505: [MINOR][DOCS] Remove SparkSession constructor invocation in the example - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:19:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40505: [MINOR][DOCS] Remove SparkSession constructor invocation in the example - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:19:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40507: [SPARK-42662][CONNECT][PS] Add proto message for pandas API on Spark default index - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:20:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40505: [MINOR][DOCS] Remove SparkSession constructor invocation in the example - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:20:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40507: [SPARK-42662][CONNECT][PS] Add proto message for pandas API on Spark default index - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:20:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40468: [SPARK-42838][SQL] changed error class name _LEGACY_ERROR_TEMP_2000 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:23:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40489: [SPARK-42871][BUILD] Upgrade slf4j to 2.0.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:24:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40489: [SPARK-42871][BUILD] Upgrade slf4j to 2.0.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 00:24:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40511: [SPARK-42888][BUILD] Upgrade `gcs-connector` to 2.2.11 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 01:00:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40511: [SPARK-42888][BUILD] Upgrade `gcs-connector` to 2.2.11 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 01:03:36 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #40511: [SPARK-42888][BUILD] Upgrade `gcs-connector` to 2.2.11 - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/22 01:13:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40447: [SPARK-42816][CONNECT] Support Max Message size up to 128MB - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 01:13:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40447: [SPARK-42816][CONNECT] Support Max Message size up to 128MB - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 01:13:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40510: [SPARK-42889][CONNECT][PYTHON] Implement cache, persist, unpersist, and storageLevel - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 01:18:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40510: [SPARK-42889][CONNECT][PYTHON] Implement cache, persist, unpersist, and storageLevel - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 01:18:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40494: [MINOR][DOCS] Fix typos - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 01:20:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 01:24:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40385: [SPARK-42753] ReusedExchange refers to non-existent nodes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 01:28:31 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40512: [SPARK-42892][SQL] Move sameType and relevant methods out of DataType - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/22 01:33:38 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40512: [SPARK-42892][SQL] Move sameType and relevant methods out of DataType - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/22 01:33:47 UTC, 0 replies.
- [GitHub] [spark] frankliee commented on a diff in pull request #40504: [SPARK-42880][DOCS] Update running-on-yarn.md to log4j2 syntax - posted by "frankliee (via GitHub)" <gi...@apache.org> on 2023/03/22 01:59:23 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40467: [SPARK-42584][CONNECT] Improve output of `Column.explain` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/22 02:13:00 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40513: Block Arrow-optimized Python UDFs - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/22 02:14:41 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #40393: [SPARK-40082] Schedule mergeFinalize when push merge shuffleMapStage retry but no running tasks - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/22 02:22:07 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/22 02:26:53 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 02:37:40 UTC, 2 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40514: [SPARK-41233][CONNECT][PYTHON] Add array_prepend to Spark Connect Python client - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/22 02:44:58 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40515: [SPARK-42884][CONNECT] Add Ammonite REPL integration - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/22 03:06:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40368: [SPARK-42748][CONNECT] Server-side Artifact Management - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/22 03:33:16 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40413: [SPARK-42786][Connect] Typed Select - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/22 03:33:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40413: [SPARK-42786][Connect] Typed Select - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/22 03:34:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40514: [SPARK-41233][CONNECT][PYTHON] Add array_prepend to Spark Connect Python client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/22 03:55:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40397: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 04:14:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40397: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 04:15:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40512: [SPARK-42892][SQL] Move sameType and relevant methods out of DataType - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 04:18:58 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #40504: [SPARK-42880][DOCS] Update running-on-yarn.md to log4j2 syntax - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/22 04:33:32 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40504: [SPARK-42880][DOCS] Update running-on-yarn.md to log4j2 syntax - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/22 04:33:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40516: [SPARK-42894][CONNECT] Support `cache`/`persist`/`unpersist`/`storageLevel` for Scala connect client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 05:30:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40516: [SPARK-42894][CONNECT] Support `cache`/`persist`/`unpersist`/`storageLevel` for Spark connect jvm client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 05:40:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40516: [SPARK-42894][CONNECT] Support `cache`/`persist`/`unpersist`/`storageLevel` for Spark connect jvm client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 05:40:02 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40510: [SPARK-42889][CONNECT][PYTHON] Implement cache, persist, unpersist, and storageLevel - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 05:56:52 UTC, 1 replies.
- [GitHub] [spark] panbingkun closed pull request #40316: [SPARK-42679][CONNECT][PYTHON] createDataFrame doesn't work with non-nullable schema - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/22 06:19:05 UTC, 0 replies.
- [GitHub] [spark] yb12138 commented on a diff in pull request #40128: [SPARK-42466][K8S]: Cleanup k8s upload directory when job terminates - posted by "yb12138 (via GitHub)" <gi...@apache.org> on 2023/03/22 06:28:15 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40517: Revert "[SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common`" - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/22 06:37:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40517: Revert "[SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common`" - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/22 06:38:04 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40514: [SPARK-41233][CONNECT][PYTHON] Add array_prepend to Spark Connect Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 07:04:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40514: [SPARK-41233][CONNECT][PYTHON] Add array_prepend to Spark Connect Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 07:05:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40513: [SPARK-42893][PYTHON][3.4] Block Arrow-optimized Python UDFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 07:05:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40513: [SPARK-42893][PYTHON][3.4] Block Arrow-optimized Python UDFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 07:05:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40496: [SPARK-42874][SQL] Enable new golden file test framework for analysis for all input files - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 07:06:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40510: [SPARK-42889][CONNECT][PYTHON] Implement cache, persist, unpersist, and storageLevel - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 07:08:18 UTC, 0 replies.
- [GitHub] [spark] mskapilks commented on a diff in pull request #40266: [SPARK-42660][SQL] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "mskapilks (via GitHub)" <gi...@apache.org> on 2023/03/22 07:31:15 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40508: [MINOR][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 07:49:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40518: [SPARK-42889][CONNECT][FOLLOWUP] Move `StorageLevel` into a separate file to avoid potential file recursively imports - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 08:25:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40518: [SPARK-42889][CONNECT][PYTHON][FOLLOWUP] Move `StorageLevel` into a separate file to avoid potential file recursively imports - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 08:30:22 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40517: Revert "[SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common`" - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/22 09:09:16 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40266: [SPARK-42660][SQL] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/03/22 09:38:09 UTC, 3 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40334: [SPARK-42716][SQL] DataSourceV2 supports reporting key-grouped partitioning without HasPartitionKey - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/22 10:25:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40519: [SPARK-42864][ML] Make IsotonicRegression.PointsAccumulator private - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/22 11:26:58 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #40520: [SPARK-42896][SQL][PYSPARK] Make `mapInPandas` / mapInArrow` support barrier mode execution - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/22 12:05:33 UTC, 0 replies.
- [GitHub] [spark] chong0929 opened a new pull request, #40521: [MINOR][DOCS][PYTHON] Update some urls about deprecated repository pyspark.pandas - posted by "chong0929 (via GitHub)" <gi...@apache.org> on 2023/03/22 13:05:04 UTC, 0 replies.
- [GitHub] [spark] chong0929 commented on pull request #40521: [MINOR][DOCS][PYTHON] Update some urls about deprecated repository pyspark.pandas - posted by "chong0929 (via GitHub)" <gi...@apache.org> on 2023/03/22 13:06:46 UTC, 1 replies.
- [GitHub] [spark] clownxc commented on pull request #40400: [SPARK-41359][SQL] Use `PhysicalDataType` instead of DataType in UnsafeRow - posted by "clownxc (via GitHub)" <gi...@apache.org> on 2023/03/22 13:09:07 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40518: [SPARK-42889][CONNECT][PYTHON][FOLLOWUP] Move `StorageLevel` into a separate file to avoid potential file recursively imports - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 13:14:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40520: [SPARK-42896][SQL][PYSPARK] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/22 13:19:39 UTC, 4 replies.
- [GitHub] [spark] panbingkun commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/22 13:27:37 UTC, 2 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40520: [SPARK-42896][SQL][PYSPARK] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/22 13:42:22 UTC, 7 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40522: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec.resultOption and isMeterialized consistent - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 13:52:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40522: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec.resultOption and isMeterialized consistent - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 13:53:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40522: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec.resultOption and isMeterialized consistent - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 13:54:24 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut expression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 14:18:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40446: [SPARK-42815][SQL] Subexpression elimination support shortcut expression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/22 14:18:52 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #40523: [SPARK-42897][SQL] Avoid evaluate more than once for the variables from the left side in the FullOuter SMJ condition - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/03/22 14:19:46 UTC, 0 replies.
- [GitHub] [spark] revans2 opened a new pull request, #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "revans2 (via GitHub)" <gi...@apache.org> on 2023/03/22 15:51:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40516: [SPARK-42894][CONNECT] Support `cache`/`persist`/`unpersist`/`storageLevel` for Spark connect jvm client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 16:53:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40516: [SPARK-42894][CONNECT] Support `cache`/`persist`/`unpersist`/`storageLevel` for Spark connect jvm client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 16:53:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40519: [SPARK-42864][ML] Make `IsotonicRegression.PointsAccumulator` private - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 16:56:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40519: [SPARK-42864][ML] Make `IsotonicRegression.PointsAccumulator` private - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 16:57:11 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 17:31:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 17:32:17 UTC, 4 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40518: [SPARK-42889][CONNECT][PYTHON][FOLLOWUP] Move `StorageLevel` into a separate file to avoid potential file recursively imports - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/22 17:34:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40515: [SPARK-42884][CONNECT] Add Ammonite REPL integration - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/22 17:51:19 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40515: [SPARK-42884][CONNECT] Add Ammonite REPL integration - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 18:06:50 UTC, 1 replies.
- [GitHub] [spark] gerashegalov commented on pull request #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "gerashegalov (via GitHub)" <gi...@apache.org> on 2023/03/22 18:22:00 UTC, 0 replies.
- [GitHub] [spark] cnauroth commented on pull request #40511: [SPARK-42888][BUILD] Upgrade `gcs-connector` to 2.2.11 - posted by "cnauroth (via GitHub)" <gi...@apache.org> on 2023/03/22 18:44:23 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40525: [WIP][SPARK-42859][CONNECT][PS] Basic support for pandas API on Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/22 19:05:10 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40525: [WIP][SPARK-42859][CONNECT][PS] Basic support for pandas API on Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/22 19:10:52 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40512: [SPARK-42892][SQL] Move sameType and relevant methods out of DataType - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/22 21:04:02 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40526: [SPARK-42899][SQL] Fix DataFrame.to(schema) to handle the case where there is a non-nullable nested field in a nullable field - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/22 21:59:25 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40527: [SPARK-42900][CONNECT][PYTHON] Fix createDataFrame to respect inference and column names - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/22 22:32:21 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40487: [SPARK-42891][CONNECT][PYTHON] Implement CoGrouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/22 23:12:53 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/22 23:52:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40518: [SPARK-42889][CONNECT][PYTHON][FOLLOWUP] Move `StorageLevel` into a separate file to avoid potential file recursively imports - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/22 23:59:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40128: [SPARK-42466][K8S]: Cleanup k8s upload directory when job terminates - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/22 23:59:17 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40518: [SPARK-42889][CONNECT][PYTHON][FOLLOWUP] Move `StorageLevel` into a separate file to avoid potential file recursively imports - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/23 00:06:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40368: [SPARK-42748][CONNECT] Server-side Artifact Management - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/23 00:06:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40521: [MINOR][DOCS][PYTHON] Update some urls about deprecated repository pyspark.pandas - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 00:11:46 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #39023: [SPARK-41459][SQL][3.3] fix thrift server operation log output is empty - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/23 00:20:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38965: [SPARK-41386][SQL] Improve small partition factor for rebalance - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/23 00:20:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/23 00:20:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38756: [SPARK-41220][SQL] Range partitioner sample supports column pruning - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/23 00:20:08 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40520: [SPARK-42896][SQL][PYSPARK] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/23 00:40:15 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40520: [SPARK-42896][SQL][PYSPARK] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/23 00:49:17 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40518: [SPARK-42901][CONNECT][PYTHON] Move `StorageLevel` into a separate file to avoid potential `file recursively imports` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/23 00:57:15 UTC, 3 replies.
- [GitHub] [spark] gerashegalov commented on a diff in pull request #40515: [SPARK-42884][CONNECT] Add Ammonite REPL integration - posted by "gerashegalov (via GitHub)" <gi...@apache.org> on 2023/03/23 00:59:54 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40520: [SPARK-42896][SQL][PYSPARK] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 01:07:05 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/23 01:29:42 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40522: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec.resultOption and isMeterialized consistent - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/23 01:43:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40519: [SPARK-42864][ML] Make `IsotonicRegression.PointsAccumulator` private - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 01:52:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40527: [SPARK-42900][CONNECT][PYTHON] Fix createDataFrame to respect inference and column names - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 01:54:51 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40526: [SPARK-42899][SQL] Fix DataFrame.to(schema) to handle the case where there is a non-nullable nested field in a nullable field - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 02:14:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40526: [SPARK-42899][SQL] Fix DataFrame.to(schema) to handle the case where there is a non-nullable nested field in a nullable field - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 02:15:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40520: [SPARK-42896][SQL][PYSPARK] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 02:22:42 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40528: [WIP][SPARK-42584][CONNECT] Improve output of Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/23 02:36:31 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40528: [WIP][SPARK-42584][CONNECT] Improve output of Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/23 02:38:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40521: [MINOR][DOCS][PYTHON] Update some urls about deprecated repository pyspark.pandas - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 02:40:57 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40528: [WIP][SPARK-42584][CONNECT] Improve output of Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/23 02:43:47 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #40439: [SPARK-42807][CORE] Apply custom log URL pattern for yarn-client AM log URL in SHS - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/03/23 02:55:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40526: [SPARK-42899][SQL] Fix DataFrame.to(schema) to handle the case where there is a non-nullable nested field in a nullable field - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/23 03:01:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40258: [SPARK-42655][SQL] Incorrect ambiguous column reference error - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/23 03:08:34 UTC, 6 replies.
- [GitHub] [spark] yliou opened a new pull request, #40529: [SPARK-42890] [UI] add repeat identifier on SQL UI - posted by "yliou (via GitHub)" <gi...@apache.org> on 2023/03/23 03:23:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 03:27:56 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40487: [SPARK-42891][CONNECT][PYTHON] Implement CoGrouped Map API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 03:38:22 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40487: [SPARK-42891][CONNECT][PYTHON] Implement CoGrouped Map API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 03:38:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40518: [SPARK-42901][CONNECT][PYTHON] Move `StorageLevel` into a separate file to avoid potential `file recursively imports` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 03:43:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40518: [SPARK-42901][CONNECT][PYTHON] Move `StorageLevel` into a separate file to avoid potential `file recursively imports` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 03:43:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 04:03:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40530: [SPARK-42903][PYTHON][DOCS] Avoid documenting None as as a return value in docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 04:36:19 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/23 04:55:55 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40531: [SPARK-42904][SQL] Char/Varchar Support for JDBC Catalog - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/23 05:02:40 UTC, 0 replies.
- [GitHub] [spark] panbingkun closed pull request #39524: [SPARK-41990][SQL] Fix bug for FieldReference - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/23 06:25:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 06:27:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40498: [SPARK-42878][CONNECT] The table API in DataFrameReader could also accept options - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 06:28:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40530: [SPARK-42903][PYTHON][DOCS] Avoid documenting None as as a return value in docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 06:28:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40530: [SPARK-42903][PYTHON][DOCS] Avoid documenting None as as a return value in docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 06:29:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40487: [SPARK-42891][CONNECT][PYTHON] Implement CoGrouped Map API - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/23 06:55:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40531: [SPARK-42904][SQL] Char/Varchar Support for JDBC Catalog - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/23 06:57:27 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40532: [SPARK-42903][PYTHON] Avoid documenting None as as a return value in docstring - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/23 07:22:02 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #40533: [SPARK-42906][K8S] Resource name prefix should start with an alphabetic character - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/23 07:29:15 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #40533: [SPARK-42906][K8S] Resource name prefix should start with an alphabetic character - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/23 07:39:23 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40534: [][PYTHON] Raise RuntimeError if SparkContext is not initialized when parsing DDL-formatted type strings - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/23 08:30:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40535: [SPARK-42907][CONNECT][PYTHON] Implement Avro functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 08:34:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40535: [SPARK-42907][CONNECT][PYTHON] Implement Avro functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 08:35:51 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40527: [SPARK-42900][CONNECT][PYTHON] Fix createDataFrame to respect inference and column names - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 08:45:49 UTC, 0 replies.
- [GitHub] [spark] melin commented on pull request #37814: [SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.9.3 - posted by "melin (via GitHub)" <gi...@apache.org> on 2023/03/23 08:49:01 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37814: [SPARK-40365][BUILD] Bump ANTLR runtime version from 4.8 to 4.9.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/23 08:52:20 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40533: [SPARK-42906][K8S] Resource name prefix should start with an alphabetic character - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/23 09:03:42 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40533: [SPARK-42906][K8S] Resource name prefix should start with an alphabetic character - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/23 10:04:20 UTC, 6 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #40536: [SPARK-42895][CONNECT] Improve error messages for stopped Spark sessions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/23 10:37:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40535: [SPARK-42907][CONNECT][PYTHON] Implement Avro functions - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/23 11:24:39 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40528: [SPARK-42584][CONNECT] Improve output of Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/23 11:45:36 UTC, 0 replies.
- [GitHub] [spark] thomasg19930417 commented on pull request #34542: [SPARK-37267][SQL] OptimizeSkewInRebalancePartitions support optimize non-root node - posted by "thomasg19930417 (via GitHub)" <gi...@apache.org> on 2023/03/23 12:20:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40532: [SPARK-42903][PYTHON][DOCS] Avoid documenting None as as a return value in docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 12:29:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40532: [SPARK-42903][PYTHON][DOCS] Avoid documenting None as as a return value in docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 12:29:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40534: [SPARK-42908][PYTHON] Raise RuntimeError if SparkContext is not initialized when parsing DDL-formatted type strings - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 12:31:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for `Column.explain` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/23 12:35:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40535: [SPARK-42907][CONNECT][PYTHON] Implement Avro functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 12:48:57 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 12:54:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 12:55:45 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/23 12:57:03 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/23 13:03:45 UTC, 0 replies.
- [GitHub] [spark] JordanMLee commented on pull request #37251: [SPARK-39838][SQL] Preserve explicit empty column metadata - posted by "JordanMLee (via GitHub)" <gi...@apache.org> on 2023/03/23 13:50:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40515: [SPARK-42884][CONNECT] Add Ammonite REPL integration - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/23 14:26:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/23 17:27:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40533: [SPARK-42906][K8S] Resource name prefix should start with an alphabetic character - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/23 19:44:02 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40518: [SPARK-42901][CONNECT][PYTHON] Move `StorageLevel` into a separate file to avoid potential `file recursively imports` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/23 19:45:06 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #40537: [SPARK-42202][CONNECT][TEST][FOLLOWUP] Loop around command entry in SimpleSparkConnectService - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/03/23 19:46:08 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on a diff in pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC2 - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/03/23 20:27:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/23 20:43:15 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40538: [SPARK-42911][PYTHON] Introduce more basic exceptions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/24 00:17:34 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40538: [SPARK-42911][PYTHON] Introduce more basic exceptions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/24 00:17:58 UTC, 2 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #39023: [SPARK-41459][SQL][3.3] fix thrift server operation log output is empty - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/24 00:19:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38965: [SPARK-41386][SQL] Improve small partition factor for rebalance - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/24 00:19:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/24 00:19:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38756: [SPARK-41220][SQL] Range partitioner sample supports column pruning - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/24 00:19:06 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40538: [SPARK-42911][PYTHON] Introduce more basic exceptions - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/24 00:28:41 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40538: [SPARK-42911][PYTHON] Introduce more basic exceptions - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/24 00:29:29 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40355: [SPARK-42604][CONNECT] Implement functions.typedlit - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/24 00:36:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/24 00:41:18 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40538: [SPARK-42911][PYTHON] Introduce more basic exceptions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/24 01:27:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40520: [SPARK-42896][SQL][PYTHON] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/24 01:46:59 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40537: [SPARK-42202][CONNECT][TEST][FOLLOWUP] Loop around command entry in SimpleSparkConnectService - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/24 01:56:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40537: [SPARK-42202][CONNECT][TEST][FOLLOWUP] Loop around command entry in SimpleSparkConnectService - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/24 01:56:37 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #37251: [SPARK-39838][SQL] Preserve explicit empty column metadata - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/24 01:58:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40536: [SPARK-42895][CONNECT] Improve error messages for stopped Spark sessions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/24 02:28:37 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/24 03:11:26 UTC, 2 replies.
- [GitHub] [spark] beliefer closed pull request #40466: [SPARK-42835][SQL][TESTS] Add test cases for `Column.explain` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/24 03:17:09 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40539: [SPARK-42891][CONNECT][PYTHON][3.4] Implement CoGrouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/24 03:38:16 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40540: [SPARK-42914][PYTHON] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/24 04:08:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40540: [SPARK-42914][PYTHON] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/24 04:09:57 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39124: [SPARK-42913][BUILD] Upgrade Hadoop to 3.3.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/24 04:29:54 UTC, 3 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40541: [SPARK-42861][SQL] Use private[sql] instead of protected[sql] to avoid generating API doc - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 04:58:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40541: [SPARK-42861][SQL] Use private[sql] instead of protected[sql] to avoid generating API doc - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 04:59:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40531: [SPARK-42904][SQL] Char/Varchar Support for JDBC Catalog - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/24 05:23:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40526: [SPARK-42899][SQL] Fix DataFrame.to(schema) to handle the case where there is a non-nullable nested field in a nullable field - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 05:37:45 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40541: [SPARK-42861][SQL] Use private[sql] instead of protected[sql] to avoid generating API doc - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/24 05:39:55 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40531: [SPARK-42904][SQL] Char/Varchar Support for JDBC Catalog - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 05:41:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40541: [SPARK-42861][SQL] Use private[sql] instead of protected[sql] to avoid generating API doc - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 05:50:41 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #40522: [SPARK-42101][SQL][FOLLOWUP] Make QueryStageExec.resultOption and isMeterialized consistent - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 05:51:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40462: [SPARK-42832][SQL] Remove repartition if it is the child of LocalLimit - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 06:31:47 UTC, 5 replies.
- [GitHub] [spark] yaooqinn closed pull request #40531: [SPARK-42904][SQL] Char/Varchar Support for JDBC Catalog - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 06:43:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40400: [SPARK-41359][SQL] Use `PhysicalDataType` instead of DataType in UnsafeRow - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 06:44:41 UTC, 3 replies.
- [GitHub] [spark] yaooqinn closed pull request #40541: [SPARK-42861][SQL] Use private[sql] instead of protected[sql] to avoid generating API doc - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 06:45:56 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40541: [SPARK-42861][SQL] Use private[sql] instead of protected[sql] to avoid generating API doc - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 06:46:39 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40542: [SPARK-42915][SQL] Codegen Support for sentences - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/24 07:25:09 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on a diff in pull request #40437: [SPARK-41259][SQL] SparkSQLDriver Output schema and result string should be consistent - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/03/24 07:58:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40540: [SPARK-42914][PYTHON] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/24 08:09:11 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40258: [SPARK-42655][SQL] Incorrect ambiguous column reference error - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/24 08:26:41 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40540: [SPARK-42914][PYTHON] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/24 08:41:24 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #40536: [SPARK-42895][CONNECT] Improve error messages for stopped Spark sessions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/24 08:41:47 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40539: [SPARK-42891][CONNECT][PYTHON][3.4] Implement CoGrouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/24 08:41:54 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40539: [SPARK-42891][CONNECT][PYTHON][3.4] Implement CoGrouped Map API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/24 08:41:54 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40543: [SPARK-42916][SQL] JDBCTableCatalog Keeps Char/Varchar meta on the read-side - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 09:10:23 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40543: [SPARK-42916][SQL] JDBCTableCatalog Keeps Char/Varchar meta on the read-side - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 09:53:44 UTC, 1 replies.
- [GitHub] [spark] shrprasa commented on a diff in pull request #40258: [SPARK-42655][SQL] Incorrect ambiguous column reference error - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/24 09:58:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40538: [SPARK-42911][PYTHON] Introduce more basic exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/24 10:13:05 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40538: [SPARK-42911][PYTHON] Introduce more basic exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/24 10:13:24 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40520: [SPARK-42896][SQL][PYTHON] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/24 10:21:14 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40544: [SPARK-42917][SQL] Correct getUpdateColumnNullabilityQuery for DerbyDialect - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 10:23:12 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40544: [SPARK-42917][SQL] Correct getUpdateColumnNullabilityQuery for DerbyDialect - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 10:29:30 UTC, 1 replies.
- [GitHub] [spark] johanl-db opened a new pull request, #40545: [WIP][SPARK-42918] Introduce abstractions to create constant and generated metadata fields - posted by "johanl-db (via GitHub)" <gi...@apache.org> on 2023/03/24 11:43:48 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #40544: [SPARK-42917][SQL] Correct getUpdateColumnNullabilityQuery for DerbyDialect - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/24 14:19:05 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #39124: [SPARK-42913][BUILD] Upgrade Hadoop to 3.3.5 - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/03/24 16:01:43 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40515: [SPARK-42884][CONNECT] Add Ammonite REPL integration - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/24 17:15:35 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40515: [SPARK-42884][CONNECT] Add Ammonite REPL integration - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/24 17:16:05 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40526: [SPARK-42899][SQL] Fix DataFrame.to(schema) to handle the case where there is a non-nullable nested field in a nullable field - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/24 17:35:17 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40537: [SPARK-42202][CONNECT][TEST][FOLLOWUP] Loop around command entry in SimpleSparkConnectService - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/24 17:44:08 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #38357: [SPARK-40887][K8S] Allow Spark on K8s to integrate w/ Log Service - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/24 17:49:42 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #38357: [SPARK-40887][K8S] Allow Spark on K8s to integrate w/ Log Service - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/24 18:04:34 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40546: [SPARK-42899][SQL][FOLLOWUP] Project.reconcileColumnType should use KnownNotNull instead of AssertNotNull - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/24 18:07:29 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40547: [SPARK-42911][PYTHON][3.4] Introduce more basic exceptions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/24 18:28:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40543: [SPARK-42916][SQL] JDBCTableCatalog Keeps Char/Varchar meta on the read-side - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/24 19:22:05 UTC, 2 replies.
- [GitHub] [spark] revans2 commented on pull request #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "revans2 (via GitHub)" <gi...@apache.org> on 2023/03/24 21:36:58 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40548: [Minor][Core] Remove unused variables and method in Spark listeners - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/24 21:41:21 UTC, 0 replies.
- [GitHub] [spark] chenhao-db commented on pull request #40429: [SPARK-42775][SQL] Throw exception when ApproximatePercentile result doesn't fit into output decimal type. - posted by "chenhao-db (via GitHub)" <gi...@apache.org> on 2023/03/24 22:55:52 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40549: [SPARK-42920][CONNECT][PYTHON] Enable tests for UDF with UDT - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/24 23:25:40 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks opened a new pull request, #40550: [SPARK] LogicalPlan.metadataOutput always contains AttributeReference - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/24 23:49:37 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks opened a new pull request, #40551: [SPARK] Project implements ExposesMetadataColumns - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/24 23:56:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39124: [SPARK-42913][BUILD] Upgrade Hadoop to 3.3.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/25 01:15:27 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40552: [SPARK-42921][SQL][TESTS] Split `timestampNTZ/datetime-special.sql` into w/ and w/o `ansi` for test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/25 03:07:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40552: [SPARK-42921][SQL][TESTS] Split `timestampNTZ/datetime-special.sql` into w/ and w/o `ansi` for fix sql analyzer test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/25 03:15:09 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #40552: [SPARK-42921][SQL][TESTS] Split `timestampNTZ/datetime-special.sql` into w/ and w/o `ansi` suffix to pass sql analyzer test in ansi mode - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/25 03:19:00 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40548: [Minor][Core] Remove unused variables and method in Spark listeners - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/25 04:34:36 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40533: [SPARK-42906][K8S] Resource name prefix should start with an alphabetic character - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/25 05:19:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40552: [SPARK-42921][SQL][TESTS] Split `timestampNTZ/datetime-special.sql` into w/ and w/o `ansi` suffix to pass sql analyzer test in ansi mode - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/25 05:39:08 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on a diff in pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/03/25 13:09:27 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40545: [WIP][SPARK-42918] Generalize handling of metadata attributes in FileSourceStrategy - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/25 14:45:46 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/25 15:10:09 UTC, 0 replies.
- [GitHub] [spark] VindhyaG opened a new pull request, #40553: [SPARK-39722] [SQL] getString API for Dataset - posted by "VindhyaG (via GitHub)" <gi...@apache.org> on 2023/03/25 19:18:32 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40548: [Minor][Core] Remove unused variables and method in Spark listeners - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/25 20:30:59 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40548: [Minor][Core] Remove unused variables and method in Spark listeners - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/25 20:31:46 UTC, 0 replies.
- [GitHub] [spark] clownxc commented on a diff in pull request #40400: [SPARK-41359][SQL] Use `PhysicalDataType` instead of DataType in UnsafeRow - posted by "clownxc (via GitHub)" <gi...@apache.org> on 2023/03/26 03:30:16 UTC, 5 replies.
- [GitHub] [spark] VindhyaG commented on pull request #40553: [SPARK-39722] [SQL] getString API for Dataset - posted by "VindhyaG (via GitHub)" <gi...@apache.org> on 2023/03/26 05:33:52 UTC, 3 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #40525: [WIP][SPARK-42859][CONNECT][PS] Basic support for pandas API on Spark Connect - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/26 16:31:01 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40547: [SPARK-42911][PYTHON][3.4] Introduce more basic exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 00:24:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40547: [SPARK-42911][PYTHON][3.4] Introduce more basic exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 00:27:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40540: [SPARK-42914][PYTHON] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 00:28:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40549: [SPARK-42920][CONNECT][PYTHON] Enable tests for UDF with UDT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 00:35:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40549: [SPARK-42920][CONNECT][PYTHON] Enable tests for UDF with UDT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 00:35:43 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40554: [SPARK-42914][PYTHON][3.4] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/27 01:10:13 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40554: [SPARK-42914][PYTHON][3.4] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/27 01:10:41 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40549: [SPARK-42920][CONNECT][PYTHON] Enable tests for UDF with UDT - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/27 01:20:08 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #40520: [SPARK-42896][SQL][PYTHON] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/27 01:40:11 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40520: [SPARK-42896][SQL][PYTHON] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/27 01:42:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40554: [SPARK-42914][PYTHON][3.4] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 01:43:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40554: [SPARK-42914][PYTHON][3.4] Reuse `transformUnregisteredFunction` for `DistributedSequenceID`. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 01:43:18 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40543: [SPARK-42916][SQL] JDBCTableCatalog Keeps Char/Varchar meta on the read-side - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/27 01:48:11 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40552: [SPARK-42921][SQL][TESTS] Split `timestampNTZ/datetime-special.sql` into w/ and w/o `ansi` suffix to pass sql analyzer test in ansi mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 02:19:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40552: [SPARK-42921][SQL][TESTS] Split `timestampNTZ/datetime-special.sql` into w/ and w/o `ansi` suffix to pass sql analyzer test in ansi mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 02:20:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40552: [SPARK-42921][SQL][TESTS] Split `timestampNTZ/datetime-special.sql` into w/ and w/o `ansi` suffix to pass sql analyzer test in ansi mode - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 02:21:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40543: [SPARK-42916][SQL] JDBCTableCatalog Keeps Char/Varchar meta on the read-side - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/27 03:08:18 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40555: [WIP][SPARK-42926][BUIILD][SQL] Upgrade Parquet to 1.12.4 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/27 03:10:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40546: [SPARK-42899][SQL][FOLLOWUP] Project.reconcileColumnType should use KnownNotNull instead of AssertNotNull - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/27 03:19:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40546: [SPARK-42899][SQL][FOLLOWUP] Project.reconcileColumnType should use KnownNotNull instead of AssertNotNull - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/27 03:19:38 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40534: [SPARK-42908][PYTHON] Raise RuntimeError when SparkContext is required but not initialized - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/27 03:34:04 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40556: [SPARK-42927][CORE] Change the access scope of `o.a.spark.util.Iterators#size` to `private[spark]` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 03:56:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40508: [SPARK-42924][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/27 05:53:34 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40508: [SPARK-42924][SQL][CONNECT][PYTHON] Clarify the comment of parameterized SQL args - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/27 05:54:30 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #40557: [SPARK-42928][SQL] Make resolvePersistentFunction synchronized - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/27 05:55:41 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #40557: [SPARK-42928][SQL] Make resolvePersistentFunction synchronized - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/27 05:55:48 UTC, 0 replies.
- [GitHub] [spark] anchovYu opened a new pull request, #40558: [WIP]Fix LCA having issue - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/03/27 06:21:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40557: [SPARK-42928][SQL] Make resolvePersistentFunction synchronized - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/27 06:23:12 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #37400: [SPARK-39957][CORE] Delay onDisconnected to enable Driver receives ExecutorExitCode - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 06:36:57 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #40559: [SPARK-42929] make mapInPandas / mapInArrow support "is_barrier" - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/27 07:02:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40560: [SPARK-42930][CORE] Change the access scope of `ProtobufSerDe` related implementations to `private[spark]` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 07:09:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40559: [SPARK-42929] make mapInPandas / mapInArrow support "is_barrier" - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/27 07:14:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40560: [SPARK-42930][CORE][SQL] Change the access scope of `ProtobufSerDe` related implementations to `private[spark]` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 07:19:06 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40560: [SPARK-42930][CORE][SQL] Change the access scope of `ProtobufSerDe` related implementations to `private[spark]` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 07:23:57 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #40557: [SPARK-42928][SQL] Make resolvePersistentFunction synchronized - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/27 07:44:49 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #40561: [SPARK-42931][SS] Introduce dropDuplicatesWithinWatermark - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/27 08:11:48 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40561: [SPARK-42931][SS] Introduce dropDuplicatesWithinWatermark - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/27 08:12:56 UTC, 7 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40553: [SPARK-39722] [SQL] getString API for Dataset - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/27 09:16:01 UTC, 0 replies.
- [GitHub] [spark] ted-jenks commented on pull request #39907: [SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/03/27 09:48:33 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #40559: [SPARK-42929] make mapInPandas / mapInArrow support "is_barrier" - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/27 09:50:42 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40559: [SPARK-42929] make mapInPandas / mapInArrow support "is_barrier" - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/27 09:51:20 UTC, 0 replies.
- [GitHub] [spark] chong0929 opened a new pull request, #40562: [MINOR][DOCS] Update broken links for pyspark.pandas - posted by "chong0929 (via GitHub)" <gi...@apache.org> on 2023/03/27 10:21:00 UTC, 0 replies.
- [GitHub] [spark] chong0929 commented on pull request #40562: [MINOR][DOCS] Update broken links for pyspark.pandas - posted by "chong0929 (via GitHub)" <gi...@apache.org> on 2023/03/27 10:23:23 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40563: [SPARK-41232][SPARK-41233][FOLLOWUP] Refactor array_append and array_prepend with `RuntimeReplaceable` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/27 10:23:55 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40555: [SPARK-42926][BUILD][SQL] Upgrade Parquet to 1.12.4 - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/27 10:38:39 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #40564: [SPARK-42519] [Test] [Connect] Add more WriteTo tests after Scala Client session config is supported - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/27 10:48:37 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40474: [SPARK-42849] [WIP] [SQL] Session Variables - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/27 10:58:55 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #39907: [SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/27 11:13:07 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on pull request #39124: [SPARK-42913][BUILD] Upgrade Hadoop to 3.3.5 - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/03/27 11:13:41 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40528: [SPARK-42584][CONNECT] Improve output of Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/27 11:35:45 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40555: [SPARK-42926][BUILD][SQL] Upgrade Parquet to 1.12.4 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/27 12:05:56 UTC, 1 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #40565: [WIP][SPARK-42873][SQL] Define Spark SQL types as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/27 13:54:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40566: [SPARK-42934][SQL][TESTS] Move `spark.hadoop.hadoop.security.key.provider.path` from `systemPropertyVariables` of `maven-surefire-plugin` to `systemProperties` of `scalatest-maven-plugin` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 13:59:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40566: [SPARK-42934][SQL][TESTS] Move `spark.hadoop.hadoop.security.key.provider.path` from `systemPropertyVariables` of `maven-surefire-plugin` to `systemProperties` of `scalatest-maven-plugin` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 13:59:50 UTC, 0 replies.
- [GitHub] [spark] zhmin opened a new pull request, #40567: [SPARK-42935] [SQL] Add union required distribution push down - posted by "zhmin (via GitHub)" <gi...@apache.org> on 2023/03/27 14:06:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40563: [SPARK-41232][SPARK-41233][FOLLOWUP] Refactor `array_append` and `array_prepend` with `RuntimeReplaceable` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 14:43:00 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39124: [SPARK-42913][BUILD] Upgrade Hadoop to 3.3.5 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 15:03:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39124: [SPARK-42913][BUILD] Upgrade Hadoop to 3.3.5 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 15:53:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40566: [SPARK-42934][BUILD] Move test property `spark.hadoop.hadoop.security.key.provider.path` from `maven-surefire-plugin` to `scalatest-maven-plugin` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 16:05:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40566: [SPARK-42934][BUILD] Move test property `spark.hadoop.hadoop.security.key.provider.path` from `maven-surefire-plugin` to `scalatest-maven-plugin` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 16:07:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40566: [SPARK-42934][BUILD] Move test property `spark.hadoop.hadoop.security.key.provider.path` from `maven-surefire-plugin` to `scalatest-maven-plugin` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 16:08:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40560: [SPARK-42930][CORE][SQL] Change the access scope of `ProtobufSerDe` related implementations to `private[protobuf]` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 16:27:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40566: [SPARK-42934][BUILD] Add `spark.hadoop.hadoop.security.key.provider.path` to `scalatest-maven-plugin` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 16:42:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40566: [SPARK-42934][BUILD] Add `spark.hadoop.hadoop.security.key.provider.path` to `scalatest-maven-plugin` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 16:42:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40560: [SPARK-42930][CORE][SQL] Change the access scope of `ProtobufSerDe` related implementations to `private[protobuf]` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 16:49:32 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40566: [SPARK-42934][BUILD] Add `spark.hadoop.hadoop.security.key.provider.path` to `scalatest-maven-plugin` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/27 16:50:30 UTC, 0 replies.
- [GitHub] [spark] johanl-db commented on a diff in pull request #40545: [SPARK-42918] Generalize handling of metadata attributes in FileSourceStrategy - posted by "johanl-db (via GitHub)" <gi...@apache.org> on 2023/03/27 17:23:02 UTC, 2 replies.
- [GitHub] [spark] anchovYu commented on pull request #40558: [SPARK-42936][SQL] Fix LCA bug when the having clause can be resolved directly by its child Aggregate - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/03/27 17:55:12 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40560: [SPARK-42930][CORE][SQL] Change the access scope of `ProtobufSerDe` related implementations to `private[protobuf]` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/27 18:27:49 UTC, 0 replies.
- [GitHub] [spark] mridulm opened a new pull request, #40568: SPARK-42922: Move from Random to SecureRandom - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/27 18:31:45 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40568: SPARK-42922: Move from Random to SecureRandom - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/27 18:32:03 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40568: [SPARK-42922][SQL]: Move from Random to SecureRandom - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/27 18:58:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40568: [SPARK-42922][SQL] Move from Random to SecureRandom - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 20:34:13 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/03/27 22:02:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40533: [SPARK-42906][K8S] Replace a starting digit with `x` in resource name prefix - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 22:31:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40533: [SPARK-42906][K8S] Replace a starting digit with `x` in resource name prefix - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/27 22:32:43 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40561: [SPARK-42931][SS] Introduce dropDuplicatesWithinWatermark - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/27 23:31:53 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40521: [MINOR][DOCS][PYTHON] Update some urls about deprecated repository pyspark.pandas - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/27 23:42:39 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40570: [SPARK-41876][CONNECT][PYTHON] Implement DataFrame.toLocalIterator - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/27 23:49:30 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40520: [SPARK-42896][SQL][PYTHON] Make `mapInPandas` / `mapInArrow` support barrier mode execution - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/28 00:18:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40521: [MINOR][DOCS][PYTHON] Update some urls about deprecated repository pyspark.pandas - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/28 00:21:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40534: [SPARK-42908][PYTHON] Raise RuntimeError when SparkContext is required but not initialized - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 01:31:30 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40571: [SPARK-42896][SQL][PYTHON][FOLLOW-UP] Rename isBarrier to barrier, and correct docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 01:43:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40571: [SPARK-42896][SQL][PYTHON][FOLLOW-UP] Rename isBarrier to barrier, and correct docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 01:43:39 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 01:45:24 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #40523: [SPARK-42897][SQL] Avoid evaluate more than once for the variables from the left side in the FullOuter SMJ condition - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/03/28 01:50:40 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/28 02:20:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40558: [SPARK-42936][SQL] Fix LCA bug when the having clause can be resolved directly by its child Aggregate - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/28 03:08:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40558: [SPARK-42936][SQL] Fix LCA bug when the having clause can be resolved directly by its child Aggregate - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/28 03:08:45 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #40572: [SPARK-37677][CORE] Unzip could keep file permissions - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/03/28 03:22:31 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #40572: [SPARK-37677][CORE] Unzip could keep file permissions - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/03/28 03:26:07 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40571: [SPARK-42896][SQL][PYTHON][FOLLOW-UP] Rename isBarrier to barrier, and correct docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 03:33:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40570: [SPARK-41876][CONNECT][PYTHON] Implement DataFrame.toLocalIterator - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 03:35:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40570: [SPARK-41876][CONNECT][PYTHON] Implement DataFrame.toLocalIterator - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 03:36:04 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40568: [SPARK-42922][SQL] Move from Random to SecureRandom - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/28 03:48:15 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40568: [SPARK-42922][SQL] Move from Random to SecureRandom - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/28 03:48:40 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/03/28 03:49:06 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40573: [SPARK-42943][SQL] Use LONGTEXT instead of TEXT for StringType for effective length - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/28 03:59:47 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40573: [SPARK-42943][SQL] Use LONGTEXT instead of TEXT for StringType for effective length - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/28 04:21:34 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40573: [SPARK-42943][SQL] Use LONGTEXT instead of TEXT for StringType for effective length - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/28 04:21:54 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40536: [SPARK-42895][CONNECT] Improve error messages for stopped Spark sessions - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/28 04:30:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40572: [SPARK-37677][CORE] Unzip could keep file permissions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 05:26:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40572: [SPARK-37677][CORE] Unzip could keep file permissions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 05:26:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40534: [SPARK-42908][PYTHON] Raise RuntimeError when SparkContext is required but not initialized - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 05:35:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40534: [SPARK-42908][PYTHON] Raise RuntimeError when SparkContext is required but not initialized - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 05:35:48 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40574: [SPARK-42942][SQL] Support coalesce table cache stage partitions - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/28 05:42:49 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/03/28 06:05:35 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40574: [SPARK-42942][SQL] Support coalesce table cache stage partitions - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/28 06:08:15 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/28 06:37:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40562: [MINOR][DOCS] Update broken links for pyspark.pandas - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 06:49:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40562: [MINOR][DOCS] Update broken links for pyspark.pandas - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 06:49:52 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/28 08:24:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/28 08:27:05 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/03/28 08:35:17 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40557: [SPARK-42928][SQL] Make resolvePersistentFunction synchronized - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/28 08:43:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40557: [SPARK-42928][SQL] Make resolvePersistentFunction synchronized - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/28 08:43:17 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40568: [SPARK-42922][SQL] Move from Random to SecureRandom - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/28 08:53:30 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40576: [SPARK-42946][SQL] Redact sensitive data which is nested by variable substitution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/28 08:58:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40568: [SPARK-42922][SQL] Move from Random to SecureRandom - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/28 09:27:08 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40556: [SPARK-42927][CORE] Change the access scope of `o.a.spark.util.Iterators#size` to `private[util]` - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/03/28 09:54:25 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40576: [SPARK-42946][SQL] Redact sensitive data which is nested by variable substitution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/28 10:43:14 UTC, 1 replies.
- [GitHub] [spark] liujiayi771 opened a new pull request, #40577: [SPARK-42947][SQL] Spark Thriftserver LDAP should not use DN pattern if user contains domain - posted by "liujiayi771 (via GitHub)" <gi...@apache.org> on 2023/03/28 12:01:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #40573: [SPARK-42943][SQL] Use LONGTEXT instead of TEXT for StringType for effective length - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/28 12:05:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40491: [SPARK-41006][K8S] Generate new ConfigMap names for each run - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/28 12:24:36 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/28 12:32:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40569: [SPARK-42937][SQL] `PlanSubqueries` should set `InSubqueryExec#shouldBroadcast` to true - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/28 12:42:17 UTC, 0 replies.
- [GitHub] [spark] DHKold commented on pull request #40491: [SPARK-41006][K8S] Generate new ConfigMap names for each run - posted by "DHKold (via GitHub)" <gi...@apache.org> on 2023/03/28 12:45:18 UTC, 0 replies.
- [GitHub] [spark] liujiayi771 commented on pull request #40577: [SPARK-42947][SQL] Spark Thriftserver LDAP should not use DN pattern if user contains domain - posted by "liujiayi771 (via GitHub)" <gi...@apache.org> on 2023/03/28 12:46:15 UTC, 3 replies.
- [GitHub] [spark] DHKold commented on a diff in pull request #40491: [SPARK-41006][K8S] Generate new ConfigMap names for each run - posted by "DHKold (via GitHub)" <gi...@apache.org> on 2023/03/28 12:47:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40491: [SPARK-41006][K8S] Generate new ConfigMap names for each run - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/28 12:49:15 UTC, 0 replies.
- [GitHub] [spark] tamama commented on pull request #37206: [SPARK-39696][CORE] Ensure Concurrent r/w `TaskMetrics` not throw Exception - posted by "tamama (via GitHub)" <gi...@apache.org> on 2023/03/28 13:10:54 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40574: [SPARK-42942][SQL] Support coalesce table cache stage partitions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/28 13:34:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40577: [SPARK-42947][SQL] Spark Thriftserver LDAP should not use DN pattern if user contains domain - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/28 13:43:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40576: [SPARK-42946][SQL] Redact sensitive data which is nested by variable substitution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/28 13:49:40 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks commented on a diff in pull request #40545: [SPARK-42918] Generalize handling of metadata attributes in FileSourceStrategy - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2023/03/28 13:52:54 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #40556: [SPARK-42927][CORE] Change the access scope of `o.a.spark.util.Iterators#size` to `private[util]` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/28 14:07:03 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40556: [SPARK-42927][CORE] Change the access scope of `o.a.spark.util.Iterators#size` to `private[util]` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/28 14:07:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40556: [SPARK-42927][CORE] Change the access scope of `o.a.spark.util.Iterators#size` to `private[util]` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/28 14:07:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/28 14:18:37 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #40578: [SPARK-42949][SQL] Simplify code for NAAJ - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/28 14:28:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37206: [SPARK-39696][CORE] Ensure Concurrent r/w `TaskMetrics` not throw Exception - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/28 14:31:21 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40578: [SPARK-42949][SQL] Simplify code for NAAJ - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/28 14:33:03 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40577: [SPARK-42947][SQL] Spark Thriftserver LDAP should not use DN pattern if user contains domain - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/28 14:38:07 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on pull request #40577: [SPARK-42947][SQL] Spark Thriftserver LDAP should not use DN pattern if user contains domain - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/28 14:50:54 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40579: [SPARK-42929][CONNECT][FOLLOWUP] Rename isBarrier to barrier - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/28 18:16:07 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40232: [SPARK-42629][DOCS] Update the description of default data source in the document - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/28 18:30:23 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/28 19:01:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/28 19:04:36 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/28 19:52:06 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40575: [SPARK-42945][CONNECT] Support PYSPARK_JVM_STACKTRACE_ENABLED in Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/28 19:52:32 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40579: [SPARK-42929][CONNECT][FOLLOWUP] Rename isBarrier to barrier - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/28 19:55:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40400: [SPARK-41359][SQL] Use `PhysicalDataType` instead of DataType in UnsafeRow - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/28 19:59:57 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40564: [SPARK-42519] [Test] [Connect] Add More WriteTo Tests In Spark Connect Client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/28 20:00:25 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #40128: [SPARK-42466][K8S]: Cleanup k8s upload directory when job terminates - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/03/28 20:16:15 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #40118: [SPARK-26365][K8S] In kuberentes cluster mode, spark submit should pass driver exit code - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/03/28 20:18:06 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40580: [SPARK-42952][SQL] Simplify the parameter of analyzer rule PreprocessTableCreation and DataSourceAnalysis - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/28 20:44:13 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40564: [SPARK-42519] [Test] [Connect] Add More WriteTo Tests In Spark Connect Client - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/28 23:17:40 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40579: [SPARK-42929][CONNECT][FOLLOWUP] Rename isBarrier to barrier - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 00:26:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40579: [SPARK-42929][CONNECT][FOLLOWUP] Rename isBarrier to barrier - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 00:27:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40535: [SPARK-42907][CONNECT][PYTHON] Implement Avro functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 00:58:37 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40535: [SPARK-42907][CONNECT][PYTHON] Implement Avro functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 00:59:02 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #40291: [WIP][SPARK-42578][CONNECT] Add JDBC to DataFrameWriter - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/29 01:03:37 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40581: [SPARK-42953][Connect] Typed map, flatMap, mapPartitions - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/29 01:16:00 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40581: [SPARK-42953][Connect] Typed map, flatMap, mapPartitions - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/29 01:19:52 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40581: [SPARK-42953][Connect] Typed map, flatMap, mapPartitions - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/29 01:21:12 UTC, 0 replies.
- [GitHub] [spark] zhmin commented on pull request #40567: [SPARK-42935] [SQL] Add union required distribution push down - posted by "zhmin (via GitHub)" <gi...@apache.org> on 2023/03/29 01:34:09 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40576: [SPARK-42946][SQL] Redact sensitive data which is nested by variable substitution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/29 01:44:14 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40574: [SPARK-42942][SQL] Support coalesce table cache stage partitions - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/29 01:50:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40581: [SPARK-42953][Connect] Typed filter, map, flatMap, mapPartitions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/29 02:09:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40581: [SPARK-42953][Connect] Typed filter, map, flatMap, mapPartitions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/29 02:18:23 UTC, 4 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #40564: [SPARK-42519] [Test] [Connect] Add More WriteTo Tests In Spark Connect Client - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/29 02:46:35 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #40581: [SPARK-42953][Connect] Typed filter, map, flatMap, mapPartitions - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/03/29 02:49:36 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40563: [SPARK-41232][SPARK-41233][FOLLOWUP] Refactor `array_append` and `array_prepend` with `RuntimeReplaceable` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/29 03:14:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40582: [SPARK-42954][PYTHON][CONNECT] Add `YearMonthIntervalType` to PySpark and Spark Connect Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/29 03:38:00 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on pull request #40182: [SPARK-42588][SQL] Collapse two adjacent windows with the equivalent partition/order expressions in two withColumn() - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/03/29 03:46:17 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40583: [SPARK-42955][SQL] Skip classifyException and wrap AnalysisException for SparkThrowable - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/29 03:58:16 UTC, 0 replies.
- [GitHub] [spark] navinvishy commented on a diff in pull request #40563: [SPARK-41232][SPARK-41233][FOLLOWUP] Refactor `array_append` and `array_prepend` with `RuntimeReplaceable` - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/03/29 04:03:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40118: [SPARK-26365][K8S] In kuberentes cluster mode, spark submit should pass driver exit code - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 04:15:54 UTC, 7 replies.
- [GitHub] [spark] pan3793 commented on pull request #40578: [SPARK-42949][SQL] Simplify code for NAAJ - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/29 04:21:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40565: [SPARK-42873][SQL] Define Spark SQL types as keywords - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/29 04:24:00 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40563: [SPARK-41232][SPARK-41233][FOLLOWUP] Refactor `array_append` and `array_prepend` with `RuntimeReplaceable` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/03/29 04:28:10 UTC, 7 replies.
- [GitHub] [spark] yaooqinn closed pull request #40576: [SPARK-42946][SQL] Redact sensitive data which is nested by variable substitution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/29 04:37:26 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #40561: [SPARK-42931][SS] Introduce dropDuplicatesWithinWatermark - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/29 04:47:47 UTC, 12 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40584: [SPARK-42956][CONNECT] Support avro functions for Scala client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/29 05:01:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40584: [SPARK-42956][CONNECT] Support avro functions for Scala client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/29 05:02:49 UTC, 3 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40578: [SPARK-42949][SQL] Simplify code for NAAJ - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/29 05:21:15 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40578: [SPARK-42949][SQL] Simplify code for NAAJ - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/29 05:21:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40563: [SPARK-41232][SPARK-41233][FOLLOWUP] Refactor `array_append` and `array_prepend` with `RuntimeReplaceable` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 05:21:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40438: [SPARK-42806][SPARK-42811][CONNECT] Add `Catalog` support - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/29 05:30:13 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40585: [SPARK-42957][INFRA] `release-build.sh` should not remove SBOM artifacts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 05:31:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40585: [SPARK-42957][INFRA] `release-build.sh` should not remove SBOM artifacts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 05:34:00 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40565: [SPARK-42873][SQL] Define Spark SQL types as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/29 05:56:27 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #40561: [SPARK-42931][SS] Introduce dropDuplicatesWithinWatermark - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/29 06:06:43 UTC, 34 replies.
- [GitHub] [spark] zwangsheng commented on a diff in pull request #40118: [SPARK-26365][K8S] In kuberentes cluster mode, spark submit should pass driver exit code - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/03/29 06:29:39 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40585: [SPARK-42957][INFRA] `release-build.sh` should not remove SBOM artifacts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 06:30:36 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40585: [SPARK-42957][INFRA] `release-build.sh` should not remove SBOM artifacts - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/29 06:30:49 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40585: [SPARK-42957][INFRA] `release-build.sh` should not remove SBOM artifacts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 06:32:30 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40578: [SPARK-42949][SQL] Simplify code for NAAJ - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/29 06:38:05 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #40578: [SPARK-42949][SQL] Simplify code for NAAJ - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/29 06:38:48 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #40586: [SPARK-42939] Core streaming Python API for Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/29 06:43:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40400: [SPARK-41359][SQL] Use `PhysicalDataType` instead of DataType in UnsafeRow - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/29 06:50:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40400: [SPARK-41359][SQL] Use `PhysicalDataType` instead of DataType in UnsafeRow - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/29 06:51:26 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #40586: [SPARK-42939] Core streaming Python API for Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/29 06:56:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40584: [SPARK-42956][CONNECT] Support avro functions for Scala client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 07:19:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40584: [SPARK-42956][CONNECT] Support avro functions for Scala client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 07:20:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40587: [SPARK-42957][INFRA][FOLLOWUP] Use 'cyclonedx' instead of file extensions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 08:02:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40536: [SPARK-42895][CONNECT] Improve error messages for stopped Spark sessions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 08:02:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40536: [SPARK-42895][CONNECT] Improve error messages for stopped Spark sessions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/29 08:02:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40587: [SPARK-42957][INFRA][FOLLOWUP] Use 'cyclonedx' instead of file extensions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 08:09:09 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40587: [SPARK-42957][INFRA][FOLLOWUP] Use 'cyclonedx' instead of file extensions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/29 08:09:52 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40580: [SPARK-42952][SQL] Simplify the parameter of analyzer rule PreprocessTableCreation and DataSourceAnalysis - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/29 08:15:37 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40580: [SPARK-42952][SQL] Simplify the parameter of analyzer rule PreprocessTableCreation and DataSourceAnalysis - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/29 08:16:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40582: [SPARK-42954][PYTHON][CONNECT] Add `YearMonthIntervalType` to PySpark and Spark Connect Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/29 09:40:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40582: [SPARK-42954][PYTHON][CONNECT] Add `YearMonthIntervalType` to PySpark and Spark Connect Python Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/29 09:40:42 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40588: [SPARK-42964][SQL] PosgresDialect '42P07' also means table already exists - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/29 11:49:25 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40589: [SPARK-38697][SQL] Extend SparkSessionExtensions to inject rules into AQE query stage optimizer - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/29 11:53:59 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40589: [SPARK-38697][SQL] Extend SparkSessionExtensions to inject rules into AQE query stage optimizer - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/29 11:56:32 UTC, 0 replies.
- [GitHub] [spark] Kwafoor commented on a diff in pull request #40294: [SPARK-40610][SQL] Support unwrap date type to string type - posted by "Kwafoor (via GitHub)" <gi...@apache.org> on 2023/03/29 13:13:24 UTC, 0 replies.
- [GitHub] [spark] yabola closed pull request #40495: test reading footer within file range - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/03/29 13:18:10 UTC, 0 replies.
- [GitHub] [spark] VindhyaG commented on a diff in pull request #40553: [SPARK-39722] [SQL] getString API for Dataset - posted by "VindhyaG (via GitHub)" <gi...@apache.org> on 2023/03/29 13:22:25 UTC, 6 replies.
- [GitHub] [spark] infoankitp commented on a diff in pull request #40563: [SPARK-41232][SPARK-41233][FOLLOWUP] Refactor `array_append` and `array_prepend` with `RuntimeReplaceable` - posted by "infoankitp (via GitHub)" <gi...@apache.org> on 2023/03/29 13:47:16 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40565: [SPARK-42873][SQL] Define Spark SQL types as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/29 15:50:59 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #40565: [SPARK-42873][SQL] Define Spark SQL types as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/29 15:52:22 UTC, 0 replies.
- [GitHub] [spark] tomvanbussel opened a new pull request, #40590: [SPARK-42631][CONNECT][FOLLOW-UP] Expose Column.expr to extensions - posted by "tomvanbussel (via GitHub)" <gi...@apache.org> on 2023/03/29 17:06:48 UTC, 0 replies.
- [GitHub] [spark] paul-laffon-dd opened a new pull request, #40591: [SPARK-42950][CORE] Add exit code in SparkListenerApplicationEnd - posted by "paul-laffon-dd (via GitHub)" <gi...@apache.org> on 2023/03/29 17:15:25 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40581: [SPARK-42953][Connect] Typed filter, map, flatMap, mapPartitions - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/03/29 17:28:55 UTC, 2 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #40586: [SPARK-42939][SS][CONNECT] Core streaming Python API for Spark Connect - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/03/29 17:49:06 UTC, 3 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #40586: [SPARK-42939][SS][CONNECT] Core streaming Python API for Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/03/29 18:08:23 UTC, 9 replies.
- [GitHub] [spark] jiangxb1987 opened a new pull request, #40592: [SPARK-42967] Fix SparkListenerTaskStart.stageAttemptId when a task is started after the stage is cancelled - posted by "jiangxb1987 (via GitHub)" <gi...@apache.org> on 2023/03/29 18:37:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40590: [SPARK-42631][CONNECT][FOLLOW-UP] Expose Column.expr to extensions - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/29 18:51:06 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40581: [SPARK-42953][Connect] Typed filter, map, flatMap, mapPartitions - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/29 19:21:58 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40586: [SPARK-42939][SS][CONNECT] Core streaming Python API for Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/29 19:23:13 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40590: [SPARK-42631][CONNECT][FOLLOW-UP] Expose Column.expr to extensions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/29 19:27:46 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40590: [SPARK-42631][CONNECT][FOLLOW-UP] Expose Column.expr to extensions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/29 19:28:53 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #40593: [WIP][SQL] Defined typed literal constructors as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/29 19:58:28 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40594: [SPARK-42970][CONNECT][PYTHON][TESTS] Reuse pyspark.sql.tests.test_arrow test cases - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/29 22:34:15 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #36677: [SPARK-39296][CORE][SQL] Replcace `Array.toString` with `Array.mkString` - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/03/29 23:18:35 UTC, 1 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #36529: [SPARK-39102][CORE][SQL][DSTREAM] Add checkstyle rules to disabled use of Guava's `Files.createTempDir()` - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/03/29 23:24:10 UTC, 2 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #36529: [SPARK-39102][CORE][SQL][DSTREAM] Add checkstyle rules to disabled use of Guava's `Files.createTempDir()` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/03/29 23:37:31 UTC, 3 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #39102: [SPARK-41555][SQL] Multi sparkSession should share single SQLAppStatusStore - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/30 00:19:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40594: [SPARK-42970][CONNECT][PYTHON][TESTS] Reuse pyspark.sql.tests.test_arrow test cases - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 00:34:02 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40594: [SPARK-42970][CONNECT][PYTHON][TESTS] Reuse pyspark.sql.tests.test_arrow test cases - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 00:34:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #36677: [SPARK-39296][CORE][SQL] Replcace `Array.toString` with `Array.mkString` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 00:35:36 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40595: [SPARK-42970][CONNECT][PYTHON][TESTS][3.4] Reuse pyspark.sql.tests.test_arrow test cases - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/30 01:46:42 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40594: [SPARK-42970][CONNECT][PYTHON][TESTS] Reuse pyspark.sql.tests.test_arrow test cases - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/30 01:47:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40589: [SPARK-38697][SQL] Extend SparkSessionExtensions to inject rules into AQE query stage optimizer - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/30 01:49:18 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40596: [SPARK-42973][CONNECT][BUILD] Upgrade buf to v1.16.0 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/03/30 01:56:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40597: [SPARK-42971][CORE] Change to print `workdir` if `appDirs` is null when worker handle `WorkDirCleanup` event - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 02:21:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40597: [SPARK-42971][CORE] Change to print `workdir` if `appDirs` is null when worker handle `WorkDirCleanup` event - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 02:39:44 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #40294: [SPARK-40610][SQL] Support unwrap date type to string type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/30 02:46:59 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40589: [SPARK-38697][SQL] Extend SparkSessionExtensions to inject rules into AQE query stage optimizer - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/03/30 03:23:27 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #36529: [SPARK-39102][CORE][SQL][DSTREAM] Add checkstyle rules to disabled use of Guava's `Files.createTempDir()` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 03:56:00 UTC, 12 replies.
- [GitHub] [spark] itholic commented on pull request #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/30 04:20:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40598: [SPARK-42974][CORE] Restore `Utils#createTempDir` use `ShutdownHookManager#registerShutdownDeleteDir` to cleanup tempDir - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 04:48:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40597: [SPARK-42971][CORE] Change to print `workdir` if `appDirs` is null when worker handle `WorkDirCleanup` event - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 04:50:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40597: [SPARK-42971][CORE] Change to print `workdir` if `appDirs` is null when worker handle `WorkDirCleanup` event - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 04:50:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40599: [SPARK-42907][TESTS][FOLLOWUP] Avro functions doctest cleanup - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/30 04:58:28 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #40600: [SPARK-42968][SS] Add option to skip commit coordinator as part of StreamingWrite API for DSv2 sources/sinks - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/30 05:25:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40599: [SPARK-42907][TESTS][FOLLOWUP] Avro functions doctest cleanup - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 05:26:27 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #40600: [SPARK-42968][SS] Add option to skip commit coordinator as part of StreamingWrite API for DSv2 sources/sinks - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/03/30 05:26:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40599: [SPARK-42907][TESTS][FOLLOWUP] Avro functions doctest cleanup - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 05:26:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40598: [SPARK-42974][CORE] Restore `Utils#createTempDir` use `ShutdownHookManager#registerShutdownDeleteDir` to cleanup tempDir - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 05:59:03 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40601: [SPARK-42975][SQL] Cast result type to timestamp type for string +/- interval - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/30 06:25:10 UTC, 0 replies.
- [GitHub] [spark] zsxwing commented on a diff in pull request #40561: [SPARK-42931][SS] Introduce dropDuplicatesWithinWatermark - posted by "zsxwing (via GitHub)" <gi...@apache.org> on 2023/03/30 06:25:14 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40525: [SPARK-42859][CONNECT][PS] Basic support for pandas API on Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/30 06:36:14 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40437: [SPARK-41259][SQL] SparkSQLDriver Output schema and result string should be consistent - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/30 06:46:03 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40601: [SPARK-42975][SQL] Cast result type to timestamp type for string +/- interval - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/30 06:46:11 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40437: [SPARK-41259][SQL] SparkSQLDriver Output schema and result string should be consistent - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/30 06:53:11 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40602: [SPARK-42978][SQL] Derby&PG: RENAME cannot qualify a new-table-Name with a schema-Name - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/30 07:01:45 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40602: [SPARK-42978][SQL] Derby&PG: RENAME cannot qualify a new-table-Name with a schema-Name - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/30 07:10:08 UTC, 0 replies.
- [GitHub] [spark] ScrapCodes commented on pull request #40553: [SPARK-39722] [SQL] getString API for Dataset - posted by "ScrapCodes (via GitHub)" <gi...@apache.org> on 2023/03/30 07:14:52 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40586: [SPARK-42939][SS][CONNECT] Core streaming Python API for Spark Connect - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/30 07:38:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40593: [WIP][SQL] Define typed literal constructors as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/30 07:58:17 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #40603: [MINOR][CONNECT] Adding Proto Debug String to Job Description. - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/30 08:00:53 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40601: [SPARK-42975][SQL] Cast result type to timestamp type for string +/- interval - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/30 08:01:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40437: [SPARK-41259][SQL] SparkSQLDriver Output schema and result string should be consistent - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/30 08:09:06 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40593: [WIP][SQL] Define typed literal constructors as keywords - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/30 08:23:40 UTC, 2 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40604: Revert "[SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles" - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/30 08:43:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40604: Revert "[SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles" - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/30 08:43:22 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40598: [SPARK-42974][CORE] Restore `Utils#createTempDir` use `ShutdownHookManager#registerShutdownDeleteDir` to cleanup tempDir - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 08:54:57 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/30 08:59:03 UTC, 5 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/03/30 09:05:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40605: [SPARK-42958][CONNECT] Refactor `CheckConnectJvmClientCompatibility` to compare client and avro module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 09:08:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40300: [SPARK-42683] Automatically rename conflicting metadata columns - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/30 09:09:34 UTC, 0 replies.
- [GitHub] [spark] lyy-pineapple commented on pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by "lyy-pineapple (via GitHub)" <gi...@apache.org> on 2023/03/30 09:27:48 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #40606: Debugging is awesome - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/03/30 09:30:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40605: [SPARK-42958][CONNECT] Refactor `connect-jvm-client-mima-check` to support mima check with avro module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/30 09:49:18 UTC, 0 replies.
- [GitHub] [spark] huangxiaopingRD commented on a diff in pull request #40232: [SPARK-42629][DOCS] Update the description of default data source in the document - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/03/30 09:49:58 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40607: [WIP][ML] Make Torch Distributor support Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/30 10:15:15 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/30 10:16:57 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40607: [WIP][ML] Make Torch Distributor support Spark Connect - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/03/30 10:36:47 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40607: [WIP][ML] Make Torch Distributor support Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/30 10:39:57 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #40604: Revert "[SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles" - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/30 11:43:49 UTC, 0 replies.
- [GitHub] [spark] martin-kokos commented on pull request #39941: [MINOR][DOCS] Add link to Hadoop docs - posted by "martin-kokos (via GitHub)" <gi...@apache.org> on 2023/03/30 12:38:49 UTC, 0 replies.
- [GitHub] [spark] martin-kokos closed pull request #39941: [MINOR][DOCS] Add link to Hadoop docs - posted by "martin-kokos (via GitHub)" <gi...@apache.org> on 2023/03/30 12:38:50 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40600: [SPARK-42968][SS] Add option to skip commit coordinator as part of StreamingWrite API for DSv2 sources/sinks - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/30 12:48:20 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40600: [SPARK-42968][SS] Add option to skip commit coordinator as part of StreamingWrite API for DSv2 sources/sinks - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/03/30 12:49:00 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40567: [SPARK-42935] [SQL] Add union required distribution push down - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/30 13:23:48 UTC, 0 replies.
- [GitHub] [spark] juanvisoler opened a new pull request, #40608: SPARK-35198 - posted by "juanvisoler (via GitHub)" <gi...@apache.org> on 2023/03/30 14:04:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40604: Revert "[SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles" - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/30 14:27:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40593: [SPARK-42979][SQL] Define literal constructors as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/30 14:43:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40593: [SPARK-42979][SQL] Define literal constructors as keywords - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/30 14:44:07 UTC, 0 replies.
- [GitHub] [spark] juanvisoler commented on pull request #40608: [SPARK-35198][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "juanvisoler (via GitHub)" <gi...@apache.org> on 2023/03/30 15:06:09 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #40609: [SPARK-42316][SQL] Assign name to _LEGACY_ERROR_TEMP_2044 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/30 15:27:33 UTC, 0 replies.
- [GitHub] [spark] ivoson opened a new pull request, #40610: [SPARK-42626][CONNECT] Add Destructive Iterator for SparkResult - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/30 15:53:09 UTC, 0 replies.
- [GitHub] [spark] arturobernalg commented on pull request #40608: [SPARK-35198][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "arturobernalg (via GitHub)" <gi...@apache.org> on 2023/03/30 17:19:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/30 17:55:13 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/30 17:57:23 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/03/30 18:09:35 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40609: [SPARK-42316][SQL] Assign name to _LEGACY_ERROR_TEMP_2044 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/30 18:26:41 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #40587: [SPARK-42957][INFRA][FOLLOWUP] Use 'cyclonedx' instead of file extensions - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/03/30 18:27:08 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #40363: [SPARK_42744] delete uploaded file when job finish for k8s - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/03/30 19:34:16 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40612: [SPARK-42969][CONNECT][TESTS] Fix the comparison the result with Arrow optimization enabled/disabled - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/30 21:32:20 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40592: [SPARK-42967][CORE][3.2][3.3][3.4] Fix SparkListenerTaskStart.stageAttemptId when a task is started after the stage is cancelled - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/30 22:47:39 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40592: [SPARK-42967][CORE][3.2][3.3][3.4] Fix SparkListenerTaskStart.stageAttemptId when a task is started after the stage is cancelled - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/30 22:48:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40612: [SPARK-42969][CONNECT][TESTS] Fix the comparison the result with Arrow optimization enabled/disabled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 23:45:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40612: [SPARK-42969][CONNECT][TESTS] Fix the comparison the result with Arrow optimization enabled/disabled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 23:45:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40595: [SPARK-42970][CONNECT][PYTHON][TESTS][3.4] Reuse pyspark.sql.tests.test_arrow test cases - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 23:46:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40595: [SPARK-42970][CONNECT][PYTHON][TESTS][3.4] Reuse pyspark.sql.tests.test_arrow test cases - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 23:46:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40608: [SPARK-35198][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 23:50:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40591: [SPARK-42950][CORE] Add exit code in SparkListenerApplicationEnd - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/30 23:55:27 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/31 00:01:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40612: [SPARK-42969][CONNECT][TESTS] Fix the comparison the result with Arrow optimization enabled/disabled - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/31 00:08:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #39130: [SPARK-xxxxx][DOCUMENTATION][PYTHON] Fix grammar in docstring for toDF(). - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/31 00:19:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #39102: [SPARK-41555][SQL] Multi sparkSession should share single SQLAppStatusStore - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/03/31 00:19:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40613: Revert "[SPARK-39204][CORE] Change `Utils.createTempDir` and `Utils.createDirectory` call the same logic method in `JavaUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 01:23:53 UTC, 0 replies.
- [GitHub] [spark] lucaspompeun opened a new pull request, #40614: correction of protobuf sql docuentation - posted by "lucaspompeun (via GitHub)" <gi...@apache.org> on 2023/03/31 01:24:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40613: Revert "[SPARK-39204][CORE] Change `Utils.createTempDir` and `Utils.createDirectory` call the same logic method in `JavaUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 01:28:26 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40601: [SPARK-42975][SQL] Cast result type to timestamp type for string +/- interval - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 01:29:40 UTC, 1 replies.
- [GitHub] [spark] lucaspompeun commented on pull request #40614: [SPARK-42987][DOCS] Correction of protobuf sql documentation - posted by "lucaspompeun (via GitHub)" <gi...@apache.org> on 2023/03/31 01:32:21 UTC, 0 replies.
- [GitHub] [spark] RyanBerti opened a new pull request, #40615: [SPARK-16484][SQL] Add support for Datasketches HllSketch - posted by "RyanBerti (via GitHub)" <gi...@apache.org> on 2023/03/31 01:38:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #40598: [SPARK-42974][CORE] Restore `Utils#createTempDir` use `ShutdownHookManager#registerShutdownDeleteDir` to cleanup tempDir - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 01:40:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40613: [SPARK-42974][CORE] Separate Implementation of `Utils.createTempDir` and `JavaUtils.createTempDir` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 01:45:08 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40602: [SPARK-42978][SQL] Derby&PG: RENAME cannot qualify a new-table-Name with a schema-Name - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/31 02:17:11 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40613: [SPARK-42974][CORE] Restore `Utils.createTempDir` to use the `ShutdownHookManager` and clean up `JavaUtils.createTempDir` method. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 02:37:20 UTC, 6 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40610: [SPARK-42626][CONNECT] Add Destructive Iterator for SparkResult - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/03/31 02:40:51 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40561: [SPARK-42931][SS] Introduce dropDuplicatesWithinWatermark - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/31 03:00:07 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #40601: [SPARK-42975][SQL] Cast result type to timestamp type for string +/- interval - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/31 03:09:45 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40601: [SPARK-42975][SQL] Cast result type to timestamp type for string +/- interval - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/03/31 03:15:43 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #40609: [SPARK-42316][SQL] Assign name to _LEGACY_ERROR_TEMP_2044 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/03/31 03:22:49 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40545: [SPARK-42918] Generalize handling of metadata attributes in FileSourceStrategy - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 03:30:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40602: [SPARK-42978][SQL] Derby&PG: RENAME cannot qualify a new-table-Name with a schema-Name - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 03:39:50 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #32987: [SPARK-35564][SQL] Support subexpression elimination for conditionally evaluated expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 03:49:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40589: [SPARK-38697][SQL] Extend SparkSessionExtensions to inject rules into AQE query stage optimizer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/31 03:53:26 UTC, 6 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40583: [SPARK-42955][SQL] Skip classifyException and wrap AnalysisException for SparkThrowable - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/31 03:53:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40610: [SPARK-42626][CONNECT] Add Destructive Iterator for SparkResult - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 03:54:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40605: [SPARK-42958][CONNECT] Refactor `connect-jvm-client-mima-check` to support mima check with avro module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 03:59:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 04:13:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40583: [SPARK-42955][SQL] Skip classifyException and wrap AnalysisException for SparkThrowable - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 04:19:18 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40583: [SPARK-42955][SQL] Skip classifyException and wrap AnalysisException for SparkThrowable - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/31 04:22:28 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on a diff in pull request #40610: [SPARK-42626][CONNECT] Add Destructive Iterator for SparkResult - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/03/31 04:39:34 UTC, 0 replies.
- [GitHub] [spark] juanvisoler commented on a diff in pull request #40608: [SPARK-35198][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "juanvisoler (via GitHub)" <gi...@apache.org> on 2023/03/31 05:38:07 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40616: [SPARK-42991][SQL] Disable string type +/- interval in ANSI mode - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/31 07:00:28 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40617: [SPARK-42992][PYTHON] Introduce PySparkRuntimeError - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/31 07:01:30 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40616: [SPARK-42991][SQL] Disable string type +/- interval in ANSI mode - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/03/31 07:02:13 UTC, 0 replies.
- [GitHub] [spark] thyecust opened a new pull request, #40618: fix typo in StorageLevel __eq__() - posted by "thyecust (via GitHub)" <gi...@apache.org> on 2023/03/31 07:06:09 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40523: [SPARK-42897][SQL] Avoid evaluate more than once for the variables from the left side in the FullOuter SMJ condition - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 07:14:38 UTC, 0 replies.
- [GitHub] [spark] thyecust closed pull request #40618: fix typo in StorageLevel __eq__() - posted by "thyecust (via GitHub)" <gi...@apache.org> on 2023/03/31 07:16:23 UTC, 0 replies.
- [GitHub] [spark] thyecust opened a new pull request, #40619: fix typo in StorageLevel __eq__() - posted by "thyecust (via GitHub)" <gi...@apache.org> on 2023/03/31 07:16:56 UTC, 0 replies.
- [GitHub] [spark] thyecust opened a new pull request, #40620: fix typo in pyspark/pandas/config.py - posted by "thyecust (via GitHub)" <gi...@apache.org> on 2023/03/31 07:20:53 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 opened a new pull request, #40621: Fix ExecutorAllocationManager cannot allocate new instances when all … - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/03/31 07:28:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40616: [SPARK-42991][SQL] Disable string type +/- interval in ANSI mode - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 07:29:18 UTC, 0 replies.
- [GitHub] [spark] thyecust opened a new pull request, #40622: fix typo in ResourceRequest.equals() - posted by "thyecust (via GitHub)" <gi...@apache.org> on 2023/03/31 07:31:30 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 commented on pull request #40621: Fix ExecutorAllocationManager cannot allocate new instances when all … - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/03/31 07:44:00 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #40602: [SPARK-42978][SQL] Derby&PG: RENAME cannot qualify a new-table-Name with a schema-Name - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/03/31 08:06:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40622: fix typo in ResourceRequest.equals() - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/03/31 08:18:00 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40525: [SPARK-42859][CONNECT][PS] Basic support for pandas API on Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/03/31 09:45:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/03/31 09:47:36 UTC, 3 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #40623: [WIP][SQL] Parameterized `sql()` with literal args - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/03/31 10:00:16 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40624: [SPARK-42995][CONNECT][PYTHON] Migrate Spark Connect DataFrame errors into error class - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/31 10:39:28 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40525: [SPARK-42859][CONNECT][PS] Basic support for pandas API on Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/03/31 10:44:05 UTC, 4 replies.
- [GitHub] [spark] zhmin commented on a diff in pull request #40567: [SPARK-42935] [SQL] Add union required distribution push down - posted by "zhmin (via GitHub)" <gi...@apache.org> on 2023/03/31 10:45:25 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40545: [SPARK-42918] Generalize handling of metadata attributes in FileSourceStrategy - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 12:51:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40545: [SPARK-42918] Generalize handling of metadata attributes in FileSourceStrategy - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/03/31 12:52:18 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40574: [SPARK-42942][SQL] Support coalesce table cache stage partitions - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/31 13:39:28 UTC, 0 replies.
- [GitHub] [spark] jaceklaskowski commented on a diff in pull request #40607: [SPARK-42993][ML][CONNECT] Make PyTorch Distributor support Spark Connect - posted by "jaceklaskowski (via GitHub)" <gi...@apache.org> on 2023/03/31 14:11:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40589: [SPARK-38697][SQL] Extend SparkSessionExtensions to inject rules into AQE query stage optimizer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/31 16:48:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40596: [SPARK-42973][CONNECT][BUILD] Upgrade buf to v1.16.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/03/31 16:53:04 UTC, 0 replies.
- [GitHub] [spark] arunsimhateachmint opened a new pull request, #40625: [TDE-88] Access control on spark-3.3.2 - posted by "arunsimhateachmint (via GitHub)" <gi...@apache.org> on 2023/03/31 18:10:51 UTC, 0 replies.
- [GitHub] [spark] arunsimhateachmint closed pull request #40625: [TDE-88] Access control on spark-3.3.2 - posted by "arunsimhateachmint (via GitHub)" <gi...@apache.org> on 2023/03/31 18:14:00 UTC, 0 replies.
- [GitHub] [spark] ritikam2 commented on pull request #18994: [SPARK-21784][SQL] Adds support for defining informational primary key and foreign key constraints using ALTER TABLE DDL. - posted by "ritikam2 (via GitHub)" <gi...@apache.org> on 2023/03/31 18:40:51 UTC, 0 replies.
- [GitHub] [spark] clownxc opened a new pull request, #40626: [SPARK-42860][SQL] Add analysed logical mode in org.apache.spark.sql.execution.ExplainMode - posted by "clownxc (via GitHub)" <gi...@apache.org> on 2023/03/31 18:50:37 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40622: fix typo in ResourceRequest.equals() - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/03/31 19:10:53 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #40615: [WIP][SPARK-16484][SQL] Add support for Datasketches HllSketch - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/03/31 20:01:40 UTC, 0 replies.
- [GitHub] [spark] RyanBerti commented on a diff in pull request #40615: [WIP][SPARK-16484][SQL] Add support for Datasketches HllSketch - posted by "RyanBerti (via GitHub)" <gi...@apache.org> on 2023/03/31 21:36:41 UTC, 3 replies.
- [GitHub] [spark] WweiL commented on pull request #40586: [SPARK-42939][SS][CONNECT] Core streaming Python API for Spark Connect - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/03/31 22:37:38 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40627: [SPARK-42998][CONNECT][PYTHON] Fix DataFrame.collect with null struct - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/03/31 22:45:07 UTC, 0 replies.