You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [spark] gengliangwang opened a new pull request, #39827: [SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 00:00:52 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39827: [SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 00:01:20 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #39825: [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/02/01 00:09:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 00:24:30 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39821: [SPARK-42253][PYTHON] Add test for detecting duplicated error class - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 00:46:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 01:02:00 UTC, 11 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39824: [SPARK-42259][SQL] ResolveGroupingAnalytics should take care of Python UDAF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 01:03:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39821: [SPARK-42253][PYTHON] Add test for detecting duplicated error class - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 01:05:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 01:16:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 01:16:43 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39799: [SPARK-42232][SQL] Rename error class: `UNSUPPORTED_FEATURE.JDBC_TRANSACTION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 01:19:02 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 01:23:32 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #37479: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/01 01:23:42 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 01:27:52 UTC, 0 replies.
- [GitHub] [spark] weicm commented on pull request #39097: [SPARK-42169] Implement code generation for to_csv function (StructsToCsv) - posted by "weicm (via GitHub)" <gi...@apache.org> on 2023/02/01 01:49:08 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/01 01:52:52 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/01 01:56:35 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/01 01:56:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39827: [SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 01:58:40 UTC, 0 replies.
- [GitHub] [spark] melin commented on pull request #39626: An automatic caching solution for Spark - posted by "melin (via GitHub)" <gi...@apache.org> on 2023/02/01 02:24:27 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39828: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:27:07 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39706: [SPARK-42158][SQL] Integrate `_LEGACY_ERROR_TEMP_1003` into `FIELD_NOT_FOUND` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:27:22 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39829: [3.4][SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:30:19 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:30:26 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39830: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:32:00 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39700: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:32:08 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39831: [SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:33:56 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39806: [SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:34:02 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39832: [SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:35:19 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39791: [SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:35:31 UTC, 0 replies.
- [GitHub] [spark] sadikovi closed pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/01 02:36:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 02:38:19 UTC, 11 replies.
- [GitHub] [spark] itholic opened a new pull request, #39833: [3.4][SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:41:26 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:41:48 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 02:45:59 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 02:55:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 03:35:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39834: [WIP][CONNEC][TESTS] Use an available ephemeral port for Spark Connect server in testing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 03:42:46 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39827: [SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 04:06:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39835: [SPARK-42268][CONNECT][PYTHON] Add UserDefinedType in protos - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/01 04:47:10 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39836: [SPARK-41931][SQL][FOLLOWUP] Refine example more useful - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 05:08:26 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 05:08:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39811: [SPARK-42242][BUILD] Upgrade `snappy-java` to 1.1.9.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 05:37:29 UTC, 2 replies.
- [GitHub] [spark] erenavsarogullari commented on pull request #39037: [SPARK-41214][SQL] Fix AQE cache does not update plan and metrics - posted by "erenavsarogullari (via GitHub)" <gi...@apache.org> on 2023/02/01 05:45:19 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39837: [SPARK-42254][SQL] Assign name to _LEGACY_ERROR_TEMP_1117 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 05:48:53 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39837: [SPARK-42254][SQL] Assign name to _LEGACY_ERROR_TEMP_1117 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 05:49:25 UTC, 0 replies.
- [GitHub] [spark] ulysses-you closed pull request #39556: [SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/01 05:51:15 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39838: [SPARK-42270][SQL] Sort merge join may oom when right match rows are very large - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/01 05:59:20 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39835: [SPARK-42268][CONNECT][PYTHON] Add UserDefinedType in protos - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/01 06:03:26 UTC, 5 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39814: [SPARK-42271][CONNECT][PYTHON] Reuse UDF test cases under `pyspark.sql.tests` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/01 06:14:40 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39811: [SPARK-42242][BUILD] Upgrade `snappy-java` to 1.1.9.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 06:14:48 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/01 06:22:42 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39834: [SPARK-42272][CONNEC][TESTS] Use an available ephemeral port for Spark Connect server in testing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 06:40:05 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/01 06:46:05 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 06:51:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 06:52:16 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #39823: [SPARK-42257][CORE] Remove unused variable external sorter - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/01 06:59:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 07:06:58 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 07:18:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39835: [SPARK-42268][CONNECT][PYTHON] Add UserDefinedType in protos - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/01 07:18:52 UTC, 3 replies.
- [GitHub] [spark] itholic commented on pull request #39836: [SPARK-41931][SQL][FOLLOWUP] Refine example more useful - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 07:31:39 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 07:32:33 UTC, 4 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 07:37:04 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39840: [SPARK-42273][CONNECT][TESTS] Skip Spark Connect tests if dependencies are not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 07:38:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39840: [SPARK-42273][CONNECT][TESTS] Skip Spark Connect tests if dependencies are not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 07:40:42 UTC, 1 replies.
- [GitHub] [spark] anchovYu commented on pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/02/01 07:42:17 UTC, 1 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2023/02/01 07:43:04 UTC, 20 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39799: [SPARK-42232][SQL] Rename error class: `UNSUPPORTED_FEATURE.JDBC_TRANSACTION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 07:51:05 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39841: [SPARK-42274][BUILD] Upgrade `compress-lzf` to 1.1.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 07:57:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39841: [SPARK-42274][BUILD] Upgrade `compress-lzf` to 1.1.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 08:10:53 UTC, 2 replies.
- [GitHub] [spark] kelvinjian-db opened a new pull request, #39842: [SPARK-42115][SQL] Push down limit through Python UDFs - posted by "kelvinjian-db (via GitHub)" <gi...@apache.org> on 2023/02/01 08:16:39 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/01 08:19:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39844: [SPARK-42275][CONNECT][PYTHON] Avoid using built-in list, dict in static typing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/01 08:35:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39845: [SPARK-42277][CORE] Use RocksDB for `spark.history.store.hybridStore.diskBackend` by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 08:44:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39845: [SPARK-42277][CORE] Use RocksDB for `spark.history.store.hybridStore.diskBackend` by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 08:52:17 UTC, 5 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39846: [SPARK-42278][SQL] DS V2 pushdown supports supports JDBC dialects compile `SortOrder` by themselves - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/01 09:00:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39653: [SPARK-42115][SQL][PYTHON] Push down limit through Python UDFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 09:23:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39653: [SPARK-42115][SQL][PYTHON] Push down limit through Python UDFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 09:23:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39834: [SPARK-42272][CONNEC][TESTS] Use an available ephemeral port for Spark Connect server in testing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 09:27:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39842: [SPARK-42115][SQL] Push down limit through Python UDFs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 09:34:57 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39824: [SPARK-42259][SQL] ResolveGroupingAnalytics should take care of Python UDAF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 09:35:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39824: [SPARK-42259][SQL] ResolveGroupingAnalytics should take care of Python UDAF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 09:36:26 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39827: [WIP][SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/01 09:49:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39840: [SPARK-42273][CONNECT][TESTS] Skip Spark Connect tests if dependencies are not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/01 10:09:12 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39841: [SPARK-42274][BUILD] Upgrade `compress-lzf` to 1.1.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 10:12:24 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/01 10:24:04 UTC, 6 replies.
- [GitHub] [spark] NarekDW commented on pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/01 10:29:02 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #39846: [SPARK-42278][SQL] DS V2 pushdown supports supports JDBC dialects compile `SortOrder` by themselves - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/01 11:12:07 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39847: [SPARK-42279][PS][TESTS] Simplify `test_resample` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/01 12:03:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 12:08:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39846: [SPARK-42278][SQL] DS V2 pushdown supports supports JDBC dialects compile `SortOrder` by themselves - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 12:17:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39846: [SPARK-42278][SQL] DS V2 pushdown supports supports JDBC dialects compile `SortOrder` by themselves - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 12:18:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39848: [SPARK-42276][BUILD][CONNECT] Add `ServicesResourceTransformer` rule to connect server module shade configuration - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 13:03:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39849: [SPARK-42282][PS][TESTS] Split `pyspark.pandas.tests.test_groupby` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/01 13:13:03 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/02/01 13:15:41 UTC, 8 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39848: [SPARK-42276][BUILD][CONNECT] Add `ServicesResourceTransformer` rule to connect server module shade configuration - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 13:25:01 UTC, 3 replies.
- [GitHub] [spark] vicennial opened a new pull request, #39850: [SPARK-42283][CONNECT][SCALA] Simple Scalar Scala UDFs - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/02/01 13:31:47 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #39851: [MINOR][SQL] Enhance data type check error message - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/01 13:41:54 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #39850: [SPARK-42283][CONNECT][SCALA] Simple Scalar Scala UDFs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 13:45:18 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/01 13:45:23 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #39850: [SPARK-42283][CONNECT][SCALA] Simple Scalar Scala UDFs - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/02/01 13:48:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39847: [SPARK-42279][PS][TESTS] Simplify `pyspark.pandas.tests.test_resample` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/01 13:50:43 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39848: [SPARK-42276][BUILD][CONNECT] Add `ServicesResourceTransformer` rule to connect server module shade configuration - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 13:55:53 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39852: [SPARK-42281][SQL] Update Debugging PySpark documents to show error message properly - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/01 13:57:23 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #39853: WIP watermark propagate simulator - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/01 14:06:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39841: [SPARK-42274][BUILD] Upgrade `compress-lzf` to 1.1.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 14:11:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 14:12:57 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/01 14:41:44 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/01 14:44:50 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39837: [SPARK-42254][SQL] Assign name to _LEGACY_ERROR_TEMP_1117 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/01 14:49:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 14:51:09 UTC, 15 replies.
- [GitHub] [spark] planga82 commented on a diff in pull request #39826: [SPARK-42262][SQL] Table schema changes via V2SessionCatalog with HiveExternalCatalog - posted by "planga82 (via GitHub)" <gi...@apache.org> on 2023/02/01 15:01:31 UTC, 4 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/01 15:18:57 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39831: [3.4][SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 15:40:51 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39831: [3.4][SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 15:44:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39832: [3.4][SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 15:45:10 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39832: [3.4][SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 15:46:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39833: [3.4][SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 15:48:01 UTC, 0 replies.
- [GitHub] [spark] Daniel-Davies commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "Daniel-Davies (via GitHub)" <gi...@apache.org> on 2023/02/01 15:51:25 UTC, 3 replies.
- [GitHub] [spark] MaxGekk closed pull request #39833: [3.4][SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/01 15:54:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 16:24:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/01 16:24:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #39854: [SPARK-42284][CONNECT] Make sure connect server assembly is built before running client tests - SBT - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 16:51:05 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39854: [SPARK-42284][CONNECT] Make sure connect server assembly is built before running client tests - SBT - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 16:54:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39854: [SPARK-42284][CONNECT] Make sure connect server assembly is built before running client tests - SBT - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/01 16:56:40 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39801: [MINOR][DOCS][SQL] Fix FoldablePropagation rule document - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/01 17:21:06 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #39775: [SPARK-42219][CORE] Introducing a config to close all active SparkContexts after the Main method has finished - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/02/01 18:06:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 18:10:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 18:10:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39850: [SPARK-42283][CONNECT][SCALA] Simple Scalar Scala UDFs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 18:12:04 UTC, 2 replies.
- [GitHub] [spark] hvanhovell closed pull request #39850: [SPARK-42283][CONNECT][SCALA] Simple Scalar Scala UDFs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/01 18:12:47 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/02/01 18:37:20 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 18:51:11 UTC, 2 replies.
- [GitHub] [spark] RunyaoChen opened a new pull request, #39855: [SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/02/01 20:05:46 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/01 20:55:29 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39856: [SPARK-38829][SQL] Introduce conf spark.sql.parquet.inferTimestampNTZ.enabled for TimestampNTZ inference on Parquet - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 21:54:44 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39856: [SPARK-38829][SQL] Introduce conf spark.sql.parquet.inferTimestampNTZ.enabled for TimestampNTZ inference on Parquet - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 21:54:59 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39855: [SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 22:30:42 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39855: [SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/01 22:38:05 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen commented on pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/02/01 22:45:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39845: [SPARK-42277][CORE] Use RocksDB for `spark.history.store.hybridStore.diskBackend` by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 22:59:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/01 23:02:19 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen commented on a diff in pull request #39855: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/02/01 23:03:19 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #36158: [SPARK-38829][SQL] Add a configuration flag to enable TIMESTAMP_NTZ support in Parquet data source - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 00:00:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38330: [SPARK-40868][SQL] Avoid introducing too many partitions when bucketed scan disabled by sql planner - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/02 00:20:21 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39856: [SPARK-38829][SQL] Introduce conf spark.sql.parquet.inferTimestampNTZ.enabled for TimestampNTZ inference on Parquet - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/02 00:26:05 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39827: [WIP][SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 00:40:08 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39851: [MINOR][SQL] Enhance data type check error message - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/02 00:45:20 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39827: [WIP][SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 00:52:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39842: [SPARK-42115][SQL] Push down limit through Python UDFs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 01:01:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39849: [SPARK-42282][PS][TESTS] Split `pyspark.pandas.tests.test_groupby` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/02 01:01:21 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39854: [SPARK-42284][CONNECT] Make sure connect server assembly is built before running client tests - SBT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:08:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39854: [SPARK-42284][CONNECT] Make sure connect server assembly is built before running client tests - SBT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:08:37 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39851: [MINOR][SQL] Enhance data type check error message - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 01:19:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39856: [SPARK-38829][SQL] Introduce conf spark.sql.parquet.inferTimestampNTZ.enabled for TimestampNTZ inference on Parquet - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 01:19:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39847: [SPARK-42279][PS][TESTS] Simplify `pyspark.pandas.tests.test_resample` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:37:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39847: [SPARK-42279][PS][TESTS] Simplify `pyspark.pandas.tests.test_resample` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:37:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39844: [SPARK-42275][CONNECT][PYTHON] Avoid using built-in list, dict in static typing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:38:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39844: [SPARK-42275][CONNECT][PYTHON] Avoid using built-in list, dict in static typing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:38:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39836: [SPARK-41931][SQL][FOLLOWUP] Refine example more useful - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:40:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39836: [SPARK-41931][SQL][FOLLOWUP] Refine example more useful - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:40:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39835: [SPARK-42268][CONNECT][PYTHON] Add UserDefinedType in protos - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:41:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39835: [SPARK-42268][CONNECT][PYTHON] Add UserDefinedType in protos - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:41:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 01:43:30 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/02 01:59:36 UTC, 14 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39853: WIP watermark propagate simulator - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/02 02:29:22 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/02 02:34:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39615: [SPARK-42093][SQL] Move JavaTypeInference to AgnosticEncoders - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 02:52:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39615: [SPARK-42093][SQL] Move JavaTypeInference to AgnosticEncoders - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 02:53:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39849: [SPARK-42282][PS][TESTS] Split `pyspark.pandas.tests.test_groupby` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/02 02:58:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39814: [SPARK-42271][CONNECT][PYTHON] Reuse UDF test cases under `pyspark.sql.tests` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 03:00:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39814: [SPARK-42271][CONNECT][PYTHON] Reuse UDF test cases under `pyspark.sql.tests` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 03:01:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39840: [SPARK-42273][CONNECT][TESTS] Skip Spark Connect tests if dependencies are not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 03:02:26 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/02 03:09:33 UTC, 0 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #39687: [SPARK-41470][SQL] Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/02 03:17:09 UTC, 6 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/02 03:38:56 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/02 04:22:26 UTC, 3 replies.
- [GitHub] [spark] sadikovi commented on pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/02 04:24:09 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39827: [WIP][SPARK-36180][SQL] Support TimestampNTZ type in Hive metastore - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 04:47:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39857: [SPARK-42287][CONNECT][BUILD] Refactor `assembly / assemblyExcludedJars` rule in `SparkConnectClient` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/02 05:42:05 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39856: [SPARK-38829][SQL] Introduce conf spark.sql.parquet.inferTimestampNTZ.enabled for TimestampNTZ inference on Parquet - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 05:53:21 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 05:56:58 UTC, 0 replies.
- [GitHub] [spark] Yikf opened a new pull request, #39858: [SPARK-42288] Expose file path if reading failed - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/02/02 06:37:32 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on pull request #39858: [SPARK-42288] Expose file path if reading failed - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/02/02 06:37:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39829: [3.4][SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 06:53:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39829: [3.4][SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 06:54:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/02 07:05:58 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39830: [3.4][SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 07:12:51 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39830: [3.4][SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 07:13:57 UTC, 0 replies.
- [GitHub] [spark] zhmin closed pull request #39801: [MINOR][DOCS][SQL] Fix FoldablePropagation rule document - posted by "zhmin (via GitHub)" <gi...@apache.org> on 2023/02/02 07:14:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39828: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 07:15:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39828: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 07:26:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39828: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 07:28:12 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39859: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/02 07:45:22 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39828: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/02 07:45:38 UTC, 0 replies.
- [GitHub] [spark] itholic closed pull request #39828: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/02 07:45:42 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39859: [3.4][SPARK-42158][SQL] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/02 07:46:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #32361: [SPARK-35240][SS] Use CheckpointFileManager for checkpoint file manipulation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 08:35:05 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #21048: [SPARK-23966][SS] Refactoring all checkpoint file writing logic in a common CheckpointFileManager interface - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 08:36:30 UTC, 0 replies.
- [GitHub] [spark] wangyepeng2 commented on pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "wangyepeng2 (via GitHub)" <gi...@apache.org> on 2023/02/02 08:41:15 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39835: [SPARK-42268][CONNECT][PYTHON] Add UserDefinedType in protos - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/02 08:54:32 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39852: [SPARK-42281][PYTHON][DOCS] Update Debugging PySpark documents to show error message properly - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 09:05:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39852: [SPARK-42281][PYTHON][DOCS] Update Debugging PySpark documents to show error message properly - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/02 09:05:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39799: [SPARK-42232][SQL] Rename error class: `UNSUPPORTED_FEATURE.JDBC_TRANSACTION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 09:16:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39799: [SPARK-42232][SQL] Rename error class: `UNSUPPORTED_FEATURE.JDBC_TRANSACTION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 09:16:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 09:22:38 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39851: [MINOR][SQL] Enhance data type check error message - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/02 09:40:31 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39850: [SPARK-42283][CONNECT][SCALA] Simple Scalar Scala UDFs - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/02 10:12:42 UTC, 1 replies.
- [GitHub] [spark] soxofaan commented on pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by "soxofaan (via GitHub)" <gi...@apache.org> on 2023/02/02 10:20:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/02 10:50:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/02 11:11:10 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39860: Standardize registered pickled Python UDFs - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/02 12:09:47 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39860: Standardize registered pickled Python UDFs - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/02 12:18:26 UTC, 1 replies.
- [GitHub] [spark] yeachan153 opened a new pull request, #39861: [WIP][SPARK-42291] Enable dropping of columns for non V2 tables - posted by "yeachan153 (via GitHub)" <gi...@apache.org> on 2023/02/02 13:23:29 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #39719: [SPARK-42169] [SQL] Implement code generation for to_csv function (StructsToCsv) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/02 14:19:01 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #39381: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/02 14:29:38 UTC, 2 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/02 14:43:25 UTC, 2 replies.
- [GitHub] [spark] tgravescs commented on pull request #39845: [SPARK-42277][CORE] Use RocksDB for `spark.history.store.hybridStore.diskBackend` by default - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/02/02 15:26:07 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #39855: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/02 16:05:30 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/02 16:55:30 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/02 16:57:58 UTC, 0 replies.
- [GitHub] [spark] fe2s commented on pull request #39381: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by "fe2s (via GitHub)" <gi...@apache.org> on 2023/02/02 17:11:54 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen commented on pull request #39855: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/02/02 18:02:11 UTC, 4 replies.
- [GitHub] [spark] deepyaman opened a new pull request, #39862: [MINOR][CORE][PYTHON][SQL][PS] Fix argument name in error message - posted by "deepyaman (via GitHub)" <gi...@apache.org> on 2023/02/02 19:24:01 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39855: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 19:32:36 UTC, 1 replies.
- [GitHub] [spark] db-scnakandala commented on pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "db-scnakandala (via GitHub)" <gi...@apache.org> on 2023/02/02 19:32:53 UTC, 3 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #39863: [SPARK-42294][SQL] Include column default values in DESCRIBE output for V2 tables - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/02 20:47:22 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39863: [SPARK-42294][SQL] Include column default values in DESCRIBE output for V2 tables - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/02 20:48:04 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39863: [SPARK-42294][SQL] Include column default values in DESCRIBE output for V2 tables - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 20:54:07 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39863: [SPARK-42294][SQL] Include column default values in DESCRIBE output for V2 tables - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/02 20:54:17 UTC, 2 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39863: [SPARK-42294][SQL] Include column default values in DESCRIBE output for V2 tables - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/02 21:31:00 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #39864: [SPARK-42295][CONNECT][TEST] Tear down the test cleanly - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/02 22:28:20 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38330: [SPARK-40868][SQL] Avoid introducing too many partitions when bucketed scan disabled by sql planner - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/03 00:20:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38183: [SPARK-32288][UI] Add failure summary for failed tasks in stage page - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/03 00:20:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39864: [SPARK-42295][CONNECT][TEST] Tear down the test cleanly - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 00:24:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39864: [SPARK-42295][CONNECT][TEST] Tear down the test cleanly - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 00:24:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39862: [MINOR][CORE][PYTHON][SQL][PS] Fix argument name in error message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 01:00:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39862: [MINOR][CORE][PYTHON][SQL][PS] Fix argument name in error message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 01:00:17 UTC, 0 replies.
- [GitHub] [spark] kecheung commented on pull request #39779: [SPARK-42222][SQL][3.3] Make error clearer when table not found in SupportsCatalogOptions catalog - posted by "kecheung (via GitHub)" <gi...@apache.org> on 2023/02/03 01:14:57 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39865: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/03 01:58:09 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/03 02:01:38 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #39859: [SPARK-42158][SQL][3.4] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 02:25:55 UTC, 0 replies.
- [GitHub] [spark] dcoliversun commented on pull request #39306: [SPARK-41781][K8S] Add the ability to create pvc before creating driver/executor pod - posted by "dcoliversun (via GitHub)" <gi...@apache.org> on 2023/02/03 02:36:35 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #39866: [WIP] Fix the client dependency jars - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/03 02:48:05 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 02:49:03 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39837: [SPARK-42254][SQL] Assign name to _LEGACY_ERROR_TEMP_1117 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 02:52:29 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 02:54:10 UTC, 0 replies.
- [GitHub] [spark] weiyuyilia commented on pull request #39802: [SPARK-42237][SQL] Change binary to unsupported dataType in CSV format - posted by "weiyuyilia (via GitHub)" <gi...@apache.org> on 2023/02/03 02:59:15 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #39866: [WIP] Fix the client dependency jars - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/03 03:01:50 UTC, 8 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39867: [SPARK-41985][SQL][FOLLOWUP] Remove alias in GROUP BY only when the expr is resolved - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/03 04:48:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39867: [SPARK-41985][SQL][FOLLOWUP] Remove alias in GROUP BY only when the expr is resolved - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/03 04:48:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39859: [SPARK-42158][SQL][3.4] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/03 04:53:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39859: [SPARK-42158][SQL][3.4] Integrate _LEGACY_ERROR_TEMP_1003 into FIELD_NOT_FOUND - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/03 04:53:58 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39868: [SPARK-42296][SQL] Apply spark.sql.inferTimestampNTZInDataSources.enabled on JDBC data source - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/03 05:01:17 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39868: [SPARK-42296][SQL] Apply spark.sql.inferTimestampNTZInDataSources.enabled on JDBC data source - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/03 05:01:30 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39868: [SPARK-42296][SQL] Apply spark.sql.inferTimestampNTZInDataSources.enabled on JDBC data source - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/03 05:03:01 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39855: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/03 05:06:29 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39863: [SPARK-42294][SQL] Include column default values in DESCRIBE output for V2 tables - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/03 05:11:37 UTC, 0 replies.
- [GitHub] [spark] Yikf closed pull request #39858: [SPARK-42288] Expose file path if reading failed - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/02/03 05:42:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39857: [SPARK-42287][CONNECT][BUILD] Optimize the packaging strategy of connect client module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 05:42:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39807: [WIP][SPARK-42240][INFRA][CONNECT][TESTS] Move `ClientE2ETestSuite` into a separate module and add new GA task to test shaded jvm client with maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 05:44:46 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39869: [SPARK-42297][SQL] Assign name to _LEGACY_ERROR_TEMP_2412 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 05:44:50 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/03 06:00:41 UTC, 4 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39870: [SPARK-42331][SQL] Fix metadata col can not been resolved - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/03 06:37:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39860: Standardize registered pickled Python UDFs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/03 06:43:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39802: [SPARK-42237][SQL] Change binary to unsupported dataType in CSV format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 06:49:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39802: [SPARK-42237][SQL] Change binary to unsupported dataType in CSV format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 06:49:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38625: [MINOR][DOCS][PYTHON][PS] Fix the `.groupby()` method docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 06:50:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38625: [MINOR][DOCS][PYTHON][PS] Fix the `.groupby()` method docstring - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 06:52:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 06:52:53 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39871: [SPARK-42301][SQL] Assign name to _LEGACY_ERROR_TEMP_1129 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 06:55:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39866: [WIP] Fix the client dependency jars - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 06:58:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39865: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 06:59:20 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39872: [SPARK-42302][SQL] Assign name to _LEGACY_ERROR_TEMP_2135 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 07:17:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39851: [MINOR][SQL] Enhance data type check error message - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/03 07:36:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39866: [WIP] Fix the client dependency jars - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 07:45:11 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #39857: [WIP][SPARK-42287][CONNECT][BUILD] Optimize the packaging strategy of connect client module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 07:45:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 07:50:47 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39865: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 07:51:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39869: [SPARK-42297][SQL] Assign name to _LEGACY_ERROR_TEMP_2412 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 08:35:26 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39871: [SPARK-42301][SQL] Assign name to _LEGACY_ERROR_TEMP_1129 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 08:35:44 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #39872: [SPARK-42302][SQL] Assign name to _LEGACY_ERROR_TEMP_2135 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 08:36:01 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39866: [WIP] Fix the client dependency jars - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 08:39:02 UTC, 12 replies.
- [GitHub] [spark] itholic opened a new pull request, #39873: [SPARK-42303][SQL] Assign name to _LEGACY_ERROR_TEMP_1326 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 08:44:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 08:48:44 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/03 08:51:21 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39870: [SPARK-42331][SQL] Fix metadata col can not been resolved - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/03 09:01:30 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39874: [SPARK-42334][CONNECT] Make sure connect client assembly and sql package is built before running client tests - SBT - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 09:13:00 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39875: [SPARK-42305][SQL] Integrate `_LEGACY_ERROR_TEMP_1229` into `DECIMAL_PRECISION_EXCEEDS_MAX_PRECISION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 09:16:49 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #39876: [SPARK-42333][SQL] Change log level to debug when fetching result set from SparkExecuteStatementOperation - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/03 09:21:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39874: [SPARK-42334][CONNECT][BUILD] Make sure connect client assembly and sql package is built before running client tests - SBT - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 09:39:39 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39877: [SPARK-42306][SQL] Integrate `_LEGACY_ERROR_TEMP_1317` into `UNRESOLVED_COLUMN.WITH_SUGGESTION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 09:49:36 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39877: [SPARK-42306][SQL] Integrate `_LEGACY_ERROR_TEMP_1317` into `UNRESOLVED_COLUMN.WITH_SUGGESTION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 09:53:03 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39875: [SPARK-42305][SQL] Integrate `_LEGACY_ERROR_TEMP_1229` into `DECIMAL_PRECISION_EXCEEDS_MAX_PRECISION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 09:53:22 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39873: [SPARK-42303][SQL] Assign name to _LEGACY_ERROR_TEMP_1326 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 09:54:37 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/03 10:10:25 UTC, 2 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/03 10:39:08 UTC, 1 replies.
- [GitHub] [spark] awdavidson commented on a diff in pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/03 10:48:34 UTC, 1 replies.
- [GitHub] [spark] wayneguow opened a new pull request, #39878: [SPARK-42335][SQL] Add a legacy config for restoring written comment option behavior in CSV dataSource - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/03 10:55:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/03 11:02:31 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on pull request #39878: [SPARK-42335][SQL] Add a legacy config for restoring written comment option behavior in CSV dataSource - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/03 11:30:31 UTC, 3 replies.
- [GitHub] [spark] wayneguow commented on a diff in pull request #39878: [SPARK-42335][SQL] Add a legacy config for restoring written comment option behavior in CSV dataSource - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/03 11:35:50 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/03 12:09:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39795: [SPARK-42234][SQL] Rename error class: `UNSUPPORTED_FEATURE.REPEATED_PIVOT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/03 12:16:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39795: [SPARK-42234][SQL] Rename error class: `UNSUPPORTED_FEATURE.REPEATED_PIVOT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/03 12:17:14 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39867: [SPARK-41985][SQL][FOLLOWUP] Remove alias in GROUP BY only when the expr is resolved - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/03 12:40:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39867: [SPARK-41985][SQL][FOLLOWUP] Remove alias in GROUP BY only when the expr is resolved - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/03 12:40:44 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/03 12:47:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39870: [SPARK-42331][SQL] Fix metadata col can not been resolved - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/03 12:59:21 UTC, 9 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/03 13:40:19 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/03 13:41:46 UTC, 4 replies.
- [GitHub] [spark] srowen commented on pull request #39878: [SPARK-42335][SQL] Add a legacy config for restoring written comment option behavior in CSV dataSource - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/03 13:53:43 UTC, 3 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #39879: [SPARK-42336] Use OpenHashMap instead of HashMap in ResourceAllocator - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/03 14:11:35 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39872: [SPARK-42302][SQL] Assign name to _LEGACY_ERROR_TEMP_2135 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/03 15:21:57 UTC, 1 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39871: [SPARK-42301][SQL] Assign name to _LEGACY_ERROR_TEMP_1129 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/03 15:28:48 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/03 15:29:23 UTC, 0 replies.
- [GitHub] [spark] SiarheiFedartsou opened a new pull request, #39880: typo: StogeLevel -> StorageLevel - posted by "SiarheiFedartsou (via GitHub)" <gi...@apache.org> on 2023/02/03 15:58:37 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39813: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/03 16:49:24 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39813: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/03 16:49:29 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on pull request #39185: [SPARK-41551][SQL] Dynamic/absolute path support in PathOutputCommitters - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/02/03 17:10:20 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39879: [SPARK-42336][CORE] Use OpenHashMap instead of HashMap in ResourceAllocator - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 17:15:46 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39838: [SPARK-42270][SQL] Sort merge join may oom when right match rows are very large - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 17:32:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39838: [SPARK-42270][SQL] Sort merge join may oom when right match rows are very large - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 17:37:47 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/03 17:44:18 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 17:51:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39879: [SPARK-42336][CORE] Use OpenHashMap instead of HashMap in ResourceAllocator - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 17:59:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39876: [SPARK-42333][SQL] Change log level to debug when fetching result set from SparkExecuteStatementOperation - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 18:01:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39876: [SPARK-42333][SQL] Change log level to debug when fetching result set from SparkExecuteStatementOperation - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 18:01:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39868: [SPARK-42296][SQL] Apply spark.sql.inferTimestampNTZInDataSources.enabled on JDBC data source - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 18:05:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39868: [SPARK-42296][SQL] Apply spark.sql.inferTimestampNTZInDataSources.enabled on JDBC data source - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 18:06:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39571: [SPARK-42064][SQL] Implement bloom filter join hint - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 18:09:10 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39876: [SPARK-42333][SQL] Change log level to debug when fetching result set from SparkExecuteStatementOperation - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 18:09:45 UTC, 1 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/03 18:31:01 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39770: [WIP][SPARK-42206][CORE] Omit "Task Executor Metrics" field in eventlogs if values are all zero - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/03 18:38:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39876: [SPARK-42333][SQL] Change log level to debug when fetching result set from SparkExecuteStatementOperation - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 18:47:37 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #39880: typo: StogeLevel -> StorageLevel - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/03 19:45:33 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on a diff in pull request #39869: [SPARK-42297][SQL] Assign name to _LEGACY_ERROR_TEMP_2412 - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/03 19:46:38 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on pull request #39571: [SPARK-42064][SQL] Implement bloom filter join hint - posted by "sigmod (via GitHub)" <gi...@apache.org> on 2023/02/03 20:30:35 UTC, 0 replies.
- [GitHub] [spark] andylam-db commented on pull request #39571: [SPARK-42064][SQL] Implement bloom filter join hint - posted by "andylam-db (via GitHub)" <gi...@apache.org> on 2023/02/03 20:47:50 UTC, 0 replies.
- [GitHub] [spark] ben-zhang commented on pull request #38433: [SPARK-40943][SQL] Make the MSCK keyword optional in REPAIR TABLE commands - posted by "ben-zhang (via GitHub)" <gi...@apache.org> on 2023/02/03 21:03:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38433: [SPARK-40943][SQL] Make the MSCK keyword optional in REPAIR TABLE commands - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 22:13:09 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #39687: [SPARK-41470][SQL] SPJ: Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/03 23:10:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/03 23:48:09 UTC, 5 replies.
- [GitHub] [spark] ben-zhang commented on pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "ben-zhang (via GitHub)" <gi...@apache.org> on 2023/02/03 23:59:54 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38183: [SPARK-32288][UI] Add failure summary for failed tasks in stage page - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/04 00:19:30 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38047: [SPARK-40609][SQL] Casts types according to bucket info for Equality expressions - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/04 00:19:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38032: [WIP][SPARK-40597][CORE] local mode should respect TASK_MAX_FAILURES like all other cluster managers - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/04 00:19:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39881: [SPARK-42341][SQL][TESTS] Fix JoinSelectionHelperSuite and PlanStabilitySuite to use explicit broadcast threshold - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/04 00:53:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39881: [SPARK-42341][SQL][TESTS] Fix JoinSelectionHelperSuite and PlanStabilitySuite to use explicit broadcast threshold - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/04 00:55:12 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39872: [SPARK-42302][SQL] Assign name to _LEGACY_ERROR_TEMP_2135 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/04 01:06:35 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on pull request #39819: [SPARK-42252][CORE] Deprecate spark.shuffle.unsafe.file.output.buffer and add a new config - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/04 01:20:20 UTC, 1 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/02/04 01:21:41 UTC, 3 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39871: [SPARK-42301][SQL] Assign name to _LEGACY_ERROR_TEMP_1129 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/04 01:29:36 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #39882: Introduce base hierarchy to exceptions. - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/04 01:32:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/04 02:30:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39883: [SPARK-42343][CORE] Suppress `IOException` warnings in `handleBlockRemovalFailure` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/04 03:23:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39883: [SPARK-42343][CORE] Ignore `IOException` in `handleBlockRemovalFailure` if SparkContext is stopped - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/04 03:32:44 UTC, 8 replies.
- [GitHub] [spark] ninebigbig opened a new pull request, #39884: [SPARK-42344][K8S] Change the default size of the CONFIG_MAP_MAXSIZE - posted by "ninebigbig (via GitHub)" <gi...@apache.org> on 2023/02/04 05:41:40 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39883: [SPARK-42343][CORE] Ignore `IOException` in `handleBlockRemovalFailure` if SparkContext is stopped - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/04 06:12:04 UTC, 4 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #39687: [SPARK-41470][SQL] SPJ: Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/04 06:29:12 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on a diff in pull request #39879: [SPARK-42336][CORE] Use OpenHashMap instead of HashMap in ResourceAllocator - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/04 07:02:40 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39869: [SPARK-42297][SQL] Assign name to _LEGACY_ERROR_TEMP_2412 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 08:15:27 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39885: [SPARK-42345][SQL] Rename TimestampNTZ inference conf as spark.sql.sources.timestampNTZTypeInference.enabled - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/04 08:49:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39869: [SPARK-42297][SQL] Assign name to _LEGACY_ERROR_TEMP_2412 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 08:59:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39869: [SPARK-42297][SQL] Assign name to _LEGACY_ERROR_TEMP_2412 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 09:00:07 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 10:15:50 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 10:23:40 UTC, 1 replies.
- [GitHub] [spark] ivoson commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/02/04 11:25:31 UTC, 35 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 12:18:28 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 12:19:24 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/04 13:05:32 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/04 13:26:27 UTC, 0 replies.
- [GitHub] [spark] ninebigbig commented on pull request #39884: [SPARK-42344][K8S] Change the default size of the CONFIG_MAP_MAXSIZE - posted by "ninebigbig (via GitHub)" <gi...@apache.org> on 2023/02/04 14:32:49 UTC, 2 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39879: [SPARK-42336][CORE] Use OpenHashMap instead of HashMap in ResourceAllocator - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/04 15:23:18 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39884: [SPARK-42344][K8S] Change the default size of the CONFIG_MAP_MAXSIZE - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/04 16:56:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39881: [SPARK-42341][SQL][TESTS] Fix JoinSelectionHelperSuite and PlanStabilitySuite to use explicit broadcast threshold - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/04 17:02:52 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39881: [SPARK-42341][SQL][TESTS] Fix JoinSelectionHelperSuite and PlanStabilitySuite to use explicit broadcast threshold - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/04 17:18:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39881: [SPARK-42341][SQL][TESTS] Fix JoinSelectionHelperSuite and PlanStabilitySuite to use explicit broadcast threshold - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/04 18:10:28 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39881: [SPARK-42341][SQL][TESTS] Fix JoinSelectionHelperSuite and PlanStabilitySuite to use explicit broadcast threshold - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/04 18:11:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39875: [SPARK-42305][SQL] Integrate `_LEGACY_ERROR_TEMP_1229` into `DECIMAL_PRECISION_EXCEEDS_MAX_PRECISION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 18:28:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39837: [SPARK-42254][SQL] Assign name to _LEGACY_ERROR_TEMP_1117 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/04 19:08:02 UTC, 0 replies.
- [GitHub] [spark] SiarheiFedartsou commented on pull request #39880: typo: StogeLevel -> StorageLevel - posted by "SiarheiFedartsou (via GitHub)" <gi...@apache.org> on 2023/02/04 19:35:26 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39885: [SPARK-42345][SQL] Rename TimestampNTZ inference conf as spark.sql.sources.timestampNTZTypeInference.enabled - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/04 21:39:40 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #39883: [SPARK-42343][CORE] Ignore `IOException` in `handleBlockRemovalFailure` if SparkContext is stopped - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/04 21:50:49 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39883: [SPARK-42343][CORE] Ignore `IOException` in `handleBlockRemovalFailure` if SparkContext is stopped - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/04 21:56:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38268: [SPARK-40804][SQL] Missing handling a catalog name in destination tables in RenameTableExec - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/05 00:21:28 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38047: [SPARK-40609][SQL] Casts types according to bucket info for Equality expressions - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/05 00:21:30 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38032: [WIP][SPARK-40597][CORE] local mode should respect TASK_MAX_FAILURES like all other cluster managers - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/05 00:21:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/05 01:38:56 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39874: [SPARK-42334][CONNECT][BUILD] Make sure connect client assembly and sql package is built before running client tests - SBT - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/05 01:42:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39883: [SPARK-42343][CORE] Ignore `IOException` in `handleBlockRemovalFailure` if SparkContext is stopped - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/05 03:44:23 UTC, 0 replies.
- [GitHub] [spark] ninebigbig commented on a diff in pull request #39884: [SPARK-42344][K8S] Change the default size of the CONFIG_MAP_MAXSIZE - posted by "ninebigbig (via GitHub)" <gi...@apache.org> on 2023/02/05 03:58:44 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39884: [SPARK-42344][K8S] Change the default size of the CONFIG_MAP_MAXSIZE - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/05 05:07:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39883: [SPARK-42343][CORE] Ignore `IOException` in `handleBlockRemovalFailure` if SparkContext is stopped - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/05 05:16:11 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 07:15:30 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39885: [SPARK-42345][SQL] Rename TimestampNTZ inference conf as spark.sql.sources.timestampNTZTypeInference.enabled - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 07:43:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39885: [SPARK-42345][SQL] Rename TimestampNTZ inference conf as spark.sql.sources.timestampNTZTypeInference.enabled - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 07:43:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39719: [SPARK-42169] [SQL] Implement code generation for to_csv function (StructsToCsv) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 07:55:33 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39886: [SPARK-42348][SQL] Add new SQLSTATE - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 08:53:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39886: [SPARK-42348][SQL] Add new SQLSTATE - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 08:56:03 UTC, 1 replies.
- [GitHub] [spark] smallzhongfeng commented on a diff in pull request #39879: [SPARK-42336][CORE] Use `getOrElse()` instead of `contains()` in ResourceAllocator - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/05 09:32:36 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 09:39:35 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38861: [SPARK-41294][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1203 / 1168 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 09:46:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39875: [SPARK-42305][SQL] Integrate `_LEGACY_ERROR_TEMP_1229` into `DECIMAL_PRECISION_EXCEEDS_MAX_PRECISION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 09:56:04 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/05 10:18:28 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 10:21:11 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/05 10:25:36 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39888: [SPARK-42320][SQL] Assign name to _LEGACY_ERROR_TEMP_2188 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 10:42:28 UTC, 0 replies.
- [GitHub] [spark] RobinL commented on pull request #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "RobinL (via GitHub)" <gi...@apache.org> on 2023/02/05 10:44:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39884: [SPARK-42344][K8S] Change the default size of the CONFIG_MAP_MAXSIZE - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/05 11:08:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39884: [SPARK-42344][K8S] Change the default size of the CONFIG_MAP_MAXSIZE - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/05 11:10:10 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39889: [SPARK-42315][SQL] Assign name to _LEGACY_ERROR_TEMP_(2091|2092) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 11:18:36 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39890: [SPARK-42314][SQL] Assign name to _LEGACY_ERROR_TEMP_2127 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 11:52:00 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/05 11:52:40 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39891: [SPARK-42318][SQL] Assign name to _LEGACY_ERROR_TEMP_(2123|2125) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 12:34:50 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 13:02:10 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 13:10:55 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #39888: [SPARK-42320][SQL] Assign name to _LEGACY_ERROR_TEMP_2188 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 13:12:55 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39889: [SPARK-42315][SQL] Assign name to _LEGACY_ERROR_TEMP_(2091|2092) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 13:13:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39890: [SPARK-42314][SQL] Assign name to _LEGACY_ERROR_TEMP_2127 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 13:13:48 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39891: [SPARK-42318][SPARK-42319][SQL] Assign name to _LEGACY_ERROR_TEMP_(2123|2125) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 13:14:27 UTC, 3 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/05 14:11:58 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/05 15:10:36 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #39572: [SPARK-39979][SQL] Add option to use large variable width vectors for arrow UDF operations - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/02/05 16:03:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39872: [SPARK-42302][SQL] Assign name to _LEGACY_ERROR_TEMP_2135 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 17:44:59 UTC, 1 replies.
- [GitHub] [spark] WweiL commented on pull request #39843: [SPARK-39347] [SS] Bug fix for time window calculation when event time < 0 - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/05 17:56:48 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39890: [SPARK-42314][SQL] Assign name to _LEGACY_ERROR_TEMP_2127 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/05 19:22:28 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39889: [SPARK-42315][SQL] Assign name to _LEGACY_ERROR_TEMP_(2091|2092) - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/05 19:29:59 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39888: [SPARK-42320][SQL] Assign name to _LEGACY_ERROR_TEMP_2188 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/05 19:33:40 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39886: [SPARK-42348][SQL] Add new SQLSTATE - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/05 19:36:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 21:04:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/05 21:05:06 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/05 23:48:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38389: [MINOR][DOCS][PYTHON] Fix the truncation of API reference in several DataTypes - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/06 00:19:18 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38381: [SPARK-40793][SQL] Fix the LogicalRelation computeStats for Row-level Runtime Filtering cannot be applied - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/06 00:19:20 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38268: [SPARK-40804][SQL] Missing handling a catalog name in destination tables in RenameTableExec - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/06 00:19:23 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39889: [SPARK-42315][SQL] Assign name to _LEGACY_ERROR_TEMP_(2091|2092) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/06 00:19:56 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39890: [SPARK-42314][SQL] Assign name to _LEGACY_ERROR_TEMP_2127 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/06 00:32:02 UTC, 6 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39870: [SPARK-42331][SQL] Fix metadata col can not been resolved - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/06 01:43:39 UTC, 8 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #32361: [SPARK-35240][SS] Use CheckpointFileManager for checkpoint file manipulation - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/06 02:00:21 UTC, 0 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #39892: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/06 02:07:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39886: [SPARK-42348][SQL] Add new SQLSTATE - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/06 02:33:27 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #37479: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/06 02:33:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39886: [SPARK-42348][SQL] Add new SQLSTATE - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/06 02:35:29 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39843: [SPARK-39347][SS] Bug fix for time window calculation when event time < 0 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/06 02:37:12 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39843: [SPARK-39347][SS] Bug fix for time window calculation when event time < 0 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/06 02:47:40 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39879: [SPARK-42336][CORE] Use `getOrElse()` instead of `contains()` in ResourceAllocator - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/06 02:55:05 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39879: [SPARK-42336][CORE] Use `getOrElse()` instead of `contains()` in ResourceAllocator - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/06 02:55:16 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39790: [SPARK-42094][PS] Support `fill_value` for `ps.Series.(add|radd)` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/06 03:04:15 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39879: [SPARK-42336][CORE] Use `getOrElse()` instead of `contains()` in ResourceAllocator - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/06 03:09:48 UTC, 0 replies.
- [GitHub] [spark] rangareddy commented on a diff in pull request #38875: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by "rangareddy (via GitHub)" <gi...@apache.org> on 2023/02/06 04:34:16 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #39892: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/06 04:34:17 UTC, 1 replies.
- [GitHub] [spark] rangareddy commented on pull request #39515: [SPARK-38743][SQL][TEST] Test the error class: MISSING_STATIC_PARTITION_COLUMN - posted by "rangareddy (via GitHub)" <gi...@apache.org> on 2023/02/06 04:42:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39893: [SPARK-42350][SQL][K8S] Replcace `get().getOrElse` with `getOrElse` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/06 05:08:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39894: [SPARK-42351][CORE] Protobuf serializer for FsHistoryProviderMetadata - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/06 05:40:30 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #39895: [SPARK-40149][SQL][FOLLOWUP] Avoid adding extra Project in AddMetadataColumns - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/06 06:38:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39896: [SPARK-42352][BUILD] Upgrade maven to 3.8.7 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/06 07:24:35 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 opened a new pull request, #39897: [WIP][SPARK-42353][SS] Cleanup orphan sst and log files in RocksDB checkpoint directory - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2023/02/06 07:58:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/06 08:00:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/06 08:01:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/06 08:04:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39898: [SPARK-42354][BUILD] Upgrade jackson to 2.14.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/06 08:09:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/06 08:10:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39899: [SPARK-42355][BUILD] Upgrade some maven-plugins - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/06 08:19:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/06 09:34:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39614: [SPARK-42002][CONNECT][PYTHON] Implement DataFrameWriterV2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/06 09:39:40 UTC, 0 replies.
- [GitHub] [spark] awdavidson commented on pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/06 09:39:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39614: [SPARK-42002][CONNECT][PYTHON] Implement DataFrameWriterV2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/06 09:40:04 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini commented on pull request #32921: [SPARK-35779][SQL] Dynamic filtering for Data Source V2 - posted by "LorenzoMartini (via GitHub)" <gi...@apache.org> on 2023/02/06 10:16:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39900: [SPARK-42357][CORE] Log `exitCode` when `SparkContext.stop` starts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/06 10:16:42 UTC, 0 replies.
- [GitHub] [spark] awdavidson opened a new pull request, #39901: [SPARK-40819][SQL] Backport Timestamp nanos regression to 3.3 - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/06 10:21:53 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x opened a new pull request, #39902: [SPARK-42349][PYTHON]Support pandas cogroup with multiple df - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/02/06 10:24:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39894: [SPARK-42351][CORE] Protobuf serializer for FsHistoryProviderMetadata - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/06 10:28:15 UTC, 0 replies.
- [GitHub] [spark] awdavidson closed pull request #39901: [SPARK-40819][SQL] Backport Timestamp nanos regression to 3.3 - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/06 10:41:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39888: [SPARK-42320][SQL] Assign name to _LEGACY_ERROR_TEMP_2188 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 10:47:53 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39888: [SPARK-42320][SQL] Assign name to _LEGACY_ERROR_TEMP_2188 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 10:48:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39872: [SPARK-42302][SQL] Assign name to _LEGACY_ERROR_TEMP_2135 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 10:51:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39872: [SPARK-42302][SQL] Assign name to _LEGACY_ERROR_TEMP_2135 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 10:51:51 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39889: [SPARK-42315][SQL] Assign name to _LEGACY_ERROR_TEMP_(2091|2092) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 10:52:39 UTC, 1 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #39903: [SPARK-42358][CORE] Send ExecutorUpdated with the message argument in Master.removeWorker - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/02/06 10:54:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39515: [SPARK-38743][SQL][TEST] Test the error class: MISSING_STATIC_PARTITION_COLUMN - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 10:57:16 UTC, 0 replies.
- [GitHub] [spark] awdavidson opened a new pull request, #39904: [SPARK-40819][SQL] Backport Timestamp nanos regression to 3.3 - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/06 11:58:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/06 12:08:13 UTC, 2 replies.
- [GitHub] [spark] wangyum closed pull request #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/06 12:37:10 UTC, 0 replies.
- [GitHub] [spark] awdavidson opened a new pull request, #39905: [SPARK-40819][SQL] Backport Timestamp nanos regression to 3.2 - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/06 12:39:25 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39894: [SPARK-42351][CORE] Protobuf serializer for FsHistoryProviderMetadata - posted by "techaddict (via GitHub)" <gi...@apache.org> on 2023/02/06 12:43:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39873: [SPARK-42303][SQL] Assign name to _LEGACY_ERROR_TEMP_1326 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 12:48:01 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/06 12:54:53 UTC, 0 replies.
- [GitHub] [spark] wayneguow opened a new pull request, #39906: [SPARK-41962][MINOR][CORE] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/06 13:21:53 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39887: [SPARK-42346][SQL] Rewrite distinct aggregates after subquery merge - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/06 13:27:14 UTC, 0 replies.
- [GitHub] [spark] ted-jenks opened a new pull request, #39907: [WIP][SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/06 14:18:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/06 14:26:15 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on a diff in pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/06 14:33:35 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39908: [SPARK-42360][SQL] Transform LeftOuter join with IsNull filter on right side to Anti join - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/06 14:44:24 UTC, 0 replies.
- [GitHub] [spark] luhenry opened a new pull request, #39909: Fix constructor for java.nio.DirectByteBuffer - posted by "luhenry (via GitHub)" <gi...@apache.org> on 2023/02/06 15:36:21 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 15:43:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39839: [SPARK-42255][SQL] Assign name to _LEGACY_ERROR_TEMP_2430 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 15:44:22 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39909: Fix constructor for java.nio.DirectByteBuffer - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/06 15:55:40 UTC, 0 replies.
- [GitHub] [spark] luhenry commented on pull request #39909: Fix constructor for java.nio.DirectByteBuffer - posted by "luhenry (via GitHub)" <gi...@apache.org> on 2023/02/06 16:00:11 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39890: [SPARK-42314][SQL] Assign name to _LEGACY_ERROR_TEMP_2127 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 16:08:39 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #39909: Fix constructor for java.nio.DirectByteBuffer - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/06 16:48:02 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #39891: [SPARK-42318][SPARK-42319][SQL] Assign name to _LEGACY_ERROR_TEMP_(2123|2125) - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/06 16:54:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39900: [SPARK-42357][CORE] Log `exitCode` when `SparkContext.stop` starts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/06 17:50:42 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39900: [SPARK-42357][CORE] Log `exitCode` when `SparkContext.stop` starts - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/06 18:04:00 UTC, 1 replies.
- [GitHub] [spark] sunchao closed pull request #39687: [SPARK-41470][SQL] SPJ: Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/06 18:04:56 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39687: [SPARK-41470][SQL] SPJ: Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/06 18:06:41 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #39910: [SPARK-42337][SQL] Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/06 18:18:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39900: [SPARK-42357][CORE] Log `exitCode` when `SparkContext.stop` starts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/06 18:21:06 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39877: [SPARK-42306][SQL] Integrate `_LEGACY_ERROR_TEMP_1317` into `UNRESOLVED_COLUMN.WITH_SUGGESTION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 18:54:21 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39910: [SPARK-42337][SQL] Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 18:59:20 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/06 19:10:16 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #39870: [SPARK-42331][SQL] Fix metadata col can not been resolved - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/06 19:15:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39900: [SPARK-42357][CORE] Log `exitCode` when `SparkContext.stop` starts - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/06 19:24:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39687: [SPARK-41470][SQL] SPJ: Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/06 19:25:49 UTC, 0 replies.
- [GitHub] [spark] clubycoder opened a new pull request, #39911: [SPARK-36478][SQL] Optimize out unused left-outer Join under Project - posted by "clubycoder (via GitHub)" <gi...@apache.org> on 2023/02/06 19:45:27 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/06 20:49:54 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/06 21:01:17 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #39912: [SPARK-42362][BUILD] Upgrade `kubernetes-client` to 6.4.1 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/06 21:29:53 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #39825: [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/02/06 22:16:00 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #39912: [SPARK-42362][BUILD] Upgrade `kubernetes-client` to 6.4.1 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/06 22:32:32 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #39825: [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/02/06 22:33:54 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38389: [MINOR][DOCS][PYTHON] Fix the truncation of API reference in several DataTypes - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/07 00:20:43 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38381: [SPARK-40793][SQL] Fix the LogicalRelation computeStats for Row-level Runtime Filtering cannot be applied - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/07 00:20:43 UTC, 0 replies.
- [GitHub] [spark] yabola commented on pull request #39687: [SPARK-41470][SQL] SPJ: Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/07 00:22:03 UTC, 0 replies.
- [GitHub] [spark] allisonport-db commented on a diff in pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "allisonport-db (via GitHub)" <gi...@apache.org> on 2023/02/07 00:31:42 UTC, 19 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39905: [SPARK-40819][SQL] Backport Timestamp nanos regression to 3.2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 00:33:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39904: [SPARK-40819][SQL][3.3] Backport Timestamp nanos regression to 3.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 00:34:15 UTC, 0 replies.
- [GitHub] [spark] clubycoder commented on pull request #39911: [SPARK-36478][SQL][WIP] Optimize out unused left-outer Join under Project - posted by "clubycoder (via GitHub)" <gi...@apache.org> on 2023/02/07 00:37:42 UTC, 1 replies.
- [GitHub] [spark] clubycoder closed pull request #39911: [SPARK-36478][SQL][WIP] Optimize out unused left-outer Join under Project - posted by "clubycoder (via GitHub)" <gi...@apache.org> on 2023/02/07 01:09:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39913: [SPARK-42268][CONNECT][PYTHON][TESTS][FOLLOWUPS] Add more tests for `UserDefinedType` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 01:29:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39914: [SPARK-40532][[CONNECT] Add Python Version into Python UDF message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 02:03:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39914: [SPARK-40532][[CONNECT] Add Python Version into Python UDF message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 02:03:55 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39914: [SPARK-40532][[CONNECT] Add Python Version into Python UDF message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 02:04:39 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39915: [SPARK-42364][PS][TESTS] Split 'pyspark.pandas.tests.test_dataframe' - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 02:17:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39916: [SPARK-42363][CONNECT] Remove SparkSession.register_udf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 02:18:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39916: [SPARK-42363][CONNECT] Remove SparkSession.register_udf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 02:18:59 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39912: [SPARK-42362][BUILD] Upgrade `kubernetes-client` to 6.4.1 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 02:19:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39895: [SPARK-40149][SQL][FOLLOWUP] Avoid adding extra Project in AddMetadataColumns - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 02:19:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39895: [SPARK-40149][SQL][FOLLOWUP] Avoid adding extra Project in AddMetadataColumns - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 02:22:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39917: [SPARK-42365][PS][TESTS] Split 'pyspark.pandas.tests.test_ops_on_diff_frames' - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 02:35:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39914: [SPARK-40532][CONNECT] Add Python Version into Python UDF message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 02:37:48 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39894: [SPARK-42351][CORE] Protobuf serializer for FsHistoryProviderMetadata - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 02:40:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39900: [SPARK-42357][CORE] Log `exitCode` when `SparkContext.stop` starts - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 02:50:00 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/02/07 02:55:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39909: Fix constructor for java.nio.DirectByteBuffer - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 02:57:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39896: [SPARK-42352][BUILD] Upgrade maven to 3.9.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 03:00:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39919: [SPARK-41600][SPARK-41623][SPARK-41612][CONNECT] Implement Catalog.cacheTable, isCached and uncache - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 03:10:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39919: [SPARK-41600][SPARK-41623][SPARK-41612][CONNECT] Implement Catalog.cacheTable, isCached and uncache - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 03:10:52 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39919: [SPARK-41600][SPARK-41623][SPARK-41612][CONNECT] Implement Catalog.cacheTable, isCached and uncache - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 03:43:04 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39873: [SPARK-42303][SQL] Assign name to _LEGACY_ERROR_TEMP_1326 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/07 03:43:05 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39912: [SPARK-42362][BUILD] Upgrade `kubernetes-client` to 6.4.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 03:52:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39920: [SPARK-41716][CONNECT] Remove the JIRA (and rename _catalog_to_pandas to _execute_and_fetch) in Catalog - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 04:15:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39920: [SPARK-41716][CONNECT] Remove the JIRA (and rename _catalog_to_pandas to _execute_and_fetch) in Catalog - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 04:16:11 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/07 04:20:47 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/07 04:56:15 UTC, 28 replies.
- [GitHub] [spark] wayneguow commented on a diff in pull request #39906: [WIP][SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/07 04:57:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 04:59:00 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/07 05:02:26 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on a diff in pull request #39878: [SPARK-42335][SQL] Pass the comment option through to univocity if users set it explicitly in CSV dataSource - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/07 05:02:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39920: [SPARK-41716][CONNECT] Rename _catalog_to_pandas to _execute_and_fetch in Catalog - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 05:03:01 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39913: [SPARK-42268][CONNECT][PYTHON][TESTS][FOLLOWUP] Add `test_simple_udt` for `UserDefinedType` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:04:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39913: [SPARK-42268][CONNECT][PYTHON][TESTS][FOLLOWUP] Add `test_simple_udt` for `UserDefinedType` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:04:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39914: [SPARK-40532][CONNECT] Add Python Version into Python UDF message - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:06:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39898: [SPARK-42354][BUILD] Upgrade jackson to 2.14.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:08:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39898: [SPARK-42354][BUILD] Upgrade jackson to 2.14.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:08:57 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2023/02/07 05:09:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:20:46 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on pull request #39878: [SPARK-42335][SQL] Pass the comment option through to univocity if users set it explicitly in CSV dataSource - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/07 05:29:31 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39916: [SPARK-42363][CONNECT] Remove SparkSession.register_udf - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:31:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39916: [SPARK-42363][CONNECT] Remove SparkSession.register_udf - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:31:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39915: [SPARK-42364][PS][TESTS] Split 'pyspark.pandas.tests.test_dataframe' - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 05:39:09 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/02/07 06:00:54 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39898: [SPARK-42354][BUILD] Upgrade jackson to 2.14.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 06:03:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39919: [SPARK-41600][SPARK-41623][SPARK-41612][CONNECT] Implement Catalog.cacheTable, isCached and uncache - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 06:27:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 06:30:48 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39914: [SPARK-40532][CONNECT] Add Python Version into Python UDF message - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 06:31:20 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39921: [SPARK-42368][INFRA][TESTS] Ignore SparkRemoteFileTest K8s IT test case in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 06:36:59 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39904: [SPARK-40819][SQL][3.3] Timestamp nanos behaviour regression - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/07 06:39:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39921: [SPARK-42368][INFRA][TESTS] Exclude SparkRemoteFileTest from GitHub Action K8s IT job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 06:43:21 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39920: [SPARK-41716][CONNECT] Rename _catalog_to_pandas to _execute_and_fetch in Catalog - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 06:45:14 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #39877: [SPARK-42306][SQL] Integrate `_LEGACY_ERROR_TEMP_1317` into `UNRESOLVED_COLUMN.WITH_SUGGESTION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/07 06:45:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39914: [SPARK-40532][CONNECT] Add Python Version into Python UDF message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 06:45:44 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39896: [SPARK-42352][BUILD] Upgrade maven to 3.8.7 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 06:48:59 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39922: [SPARK-41708][SQL][FOLLOWUP] Do not insert columnar to row transition before write command - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 06:55:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39922: [SPARK-41708][SQL][FOLLOWUP] Do not insert columnar to row transition before write command - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 06:56:05 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39922: [SPARK-41708][SQL][FOLLOWUP] Do not insert columnar to row transition before write command - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 06:56:06 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39891: [SPARK-42318][SPARK-42319][SQL] Assign name to _LEGACY_ERROR_TEMP_(2123|2125) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/07 07:01:15 UTC, 2 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39923: [SPARK-39851][SQL] Improve join stats estimation if one side can keep uniqueness - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/07 07:04:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 07:11:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 07:11:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39924: [SPARK-41708][SQL][TEST][FOLLOWUP] Match non-space chars in path string - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 07:12:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39924: [SPARK-41708][SQL][TEST][FOLLOWUP] Match non-space chars in path string - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 07:13:27 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39924: [SPARK-41708][SQL][TEST][FOLLOWUP] Match non-space chars in path string - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 07:13:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39922: [SPARK-41708][SQL][FOLLOWUP] Do not insert columnar to row transition before write command - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 07:29:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39921: [SPARK-42368][INFRA][TESTS] Exclude SparkRemoteFileTest from GitHub Action K8s IT job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 07:34:39 UTC, 0 replies.
- [GitHub] [spark] awdavidson commented on a diff in pull request #39904: [SPARK-40819][SQL][3.3] Timestamp nanos behaviour regression - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/07 07:39:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39925: [SPARK-41812][SPARK-41823][CONNECT][SQL][PYTHON] Resolve ambiguous columns issue in `Join` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 07:58:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39922: [SPARK-41708][SQL][FOLLOWUP] Do not insert columnar to row transition before write command - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 08:00:06 UTC, 0 replies.
- [GitHub] [spark] luhenry commented on pull request #39909: [SPARK-42369][Common] Fix constructor for java.nio.DirectByteBuffer - posted by "luhenry (via GitHub)" <gi...@apache.org> on 2023/02/07 08:11:50 UTC, 0 replies.
- [GitHub] [spark] ted-jenks opened a new pull request, #39926: [SQL] Remove repeated function in CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 08:13:38 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39902: [SPARK-42349][PYTHON]Support pandas cogroup with multiple df - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/07 08:25:19 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39734: [WIP][SPARK-41812][SPARK-41823][CONNECT][PYTHON] Fix ambiguous columns issue in Join - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 08:25:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39734: [WIP][SPARK-41812][SPARK-41823][CONNECT][PYTHON] Fix ambiguous columns issue in Join - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 08:25:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39914: [SPARK-40532][CONNECT] Add Python Version into Python UDF message - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 08:31:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39920: [SPARK-41716][CONNECT] Rename _catalog_to_pandas to _execute_and_fetch in Catalog - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 08:34:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39917: [SPARK-42365][PS][TESTS] Split 'pyspark.pandas.tests.test_ops_on_diff_frames' - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 08:35:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39917: [SPARK-42365][PS][TESTS] Split 'pyspark.pandas.tests.test_ops_on_diff_frames' - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 08:35:55 UTC, 0 replies.
- [GitHub] [spark] ted-jenks opened a new pull request, #39927: [WIP][SQL] Remove unused blank line removal from CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 08:44:29 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39866: [SPARK-42287][CONNECT][BUILD] Fix the client dependency jars - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 08:46:46 UTC, 12 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39917: [SPARK-42365][PS][TESTS] Split 'pyspark.pandas.tests.test_ops_on_diff_frames' - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 08:49:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39915: [SPARK-42364][PS][TESTS] Split 'pyspark.pandas.tests.test_dataframe' - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 08:49:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39925: [SPARK-41812][SPARK-41823][CONNECT][SQL][PYTHON] Resolve ambiguous columns issue in `Join` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/07 09:10:47 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39926: [WIP][SQL] Remove repeated function in CSVExprUtils - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 09:12:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39927: [WIP][SQL] Remove unused blank line removal from CSVExprUtils - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 09:14:48 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2023/02/07 09:22:38 UTC, 1 replies.
- [GitHub] [spark] ted-jenks commented on a diff in pull request #39927: [WIP][SQL] Remove unused blank line removal from CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 09:27:45 UTC, 3 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39922: [SPARK-41708][SQL][FOLLOWUP] Do not insert columnar to row transition before write command - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/07 09:53:36 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on pull request #39902: [SPARK-42349][PYTHON]Support pandas cogroup with multiple df - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/02/07 10:23:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39928: [WIP][SPARK-42371][CONNECT] Add scripts to start and stop Spark Connect server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 10:31:14 UTC, 0 replies.
- [GitHub] [spark] ted-jenks commented on a diff in pull request #39926: [WIP][SQL] Remove repeated function in CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 10:42:56 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #39929: [SPARK-42372][SQL] Improve performance of HiveGenericUDTF by making inputProjection instantiate once - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/07 10:46:55 UTC, 0 replies.
- [GitHub] [spark] ted-jenks closed pull request #39927: [WIP][SQL] Remove unused blank line removal from CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 10:47:21 UTC, 0 replies.
- [GitHub] [spark] ted-jenks closed pull request #39926: [WIP][SQL] Remove repeated function in CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 10:47:39 UTC, 0 replies.
- [GitHub] [spark] ted-jenks opened a new pull request, #39926: [WIP][SQL] Remove repeated function in CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 10:48:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39926: [WIP][SQL] Remove skipComments function in CSVExprUtils - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 11:20:28 UTC, 0 replies.
- [GitHub] [spark] ted-jenks closed pull request #39926: [WIP][SQL] Remove skipComments function in CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 11:26:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39927: [SPARK-42373][SQL] Remove unused blank line removal from CSVExprUtils - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/07 11:47:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39922: [SPARK-41708][SQL][FOLLOWUP] Do not insert columnar to row transition before write command - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 12:13:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 12:33:19 UTC, 0 replies.
- [GitHub] [spark] wayneguow opened a new pull request, #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/07 12:33:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 12:33:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39894: [SPARK-42351][CORE] Protobuf serializer for FsHistoryProviderMetadata - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 12:41:16 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on pull request #38038: [SPARK-42136] Refactor BroadcastHashJoinExec output partitioning calculation - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/07 12:51:06 UTC, 1 replies.
- [GitHub] [spark] ted-jenks commented on pull request #39927: [SPARK-42373][SQL] Remove unused blank line removal from CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 13:01:19 UTC, 1 replies.
- [GitHub] [spark] ted-jenks commented on a diff in pull request #39927: [SPARK-42373][SQL] Remove unused blank line removal from CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/07 13:03:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38038: [SPARK-42136] Refactor BroadcastHashJoinExec output partitioning calculation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 13:28:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38038: [SPARK-42136] Refactor BroadcastHashJoinExec output partitioning calculation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 13:29:24 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2023/02/07 13:35:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/07 13:40:55 UTC, 17 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39929: [SPARK-42372][SQL] Improve performance of HiveGenericUDTF by making inputProjection instantiate once - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/07 13:44:06 UTC, 2 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #39903: [SPARK-42358][CORE] Send ExecutorUpdated with the message argument in Master.removeWorker - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2023/02/07 13:44:50 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39930: [Do not merged][SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/07 14:14:23 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39929: [SPARK-42372][SQL] Improve performance of HiveGenericUDTF by making inputProjection instantiate once - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/07 14:25:39 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #39882: [SPARK-42342][CONNECT] Introduce base hierarchy to exceptions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/07 14:43:27 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/07 15:14:23 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/07 15:15:42 UTC, 10 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39909: [SPARK-42369][CORE] Fix constructor for java.nio.DirectByteBuffer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 15:45:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39909: [SPARK-42369][CORE] Fix constructor for java.nio.DirectByteBuffer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 15:46:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39825: [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/07 15:57:01 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #39878: [SPARK-42335][SQL] Pass the comment option through to univocity if users set it explicitly in CSV dataSource - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/07 16:41:55 UTC, 1 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #39932: [WIP][MINOR] Code clean up in org.apache.spark.storage - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/07 22:35:30 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #39933: [SPARK-42377][Connect] Test framework for Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/07 23:33:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39933: [SPARK-42377][Connect] Test framework for Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/07 23:33:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39790: [SPARK-42094][PS] Support `fill_value` for `ps.Series.(add|radd)` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 00:16:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39790: [SPARK-42094][PS] Support `fill_value` for `ps.Series.(add|radd)` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 00:17:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39896: [SPARK-42352][BUILD] Upgrade maven to 3.8.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 00:17:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39896: [SPARK-42352][BUILD] Upgrade maven to 3.8.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 00:18:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38233: [SPARK-40781][CORE] Explain exit code 137 as killed due to OOM - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/08 00:19:19 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38068: [SPARK-40409] spark-sql supports reading `ByteType` data of avro serde - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/08 00:19:22 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39815: [SPARK-42244][PYTHON] Refine error classes and messages - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 00:34:17 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #39928: [SPARK-42371][CONNECT] Add scripts to start and stop Spark Connect server - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/08 00:35:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39933: [SPARK-42377][Connect] Test framework for Spark Connect Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 00:40:13 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #39933: [SPARK-42377][CONNECT][TESTS] Test framework for Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/08 00:44:29 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39916: [SPARK-42363][CONNECT] Remove SparkSession.register_udf - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 00:45:57 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #32361: [SPARK-35240][SS] Use CheckpointFileManager for checkpoint file manipulation - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/08 00:47:24 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39928: [SPARK-42371][CONNECT] Add scripts to start and stop Spark Connect server - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/08 00:52:05 UTC, 5 replies.
- [GitHub] [spark] vinodkc commented on pull request #39449: [SPARK-40688][SQL] Support data masking built-in function 'mask_first_n' - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/02/08 01:16:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39934: [SPARK-42378][CONNECT][PYTHON] Make `DataFrame.select` support `a.*` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 01:20:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 01:28:36 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39837: [SPARK-42254][SQL] Assign name to _LEGACY_ERROR_TEMP_1117 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 01:32:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39837: [SPARK-42254][SQL] Assign name to _LEGACY_ERROR_TEMP_1117 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 01:33:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39871: [SPARK-42301][SQL] Assign name to _LEGACY_ERROR_TEMP_1129 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 01:35:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39871: [SPARK-42301][SQL] Assign name to _LEGACY_ERROR_TEMP_1129 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 01:36:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39815: [SPARK-42244][PYTHON] Refine error classes and messages - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 01:44:17 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #39929: [SPARK-42372][SQL] Improve performance of HiveGenericUDTF by making inputProjection instantiate once - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/08 01:44:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39815: [SPARK-42244][PYTHON] Refine error classes and messages - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 01:44:49 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #21048: [SPARK-23966][SS] Refactoring all checkpoint file writing logic in a common CheckpointFileManager interface - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/08 02:05:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39904: [SPARK-40819][SQL][3.3] Timestamp nanos behaviour regression - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 02:07:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39905: [SPARK-40819][SQL][3.2] Timestamp nanos behaviour regression - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 02:07:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39904: [SPARK-40819][SQL][3.3] Timestamp nanos behaviour regression - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 02:08:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39905: [SPARK-40819][SQL][3.2] Timestamp nanos behaviour regression - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 02:08:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 02:09:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39932: [WIP][MINOR] Code clean up in org.apache.spark.storage - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/08 02:13:25 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #39924: [SPARK-41708][SQL][TEST][FOLLOWUP] Match non-space chars in path string - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/08 02:17:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39933: [SPARK-42377][CONNECT][TESTS] Test framework for Spark Connect Scala Client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/08 02:20:13 UTC, 4 replies.
- [GitHub] [spark] pan3793 commented on pull request #39928: [SPARK-42371][CONNECT] Add scripts to start and stop Spark Connect server - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/02/08 02:28:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39928: [SPARK-42371][CONNECT] Add scripts to start and stop Spark Connect server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 02:37:20 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39928: [SPARK-42371][CONNECT] Add scripts to start and stop Spark Connect server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 02:39:46 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39935: [SPARK-42244][PYTHON][FOLLOWUP] Fix error messages to keep the consistency - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 02:45:36 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #39936: [MINOR][SS] Use fs.exists in FileSystemBasedCheckpointFileManager.exists - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/08 03:04:50 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39936: [MINOR][SS] Use FileSystem.exists in FileSystemBasedCheckpointFileManager.exists - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/08 03:05:26 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39860: Standardize registered pickled Python UDFs - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/08 03:08:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39933: [SPARK-42377][CONNECT][TESTS] Test framework for Spark Connect Scala Client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/08 04:06:38 UTC, 4 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39936: [SPARK-42379][SS] Use FileSystem.exists in FileSystemBasedCheckpointFileManager.exists - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/08 04:40:06 UTC, 3 replies.
- [GitHub] [spark] navinvishy commented on pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/02/08 04:56:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39825: [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/08 04:58:25 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39892: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/08 05:04:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39892: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/08 05:06:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39903: [SPARK-42358][CORE] Send ExecutorUpdated with the message argument in Master.removeWorker - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/08 05:09:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39903: [SPARK-42358][CORE] Send ExecutorUpdated with the message argument in Master.removeWorker - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/08 05:09:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39889: [SPARK-42315][SQL] Assign name to _LEGACY_ERROR_TEMP_(2091|2092) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 05:26:09 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39882: [SPARK-42342][PYTHON][CONNECT] Introduce base hierarchy to exceptions - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 05:36:18 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #39929: [SPARK-42372][SQL] Improve performance of HiveGenericUDTF by making inputProjection instantiate once - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/08 05:36:55 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #39929: [SPARK-42372][SQL] Improve performance of HiveGenericUDTF by making inputProjection instantiate once - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/08 05:38:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39934: [SPARK-42378][CONNECT][PYTHON] Make `DataFrame.select` support `a.*` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 06:43:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39934: [SPARK-42378][CONNECT][PYTHON] Make `DataFrame.select` support `a.*` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 06:44:13 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39937: [SPARK-42309][SQL] Assign name to _LEGACY_ERROR_TEMP_1204 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 06:53:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39938: [SPARK-42267][CONNECT][PYTHON] `DataFrame.join` should standardize the JoinType string - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 07:06:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39938: [SPARK-42267][CONNECT][PYTHON] `DataFrame.join` should standardize the JoinType string - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 07:06:38 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #39882: [SPARK-42342][PYTHON][CONNECT] Introduce base hierarchy to exceptions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/08 07:27:38 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by "ivoson (via GitHub)" <gi...@apache.org> on 2023/02/08 08:10:21 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39939: [SPARK-42381][CONNECT][PYTHON] `CreateDataFrame` should accept objects - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 08:24:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39938: [SPARK-42267][CONNECT][PYTHON] `DataFrame.join` should standardize the JoinType string - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 08:28:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39938: [SPARK-42267][CONNECT][PYTHON] `DataFrame.join` should standardize the JoinType string - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 08:29:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39940: [SPARK-42267][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable `test_udf_in_filter_on_top_of_outer_join ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 08:37:24 UTC, 0 replies.
- [GitHub] [spark] martin-kokos opened a new pull request, #39941: [DOCS] Add link to Hadoop docs - posted by "martin-kokos (via GitHub)" <gi...@apache.org> on 2023/02/08 09:33:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39939: [SPARK-42381][CONNECT][PYTHON] `CreateDataFrame` should accept objects - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 11:16:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39939: [SPARK-42381][CONNECT][PYTHON] `CreateDataFrame` should accept objects - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 11:16:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39940: [SPARK-42267][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable `test_udf_in_filter_on_top_of_outer_join ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 11:18:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39940: [SPARK-42267][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable `test_udf_in_filter_on_top_of_outer_join ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 11:19:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39860: [SPARK-42210][CONNECT][PYTHON] Standardize registered pickled Python UDFs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/08 11:26:13 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39860: [SPARK-42210][CONNECT][PYTHON] Standardize registered pickled Python UDFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:45:54 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39882: [SPARK-42342][PYTHON][CONNECT] Introduce base hierarchy to exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:46:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39882: [SPARK-42342][PYTHON][CONNECT] Introduce base hierarchy to exceptions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:46:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39860: [SPARK-42210][CONNECT][PYTHON] Standardize registered pickled Python UDFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:47:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39935: [SPARK-42244][PYTHON][FOLLOWUP] Fix error messages to keep the consistency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:49:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39935: [SPARK-42244][PYTHON][FOLLOWUP] Fix error messages to keep the consistency - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:49:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39942: [WIP] refine default column value framework - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/08 11:56:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39936: [SPARK-42379][SS] Use FileSystem.exists in FileSystemBasedCheckpointFileManager.exists - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:56:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39924: [SPARK-41708][SQL][TEST][FOLLOWUP] Match non-space chars in path string - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:58:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39924: [SPARK-41708][SQL][TEST][FOLLOWUP] Match non-space chars in path string - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/08 11:59:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39936: [SPARK-42379][SS] Use FileSystem.exists in FileSystemBasedCheckpointFileManager.exists - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/08 12:05:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39873: [SPARK-42303][SQL] Assign name to _LEGACY_ERROR_TEMP_1326 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 14:05:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39873: [SPARK-42303][SQL] Assign name to _LEGACY_ERROR_TEMP_1326 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 14:06:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/08 14:17:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/08 14:18:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39875: [SPARK-42305][SQL] Integrate `_LEGACY_ERROR_TEMP_1229` into `DECIMAL_PRECISION_EXCEEDS_MAX_PRECISION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 14:23:04 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39875: [SPARK-42305][SQL] Integrate `_LEGACY_ERROR_TEMP_1229` into `DECIMAL_PRECISION_EXCEEDS_MAX_PRECISION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 14:23:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/08 14:26:40 UTC, 29 replies.
- [GitHub] [spark] awdavidson opened a new pull request, #39943: [SPARK-40819][SQL] Update version for nanos as long in SqlConf - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/08 14:44:04 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39925: [SPARK-41812][SPARK-41823][CONNECT][SQL][PYTHON] Resolve ambiguous columns issue in `Join` - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/08 14:57:53 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39944: [SPARK-42383][CORE] Protobuf serializer for `RocksDB.TypeAliases` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/08 15:14:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39943: [SPARK-40819][SQL] Update version for nanos as long in SqlConf - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/08 15:27:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/08 15:29:45 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #39936: [SPARK-42379][SS] Use FileSystem.exists in FileSystemBasedCheckpointFileManager.exists - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/08 16:49:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39937: [SPARK-42309][SQL] Assign name to _LEGACY_ERROR_TEMP_1204 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 17:02:31 UTC, 1 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #39945: [SPARK-42384][SQL] Check for null input in generated code for mask function - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/08 17:11:41 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39941: [DOCS] Add link to Hadoop docs - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 17:12:46 UTC, 2 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #39866: [SPARK-42287][CONNECT][BUILD] Fix the client dependency jars - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/08 17:17:23 UTC, 3 replies.
- [GitHub] [spark] steveloughran commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC0 - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/02/08 17:30:42 UTC, 0 replies.
- [GitHub] [spark] awdavidson commented on pull request #39943: [SPARK-40819][SQL][FOLLOWUP] Update SqlConf version for nanosAsLong configuration - posted by "awdavidson (via GitHub)" <gi...@apache.org> on 2023/02/08 17:42:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39890: [SPARK-42314][SQL] Assign name to _LEGACY_ERROR_TEMP_2127 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 17:46:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39890: [SPARK-42314][SQL] Assign name to _LEGACY_ERROR_TEMP_2127 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 17:47:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39945: [SPARK-42384][SQL] Check for null input in generated code for mask function - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/08 17:57:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39937: [SPARK-42309][SQL] Assign name to _LEGACY_ERROR_TEMP_1204 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 17:58:52 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39937: [SPARK-42309][SQL] Assign name to _LEGACY_ERROR_TEMP_1204 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 18:02:08 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39946: [SPARK-42310][SQL] Assign name to _LEGACY_ERROR_TEMP_1289 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 18:02:32 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39880: typo: StogeLevel -> StorageLevel - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/08 18:14:58 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #39893: [SPARK-42350][SQL][K8S][SS] Replcace `get().getOrElse` with `getOrElse` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/08 18:15:49 UTC, 1 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #39945: [SPARK-42384][SQL] Check for null input in generated code for mask function - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/08 18:29:05 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #39891: [SPARK-42318][SPARK-42319][SQL] Assign name to _LEGACY_ERROR_TEMP_(2123|2125) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 19:18:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39946: [SPARK-42310][SQL] Assign name to _LEGACY_ERROR_TEMP_1289 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/08 21:07:27 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #39878: [SPARK-42335][SQL] Pass the comment option through to univocity if users set it explicitly in CSV dataSource - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/08 21:12:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/08 21:31:39 UTC, 4 replies.
- [GitHub] [spark] wayneguow commented on pull request #39906: [SPARK-41962][MINOR][SQL] Update the order of imports in class SpecificParquetRecordReaderBase - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/08 21:38:32 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39933: [SPARK-42377][CONNECT][TESTS] Test framework for Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/08 21:51:51 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39936: [SPARK-42379][SS] Use FileSystem.exists in FileSystemBasedCheckpointFileManager.exists - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/08 22:12:33 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39941: [MINOR][DOCS] Add link to Hadoop docs - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/08 23:44:26 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39943: [SPARK-40819][SQL][FOLLOWUP] Update SqlConf version for nanosAsLong configuration - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 00:02:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39943: [SPARK-40819][SQL][FOLLOWUP] Update SqlConf version for nanosAsLong configuration - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 00:02:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38070: [SPARK-38004][PYTHON] Mangle dupe cols documentation - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/09 00:20:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38233: [SPARK-40781][CORE] Explain exit code 137 as killed due to OOM - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/09 00:20:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38068: [SPARK-40409] spark-sql supports reading `ByteType` data of avro serde - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/09 00:20:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37369: [SPARK-39942][PYTHON][PS] Need to verify the input nums is integer in nsmallest func - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/09 00:20:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37232: [SPARK-39821][PYTHON][PS] Fix error during using DatetimeIndex - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/09 00:20:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 00:52:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 00:52:23 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/09 01:18:39 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39860: [SPARK-42210][CONNECT][PYTHON] Standardize registered pickled Python UDFs - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/09 01:44:10 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39925: [SPARK-41812][SPARK-41823][CONNECT][SQL][PYTHON] Resolve ambiguous columns issue in `Join` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/09 01:45:54 UTC, 21 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39893: [SPARK-42350][SQL][K8S][SS] Replcace `get().getOrElse` with `getOrElse` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 02:13:02 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/09 02:22:23 UTC, 3 replies.
- [GitHub] [spark] srowen closed pull request #39893: [SPARK-42350][SQL][K8S][SS] Replcace `get().getOrElse` with `getOrElse` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/09 02:45:41 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39899: [SPARK-42355][BUILD] Upgrade some maven-plugins - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/09 02:46:35 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39899: [SPARK-42355][BUILD] Upgrade some maven-plugins - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/09 02:46:42 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39941: [MINOR][DOCS] Add link to Hadoop docs - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/09 02:48:35 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #39946: [SPARK-42310][SQL] Assign name to _LEGACY_ERROR_TEMP_1289 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/09 03:06:56 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39946: [SPARK-42310][SQL] Assign name to _LEGACY_ERROR_TEMP_1289 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/09 03:10:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39899: [SPARK-42355][BUILD] Upgrade some maven-plugins - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 03:12:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 03:20:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/09 03:49:35 UTC, 4 replies.
- [GitHub] [spark] db-scnakandala commented on a diff in pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "db-scnakandala (via GitHub)" <gi...@apache.org> on 2023/02/09 04:41:35 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #39897: [WIP][SPARK-42353][SS] Cleanup orphan sst and log files in RocksDB checkpoint directory - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/09 04:52:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 05:22:34 UTC, 10 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39948: [SPARK-42385][BUILD] Upgrade RoaringBitmap to 0.9.39 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 05:36:05 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on a diff in pull request #39897: [WIP][SPARK-42353][SS] Cleanup orphan sst and log files in RocksDB checkpoint directory - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2023/02/09 06:23:04 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on pull request #39897: [WIP][SPARK-42353][SS] Cleanup orphan sst and log files in RocksDB checkpoint directory - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2023/02/09 06:23:28 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39949: [SPARK-42386][SQL] Rewrite HiveGenericUDF with Invoke - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/09 06:41:19 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39865: [SPARK-42052][SQL] Codegen Support for HiveSimpleUDF - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/09 06:48:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 07:55:16 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39948: [SPARK-42385][BUILD] Upgrade RoaringBitmap to 0.9.39 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/09 08:14:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39948: [SPARK-42385][BUILD] Upgrade RoaringBitmap to 0.9.39 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 08:18:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39949: [SPARK-42386][SQL] Rewrite HiveGenericUDF with Invoke - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 08:38:25 UTC, 0 replies.
- [GitHub] [spark] yabola opened a new pull request, #39950: SPARK-42388 Avoid unnecessary parquet footer reads when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/09 08:48:55 UTC, 0 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #39950: [SPARK-42388][SQL] Avoid unnecessary parquet footer reads when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/09 09:00:58 UTC, 0 replies.
- [GitHub] [spark] martin-kokos commented on a diff in pull request #39941: [MINOR][DOCS] Add link to Hadoop docs - posted by "martin-kokos (via GitHub)" <gi...@apache.org> on 2023/02/09 09:04:53 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #37787: [SPARK-40323][BUILD] Update ORC to 1.8.0 - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/02/09 09:28:10 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39908: [SPARK-42360][SQL] Transform LeftOuter join with IsNull filter on right side to Anti join - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/09 09:51:42 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng closed pull request #39860: [SPARK-42210][CONNECT][PYTHON] Standardize registered pickled Python UDFs - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/09 10:18:29 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39951: [SPARK-42312][SQL] Assign name to _LEGACY_ERROR_TEMP_0042 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/09 10:59:02 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/09 11:02:27 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39953: [WIP][SPARK-42313][SQL] Integrate `_LEGACY_ERROR_TEMP_1152` into `UNSUPPORTED_SAVE_MODE.EXISTENT_PATH` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/09 11:09:57 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39949: [SPARK-42386][SQL] Rewrite HiveGenericUDF with Invoke - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/09 11:13:34 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39954: [SPARK-42289][SQL] DS V2 pushdown could let JDBC dialect decide to push down offset and limit - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/09 13:35:22 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/09 14:04:34 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39955: [SPARK-42389][CORE][TESTS] Clean up the remaining ui path after test when `LIVE_UI_LOCAL_STORE_DIR` is configured - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 15:20:10 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/09 16:36:23 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39918: [SPARK-42366][SHUFFLE] Log shuffle data corruption diagnose cause - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/09 16:37:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39955: [WIP][SPARK-42389][CORE][TESTS] Ensure Live UI dir be cleaned up after testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 17:02:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39956: [SPARK-42389][CORE][TESTS] Ensure Live UI dir be cleaned up after test when `LIVE_UI_LOCAL_STORE_DIR` is configured - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 17:06:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39956: [SPARK-42389][CORE][TESTS] Ensure Live UI dir be cleaned up after test when `LIVE_UI_LOCAL_STORE_DIR` is configured - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/09 17:16:04 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #39897: [SPARK-42353][SS] Cleanup orphan sst and log files in RocksDB checkpoint directory - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/09 17:21:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37787: [SPARK-40323][BUILD] Update ORC to 1.8.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/09 17:24:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39956: [SPARK-42389][CORE][TESTS] Ensure Live UI dir be cleaned up after test when `LIVE_UI_LOCAL_STORE_DIR` is configured - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/09 17:54:17 UTC, 5 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39894: [SPARK-42351][CORE] Protobuf serializer for FsHistoryProviderMetadata - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/09 19:41:39 UTC, 0 replies.
- [GitHub] [spark] ben-zhang commented on a diff in pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "ben-zhang (via GitHub)" <gi...@apache.org> on 2023/02/09 21:17:59 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #39957: [SPARK-42338][CONNECT] Add details to non-fatal errors to raise a proper exception in the Python client - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/09 22:51:26 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #39957: [SPARK-42338][CONNECT] Add details to non-fatal errors to raise a proper exception in the Python client - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/09 22:55:15 UTC, 0 replies.
- [GitHub] [spark] viirya opened a new pull request, #39958: MINOR: Fix setTimeoutTimestamp doc - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/09 23:05:28 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39897: [SPARK-42353][SS] Cleanup orphan sst and log files in RocksDB checkpoint directory - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/09 23:21:25 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39897: [SPARK-42353][SS] Cleanup orphan sst and log files in RocksDB checkpoint directory - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/09 23:22:16 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/09 23:41:18 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39957: [SPARK-42338][CONNECT] Add details to non-fatal errors to raise a proper exception in the Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 23:48:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39957: [SPARK-42338][CONNECT] Add details to non-fatal errors to raise a proper exception in the Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/09 23:48:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38452: [SPARK-40802][SQL] Resolve JDBCRelation's schema with preparing the statement - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/10 00:21:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38070: [SPARK-38004][PYTHON] Mangle dupe cols documentation - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/10 00:21:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37369: [SPARK-39942][PYTHON][PS] Need to verify the input nums is integer in nsmallest func - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/10 00:21:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37232: [SPARK-39821][PYTHON][PS] Fix error during using DatetimeIndex - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/10 00:21:16 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39954: [SPARK-42289][SQL] DS V2 pushdown could let JDBC dialect decide to push down offset and limit - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/10 00:46:10 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #39958: [MINOR][SS]: Fix setTimeoutTimestamp doc - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/10 01:08:03 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39959: [SPARK-42390][CONNECT][BUILD] Upgrade buf from 1.13.1 to 1.14.0 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/10 01:26:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/10 01:50:11 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #39960: [SPARK-41963][CONNECT] Fix DataFrame.unpivot to raise the same error class when the `values` argument is empty - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/10 02:02:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39960: [SPARK-41963][CONNECT] Fix DataFrame.unpivot to raise the same error class when the `values` argument is empty - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/10 02:16:17 UTC, 0 replies.
- [GitHub] [spark] viirya closed pull request #39958: [MINOR][SS]: Fix setTimeoutTimestamp doc - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/10 02:17:37 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39951: [SPARK-42312][SQL] Assign name to _LEGACY_ERROR_TEMP_0042 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/10 02:24:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39894: [SPARK-42351][CORE] Protobuf serializer for FsHistoryProviderMetadata - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/10 02:28:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39944: [SPARK-42383][CORE] Protobuf serializer for `RocksDB.TypeAliases` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/10 02:28:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39944: [SPARK-42383][CORE] Protobuf serializer for `RocksDB.TypeAliases` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/10 02:28:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39045: [SPARK-41507][SQL] Correct group of collection_funcs and fix description of `ExpressionDescription#group` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/10 02:30:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39848: [SPARK-42276][BUILD][CONNECT] Add `ServicesResourceTransformer` rule to connect server module shade configuration - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/10 02:58:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39848: [SPARK-42276][BUILD][CONNECT] Add `ServicesResourceTransformer` rule to connect server module shade configuration - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/10 02:58:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39925: [SPARK-41812][SPARK-41823][CONNECT][SQL][PYTHON] Resolve ambiguous columns issue in `Join` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/10 03:02:47 UTC, 17 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39961: [SPARK-42391][CORE][TESTS] Close live `AppStore` in the finally block for test cases - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/10 03:04:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39942: [WIP] refine default column value framework - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/10 03:25:09 UTC, 10 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #39962: [SPARK-42392] Add a new case of TriggeredByExecutorDecommissionInfo to remove unnecessary param - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/10 04:24:03 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/10 05:57:49 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/10 05:58:12 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39964: [SPARK-42269][CONNECT][PYTHON] Support complex return types in DDL strings - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/10 06:18:18 UTC, 0 replies.
- [GitHub] [spark] yabola commented on pull request #39950: [SPARK-42388][SQL] Avoid unnecessary parquet footer reads when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/10 06:25:16 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39965: [SPARK-42325][SQL] Assign name to _LEGACY_ERROR_TEMP_0035 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/10 06:27:41 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39965: [SPARK-42325][SQL] Assign name to _LEGACY_ERROR_TEMP_0035 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/10 06:31:05 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/02/10 06:36:37 UTC, 6 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39962: [SPARK-42392][CORE] Add a new case of TriggeredByExecutorDecommissionInfo to remove unnecessary param - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/10 07:34:58 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/10 07:53:18 UTC, 54 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #39966: [SPARK-42394][SQL] Fix the usage information of bin/spark-sql --help - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/10 09:03:11 UTC, 0 replies.
- [GitHub] [spark] ninebigbig opened a new pull request, #39967: [SPARK-42395][K8S]The code logic of the configmap max size validation lacks extra content - posted by "ninebigbig (via GitHub)" <gi...@apache.org> on 2023/02/10 09:57:10 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/10 11:21:40 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #39966: [SPARK-42394][SQL] Fix the usage information of bin/spark-sql --help - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/10 11:25:23 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #39966: [SPARK-42394][SQL] Fix the usage information of bin/spark-sql --help - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/10 11:26:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39045: [SPARK-41507][SQL] Correct group of collection_funcs and fix description of `ExpressionDescription#group` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/10 11:54:47 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39902: [SPARK-42349][PYTHON]Support pandas cogroup with multiple df - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/10 13:38:09 UTC, 2 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on a diff in pull request #39902: [SPARK-42349][PYTHON]Support pandas cogroup with multiple df - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/02/10 15:10:34 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/10 15:55:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/10 15:57:09 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/10 19:24:36 UTC, 2 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39951: [SPARK-42312][SQL] Assign name to _LEGACY_ERROR_TEMP_0042 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/10 19:58:14 UTC, 0 replies.
- [GitHub] [spark] SiHuoGe commented on a diff in pull request #38703: [SPARK-41191] [SQL] Cache Table is not working while nested caches exist - posted by "SiHuoGe (via GitHub)" <gi...@apache.org> on 2023/02/10 20:15:21 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #39968: [SPARK-42265][SPARK-41820][CONNECT] Fix createTempView and its variations to work with not analyzed plans - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/10 20:49:42 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #39969: [SPARK-42396][BUILD] Upgrade `Apache Kafka` to 3.4.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/10 21:25:41 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #39969: [WIP][SPARK-42396][BUILD] Upgrade `Apache Kafka` to 3.4.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/10 22:14:45 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #39970: [SPARK-42401][SQL] Set `containsNull` correctly in the data type for array_insert/array_append - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/10 23:35:40 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #39970: [SPARK-42401][SQL] Set `containsNull` correctly in the data type for array_insert/array_append - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/10 23:47:24 UTC, 5 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38452: [SPARK-40802][SQL] Resolve JDBCRelation's schema with preparing the statement - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/11 00:17:12 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #39971: [SPARK-42402][CONNECT] Support parameterized SQL by `sql()` - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/11 00:41:49 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #39971: [SPARK-42402][CONNECT] Support parameterized SQL by `sql()` - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/11 00:43:29 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #36885: [SPARK-39489][CORE] Improve event logging JsonProtocol performance by using Jackson instead of Json4s - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 01:08:21 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #36885: [SPARK-39489][CORE] Improve event logging JsonProtocol performance by using Jackson instead of Json4s - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 01:13:44 UTC, 3 replies.
- [GitHub] [spark] itholic commented on pull request #39953: [WIP][SPARK-42313][SQL] Assign name to `_LEGACY_ERROR_TEMP_1152` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/11 01:14:12 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/11 01:21:48 UTC, 8 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39951: [SPARK-42312][SQL] Assign name to _LEGACY_ERROR_TEMP_0042 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/11 01:28:03 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39972: [SPARK-42403][CORE] Handle StackTraces based on no Java files - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 01:31:59 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #36885: [SPARK-39489][CORE] Improve event logging JsonProtocol performance by using Jackson instead of Json4s - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/02/11 01:33:25 UTC, 2 replies.
- [GitHub] [spark] JoshRosen opened a new pull request, #39973: [SPARK-42403][CORE] JsonProtocol should handle null Json strings - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/02/11 02:09:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39972: [SPARK-42403][CORE] Handle StackTraces based on no Java files - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 02:25:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39973: [SPARK-42403][CORE] JsonProtocol should handle null JSON strings - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 02:26:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39973: [SPARK-42403][CORE] JsonProtocol should handle null JSON strings - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 02:29:25 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39971: [SPARK-42402][CONNECT] Support parameterized SQL by `sql()` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 02:40:02 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #39973: [SPARK-42403][CORE] JsonProtocol should handle null JSON strings - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/02/11 02:55:45 UTC, 0 replies.
- [GitHub] [spark] ninebigbig commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "ninebigbig (via GitHub)" <gi...@apache.org> on 2023/02/11 03:42:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39973: [SPARK-42403][CORE] JsonProtocol should handle null JSON strings - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 05:54:54 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39953: [WIP][SPARK-42313][SQL] Assign name to `_LEGACY_ERROR_TEMP_1152` - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/11 06:03:20 UTC, 0 replies.
- [GitHub] [spark] Dam1029 commented on pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by "Dam1029 (via GitHub)" <gi...@apache.org> on 2023/02/11 06:39:08 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39953: [WIP][SPARK-42313][SQL] Assign name to `_LEGACY_ERROR_TEMP_1152` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/11 07:12:47 UTC, 0 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/11 09:51:22 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #39932: [SPARK-42400] Code clean up in org.apache.spark.storage - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/11 12:58:51 UTC, 1 replies.
- [GitHub] [spark] wayneguow opened a new pull request, #39974: [SPARK-39142A][SPARK-42235][PYTHON] Add overloads in pandas function stub file - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/11 13:44:41 UTC, 0 replies.
- [GitHub] [spark] Daniel-Davies opened a new pull request, #39975: improve array insert documentation - posted by "Daniel-Davies (via GitHub)" <gi...@apache.org> on 2023/02/11 13:59:50 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 opened a new pull request, #39976: [SPARK-42034] QueryExecutionListener and Observation API do not work with `foreach` / `reduce` / `foreachPartition` action. - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/02/11 16:59:51 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on pull request #39974: [SPARK-39142][SPARK-42235][PYTHON] Add overloads in pandas function stub file - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/11 17:31:31 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38262: [SPARK-40801][BUILD] Upgrade `Apache commons-text` to 1.10 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/11 20:44:48 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #34376: [SPARK-37105][TEST] Pass all UTs in `sql/hive` with Java 17 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/11 22:35:10 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #34376: [SPARK-37105][TEST] Pass all UTs in `sql/hive` with Java 17 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/11 22:37:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39971: [SPARK-42402][CONNECT] Support parameterized SQL by `sql()` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:04:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39971: [SPARK-42402][CONNECT] Support parameterized SQL by `sql()` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:05:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39968: [SPARK-42265][SPARK-41820][CONNECT] Fix createTempView and its variations to work with not analyzed plans - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:05:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39968: [SPARK-42265][SPARK-41820][CONNECT] Fix createTempView and its variations to work with not analyzed plans - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:06:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39960: [SPARK-41963][CONNECT] Fix DataFrame.unpivot to raise the same error class when the `values` argument is empty - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:06:50 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39964: [SPARK-42269][CONNECT][PYTHON] Support complex return types in DDL strings - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:09:51 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39946: [SPARK-42310][SQL] Assign name to _LEGACY_ERROR_TEMP_1289 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:10:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39946: [SPARK-42310][SQL] Assign name to _LEGACY_ERROR_TEMP_1289 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:11:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39959: [SPARK-42390][CONNECT][BUILD] Upgrade buf from 1.13.1 to 1.14.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:11:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39959: [SPARK-42390][CONNECT][BUILD] Upgrade buf from 1.13.1 to 1.14.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/12 00:12:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38324: Protobuf generate V2 and V3 protos and extend tests. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/12 00:21:55 UTC, 0 replies.
- [GitHub] [spark] felixcheung commented on pull request #36337: [SPARK-39008][BUILD] Change ASF as a single author in Spark distribution - posted by "felixcheung (via GitHub)" <gi...@apache.org> on 2023/02/12 00:25:03 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39977: [DO-NOT-MERGE][TEST] LEGACY_2332 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/12 04:13:22 UTC, 0 replies.
- [GitHub] [spark] williamhyun opened a new pull request, #39978: [SPARK-42408][CORE] Register DoubleType to KryoSerializer - posted by "williamhyun (via GitHub)" <gi...@apache.org> on 2023/02/12 04:16:54 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39933: [SPARK-42377][CONNECT][TESTS] Test framework for Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/12 04:25:51 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39979: [SPARK-42326][SQL] Integrate `_LEGACY_ERROR_TEMP_2099` into `UNSUPPORTED_DATATYPE` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/12 05:05:39 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39980: [SPARK-42327][SQL] Assign name to `_LEGACY_ERROR_TEMP_2177` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/12 05:32:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39978: [SPARK-42408][CORE] Register DoubleType to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 06:36:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39978: [SPARK-42408][CORE] Register DoubleType to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 06:37:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39969: [SPARK-42396][BUILD] Upgrade `Apache Kafka` to 3.4.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 06:40:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39981: [SPARK-42409][BUILD] Upgrade ZSTD-JNI to 1.5.4-1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/12 12:39:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39956: [SPARK-42389][CORE][TESTS] Ensure Live UI dir be cleaned up after test when `LIVE_UI_LOCAL_STORE_DIR` is configured - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/12 12:43:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39951: [SPARK-42312][SQL] Assign name to _LEGACY_ERROR_TEMP_0042 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/12 14:12:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39951: [SPARK-42312][SQL] Assign name to _LEGACY_ERROR_TEMP_0042 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/12 14:13:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39953: [SPARK-42313][SQL] Assign name to `_LEGACY_ERROR_TEMP_1152` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/12 14:22:51 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39953: [SPARK-42313][SQL] Assign name to `_LEGACY_ERROR_TEMP_1152` - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/12 14:35:57 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39932: [SPARK-42400] Code clean up in org.apache.spark.storage - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/12 15:43:10 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39932: [SPARK-42400] Code clean up in org.apache.spark.storage - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/12 15:43:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/12 15:48:59 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39965: [SPARK-42325][SQL] Assign name to _LEGACY_ERROR_TEMP_0035 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/12 16:33:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39979: [SPARK-42326][SQL] Integrate `_LEGACY_ERROR_TEMP_2099` into `UNSUPPORTED_DATATYPE` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/12 17:30:29 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39980: [SPARK-42327][SQL] Assign name to `_LEGACY_ERROR_TEMP_2177` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/12 17:56:21 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39981: [SPARK-42409][BUILD] Upgrade ZSTD-JNI to 1.5.4-1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 21:11:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39981: [SPARK-42409][BUILD] Upgrade ZSTD-JNI to 1.5.4-1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 21:11:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39933: [SPARK-42377][CONNECT][TESTS] Test framework for Spark Connect Scala Client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 21:24:42 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39933: [SPARK-42377][CONNECT][TESTS] Test framework for Spark Connect Scala Client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 23:17:29 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39982: [SPARK-42410][CONNECT][TESTS] Support Scala 2.12/2.13 tests in `connect` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 23:34:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39982: [SPARK-42410][CONNECT][TESTS] Support Scala 2.12/2.13 tests in `connect` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/12 23:37:38 UTC, 5 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38486: [SPARK-41000][SQL] Make CommandResult extend Command trait - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/13 00:20:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38324: Protobuf generate V2 and V3 protos and extend tests. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/13 00:20:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38071: [SPARK-36290][SQL] Pull out complex join condition - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/13 00:20:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39983: [SPARK-40453][SPARK-41715][CONNECT][TESTS] Skip freqItems doctest due to Scala 2.13 failure - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 00:41:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39947: [SPARK-40453][SPARK-41715][CONNECT] Take super class into account when throwing an exception - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 00:41:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39983: [SPARK-40453][SPARK-41715][CONNECT][TESTS][FOLLOWUP] Skip freqItems doctest due to Scala 2.13 failure - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 00:42:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39983: [SPARK-40453][SPARK-41715][CONNECT][TESTS][FOLLOWUP] Skip freqItems doctest due to Scala 2.13 failure - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 00:43:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39983: [SPARK-40453][SPARK-41715][CONNECT][TESTS][FOLLOWUP] Skip freqItems doctest due to Scala 2.13 failure - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 00:43:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39960: [SPARK-41963][CONNECT] Fix DataFrame.unpivot to raise the same error class when the `values` argument is empty - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 00:45:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39970: [SPARK-42401][SQL] Set `containsNull` correctly in the data type for array_insert/array_append - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 00:49:18 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39976: [SPARK-42034] QueryExecutionListener and Observation API do not work with `foreach` / `reduce` / `foreachPartition` action. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 00:50:54 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39974: [SPARK-39142][SPARK-42235][PYTHON] Add overloads in pandas function stub file - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 00:55:49 UTC, 1 replies.
- [GitHub] [spark] yabola commented on pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/13 01:29:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39982: [SPARK-42410][CONNECT][TESTS] Support Scala 2.12/2.13 tests in `connect` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 01:31:35 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39984: [SPARK-42263][CONNECT][PYTHON] Implement `spark.catalog.registerFunction` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/13 02:08:21 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #39985: [WIP] Initial prototype implementation of PySpark ML via Spark connect - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/13 02:11:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39986: [SPARK-42410][CONNECT][TESTS][FOLLOWUP] Fix PlanGenerationTestSuite together - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 02:22:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39986: [SPARK-42410][CONNECT][TESTS][FOLLOWUP] Fix PlanGenerationTestSuite together - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 02:26:36 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #28: Add awesome-spark-docker.md - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/13 02:30:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37487: [SPARK-40053][CORE][SQL][TESTS] Add `assume` to dynamic cancel cases which requiring Python runtime environment - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 02:54:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39981: [SPARK-42409][BUILD] Upgrade ZSTD-JNI to 1.5.4-1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 02:54:42 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37487: [SPARK-40053][CORE][SQL][TESTS] Add `assume` to dynamic cancel cases which requiring Python runtime environment - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/13 02:55:12 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #29: Test on 3.3.2-rc1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/13 02:57:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39986: [SPARK-42410][CONNECT][TESTS][FOLLOWUP] Fix `PlanGenerationTestSuite` together - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 02:57:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39986: [SPARK-42410][CONNECT][TESTS][FOLLOWUP] Fix `PlanGenerationTestSuite` together - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 02:58:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39986: [SPARK-42410][CONNECT][TESTS][FOLLOWUP] Fix `PlanGenerationTestSuite` together - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 03:28:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39961: [SPARK-42391][CORE][TESTS] Close live `AppStatusStore` in the finally block for test cases - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 03:33:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39975: [SPARK-42405][SQL] Improve array insert documentation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/13 03:40:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39866: [SPARK-42287][CONNECT][BUILD] Fix the client dependency jars - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 03:54:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39961: [SPARK-42391][CORE][TESTS] Close live `AppStatusStore` in the finally block for test cases - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 04:15:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39961: [SPARK-42391][CORE][TESTS] Close live `AppStatusStore` in the finally block for test cases - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 04:15:39 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39979: [SPARK-42326][SQL] Integrate `_LEGACY_ERROR_TEMP_2099` into `UNSUPPORTED_DATATYPE` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/13 04:35:14 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39870: [SPARK-42331][SQL] Fix metadata col can not been resolved - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/13 04:43:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39870: [SPARK-42331][SQL] Fix metadata col can not been resolved - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/13 04:43:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39973: [SPARK-42403][CORE] JsonProtocol should handle null JSON strings - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/13 04:47:18 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39953: [SPARK-42313][SQL] Assign name to `_LEGACY_ERROR_TEMP_1152` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/13 04:53:18 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on a diff in pull request #39974: [SPARK-39142][SPARK-42235][PYTHON] Add overloads in pandas function stub file - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/13 04:54:26 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #39976: [SPARK-42034] QueryExecutionListener and Observation API do not work with `foreach` / `reduce` / `foreachPartition` action. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/13 04:59:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39976: [SPARK-42034] QueryExecutionListener and Observation API do not work with `foreach` / `reduce` / `foreachPartition` action. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 05:09:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39976: [SPARK-42034] QueryExecutionListener and Observation API do not work with `foreach` / `reduce` / `foreachPartition` action. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 05:10:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39964: [SPARK-42269][CONNECT][PYTHON] Support complex return types in DDL strings - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 05:10:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39986: [SPARK-42410][CONNECT][TESTS][FOLLOWUP] Fix `PlanGenerationTestSuite` together - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 05:23:11 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #39987: [SPARK-41775][PYTHON][FOLLOWUP] Updating error message for training using PyTorch functions - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/02/13 05:28:26 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #39986: [SPARK-42410][CONNECT][TESTS][FOLLOWUP] Fix `PlanGenerationTestSuite` together - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/13 05:44:18 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #39908: [SPARK-42360][SQL] Transform LeftOuter join with IsNull filter on right side to Anti join - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/13 05:47:11 UTC, 2 replies.
- [GitHub] [spark] ninebigbig commented on pull request #39967: [SPARK-42395][K8S]The code logic of the configmap max size validation lacks extra content - posted by "ninebigbig (via GitHub)" <gi...@apache.org> on 2023/02/13 05:53:31 UTC, 4 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39988: [SPARK-42416][SQL] Dateset.show() should not resolve the analyzed logical plan again - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/13 05:54:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39989: [SPARK-42417][BUILD] Upgrade `netty` from 4.1.87.Final to 4.1.88.Final - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/13 06:00:49 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #39039: [SPARK-40776][SQL][PROTOBUF][DOCS] Spark-Protobuf docs - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/13 06:05:49 UTC, 2 replies.
- [GitHub] [spark] bersprockets commented on pull request #39970: [SPARK-42401][SQL] Set `containsNull` correctly in the data type for array_insert/array_append - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/13 06:07:32 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39990: [WIP][SPARK-42415][SQL] The built-in dialects support OFFSET and paging query. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/13 06:24:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39970: [SPARK-42401][SQL] Set `containsNull` correctly in the data type for array_insert/array_append - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 06:49:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39970: [SPARK-42401][SQL] Set `containsNull` correctly in the data type for array_insert/array_append - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 06:49:57 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39991: [SPARK-42419][CONNECT][PYTHON] Migrate into error framework for Spark Connect Column API. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/13 06:52:22 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39977: [SPARK-42323][SQL] Assign name to `_LEGACY_ERROR_TEMP_2332` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/13 06:55:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39991: [SPARK-42419][CONNECT][PYTHON] Migrate into error framework for Spark Connect Column API. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 07:07:57 UTC, 3 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #39992: [WIP][SPARK-42418][DOCS][PYSPARK] PySpark documentation updates to improve discoverability and add more guidance - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/13 07:35:36 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/13 07:39:05 UTC, 1 replies.
- [GitHub] [spark] williamhyun opened a new pull request, #39993: [SPARK-42420][CORE] Register WriteTaskResult, BasicWriteTaskStats, an… - posted by "williamhyun (via GitHub)" <gi...@apache.org> on 2023/02/13 08:04:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39925: [SPARK-41812][SPARK-41823][CONNECT][SQL][PYTHON] Resolve ambiguous columns issue in `Join` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/13 08:22:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39994: [SPARK-42422][BUILD] Upgrade `maven-shade-plugin` to 3.4.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 09:11:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39977: [SPARK-42323][SQL] Assign name to `_LEGACY_ERROR_TEMP_2332` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/13 09:17:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39995: [WIP][CONNECT] Initial configuration implementation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/13 09:28:00 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #39994: [SPARK-42422][BUILD] Upgrade `maven-shade-plugin` to 3.4.1 - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/02/13 09:52:48 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39994: [SPARK-42422][BUILD] Upgrade `maven-shade-plugin` to 3.4.1 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 10:24:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39989: [SPARK-42417][BUILD] Upgrade `netty` from 4.1.87.Final to 4.1.88.Final - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 10:29:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39989: [SPARK-42417][BUILD] Upgrade `netty` from 4.1.87.Final to 4.1.88.Final - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 10:29:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39987: [SPARK-41775][PYTHON][FOLLOWUP] Updating error message for training using PyTorch functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 10:32:20 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39977: [SPARK-42323][SQL] Assign name to `_LEGACY_ERROR_TEMP_2332` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/13 10:41:31 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39953: [SPARK-42313][SQL] Assign name to `_LEGACY_ERROR_TEMP_1152` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/13 11:06:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/13 11:15:53 UTC, 1 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39996: [SPARK-42423][SQL] Add metadata column file block start and length - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/13 11:22:00 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39754: [SPARK-42199][SQL] Fix issues around Dataset.groupByKey - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/13 12:16:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39992: [WIP][SPARK-42418][DOCS][PYSPARK] PySpark documentation updates to improve discoverability and add more guidance - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/13 13:00:10 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #39976: [SPARK-42034] QueryExecutionListener and Observation API do not work with `foreach` / `reduce` / `foreachPartition` action. - posted by "zzzzming95 (via GitHub)" <gi...@apache.org> on 2023/02/13 13:05:18 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39985: [SPARK-42412][WIP] Initial prototype implementation of PySpark ML via Spark connect - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/13 13:11:03 UTC, 24 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39982: [SPARK-42410][CONNECT][TESTS] Support Scala 2.12/2.13 tests in `connect` module - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/13 13:12:00 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39994: [SPARK-42422][BUILD] Upgrade `maven-shade-plugin` to 3.4.1 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/13 14:27:55 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39994: [SPARK-42422][BUILD] Upgrade `maven-shade-plugin` to 3.4.1 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/13 14:27:56 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/13 14:30:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39993: [SPARK-42420][CORE] Register WriteTaskResult, BasicWriteTaskStats, and ExecutedWriteSummary to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 15:32:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39993: [SPARK-42420][CORE] Register WriteTaskResult, BasicWriteTaskStats, and ExecutedWriteSummary to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/13 15:33:39 UTC, 0 replies.
- [GitHub] [spark] allanf-db commented on pull request #39992: [WIP][SPARK-42418][DOCS][PYSPARK] PySpark documentation updates to improve discoverability and add more guidance - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/13 16:20:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39997: [SPARK-42424][YARN] Remove unused method `setEnvFromInputString` from YarnSparkHadoopUtil - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 16:21:58 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/13 16:23:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39994: [SPARK-42422][BUILD] Upgrade `maven-shade-plugin` to 3.4.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/13 16:26:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39980: [SPARK-42327][SQL] Assign name to `_LEGACY_ERROR_TEMP_2177` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/13 16:34:06 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #39928: [SPARK-42371][CONNECT] Add scripts to start and stop Spark Connect server - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/02/13 16:40:58 UTC, 0 replies.
- [GitHub] [spark] jiwq opened a new pull request, #39998: [SPARK-42421][CORE] Use the utils to get the switch for dynamic allocation used in local checkpoint - posted by "jiwq (via GitHub)" <gi...@apache.org> on 2023/02/13 18:07:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39999: WIP - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/13 18:37:51 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39988: [SPARK-42416][SQL] Dateset operations should not resolve the analyzed logical plan again - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/13 18:57:27 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39988: [SPARK-42416][SQL] Dateset operations should not resolve the analyzed logical plan again - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/13 18:58:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39987: [SPARK-41775][PYTHON][FOLLOW-UP] Updating error message for training using PyTorch functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 19:02:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39987: [SPARK-41775][PYTHON][FOLLOW-UP] Updating error message for training using PyTorch functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/13 19:03:13 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2023/02/13 19:14:20 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/13 19:29:21 UTC, 9 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #39910: [SPARK-42337][SQL] Add error class CREATE_PERSISTENT_OBJECT_OVER_TEMP_OBJECT - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/13 21:51:48 UTC, 0 replies.
- [GitHub] [spark] Daniel-Davies commented on a diff in pull request #39975: [SPARK-42405][SQL] Improve array insert documentation - posted by "Daniel-Davies (via GitHub)" <gi...@apache.org> on 2023/02/13 23:08:01 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40000: [SPARK-41818][SPARK-42000][CONNECT] Fix saveAsTable to find the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/13 23:10:06 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40000: [SPARK-41818][SPARK-42000][CONNECT] Fix saveAsTable to find the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/13 23:13:01 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40001: [SPARK-42427][SQL] ANSI MODE: Conv should return an error if the internal conversion overflows - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/13 23:41:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40000: [SPARK-41818][SPARK-42000][CONNECT] Fix saveAsTable to find the default source - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 00:05:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40000: [SPARK-41818][SPARK-42000][CONNECT] Fix saveAsTable to find the default source - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 00:06:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38486: [SPARK-41000][SQL] Make CommandResult extend Command trait - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/14 00:20:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38071: [SPARK-36290][SQL] Pull out complex join condition - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/14 00:20:40 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39942: [SPARK-42398][SQL] Refine default column value framework - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 00:39:40 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40002: [SPARK-41999][CONNECT][PYTHON] Fix bucketBy to porperly use the first column name - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/14 00:43:12 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39942: [SPARK-42398][SQL] Refine default column value framework - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/14 01:34:37 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39942: [SPARK-42398][SQL] Refine default column value framework - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/14 01:35:14 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39992: [WIP][SPARK-42418][DOCS][PYSPARK] PySpark documentation updates to improve discoverability and add more guidance - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 02:04:01 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39975: [SPARK-42405][SQL] Improve array insert documentation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/14 02:19:48 UTC, 1 replies.
- [GitHub] [spark] allanf-db commented on a diff in pull request #39992: [WIP][SPARK-42418][DOCS][PYSPARK] PySpark documentation updates to improve discoverability and add more guidance - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/14 02:55:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 03:13:02 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40003: Standardize __repr__ of CommonInlineUserDefinedFunction - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/14 03:28:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40002: [SPARK-41999][CONNECT][PYTHON] Fix bucketBy/sortBy to properly use the first column name - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 03:55:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40002: [SPARK-41999][CONNECT][PYTHON] Fix bucketBy/sortBy to properly use the first column name - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 03:55:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/14 05:22:46 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40004: [SPARK-42429][Build][IntelliJ] Fix a sbt compile error for `getArgument` when using IntelliJ - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/14 05:36:30 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #39984: [SPARK-42263][CONNECT][PYTHON] Implement `spark.catalog.registerFunction` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/14 05:56:34 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39984: [SPARK-42263][CONNECT][PYTHON] Implement `spark.catalog.registerFunction` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/14 05:57:40 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40005: [SPARK-42430][SQL][DOC] Add documentation for TimestampNTZ type - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 06:25:00 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40005: [SPARK-42430][SQL][DOC] Add documentation for TimestampNTZ type - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 06:25:45 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40006: [SPARK-41818][SPARK-42000][CONNECT][FOLLOWUP] Fix leaked test case - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/14 06:54:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40007: [SPARK-42324][CONNECT][TESTS][FOLLOW-UP] Fix failed test - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 07:02:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39963: [SPARK-42324][SQL] Assign name to _LEGACY_ERROR_TEMP_1001 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 07:03:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40007: [SPARK-41818][SPARK-42000][CONNECT][TESTS][FOLLOWUP] Fix failed test in SparkConnectProtoSuite - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 07:09:39 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40008: [SPARK-42431][CONNECT] Avoid calling `output` before analysis in `Union` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 07:28:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40007: [SPARK-41818][SPARK-42000][CONNECT][TESTS][FOLLOWUP] Fix failed test in SparkConnectProtoSuite - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 07:35:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40006: [SPARK-41818][SPARK-42000][CONNECT][FOLLOWUP] Fix leaked test case - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 07:38:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40006: [SPARK-41818][SPARK-42000][CONNECT][FOLLOWUP] Fix leaked test case - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 07:39:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40004: [SPARK-42429][BUILD] Fix a sbt compile error for `getArgument` when using IntelliJ - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 07:55:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40004: [SPARK-42429][BUILD] Fix a sbt compile error for `getArgument` when using IntelliJ - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 07:56:03 UTC, 0 replies.
- [GitHub] [spark] wayneguow opened a new pull request, #40009: [SPARK-40238][PYTHON] Support scaleUpFactor and initialNumPartition in pySpark RDD take API - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/14 08:02:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40010: [SPARK-42433][PYTHON] `array_insert` should accept literal parameters - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 08:32:47 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #40011: [SPARK-42406][PROTOBUF] Fix recursive depth setting for Protobuf functions - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/14 08:33:40 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #40011: [SPARK-42406][PROTOBUF] Fix recursive depth setting for Protobuf functions - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/14 08:34:42 UTC, 1 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #40011: [SPARK-42406][PROTOBUF] Fix recursive depth setting for Protobuf functions - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/14 08:41:28 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40012: [SPARK-42434][PYTHON][CONNECT] `array_append` should accept `Any` value - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 08:45:22 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on pull request #40009: [SPARK-40238][PYTHON] Support scaleUpFactor and initialNumPartition in pySpark RDD take API - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/02/14 08:50:39 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40003: [SPARK-42428][CONNECT][PYTHON] Standardize __repr__ of CommonInlineUserDefinedFunction - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 09:05:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40003: [SPARK-42428][CONNECT][PYTHON] Standardize __repr__ of CommonInlineUserDefinedFunction - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 09:06:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39988: [SPARK-42416][SQL] Dateset operations should not resolve the analyzed logical plan again - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/14 09:06:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39942: [SPARK-42398][SQL] Refine default column value framework - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/14 09:28:12 UTC, 7 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39996: [SPARK-42423][SQL] Add metadata column file block start and length - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/14 09:54:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40013: [SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 10:19:10 UTC, 0 replies.
- [GitHub] [spark] jkylling closed pull request #39024: [SPARK-41480][SQL][DOCS] Update documentation for input_file_name - posted by "jkylling (via GitHub)" <gi...@apache.org> on 2023/02/14 10:23:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40013: [WIP][SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/14 11:46:42 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40012: [SPARK-42434][PYTHON][CONNECT] `array_append` should accept `Any` value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 12:57:37 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #40014: [SPARK-42435][UI] Update DataTables to 1.13.2 - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/14 12:57:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40012: [SPARK-42434][PYTHON][CONNECT] `array_append` should accept `Any` value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 12:57:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40008: [SPARK-42431][CONNECT] Avoid calling `LogicalPlan.output` before analysis in `Union` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 12:58:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40008: [SPARK-42431][CONNECT] Avoid calling `LogicalPlan.output` before analysis in `Union` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 12:58:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40010: [SPARK-42433][PYTHON][CONNECT] Add `array_insert` to Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 12:59:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40010: [SPARK-42433][PYTHON][CONNECT] Add `array_insert` to Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 12:59:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39992: [SPARK-42418][DOCS][PYTHON] PySpark documentation updates to improve discoverability and add more guidance - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 12:59:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39992: [SPARK-42418][DOCS][PYTHON] PySpark documentation updates to improve discoverability and add more guidance - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/14 13:00:17 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40014: [SPARK-42435][UI] Update DataTables to 1.13.2 - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/14 13:00:22 UTC, 1 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #40015: [WIP][SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/14 13:59:52 UTC, 0 replies.
- [GitHub] [spark] ted-jenks closed pull request #39927: [SPARK-42373][SQL] Remove unused blank line removal from CSVExprUtils - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/14 14:39:43 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #40016: [SPARK-42436][SQL] Improve multiTransform to generate alternatives dynamically - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/14 14:55:40 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on a diff in pull request #39996: [SPARK-42423][SQL] Add metadata column file block start and length - posted by "Yaohua628 (via GitHub)" <gi...@apache.org> on 2023/02/14 15:13:44 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on pull request #40014: [SPARK-42435][UI] Update DataTables to 1.13.2 - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/14 15:14:45 UTC, 2 replies.
- [GitHub] [spark] peter-toth commented on pull request #40016: [SPARK-42436][SQL] Improve multiTransform to generate alternatives dynamically - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/14 15:16:37 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on pull request #38035: [WIP][SPARK-42438][SQL] Improve constraint propagation using multiTransform - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/14 15:17:06 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #38035: [WIP][SPARK-42438][SQL] Improve constraint propagation using multiTransform - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/14 15:21:40 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on pull request #39039: [SPARK-40776][SQL][PROTOBUF][DOCS] Spark-Protobuf docs - posted by "SandishKumarHN (via GitHub)" <gi...@apache.org> on 2023/02/14 16:25:20 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40008: [SPARK-42431][CONNECT] Avoid calling `LogicalPlan.output` before analysis in `Union` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/14 16:25:39 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini opened a new pull request, #40017: Make createJobDescription in FileWrite.toBatch not lazy - posted by "LorenzoMartini (via GitHub)" <gi...@apache.org> on 2023/02/14 17:02:16 UTC, 0 replies.
- [GitHub] [spark] chanansh commented on pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "chanansh (via GitHub)" <gi...@apache.org> on 2023/02/14 17:02:37 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini closed pull request #40017: [SPARK-42439] [SQL] Make createJobDescription in FileWrite.toBatch not lazy - posted by "LorenzoMartini (via GitHub)" <gi...@apache.org> on 2023/02/14 17:03:13 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini opened a new pull request, #40018: [SPARK-42439] [SQL] In v2 writes, make createJobDescription in FileWrite.toBatch not lazyMake createJobDescription not lazy - posted by "LorenzoMartini (via GitHub)" <gi...@apache.org> on 2023/02/14 17:12:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40019: [SPARK-42440][CONNECT] Initial set of Dataframe APIs for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/14 17:33:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40019: [SPARK-42440][CONNECT] Initial set of Dataframe APIs for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/14 17:34:24 UTC, 2 replies.
- [GitHub] [spark] Daniel-Davies commented on pull request #39975: [SPARK-42405][SQL] Improve array insert documentation - posted by "Daniel-Davies (via GitHub)" <gi...@apache.org> on 2023/02/14 17:44:01 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39985: [SPARK-42412][WIP] Initial prototype implementation of PySpark ML via Spark connect - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/14 18:39:18 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40005: [SPARK-42430][SQL][DOC] Add documentation for TimestampNTZ type - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 19:23:23 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40005: [SPARK-42430][SQL][DOC] Add documentation for TimestampNTZ type - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 19:25:10 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #40020: [][SPARK][FOLLOWUP] Updating docs for readability - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/02/14 19:30:43 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #38922: [SPARK-41396][SQL][PROTOBUF] OneOf field support and recursion checks - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/14 19:44:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39988: [SPARK-42416][SQL] Dataset operations should not resolve the analyzed logical plan again - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 20:01:45 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40021: [SPARK-42342][PYTHON][CONNECT][TEST] Fix FunctionsParityTests.test_raise_error to call the proper test - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/14 20:03:36 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40011: [SPARK-42406][PROTOBUF] Fix recursive depth setting for Protobuf functions - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 20:15:22 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39985: [SPARK-42412][WIP] Initial prototype implementation of PySpark ML via Spark connect - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/14 21:01:26 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40011: [SPARK-42406][PROTOBUF] Fix recursive depth setting for Protobuf functions - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 23:11:59 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40011: [SPARK-42406][PROTOBUF] Fix recursive depth setting for Protobuf functions - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 23:12:33 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40022: [SPARK-42442][SQL] Use spark.sql.timestampType for data source inference - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 23:29:45 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40022: [SPARK-42442][SQL] Use spark.sql.timestampType for data source inference - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/14 23:32:58 UTC, 0 replies.
- [GitHub] [spark] kecheung closed pull request #39779: [SPARK-42222][SQL][3.3] Make error clearer when table not found in SupportsCatalogOptions catalog - posted by "kecheung (via GitHub)" <gi...@apache.org> on 2023/02/14 23:59:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40023: [SPARK-42443][SQL] Remove unused object in DataFrameAggregateSuite - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 00:18:33 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40023: [SPARK-42443][SQL] Remove unused object in DataFrameAggregateSuite - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 00:18:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40020: [SPARK-41775][PYTHON][FOLLOW-UP] Updating docs for readability - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 00:21:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40020: [SPARK-41775][PYTHON][FOLLOW-UP] Updating docs for readability - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 00:22:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40021: [SPARK-42342][PYTHON][CONNECT][TEST] Fix FunctionsParityTests.test_raise_error to call the proper test - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 00:22:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40021: [SPARK-42342][PYTHON][CONNECT][TEST] Fix FunctionsParityTests.test_raise_error to call the proper test - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 00:23:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40023: [SPARK-42443][SQL] Remove unused object in DataFrameAggregateSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 00:24:02 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40023: [SPARK-42443][SQL] Remove unused object in DataFrameAggregateSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 00:24:34 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40008: [SPARK-42431][CONNECT] Avoid calling `LogicalPlan.output` before analysis in `Union` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 00:34:25 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40024: [SPARK-42426][CONNECT] Fix DataFrameWriter.insertInto to call the corresponding method instead of saveAsTable - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/15 00:44:14 UTC, 0 replies.
- [GitHub] [spark] rkkorlapati-db commented on a diff in pull request #39571: [SPARK-42064][SQL] Implement bloom filter join hint - posted by "rkkorlapati-db (via GitHub)" <gi...@apache.org> on 2023/02/15 00:47:12 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40025: [CONNECT] Adding SparkSession#read - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/15 01:28:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40008: [SPARK-42431][CONNECT] Avoid calling `LogicalPlan.output` before analysis in `Union` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 01:34:21 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40019: [SPARK-42440][CONNECT] Initial set of Dataframe APIs for Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 01:38:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40019: [SPARK-42440][CONNECT] Initial set of Dataframe APIs for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 01:39:37 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39996: [SPARK-42423][SQL] Add metadata column file block start and length - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/15 02:20:26 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40014: [SPARK-42435][UI] Update DataTables to 1.13.2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 02:29:02 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #40026: [SPARK-42401][SQL][FOLLOWUP] Always set `containsNull=true` for `array_insert` - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/15 02:34:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40025: [CONNECT] Adding SparkSession#read - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 02:50:14 UTC, 3 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40025: [CONNECT] Adding SparkSession#read - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 02:53:50 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39997: [SPARK-42424][YARN] Remove unused method `setEnvFromInputString` from YarnSparkHadoopUtil - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 03:06:57 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39997: [SPARK-42424][YARN] Remove unused method `setEnvFromInputString` from YarnSparkHadoopUtil - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 03:07:55 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39997: [SPARK-42424][YARN] Remove unused method `setEnvFromInputString` from YarnSparkHadoopUtil - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/15 03:15:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40027: [SPARK-42441][CONNECT] Scala Client add Column APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 03:44:56 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39923: [SPARK-39851][SQL] Improve join stats estimation if one side can keep uniqueness - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/15 03:48:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/15 03:50:38 UTC, 0 replies.
- [GitHub] [spark] haoyanzhang opened a new pull request, #40028: [SPARK-38324][SQL] The second range is not [0, 59] in the day time ANSI interval - posted by "haoyanzhang (via GitHub)" <gi...@apache.org> on 2023/02/15 03:51:43 UTC, 1 replies.
- [GitHub] [spark] haoyanzhang closed pull request #40028: [SPARK-38324][SQL] The second range is not [0, 59] in the day time ANSI interval - posted by "haoyanzhang (via GitHub)" <gi...@apache.org> on 2023/02/15 03:59:40 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/15 04:04:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40029: [SPARK-42431][CONNECT][FOLLOWUP] Use `Distinct` to delay analysis for `Union` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 04:04:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40024: [SPARK-42426][CONNECT] Fix DataFrameWriter.insertInto to call the corresponding method instead of saveAsTable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 04:16:12 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40030: [SPARK-42002][CONNECT][FOLLOWUP] Remove unused imports - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 05:30:15 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40024: [SPARK-42426][CONNECT] Fix DataFrameWriter.insertInto to call the corresponding method instead of saveAsTable - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/15 05:32:50 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40031: [SPARK-42445][R] Fix SparkR install.spark function - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/15 05:52:01 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40001: [SPARK-42427][SQL] ANSI MODE: Conv should return an error if the internal conversion overflows - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 05:56:59 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40001: [SPARK-42427][SQL] ANSI MODE: Conv should return an error if the internal conversion overflows - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 05:57:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39997: [SPARK-42424][YARN] Remove unused declarations from Yarn module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 06:55:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40031: [SPARK-42445][R] Fix SparkR `install.spark` function - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/15 07:03:23 UTC, 4 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40031: [SPARK-42445][R] Fix SparkR `install.spark` function - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/15 07:07:13 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #40031: [SPARK-42445][R] Fix SparkR `install.spark` function - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/15 07:09:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40031: [SPARK-42445][R] Fix SparkR `install.spark` function - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/15 07:13:24 UTC, 0 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40032: [SPARK-42446][DOCS][PYTHON]Updating PySpark documentation to enhance usability - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/15 08:17:41 UTC, 0 replies.
- [GitHub] [spark] allanf-db commented on pull request #40032: [SPARK-42446][DOCS][PYTHON]Updating PySpark documentation to enhance usability - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/15 08:30:18 UTC, 0 replies.
- [GitHub] [spark] haoyanzhang opened a new pull request, #40033: [SPARK-38324][SQL] The second range is not [0, 59] in the day time ANSI interval - posted by "haoyanzhang (via GitHub)" <gi...@apache.org> on 2023/02/15 08:33:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/15 08:39:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/15 08:41:02 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39945: [SPARK-42384][SQL] Check for null input in generated code for mask function - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 08:49:22 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40014: [SPARK-42435][UI] Update DataTables to 1.13.2 - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 08:58:01 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40014: [SPARK-42435][UI] Update DataTables to 1.13.2 - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 08:58:42 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 opened a new pull request, #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "Yaohua628 (via GitHub)" <gi...@apache.org> on 2023/02/15 09:06:44 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "Yaohua628 (via GitHub)" <gi...@apache.org> on 2023/02/15 09:07:00 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40005: [SPARK-42430][SQL][DOC] Add documentation for TimestampNTZ type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/15 09:12:08 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40036: [SPARK-42448][SQL] Fix spark sql shell prompt for current db - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/15 10:29:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40029: [SPARK-42431][CONNECT][FOLLOWUP] Use `Distinct` to delay analysis for `Union` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:31:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40029: [SPARK-42431][CONNECT][FOLLOWUP] Use `Distinct` to delay analysis for `Union` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:32:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:32:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:32:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40036: [SPARK-42448][SQL] Fix spark sql shell prompt for current db - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:33:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40036: [SPARK-42448][SQL] Fix spark sql shell prompt for current db - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:33:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:34:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40032: [SPARK-42446][DOCS][PYTHON]Updating PySpark documentation to enhance usability - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:36:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40032: [SPARK-42446][DOCS][PYTHON]Updating PySpark documentation to enhance usability - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:36:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40030: [SPARK-42002][CONNECT][FOLLOWUP] Remove unused imports - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:37:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40030: [SPARK-42002][CONNECT][FOLLOWUP] Remove unused imports - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:37:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40030: [SPARK-42002][CONNECT][FOLLOWUP] Remove unused imports - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:37:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40026: [SPARK-42401][SQL][FOLLOWUP] Always set `containsNull=true` for `array_insert` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:38:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40026: [SPARK-42401][SQL][FOLLOWUP] Always set `containsNull=true` for `array_insert` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:38:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40022: [SPARK-42442][SQL] Use spark.sql.timestampType for data source inference - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:39:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40022: [SPARK-42442][SQL] Use spark.sql.timestampType for data source inference - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 11:39:50 UTC, 0 replies.
- [GitHub] [spark] olaky commented on pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "olaky (via GitHub)" <gi...@apache.org> on 2023/02/15 12:05:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40030: [SPARK-42002][CONNECT][FOLLOWUP] Remove unused imports - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 12:21:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40029: [SPARK-42431][CONNECT][FOLLOWUP] Use `Distinct` to delay analysis for `Union` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 12:22:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40037: [MINOR][PYTHON][DOCS] Add `applyInPandasWithState` to API references - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 12:32:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40037: [MINOR][PYTHON][DOCS] Add `applyInPandasWithState` to API references - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 12:33:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40032: [SPARK-42446][DOCS][PYTHON]Updating PySpark documentation to enhance usability - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/15 12:36:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40016: [SPARK-42436][SQL] Improve multiTransform to generate alternatives dynamically - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/15 12:56:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40016: [SPARK-42436][SQL] Improve multiTransform to generate alternatives dynamically - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/15 12:57:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39975: [SPARK-42405][SQL] Improve array insert documentation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/15 13:00:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 14:03:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 14:06:43 UTC, 9 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40037: [MINOR][PYTHON][DOCS] Add `applyInPandasWithState` to API references - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 14:08:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40037: [MINOR][PYTHON][DOCS] Add `applyInPandasWithState` to API references - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/15 14:08:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40036: [SPARK-42448][SQL] Fix spark sql shell prompt for current db - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/15 14:12:44 UTC, 8 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 14:23:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/15 14:24:50 UTC, 8 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40027: [SPARK-42441][CONNECT] Scala Client add Column APIs - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 14:26:50 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39997: [SPARK-42424][YARN] Remove unused declarations from Yarn module - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/15 14:27:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39997: [SPARK-42424][YARN] Remove unused declarations from Yarn module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 14:28:11 UTC, 3 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40027: [SPARK-42441][CONNECT] Scala Client add Column APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 14:52:52 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40019: [SPARK-42440][CONNECT] Initial set of Dataframe APIs for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 14:54:21 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 15:38:54 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40039: [SPARK-42451][SQL][TESTS] Simplifies the filter conditions of `testingVersions` in `HiveExternalCatalogVersionsSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/15 16:09:30 UTC, 0 replies.
- [GitHub] [spark] NarekDW opened a new pull request, #40040: [WIP][SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/15 16:26:07 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #40040: [WIP][SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/15 16:32:43 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/15 17:12:28 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40031: [SPARK-42445][R] Fix SparkR `install.spark` function - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/15 17:31:28 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40041: [SPARK-42453][CONNECT] Implement function max in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 18:35:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40041: [SPARK-42453][CONNECT] Implement function max in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 18:36:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40041: [SPARK-42453][CONNECT] Implement function max in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 18:38:06 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40027: [SPARK-42441][CONNECT] Scala Client add Column APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 18:41:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40027: [SPARK-42441][CONNECT] Scala Client add Column APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/15 18:43:04 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40041: [SPARK-42453][CONNECT] Implement function max in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 18:53:36 UTC, 0 replies.
- [GitHub] [spark] YannisSismanis commented on a diff in pull request #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "YannisSismanis (via GitHub)" <gi...@apache.org> on 2023/02/15 19:32:24 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40025: [CONNECT] Adding SparkSession#read - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/15 19:37:34 UTC, 1 replies.
- [GitHub] [spark] christoph110 commented on pull request #28946: [SPARK-32123][PYSPARK] Setting `spark.sql.session.timeZone` only partially respected - posted by "christoph110 (via GitHub)" <gi...@apache.org> on 2023/02/15 19:53:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40024: [SPARK-42426][CONNECT] Fix DataFrameWriter.insertInto to call the corresponding method instead of saveAsTable - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 20:14:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 20:18:21 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on pull request #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/02/15 20:23:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #39985: [SPARK-42412][WIP] Initial prototype implementation of PySpark ML via Spark connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 20:27:02 UTC, 6 replies.
- [GitHub] [spark] amaliujia commented on pull request #40015: [WIP][SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 20:32:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 21:08:41 UTC, 2 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40042: [SPARK-42455][SQL] Rename JDBC option inferTimestampNTZType as preferTimestampNTZ - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 21:17:38 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40042: [SPARK-42455][SQL] Rename JDBC option inferTimestampNTZType as preferTimestampNTZ - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 21:17:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40025: [CONNECT] Adding SparkSession#read - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 21:27:00 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40043: [SPARK-39904][SQL][FollowUp] Rename CSV option `prefersDate` as `preferDate` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 21:28:30 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40043: [SPARK-39904][SQL][FollowUp] Rename CSV option `prefersDate` as `preferDate` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/15 21:28:41 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40025: [CONNECT] Adding SparkSession#read - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/15 21:40:08 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #40043: [SPARK-39904][SQL][FollowUp] Rename CSV option `prefersDate` as `preferDate` - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/15 22:05:00 UTC, 1 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40044: [SPARK-42456][DOCS][PYTHON] Consolidating the PySpark version upgrade note pages into a single page to make it easier to read - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/15 22:13:33 UTC, 0 replies.
- [GitHub] [spark] allanf-db commented on pull request #40044: [SPARK-42456][DOCS][PYTHON] Consolidating the PySpark version upgrade note pages into a single page to make it easier to read - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/15 22:20:49 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40025: [SPARK-42457][CONNECT] Adding SparkSession#read - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/15 22:32:38 UTC, 1 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #40045: [][PYTHON][FOLLOW-UP] Fix for gRPC version - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/02/16 00:11:30 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #40045: [SPARK-41591][PYTHON][FOLLOW-UP] Remove gRPC version check for Distributor - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/02/16 00:13:26 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40025: [SPARK-42457][CONNECT] Adding SparkSession#read - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 00:22:37 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40045: [SPARK-41591][PYTHON][FOLLOW-UP] Remove gRPC version check for Distributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 00:22:42 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39945: [SPARK-42384][SQL] Check for null input in generated code for mask function - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/16 00:26:08 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39945: [SPARK-42384][SQL] Check for null input in generated code for mask function - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/16 00:26:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40043: [SPARK-39904][SQL][FollowUp] Rename CSV option `prefersDate` as `preferDate` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 00:27:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40042: [SPARK-42455][SQL] Rename JDBC option inferTimestampNTZType as preferTimestampNTZ - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 00:28:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40042: [SPARK-42455][SQL] Rename JDBC option inferTimestampNTZType as preferTimestampNTZ - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 00:28:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40024: [SPARK-42426][CONNECT] Fix DataFrameWriter.insertInto to call the corresponding method instead of saveAsTable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 00:29:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40024: [SPARK-42426][CONNECT] Fix DataFrameWriter.insertInto to call the corresponding method instead of saveAsTable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 00:29:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40027: [SPARK-42441][CONNECT] Scala Client add Column APIs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 00:32:32 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40046: [SPARK-41817][CONNECT][PYTHON][TEST] Enable the doctest pyspark.sql.connect.readwriter.DataFrameReader.option - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/16 01:02:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40047: [SPARK-42459][CONNECT] Create pyspark.sql.connect.utils to keep common codes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 01:04:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40044: [SPARK-42456][DOCS][PYTHON] Consolidating the PySpark version upgrade note pages into a single page to make it easier to read - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 01:34:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40044: [SPARK-42456][DOCS][PYTHON] Consolidating the PySpark version upgrade note pages into a single page to make it easier to read - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 01:35:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40041: [SPARK-42453][CONNECT] Implement function max in Scala client - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/16 01:43:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40041: [SPARK-42453][CONNECT] Implement function max in Scala client - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/16 01:43:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #39162: [WIP][SPARK-41615][CONNECT] Support Catalog.listDatabases - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 01:44:14 UTC, 0 replies.
- [GitHub] [spark] amaliujia closed pull request #39162: [WIP][SPARK-41615][CONNECT] Support Catalog.listDatabases - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 01:44:15 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40036: [SPARK-42448][SQL] Fix spark sql shell prompt for current db - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/16 01:45:05 UTC, 6 replies.
- [GitHub] [spark] Yaohua628 commented on a diff in pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "Yaohua628 (via GitHub)" <gi...@apache.org> on 2023/02/16 01:59:41 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/16 02:10:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40046: [SPARK-41817][CONNECT][PYTHON][TEST] Enable the doctest pyspark.sql.connect.readwriter.DataFrameReader.option - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 02:31:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40046: [SPARK-41817][CONNECT][PYTHON][TEST] Enable the doctest pyspark.sql.connect.readwriter.DataFrameReader.option - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 02:32:13 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #40039: [SPARK-42451][SQL][TESTS] Simplifies the filter conditions of `testingVersions` in `HiveExternalCatalogVersionsSuite` - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/16 02:39:56 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40039: [SPARK-42451][SQL][TESTS] Simplifies the filter conditions of `testingVersions` in `HiveExternalCatalogVersionsSuite` - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/16 02:40:22 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/16 02:43:41 UTC, 2 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40048: [SPARK-42460][CONNECT] Clean-up results in ClientE2ETestSuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 02:52:21 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40048: [SPARK-42460][CONNECT] Clean-up results in ClientE2ETestSuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 02:53:37 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40048: [SPARK-42460][CONNECT] Clean-up results in ClientE2ETestSuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 02:53:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40039: [SPARK-42451][SQL][TESTS] Simplifies the filter conditions of `testingVersions` in `HiveExternalCatalogVersionsSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 02:58:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 02:59:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #40038: [WIP][CONNECT] Test `RootAllocator` memory leak - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 02:59:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40047: [SPARK-42459][CONNECT] Create pyspark.sql.connect.utils to keep common codes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 03:04:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40047: [SPARK-42459][CONNECT] Create pyspark.sql.connect.utils to keep common codes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 03:04:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/16 03:10:21 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40050: [SPARK-42461][CONNECT] Scala Client implement first batch of functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 03:18:10 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40050: [SPARK-42461][CONNECT] Scala Client implement first batch of functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 03:18:36 UTC, 5 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40043: [SPARK-39904][SQL][FOLLOW-UP] Rename CSV option `prefersDate` as `preferDate` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/16 03:23:43 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40048: [SPARK-42460][CONNECT] Clean-up results in ClientE2ETestSuite - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 03:31:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40051: [SPARK-42462][K8S] Prevent `docker-image-tool.sh` from publishing OCI manifests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 04:42:58 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #40051: [SPARK-42462][K8S] Prevent `docker-image-tool.sh` from publishing OCI manifests - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/16 04:56:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40051: [SPARK-42462][K8S] Prevent `docker-image-tool.sh` from publishing OCI manifests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 04:57:02 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40025: [SPARK-42457][CONNECT] Adding SparkSession#read - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 05:04:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40051: [SPARK-42462][K8S] Prevent `docker-image-tool.sh` from publishing OCI manifests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 05:52:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40027: [SPARK-42441][CONNECT] Scala Client add Column APIs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 06:04:34 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39962: [SPARK-42392][CORE] Add a new case of `TriggeredByExecutorDecommissionInfo` to remove unnecessary param - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/16 06:24:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 06:30:06 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/16 06:37:54 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40048: [SPARK-42460][CONNECT] Clean-up results in ClientE2ETestSuite - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 06:49:10 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40048: [SPARK-42460][CONNECT] Clean-up results in ClientE2ETestSuite - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 06:51:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40052: [SPARK-42463][YARN][TESTS] Clean up the third-party Java source code copy introduced by SPARK-27180 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 06:56:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38433: [SPARK-40943][SQL] Make `MSCK` keyword optional in `REPAIR TABLE` syntax - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 07:02:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40052: [SPARK-42463][YARN][TESTS] Clean up the third-party Java files copy introduced by SPARK-27180 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 07:35:52 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #40052: [SPARK-42463][YARN][TESTS] Clean up the third-party Java files copy introduced by SPARK-27180 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/16 07:43:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40045: [SPARK-41591][PYTHON][FOLLOW-UP] Remove gRPC version check for Distributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 07:53:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40045: [SPARK-41591][PYTHON][FOLLOW-UP] Remove gRPC version check for Distributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 07:54:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40045: [SPARK-41591][PYTHON][FOLLOW-UP] Remove gRPC version check for Distributor - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 08:24:34 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #39923: [SPARK-39851][SQL] Improve join stats estimation if one side can keep uniqueness - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/16 08:38:28 UTC, 0 replies.
- [GitHub] [spark] wankunde closed pull request #39457: [WIP][SPARK-41940][SQL] Infer IsNotNull constraints for complex join expressions - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/16 08:38:52 UTC, 0 replies.
- [GitHub] [spark] wankunde closed pull request #38176: [WIP][SPARK-40715][SQL] Support selecting shuffled hash join thought LocalMapThreshold is less than advisory partition size - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/16 08:39:24 UTC, 0 replies.
- [GitHub] [spark] ted-jenks commented on pull request #39907: [SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/16 09:02:07 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39985: [SPARK-42412][WIP] Initial prototype implementation of PySpark ML via Spark connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/16 09:04:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/16 09:49:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40033: [SPARK-38324][SQL] The second range is not [0, 59] in the day time ANSI interval - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/16 09:59:31 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on pull request #39923: [SPARK-39851][SQL] Improve join stats estimation if one side can keep uniqueness - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/16 10:09:40 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39907: [SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/16 10:45:58 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini commented on pull request #40018: [SPARK-42439][SQL] In v2 writes, make createJobDescription in FileWrite.toBatch not lazyMake createJobDescription not lazy - posted by "LorenzoMartini (via GitHub)" <gi...@apache.org> on 2023/02/16 10:46:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40053: Sql hive unused - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 12:48:18 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39991: [SPARK-42419][CONNECT][PYTHON] Migrate into error framework for Spark Connect Column API. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/16 12:48:38 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/16 13:00:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39962: [SPARK-42392][CORE] Add a new case of `TriggeredByExecutorDecommissionInfo` to remove unnecessary param - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/16 13:23:03 UTC, 7 replies.
- [GitHub] [spark] ted-jenks commented on a diff in pull request #39907: [SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "ted-jenks (via GitHub)" <gi...@apache.org> on 2023/02/16 14:16:11 UTC, 2 replies.
- [GitHub] [spark] nija-at opened a new pull request, #40054: pyspark: accept user_agent in spark connect's connection string - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/16 14:21:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40055: [SPARK-42464][CONNECT] Fix ProtoToPlanTestSuite for Scala 2.13 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 14:28:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40055: [SPARK-42464][CONNECT] Fix ProtoToPlanTestSuite for Scala 2.13 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 14:29:13 UTC, 1 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #40054: pyspark: accept user_agent in spark connect's connection string - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/16 14:31:37 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39997: [SPARK-42424][YARN] Remove unused declarations from Yarn module - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/16 15:25:55 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39997: [SPARK-42424][YARN] Remove unused declarations from Yarn module - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/16 15:25:57 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40052: [SPARK-42463][YARN][TESTS] Clean up the third-party Java files copy introduced by SPARK-27180 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/16 15:26:43 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40052: [SPARK-42463][YARN][TESTS] Clean up the third-party Java files copy introduced by SPARK-27180 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/16 15:26:48 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40054: pyspark: accept user_agent in spark connect's connection string - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 16:06:56 UTC, 1 replies.
- [GitHub] [spark] huangxiaopingRD commented on a diff in pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/02/16 16:42:00 UTC, 5 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40056: [SPARK-42465][CONNECT] ProtoToPlanTestSuite should use analyzed plans instead of parsed plans. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 17:42:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40025: [SPARK-42457][CONNECT] Adding SparkSession#read - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 18:23:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39866: [SPARK-42287][CONNECT][BUILD] Fix shading so that the JVM client jar can include all 3rd-party dependencies in the runtime. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 18:25:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39866: [SPARK-42287][CONNECT][BUILD] Fix shading so that the JVM client jar can include all 3rd-party dependencies in the runtime. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 18:26:11 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40054: pyspark: accept user_agent in spark connect's connection string - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/16 18:29:20 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40034: [SPARK-42447][INFRA] Remove Hadoop 2 GitHub Action job - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/16 18:40:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40055: [SPARK-42464][CONNECT] Fix ProtoToPlanTestSuite for Scala 2.13 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 18:47:14 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39979: [SPARK-42326][SQL] Integrate `_LEGACY_ERROR_TEMP_2099` into `UNSUPPORTED_DATATYPE` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/16 19:03:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39979: [SPARK-42326][SQL] Integrate `_LEGACY_ERROR_TEMP_2099` into `UNSUPPORTED_DATATYPE` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/16 19:04:36 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40054: pyspark: accept user_agent in spark connect's connection string - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 19:51:42 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40056: [SPARK-42465][CONNECT] ProtoToPlanTestSuite should use analyzed plans instead of parsed plans. - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 19:54:12 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40056: [SPARK-42465][CONNECT] ProtoToPlanTestSuite should use analyzed plans instead of parsed plans. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 20:09:28 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40056: [SPARK-42465][CONNECT] ProtoToPlanTestSuite should use analyzed plans instead of parsed plans. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 21:01:03 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40057: [SPARK-42468][CONNECT] Implement agg by (String, String)* - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 21:35:45 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40057: [SPARK-42468][CONNECT] Implement agg by (String, String)* - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 21:35:58 UTC, 0 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #40058: [Spark-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/16 21:43:13 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #39991: [SPARK-42419][CONNECT][PYTHON] Migrate into error framework for Spark Connect Column API. - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 21:56:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40054: pyspark: accept user_agent in spark connect's connection string - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/16 22:05:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40055: [SPARK-42464][CONNECT] Fix ProtoToPlanTestSuite for Scala 2.13 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 22:06:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40056: [SPARK-42465][CONNECT] ProtoToPlanTestSuite should use analyzed plans instead of parsed plans. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 22:33:20 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #40059: [SPARK-42469][SQL] Update MSSQL Dialect to use parenthesis for TOP and add tests - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/16 22:38:36 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #40059: [SPARK-42469][SQL] Update MSSQL Dialect to use parentheses for TOP and add tests for Limit clause - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/16 22:42:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40056: [SPARK-42465][CONNECT] ProtoToPlanTestSuite should use analyzed plans instead of parsed plans. - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/16 22:43:28 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/16 22:45:14 UTC, 3 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40058: [SPARK-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/16 22:45:29 UTC, 1 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #39127: [SPARK-41585][YARN] The Spark exclude node functionality for YARN should work independently of dynamic allocation - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/02/16 23:01:00 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #40058: [SPARK-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/16 23:15:21 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40060: [SPARK-42002][CONNECT][PYTHON][FOLLOWUP] Enable tests in ReadwriterV2ParityTests - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/16 23:17:25 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40057: [SPARK-42468][CONNECT] Implement agg by (String, String)* - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/16 23:36:44 UTC, 9 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40061: [WIP][CONNECT] Scala Client Write API - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/16 23:54:41 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40061: [WIP][CONNECT] Scala Client Write API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/17 00:09:37 UTC, 8 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37362: [SPARK-39950][SQL] It's unnecessary to materialize BroadcastQueryStage firstly, because the BroadcastQueryStage does not timeout in AQE. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/17 00:22:05 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40057: [SPARK-42468][CONNECT] Implement agg by (String, String)* - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/17 00:31:44 UTC, 7 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/17 00:45:41 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/17 00:54:02 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40043: [SPARK-39904][SQL][FOLLOW-UP] Rename CSV option `prefersDate` as `preferDate` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/17 01:23:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40060: [SPARK-42002][CONNECT][PYTHON][FOLLOWUP] Enable tests in ReadwriterV2ParityTests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/17 01:26:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40060: [SPARK-42002][CONNECT][PYTHON][FOLLOWUP] Enable tests in ReadwriterV2ParityTests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/17 01:27:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40057: [SPARK-42468][CONNECT] Implement agg by (String, String)* - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/17 01:35:45 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40054: pyspark: accept user_agent in spark connect's connection string - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/17 01:46:01 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40057: [SPARK-42468][CONNECT] Implement agg by (String, String)* - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/17 03:03:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40059: [SPARK-42469][SQL] Update MSSQL Dialect to use parentheses for TOP and add tests for Limit clause - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 04:39:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40059: [SPARK-42469][SQL] Update MSSQL Dialect to use parentheses for TOP and add tests for Limit clause - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 04:39:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40058: [SPARK-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 04:41:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40058: [SPARK-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 04:42:53 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #40058: [SPARK-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/17 04:43:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads twice when no filters in vectorized reader - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 04:45:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39923: [SPARK-39851][SQL] Improve join stats estimation if one side can keep uniqueness - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 04:48:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39923: [SPARK-39851][SQL] Improve join stats estimation if one side can keep uniqueness - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 04:50:58 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on a diff in pull request #38980: [SPARK-41448] Make consistent MR job IDs in FileBatchWriter and FileFormatWriter - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/02/17 05:46:06 UTC, 0 replies.
- [GitHub] [spark] haoyanzhang commented on pull request #40033: [SPARK-38324][SQL] The second range is not [0, 59] in the day time ANSI interval - posted by "haoyanzhang (via GitHub)" <gi...@apache.org> on 2023/02/17 06:11:40 UTC, 1 replies.
- [GitHub] [spark] smallzhongfeng commented on a diff in pull request #39962: [SPARK-42392][CORE] Add a new case of `TriggeredByExecutorDecommissionInfo` to remove unnecessary param - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/17 06:25:05 UTC, 4 replies.
- [GitHub] [spark] MaxGekk closed pull request #40033: [SPARK-38324][SQL] The second range is not [0, 59] in the day time ANSI interval - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/17 06:29:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40062: [SPARK-42474][CORE][K8S] Add extraJVMOptions JVM GC option K8s test cases - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 08:22:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40062: [SPARK-42474][CORE][K8S] Add extraJVMOptions JVM GC option K8s test cases - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 08:25:08 UTC, 2 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #40063: [CONNECT] Eager Execution of DF.sql() - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/17 08:34:42 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/17 09:08:42 UTC, 9 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40062: [SPARK-42474][CORE][K8S] Add extraJVMOptions JVM GC option K8s test cases - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 09:11:26 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39991: [SPARK-42419][CONNECT][PYTHON] Migrate into error framework for Spark Connect Column API. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/17 09:14:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40062: [SPARK-42474][CORE][K8S] Add extraJVMOptions JVM GC option K8s test cases - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/17 11:12:43 UTC, 0 replies.
- [GitHub] [spark] Yikf opened a new pull request, #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/02/17 12:05:39 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/02/17 12:07:02 UTC, 5 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40063: [CONNECT] Eager Execution of DF.sql() - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/17 12:09:09 UTC, 0 replies.
- [GitHub] [spark] nija-at commented on pull request #40054: [SPARK-42477] [CONNECT] [PYTHON]: accept user_agent in spark connect's connection string - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/17 12:09:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40050: [SPARK-42461][CONNECT] Scala Client implement first batch of functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/17 12:19:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40063: [CONNECT] Eager Execution of DF.sql() - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/17 12:32:52 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/02/17 13:05:21 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/17 14:00:29 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on a diff in pull request #38980: [SPARK-41448] Make consistent MR job IDs in FileBatchWriter and FileFormatWriter - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/02/17 14:52:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40065: [SPARK-42382][BUILD] Upgrade `cyclonedx-maven-plugin` to 2.7.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/17 15:15:40 UTC, 0 replies.
- [GitHub] [spark] nija-at opened a new pull request, #40066: [CONNECT] [PYTHON] reduce spark connect service retries - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/17 15:26:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40065: [SPARK-42382][BUILD] Upgrade `cyclonedx-maven-plugin` to 2.7.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/17 15:45:58 UTC, 6 replies.
- [GitHub] [spark] itholic opened a new pull request, #40067: [WIP][SPARK-42476][CONNECT][DOCS] Spark Connect API reference - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/17 16:36:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40068: [SPARK-42380][BUILD] Upgrade maven to 3.9.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/17 17:20:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40068: [SPARK-42380][BUILD] Upgrade maven to 3.9.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/17 17:23:36 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40065: [SPARK-42382][BUILD] Upgrade `cyclonedx-maven-plugin` to 2.7.5 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 17:38:35 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/17 17:42:16 UTC, 2 replies.
- [GitHub] [spark] wecharyu opened a new pull request, #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "wecharyu (via GitHub)" <gi...@apache.org> on 2023/02/17 18:24:48 UTC, 0 replies.
- [GitHub] [spark] rehevkor5 commented on a diff in pull request #34225: [SPARK-36885][PYTHON] Inline type hints for pyspark.sql.dataframe - posted by "rehevkor5 (via GitHub)" <gi...@apache.org> on 2023/02/17 19:03:59 UTC, 0 replies.
- [GitHub] [spark] rehevkor5 commented on a diff in pull request #29591: [SPARK-32714][PYTHON] Initial pyspark-stubs port. - posted by "rehevkor5 (via GitHub)" <gi...@apache.org> on 2023/02/17 19:07:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/17 19:17:36 UTC, 4 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40054: [SPARK-42477] [CONNECT] [PYTHON]: accept user_agent in spark connect's connection string - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/17 19:49:09 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40070: [SPARK-42481][CONNECT] Implement agg.{max,min,mean,count,avg,sum} - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/17 20:37:38 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40070: [SPARK-42481][CONNECT] Implement agg.{max,min,mean,count,avg,sum} - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/17 20:37:51 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40070: [SPARK-42481][CONNECT] Implement agg.{max,min,mean,count,avg,sum} - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/17 20:53:49 UTC, 11 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40071: [SPARK-41818][CONNECT][PYTHON][FOLLOWUP][TEST] Enable a doctest for DataFrame.write - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/17 20:54:53 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40070: [SPARK-42481][CONNECT] Implement agg.{max,min,mean,count,avg,sum} - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/17 20:56:29 UTC, 10 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40072: [SPARK-42483][TESTS] Regenerate benchmark results - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/17 22:53:35 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #40073: [SPARK-42484] UnsafeRowUtils better error message - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/18 00:02:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40072: [SPARK-42483][TESTS] Regenerate benchmark results - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/18 00:20:14 UTC, 12 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38505: [SPARK-40622][WIP]do not merge(try to fix build error) - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/18 00:20:41 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38434: [SPARK-40946][SQL] Add a new DataSource V2 interface SupportsPushDownClusterKeys - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/18 00:20:43 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38263: [SPARK-40692][SQL] Support data masking built-in function 'mask_hash' - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/18 00:20:44 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37362: [SPARK-39950][SQL] It's unnecessary to materialize BroadcastQueryStage firstly, because the BroadcastQueryStage does not timeout in AQE. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/18 00:20:47 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #40074: [SPARK-42430][DOC][FOLLOW-UP] Revise the java doc for TimestampNTZ & ANSI interval types - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/18 00:41:21 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40070: [SPARK-42481][CONNECT] Implement agg.{max,min,mean,count,avg,sum} - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/18 00:48:58 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40070: [SPARK-42481][CONNECT] Implement agg.{max,min,mean,count,avg,sum} - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/18 00:49:50 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40075: [WIP] [CONNECT] Scala Client DataFrameWriterV2 - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/18 00:50:46 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40076: [SPARK-42048][PYTHON][CONNECT] Fix the alias name for numpy literals - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/18 00:52:24 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39967: [SPARK-42395][K8S]The code logic of the configmap max size validation lacks extra content - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/18 00:55:06 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40075: [WIP] [CONNECT] Scala Client DataFrameWriterV2 - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/18 00:57:31 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40061: [SPARK-42482][CONNECT] Scala Client Write API V1 - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/18 01:04:41 UTC, 0 replies.
- [GitHub] [spark] wecharyu commented on a diff in pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "wecharyu (via GitHub)" <gi...@apache.org> on 2023/02/18 03:56:01 UTC, 0 replies.
- [GitHub] [spark] ozhembr opened a new pull request, #40077: [SPIP][POC] Driver scaling: parallel schedulers - posted by "ozhembr (via GitHub)" <gi...@apache.org> on 2023/02/18 04:02:31 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #40053: [SPARK-42470][SQL] Remove unused declarations from Hive module - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/18 04:51:37 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #40053: [SPARK-42470][SQL] Remove unused declarations from Hive module - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/18 04:52:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40072: [SPARK-42483][TESTS] Regenerate benchmark results - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/18 05:10:48 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40053: [SPARK-42470][SQL] Remove unused declarations from Hive module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/18 05:12:24 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40072: [SPARK-42483][TESTS] Regenerate benchmark results - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/18 06:19:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40072: [SPARK-42483][TESTS] Regenerate benchmark results - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/18 06:52:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39910: [SPARK-42337][SQL] Add error class INVALID_TEMP_OBJ_REFERENCE - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/18 07:14:12 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #39910: [SPARK-42337][SQL] Add error class INVALID_TEMP_OBJ_REFERENCE - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/18 07:17:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40074: [SPARK-42430][DOC][FOLLOW-UP] Revise the java doc for TimestampNTZ & ANSI interval types - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/18 07:28:10 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #40074: [SPARK-42430][DOC][FOLLOW-UP] Revise the java doc for TimestampNTZ & ANSI interval types - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/18 07:30:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #40065: [SPARK-42382][BUILD] Upgrade `cyclonedx-maven-plugin` to 2.7.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/18 08:25:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40053: [SPARK-42470][SQL] Remove unused declarations from Hive module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/18 08:26:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #40068: [SPARK-42380][BUILD] Upgrade maven to 3.9.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/18 08:28:55 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #40015: [SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/18 10:38:43 UTC, 3 replies.
- [GitHub] [spark] yannfinnhsu opened a new pull request, #40078: Update scalastyle-config.xml - posted by "yannfinnhsu (via GitHub)" <gi...@apache.org> on 2023/02/18 15:11:24 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #40079: [SPARK-42486][BUILD] Upgrade `ZooKeeper` to 3.6.4 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/18 18:07:28 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40079: [SPARK-42486][BUILD] Upgrade `ZooKeeper` to 3.6.4 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/18 18:10:02 UTC, 3 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40063: [CONNECT] Eager Execution of DF.sql() - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/18 20:40:19 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38592: [SPARK-41088][SQL] Add PartialAggregate and FinalAggregate logic operators - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/19 00:20:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38505: [SPARK-40622][WIP]do not merge(try to fix build error) - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/19 00:21:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38434: [SPARK-40946][SQL] Add a new DataSource V2 interface SupportsPushDownClusterKeys - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/19 00:21:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38263: [SPARK-40692][SQL] Support data masking built-in function 'mask_hash' - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/19 00:21:03 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #40080: [SPARK-42406] Fix check for missing required fields. - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/19 05:14:35 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #40080: [SPARK-42406] Fix check for missing required fields. - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/19 05:15:01 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #40080: [SPARK-42406] Fix check for missing required fields. - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/19 05:18:53 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on pull request #40080: [SPARK-42406] Fix check for missing required fields. - posted by "SandishKumarHN (via GitHub)" <gi...@apache.org> on 2023/02/19 05:19:21 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39977: [SPARK-42323][SQL] Assign name to `_LEGACY_ERROR_TEMP_2332` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/19 10:25:04 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39977: [SPARK-42323][SQL] Assign name to `_LEGACY_ERROR_TEMP_2332` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/19 10:26:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40081: [SPARK-42487][BUILD] Upgrade Netty to 4.1.89 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/19 11:37:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40081: [SPARK-42487][BUILD] Upgrade Netty to 4.1.89 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/19 11:38:48 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40082: [SPARK-42488][BUILD] Upgrade commons-crypto from 1.1.0 to 1.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/19 11:43:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40082: [SPARK-42488][BUILD] Upgrade commons-crypto from 1.1.0 to 1.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/19 11:44:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40083: [SPARK-42489][BUILD] Upgrdae scala-parser-combinators from 2.1.1 to 2.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/19 11:49:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40083: [SPARK-42489][BUILD] Upgrdae scala-parser-combinators from 2.1.1 to 2.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/19 11:50:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40084: [SPARK-42490][BUILD] Upgrade protobuf-java from 3.21.12 to 3.22.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/19 11:52:58 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40078: Update scalastyle-config.xml - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/19 13:40:43 UTC, 0 replies.
- [GitHub] [spark] Kimahriman opened a new pull request, #40085: [SPARK-42492][SQL] Add new function filter_value - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/02/19 13:54:39 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40084: [SPARK-42490][BUILD] Upgrade protobuf-java from 3.21.12 to 3.22.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/19 14:34:36 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40061: [SPARK-42482][CONNECT] Scala Client Write API V1 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/19 17:05:09 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40077: [SPIP][POC] Driver scaling: parallel schedulers - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/19 17:13:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40013: [WIP][SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/19 17:14:47 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40013: [WIP][SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/19 17:15:29 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on a diff in pull request #40067: [WIP][SPARK-42476][CONNECT][DOCS] Spark Connect API reference - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/19 17:55:56 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on a diff in pull request #40073: [SPARK-42484] [SQL] UnsafeRowUtils better error message - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/19 18:15:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40079: [SPARK-42486][BUILD] Upgrade `ZooKeeper` to 3.6.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/19 21:45:14 UTC, 3 replies.
- [GitHub] [spark] AndreyBozhko opened a new pull request, #40086: [MINOR][SQL] Fix typo and whitespaces in SQLConf - posted by "AndreyBozhko (via GitHub)" <gi...@apache.org> on 2023/02/19 23:53:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40071: [SPARK-41818][CONNECT][PYTHON][FOLLOWUP][TEST] Enable a doctest for DataFrame.write - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:06:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40071: [SPARK-41818][CONNECT][PYTHON][FOLLOWUP][TEST] Enable a doctest for DataFrame.write - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:07:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40076: [SPARK-42048][PYTHON][CONNECT] Fix the alias name for numpy literals - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:08:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40076: [SPARK-42048][PYTHON][CONNECT] Fix the alias name for numpy literals - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:08:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38592: [SPARK-41088][SQL] Add PartialAggregate and FinalAggregate logic operators - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/20 00:22:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40086: [MINOR][SQL] Fix typo and whitespaces in SQLConf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:27:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40086: [MINOR][SQL] Fix typo and whitespaces in SQLConf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:28:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40066: [CONNECT] [PYTHON] reduce spark connect service retries - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:29:45 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40066: [CONNECT][PYTHON] Reduce spark connect service retries - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:32:33 UTC, 0 replies.
- [GitHub] [spark] allanf-db opened a new pull request, #40087: [WIP][SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/20 00:44:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40081: [SPARK-42487][BUILD] Upgrade Netty to 4.1.89 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 00:54:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 00:59:21 UTC, 1 replies.
- [GitHub] [spark] allanf-db commented on pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "allanf-db (via GitHub)" <gi...@apache.org> on 2023/02/20 01:05:43 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 01:06:29 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40088: [SPARK-42427][SQL][TESTS] Remove duplicate overflow test for conv - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 01:43:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40088: [SPARK-42427][SQL][TESTS] Remove duplicate overflow test for conv - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 01:43:17 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/20 02:20:12 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #30: [SPARK-42494] Add official image Dockerfile for Spark v3.3.2 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/20 03:29:10 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun closed pull request #29: Test on 3.3.2-rc1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/20 03:29:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40084: [SPARK-42490][BUILD] Upgrade protobuf-java from 3.21.12 to 3.22.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/20 03:46:25 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/20 03:50:36 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40067: [SPARK-42476][CONNECT][DOCS] Complete Spark Connect API reference - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/20 03:52:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40088: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Remove duplicate overflow test for conv - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 04:24:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40088: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Remove duplicate overflow test for conv - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 04:25:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40067: [SPARK-42476][CONNECT][DOCS] Complete Spark Connect API reference - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 04:26:38 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40076: [SPARK-42048][PYTHON][CONNECT] Fix the alias name for numpy literals - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/20 04:55:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40082: [SPARK-42488][BUILD] Upgrade commons-crypto to 1.2.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 05:00:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40082: [SPARK-42488][BUILD] Upgrade commons-crypto to 1.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/20 05:03:37 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40082: [SPARK-42488][BUILD] Upgrade commons-crypto to 1.2.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 05:06:03 UTC, 6 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40090: [SPARK-41741][SQL] Encode the string using the UTF_8 charset in ParquetFilters - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/20 05:25:52 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39855: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/20 05:28:53 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40036: [SPARK-42448][SQL] Fix spark sql shell prompt for current db - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/20 05:35:37 UTC, 1 replies.
- [GitHub] [spark] rangadi commented on pull request #40080: [SPARK-42406]Fix check for missing required fields. - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/20 05:36:09 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40080: [SPARK-42406]Fix check for missing required fields. - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 05:38:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 05:46:56 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #40091: [SPARK-41952] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/02/20 05:48:08 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #40091: [SPARK-41952][SQL] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/02/20 05:57:59 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 06:02:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40090: [SPARK-41741][SQL] Encode the string using the UTF_8 charset in ParquetFilters - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 06:05:38 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/20 06:08:32 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #30: [SPARK-42494] Add official image Dockerfile for Spark v3.3.2 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/20 06:09:25 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40091: [SPARK-41952][SQL] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/20 06:13:15 UTC, 1 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #40066: [CONNECT][PYTHON] Reduce spark connect service retries - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/20 07:35:03 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40091: [SPARK-41952][SQL] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/20 07:38:18 UTC, 1 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #40066: [SPARK-42498] [CONNECT][PYTHON] Reduce spark connect service retries - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/20 07:40:54 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #40091: [SPARK-41952][SQL] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/02/20 08:06:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 08:30:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 08:31:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 08:41:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 08:41:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39996: [SPARK-42423][SQL] Add metadata column file block start and length - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 09:08:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 09:13:34 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40058: [SPARK-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 09:16:30 UTC, 3 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/20 11:10:54 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #40090: [SPARK-41741][SQL] Encode the string using the UTF_8 charset in ParquetFilters - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/20 11:15:47 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/20 11:19:43 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #40090: [SPARK-41741][SQL] Encode the string using the UTF_8 charset in ParquetFilters - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/20 11:33:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 11:50:57 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40054: [SPARK-42477][CONNECT][PYTHON] accept user_agent in spark connect's connection string - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 14:02:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40054: [SPARK-42477][CONNECT][PYTHON] accept user_agent in spark connect's connection string - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 14:02:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40066: [SPARK-42498] [CONNECT][PYTHON] Reduce spark connect service retries - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 14:04:30 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #40083: [SPARK-42489][BUILD] Upgrdae scala-parser-combinators from 2.1.1 to 2.2.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/20 14:16:12 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40083: [SPARK-42489][BUILD] Upgrdae scala-parser-combinators from 2.1.1 to 2.2.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/20 14:16:13 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #40084: [SPARK-42490][BUILD] Upgrade protobuf-java from 3.21.12 to 3.22.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/20 14:17:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 14:17:56 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 14:20:36 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40067: [SPARK-42476][CONNECT][DOCS] Complete Spark Connect API reference - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 14:23:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40067: [SPARK-42476][CONNECT][DOCS] Complete Spark Connect API reference - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/20 14:24:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39996: [SPARK-42423][SQL] Add metadata column file block start and length - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 14:33:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39996: [SPARK-42423][SQL] Add metadata column file block start and length - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/20 14:34:20 UTC, 0 replies.
- [GitHub] [spark] olaky commented on a diff in pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "olaky (via GitHub)" <gi...@apache.org> on 2023/02/20 14:50:34 UTC, 3 replies.
- [GitHub] [spark] nija-at commented on pull request #40066: [SPARK-42498] [CONNECT][PYTHON] Reduce spark connect service retries - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/20 15:23:59 UTC, 0 replies.
- [GitHub] [spark] nija-at closed pull request #40066: [SPARK-42498] [CONNECT][PYTHON] Reduce spark connect service retries - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/02/20 15:24:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40075: [WIP] [CONNECT] Scala Client DataFrameWriterV2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/20 15:44:54 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40075: [WIP] [CONNECT] Scala Client DataFrameWriterV2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/20 15:48:47 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40058: [SPARK-39859][SQL] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 17:09:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40091: [SPARK-41952][SQL] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 17:31:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 17:34:48 UTC, 1 replies.
- [GitHub] [spark] sunchao closed pull request #40091: [SPARK-41952][SQL] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/20 17:40:54 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #40091: [SPARK-41952][SQL] Fix Parquet zstd off-heap memory leak as a workaround for PARQUET-2160 - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/20 17:42:52 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/20 17:51:31 UTC, 1 replies.
- [GitHub] [spark-docker] viirya commented on a diff in pull request #30: [SPARK-42494] Add official image Dockerfile for Spark v3.3.2 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/20 18:20:51 UTC, 1 replies.
- [GitHub] [spark] LucaCanali commented on pull request #39127: [SPARK-41585][YARN] The Spark exclude node functionality for YARN should work independently of dynamic allocation - posted by "LucaCanali (via GitHub)" <gi...@apache.org> on 2023/02/20 19:00:38 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #40094: [SPARK-41812][SPARK-41823][CONNECT][SQL][SCALA] Add PlanId to Scala Client - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/20 20:17:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 20:50:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40049: [SPARK-42398][SQL] Refine default column value DS v2 interface - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 20:55:23 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40095: WIP - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 22:31:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40096: [SPARK-XXX][SQL][TESTS] Reduce the degree of concurrency during ORC schema merge conflict tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/20 23:27:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40079: [SPARK-42486][BUILD] Upgrade `ZooKeeper` to 3.6.4 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/21 00:25:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40095: [SPARK-XXX][SQL][TESTS][3.4] Reduce the degree of concurrency during ORC schema merge conflict tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/21 00:39:21 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40095: [SPARK-XXX][SQL][TESTS][3.4] Reduce the degree of concurrency during ORC schema merge conflict tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/21 00:39:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40096: [SPARK-XXX][SQL][TESTS] Reduce the degree of concurrency during ORC schema merge conflict tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/21 00:39:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40097: [WIP][CONNECT][ML] Extract the common classes to mllib-common - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/21 01:03:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40094: [SPARK-41812][SPARK-41823][CONNECT][SQL][SCALA] Add PlanId to Scala Client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/21 01:28:05 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on a diff in pull request #30: [SPARK-42494] Add official image Dockerfile for Spark v3.3.2 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/21 01:33:07 UTC, 4 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/21 01:55:58 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40097: [WIP][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 02:06:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40075: [WIP] [CONNECT] Scala Client DataFrameWriterV2 - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 02:22:34 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/21 02:42:46 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 02:44:02 UTC, 0 replies.
- [GitHub] [spark] yabola commented on pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads twice when no filters in vectorized reader - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/02/21 03:02:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40097: [WIP][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/21 03:23:41 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40098: [SPARK-42504][SQL] NestedColumnAliasing support pruning adjacent projects - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/21 03:42:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/21 04:17:52 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40079: [SPARK-42486][BUILD] Upgrade `ZooKeeper` to 3.6.4 - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/21 04:22:26 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40095: [SPARK-XXX][SQL][TESTS][3.4] Reduce the degree of concurrency during ORC schema merge conflict tests - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/21 04:35:04 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40076: [SPARK-42048][PYTHON][CONNECT] Fix the alias name for numpy literals - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/21 05:32:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/21 06:05:38 UTC, 1 replies.
- [GitHub] [spark] Yikf commented on a diff in pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/02/21 06:20:54 UTC, 2 replies.
- [GitHub] [spark-docker] Yikun closed pull request #30: [SPARK-42494] Add official image Dockerfile for Spark v3.3.2 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/21 06:22:31 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #31: [SPARK-42505] Apply entrypoint template change to 3.3.0/3.3.1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/21 06:30:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40099: [WIP][CONNECT] Scala client collection functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/21 06:37:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40099: [WIP][CONNECT] Scala client collection functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/21 06:39:01 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40100: [SPARK-42506][SQL] Fix Sort's maxRowsPerPartition if maxRows does not exist - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/21 06:58:05 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40100: [SPARK-42506][SQL] Fix Sort's maxRowsPerPartition if maxRows does not exist - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/21 06:58:21 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/21 07:03:42 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/21 07:09:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40101: [SPARK-42507][SQL][TESTS] Simplify ORC schema merging conflict error check - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/21 07:11:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39954: [SPARK-42289][SQL] DS V2 pushdown could let JDBC dialect decide to push down offset and limit - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/21 07:12:02 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40101: [SPARK-42507][SQL][TESTS] Simplify ORC schema merging conflict error check - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/21 07:14:09 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #40099: [WIP][CONNECT] Scala client add collection functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/21 07:15:48 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40102: [MINOR][TESTS] Avoid NPE in an anonym SparkListener in DataFrameReaderWriterSuite - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/21 07:17:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40100: [SPARK-42506][SQL] Fix Sort's maxRowsPerPartition if maxRows does not exist - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/21 07:28:30 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #31: [SPARK-42505] Apply entrypoint template change to 3.3.0/3.3.1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/21 08:17:44 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #40100: [SPARK-42506][SQL] Fix Sort's maxRowsPerPartition if maxRows does not exist - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/21 08:19:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40101: [SPARK-42507][SQL][TESTS] Simplify ORC schema merging conflict error check - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/21 08:26:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/21 08:59:32 UTC, 4 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #40103: [WIP][SQL] Always capture the session time zone config while creating views - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/21 09:00:36 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun closed pull request #31: [SPARK-42505] Apply entrypoint template change to 3.3.0/3.3.1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/21 09:02:57 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/21 09:36:10 UTC, 4 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40101: [SPARK-42507][SQL][TESTS] Simplify ORC schema merging conflict error check - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/21 09:46:39 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40101: [SPARK-42507][SQL][TESTS] Simplify ORC schema merging conflict error check - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/21 09:48:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40067: [SPARK-42476][CONNECT][DOCS] Complete Spark Connect API reference - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/21 11:31:37 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40104: [WIP][SPARK-42510][CONNECT][PYTHON] Implement `DataFrame.mapInPandas` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/21 12:04:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40102: [MINOR][TESTS] Avoid NPE in an anonym SparkListener in DataFrameReaderWriterSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/21 12:08:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40102: [MINOR][TESTS] Avoid NPE in an anonym SparkListener in DataFrameReaderWriterSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/21 12:08:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/21 13:47:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/21 13:48:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40089: [SPARK-42495][CONNECT] Scala Client add Misc, String, and Date/Time functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/21 14:00:37 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #39127: [SPARK-41585][YARN] The Spark exclude node functionality for YARN should work independently of dynamic allocation - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/02/21 16:18:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40105: [SPARK-42514][CONNECT] Scala Client add partition transforms functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/21 16:20:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40105: [SPARK-42514][CONNECT] Scala Client add partition transforms functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/21 16:28:18 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40106: [SPARK-42002][CONNECT][FOLLOW-UP] Add Required/Optional notions to writer v2 proto - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 17:33:32 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40106: [SPARK-42002][CONNECT][FOLLOW-UP] Add Required/Optional notions to writer v2 proto - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 17:34:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40105: [SPARK-42514][CONNECT] Scala Client add partition transforms functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/21 17:50:49 UTC, 2 replies.
- [GitHub] [spark] mateiz commented on pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "mateiz (via GitHub)" <gi...@apache.org> on 2023/02/21 18:04:25 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40087: [SPARK-42493][DOCS][PYTHON] Make Python the first tab for code examples - Spark SQL, DataFrames and Datasets Guide - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/21 18:22:26 UTC, 0 replies.
- [GitHub] [spark] ozhembr commented on pull request #40077: [SPIP][POC] Driver scaling: parallel schedulers - posted by "ozhembr (via GitHub)" <gi...@apache.org> on 2023/02/21 18:49:17 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40080: [SPARK-42406]Fix check for missing required fields. - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/21 18:54:24 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40080: [SPARK-42406]Fix check for missing required fields. - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/21 18:55:09 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #40080: [SPARK-42406]Fix check for missing required fields. - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/21 19:20:51 UTC, 1 replies.
- [GitHub] [spark] WweiL commented on pull request #40073: [SPARK-42484] [SQL] UnsafeRowUtils better error message - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/21 19:39:23 UTC, 3 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40107: [SPARK-42520][CONNECT] Support basic Window API in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 20:08:42 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40107: [SPARK-42520][CONNECT] Support basic Window API in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 20:08:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40107: [SPARK-42520][CONNECT] Support basic Window API in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/21 20:12:00 UTC, 4 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40107: [SPARK-42520][CONNECT] Support basic Window API in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/21 20:42:34 UTC, 2 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #40108: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/21 20:58:46 UTC, 1 replies.
- [GitHub] [spark] dtenedor closed pull request #40108: [SPARK-42521][SQL] Add NULLs for INSERTs with user-specified lists of fewer columns than the target table - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/21 21:17:37 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/21 21:48:09 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #36506: [SPARK-25050][SQL] Avro: writing complex unions - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/21 22:07:07 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #36506: [SPARK-25050][SQL] Avro: writing complex unions - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/21 23:13:37 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40106: [SPARK-42002][CONNECT][FOLLOW-UP] Add Required/Optional notions to writer v2 proto - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/21 23:39:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40106: [SPARK-42002][CONNECT][FOLLOW-UP] Add Required/Optional notions to writer v2 proto - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/21 23:40:01 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #40110: [SPARK-41775][PYTHON][FOLLOW-UP] Updating docs for readability - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/02/22 00:22:00 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #40110: [SPARK-41775][PYTHON][FOLLOW-UP] Updating docs for readability - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/02/22 00:24:13 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/22 00:40:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40105: [SPARK-42514][CONNECT] Scala Client add partition transforms functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 00:47:55 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40073: [SPARK-42484] [SQL] UnsafeRowUtils better error message - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/22 01:22:14 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #39954: [SPARK-42289][SQL] DS V2 pushdown could let JDBC dialect decide to push down offset and limit - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/22 01:22:54 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40080: [SPARK-42406][SQL] Fix check for missing required fields of to_protobuf - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/22 01:33:43 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #40080: [SPARK-42406][SQL] Fix check for missing required fields of to_protobuf - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/22 01:34:26 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #40111: [WIP] Upgrade numpy and pandas in the release Dockerfile - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/22 01:44:49 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40075: [SPARK-42518] [CONNECT] Scala Client DataFrameWriterV2 - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/22 01:46:02 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40075: [SPARK-42518] [CONNECT] Scala Client DataFrameWriterV2 - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/22 01:50:15 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on pull request #39991: [SPARK-42419][CONNECT][PYTHON] Migrate into error framework for Spark Connect Column API. - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 01:59:36 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39954: [SPARK-42289][SQL] DS V2 pushdown could let JDBC dialect decide to push down offset and limit - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/22 02:01:26 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/22 02:04:38 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #40112: [SPARK-41933][FOLLOWUP][CONNECT] Correct an error message - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/22 02:10:51 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40113: [WIP][SPARK-42509][SQL] WindowGroupLimitExec supports codegen - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/22 02:11:29 UTC, 0 replies.
- [GitHub] [spark] huangxiaopingRD commented on pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/02/22 02:18:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40112: [SPARK-41933][FOLLOWUP][CONNECT] Correct an error message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 02:29:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40110: [SPARK-41775][PYTHON][FOLLOW-UP] Updating docs for readability - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 02:30:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40110: [SPARK-41775][PYTHON][FOLLOW-UP] Updating docs for readability - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 02:30:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40111: [SPARK-42524][BUILD] Upgrade numpy and pandas in the release Dockerfile - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 02:38:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40111: [SPARK-42524][BUILD] Upgrade numpy and pandas in the release Dockerfile - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 02:38:32 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40114: [SPARK-42513][SQL] Push down topK through join - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/22 03:10:59 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/22 03:13:41 UTC, 0 replies.
- [GitHub] [spark] zml1206 opened a new pull request, #40115: collapse two adjacent windows with the same partition/order in subquery - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/02/22 03:18:58 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #39930: [Do not merged][SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/22 03:25:23 UTC, 0 replies.
- [GitHub] [spark] ritikam2 opened a new pull request, #40116: [WIP]SPARK-41391 Fix - posted by "ritikam2 (via GitHub)" <gi...@apache.org> on 2023/02/22 05:09:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 05:23:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 05:23:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 05:26:06 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #40114: [SPARK-42513][SQL] Push down topK through join - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/22 05:35:09 UTC, 0 replies.
- [GitHub] [spark] wecharyu commented on pull request #40069: [SPARK-42480][SQL] Improve the performance of drop partitions - posted by "wecharyu (via GitHub)" <gi...@apache.org> on 2023/02/22 06:08:39 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40117: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Disable ANSI for several conv test cases in MathFunctionsSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 06:33:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40117: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Disable ANSI for several conv test cases in MathFunctionsSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 06:34:09 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40112: [SPARK-41933][FOLLOWUP][CONNECT] Correct an error message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 06:35:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40112: [SPARK-41933][FOLLOWUP][CONNECT] Correct an error message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 06:36:04 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40097: [WIP][SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/22 06:49:00 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40097: [WIP][SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/22 06:50:54 UTC, 7 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40092: [WIP][SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 06:55:04 UTC, 6 replies.
- [GitHub] [spark] zwangsheng opened a new pull request, #40118: [SPARK-26365] In kuberentes + cluster deploy-mode, spark submit should pass driver exit code - posted by "zwangsheng (via GitHub)" <gi...@apache.org> on 2023/02/22 07:04:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 07:15:17 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #40107: [SPARK-42520][CONNECT] Support basic Window API in Scala client - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 07:19:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40107: [SPARK-42520][CONNECT] Support basic Window API in Scala client - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 07:19:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40107: [SPARK-42520][CONNECT] Support basic Window API in Scala client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/22 07:33:59 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40103: [SPARK-42516][SQL] Always capture the session time zone config while creating views - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/22 07:47:40 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40097: [WIP][SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/22 07:50:09 UTC, 7 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40103: [SPARK-42516][SQL] Always capture the session time zone config while creating views - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 07:51:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40119: [SPARK-42526][ML] Add Classifier.getNumClasses back - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/22 08:22:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40119: [SPARK-42526][ML] Add Classifier.getNumClasses back - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/22 08:22:41 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40098: [SPARK-42504][SQL] NestedColumnAliasing support pruning adjacent projects - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/22 08:24:59 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #32: Test on 3.4.0-rc1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/02/22 08:29:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40120: [SPARK-42527][CONNECT] Scala Client add Window functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/22 08:40:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40120: [SPARK-42527][CONNECT] Scala Client add Window functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/22 08:41:29 UTC, 5 replies.
- [GitHub] [spark] gatorsmile commented on pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2023/02/22 08:47:26 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on pull request #40115: [SPARK-42525][CORE]collapse two adjacent windows with the same partition/order in subquery - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/02/22 08:56:14 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39992: [SPARK-42418][DOCS][PYTHON] PySpark documentation updates to improve discoverability and add more guidance - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/22 09:52:18 UTC, 0 replies.
- [GitHub] [spark] alkis opened a new pull request, #40121: [SPARK-42528] Optimize PercentileHeap - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/02/22 10:32:36 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40119: [SPARK-42526][ML] Add Classifier.getNumClasses back - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/22 11:00:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40119: [SPARK-42526][ML] Add Classifier.getNumClasses back - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/22 11:02:19 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #40103: [SPARK-42516][SQL] Always capture the session time zone config while creating views - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/22 11:03:31 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x opened a new pull request, #40122: [DRAFT][SPARK-42349][PYTHON] Support pandas cogroup with multiple df - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/02/22 11:07:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40121: [SPARK-42528] Optimize PercentileHeap - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 11:25:10 UTC, 7 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40123: [SPARK-42272][CONNEC][TESTS][FOLLOW-UP] Do not cache local port in SparkConnectService - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 11:36:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40123: [SPARK-42272][CONNEC][TESTS][FOLLOW-UP] Do not cache local port in SparkConnectService - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 11:37:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40117: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Disable ANSI for several conv test cases in MathFunctionsSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/22 11:38:16 UTC, 0 replies.
- [GitHub] [spark] alkis commented on a diff in pull request #40121: [SPARK-42528] Optimize PercentileHeap - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/02/22 11:43:37 UTC, 10 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 11:44:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40035: [SPARK-41151][FOLLOW-UP][SQL] Improve the doc of `_metadata` generated columns nullability implementation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/22 11:54:04 UTC, 0 replies.
- [GitHub] [spark] olaky opened a new pull request, #40124: [SPARK-37980] Access row_index via _metadata if possible in tests - posted by "olaky (via GitHub)" <gi...@apache.org> on 2023/02/22 12:31:20 UTC, 0 replies.
- [GitHub] [spark] joaoleveiga commented on pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by "joaoleveiga (via GitHub)" <gi...@apache.org> on 2023/02/22 12:32:51 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/22 13:03:43 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/22 14:36:14 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40120: [SPARK-42527][CONNECT] Scala Client add Window functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 14:52:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40120: [SPARK-42527][CONNECT] Scala Client add Window functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 14:54:05 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40123: [SPARK-42272][CONNEC][TESTS][FOLLOW-UP] Do not cache local port in SparkConnectService - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 15:03:00 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 15:04:01 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #40122: [SPARK-42349][PYTHON] Support pandas cogroup with multiple df - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/02/22 15:53:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40120: [SPARK-42527][CONNECT] Scala Client add Window functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 16:50:34 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39950: [SPARK-42388][SQL] Avoid parquet footer reads twice when no filters in vectorized reader - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/22 17:47:12 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40125: [SPARK-42468][CONNECT][FOLLOW-UP] Add agg in Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 18:10:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40125: [SPARK-42468][CONNECT][FOLLOW-UP] Add agg in Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 18:10:59 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40123: [SPARK-42272][CONNEC][TESTS][FOLLOW-UP] Do not cache local port in SparkConnectService - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 18:12:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #40126: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/22 18:13:11 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 18:13:37 UTC, 2 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/22 18:19:33 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40127: [SPARK-42530][PYSPARK][DOCS] Remove Hadoop 2 from PySpark installation guide - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 19:27:01 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 19:28:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40127: [SPARK-42530][PYSPARK][DOCS] Remove Hadoop 2 from PySpark installation guide - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 19:30:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40127: [SPARK-42530][PYSPARK][DOCS] Remove Hadoop 2 from PySpark installation guide - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 19:30:25 UTC, 1 replies.
- [GitHub] [spark] shrprasa opened a new pull request, #40128: [SPARK-42466][Core][K8S]: Cleanup k8s upload directory when job terminates - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/02/22 19:40:53 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/02/22 19:51:08 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40125: [SPARK-42468][CONNECT][FOLLOW-UP] Add .agg variants in Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 19:53:23 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on pull request #40125: [SPARK-42468][CONNECT][FOLLOW-UP] Add .agg variants in Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 19:56:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40129: [SPARK-42529][CONNECT] Support Cube and Rollup in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 20:05:25 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40121: [SPARK-42528] Optimize PercentileHeap - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/22 20:15:33 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40121: [SPARK-42528] Optimize PercentileHeap - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/22 20:29:10 UTC, 3 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40130: [SPARK-42531][CONNECT] Scala Client Add Collections Functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 20:47:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40130: [SPARK-42531][CONNECT] Scala Client Add Collections Functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 20:48:11 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40075: [SPARK-42518] [CONNECT] Scala Client DataFrameWriterV2 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 20:50:48 UTC, 0 replies.
- [GitHub] [spark] grundprinzip closed pull request #40094: [SPARK-41812][SPARK-41823][CONNECT][SQL][SCALA] Add PlanId to Scala Client - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/22 20:51:05 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40075: [SPARK-42518] [CONNECT] Scala Client DataFrameWriterV2 - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 20:51:37 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40109: [SPARK-42522][CONNECT] Fix DataFrameWriterV2 to find the default source - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 20:53:17 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39995: [WIP][CONNECT] Initial runtime SQL configuration implementation - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/22 20:53:21 UTC, 3 replies.
- [GitHub] [spark] hvanhovell closed pull request #40125: [SPARK-42468][CONNECT][FOLLOW-UP] Add .agg variants in Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/22 20:55:09 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on a diff in pull request #40122: [SPARK-42349][PYTHON] Support pandas cogroup with multiple df - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/02/22 21:10:47 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #40129: [SPARK-42529][CONNECT] Support Cube and Rollup in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/22 21:44:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40131: [SPARK-42150][K8S][DOCS][FOLLOWUP] Use v1.7.0 in docs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 21:58:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40131: [SPARK-42150][K8S][DOCS][FOLLOWUP] Use v1.7.0 in docs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 21:58:44 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40131: [SPARK-42150][K8S][DOCS][FOLLOWUP] Use v1.7.0 in docs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 22:14:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40132: [SPARK-42532][K8S][DOCS] Update YuniKorn documentation with v1.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 22:33:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40132: [SPARK-42532][K8S][DOCS] Update YuniKorn documentation with v1.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 22:35:41 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #40132: [SPARK-42532][K8S][DOCS] Update YuniKorn docs with v1.2 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/22 22:43:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40132: [SPARK-42532][K8S][DOCS] Update YuniKorn docs with v1.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 23:01:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40132: [SPARK-42532][K8S][DOCS] Update YuniKorn docs with v1.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/22 23:02:29 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40115: [SPARK-42525][CORE]collapse two adjacent windows with the same partition/order in subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/22 23:51:29 UTC, 2 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40133: [SPARK-42533][CONNECT][Scala] Add ssl for Scala client - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/22 23:54:56 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40133: [SPARK-42533][CONNECT][Scala] Add ssl for Scala client - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/22 23:55:32 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/23 00:20:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39995: [WIP][CONNECT] Initial runtime SQL configuration implementation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/23 00:55:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40127: [SPARK-42530][PYSPARK][DOCS] Remove Hadoop 2 from PySpark installation guide - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/23 01:09:25 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40129: [SPARK-42529][CONNECT] Support Cube and Rollup in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 01:26:52 UTC, 1 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #40134: [SPARK-42534] Fix DB2Dialect Limit clause - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/23 01:50:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40120: [SPARK-42527][CONNECT] Scala Client add Window functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 02:09:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40129: [SPARK-42529][CONNECT] Support Cube and Rollup in Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 02:16:17 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40115: [SPARK-42525][CORE]collapse two adjacent windows with the same partition/order in subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/23 02:34:27 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #40134: [SPARK-42534][SQL] Fix DB2Dialect Limit clause - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/23 02:36:08 UTC, 4 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/23 02:51:56 UTC, 3 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/23 02:57:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/23 03:05:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/23 03:05:53 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40134: [SPARK-42534][SQL] Fix DB2Dialect Limit clause - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 03:12:23 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40136: [SPARK-42515][BUILD][TESTS] Make `write table` in `ClientE2ETestSuite` local test pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 03:19:27 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/23 03:22:27 UTC, 1 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #40134: [SPARK-42534][SQL] Fix DB2Dialect Limit clause - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/23 03:37:45 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 03:51:17 UTC, 4 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #40137: [SPARK-42049][SQL][FOLLOWUP] Always filter away invalid ordering/partitioning - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 03:55:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40137: [SPARK-42049][SQL][FOLLOWUP] Always filter away invalid ordering/partitioning - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 03:55:14 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 03:55:23 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40129: [SPARK-42529][CONNECT] Support Cube and Rollup in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 03:55:32 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40129: [SPARK-42529][CONNECT] Support Cube and Rollup in Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 03:56:47 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40138: [SPARK-41793][SQL] Incorrect result for window frames defined by a range clause on large decimals - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/23 03:59:08 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40133: [SPARK-42533][CONNECT][Scala] Add ssl for Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 03:59:13 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40134: [SPARK-42534][SQL] Fix DB2Dialect Limit clause - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/23 04:03:23 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 05:24:51 UTC, 18 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/23 05:38:02 UTC, 0 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #40139: [SPARK-39859][SQL][FOLLOWUP] Support v2 DESCRIBE TABLE EXTENDED for columns - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/23 05:47:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40073: [SPARK-42484] [SQL] UnsafeRowUtils better error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 06:08:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40073: [SPARK-42484] [SQL] UnsafeRowUtils better error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 06:08:54 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen opened a new pull request, #40140: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/02/23 06:22:23 UTC, 0 replies.
- [GitHub] [spark] alkis commented on pull request #40121: [SPARK-42528] Optimize PercentileHeap - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/02/23 06:22:57 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on pull request #40140: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/23 06:31:52 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #40140: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/23 06:45:04 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40138: [SPARK-41793][SQL] Incorrect result for window frames defined by a range clause on large decimals - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/23 06:53:31 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #40134: [SPARK-42534][SQL] Fix DB2Dialect Limit clause - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/23 07:14:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40138: [SPARK-41793][SQL] Incorrect result for window frames defined by a range clause on large decimals - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 07:17:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 07:19:33 UTC, 0 replies.
- [GitHub] [spark] amaliujia closed pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 07:44:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/23 07:45:04 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 07:54:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40138: [SPARK-41793][SQL] Incorrect result for window frames defined by a range clause on large decimals - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 08:12:17 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #40137: [SPARK-42049][SQL][FOLLOWUP] Always filter away invalid ordering/partitioning - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/23 08:27:13 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/23 08:27:38 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #40036: [SPARK-42448][SQL] Fix spark sql shell prompt for current db - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/23 08:46:48 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/23 08:48:26 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40130: [SPARK-42531][CONNECT] Scala Client Add Collections Functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 09:09:30 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/23 09:17:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40130: [SPARK-42531][CONNECT] Scala Client Add Collections Functions - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 09:25:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40116: [WIP]SPARK-41391 Fix - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/23 09:26:08 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #40141: [SPARK-42406] Terminate Protobuf recursive fields by dropping the field - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/23 09:40:23 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #40141: [SPARK-42406] Terminate Protobuf recursive fields by dropping the field - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/23 09:41:01 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #40141: [SPARK-42406] Terminate Protobuf recursive fields by dropping the field - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/02/23 09:52:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40097: [SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/23 10:40:59 UTC, 0 replies.
- [GitHub] [spark] steveloughran commented on pull request #40065: [SPARK-42382][BUILD] Upgrade `cyclonedx-maven-plugin` to 2.7.5 - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/02/23 11:08:37 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40142: [SPARK-41171][SQL] Infer window limit and push down it through window when partitionSpec is empty - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/23 11:13:47 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #38689: [SPARK-41171][SQL] Push down filter through window when partitionSpec is empty - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/23 11:20:28 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #38689: [SPARK-41171][SQL] Push down filter through window when partitionSpec is empty - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/23 11:20:29 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #29643: [SPARK-32638][SQL][FOLLOWUP] Move the plan rewriting methods to QueryPlan - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/23 11:30:52 UTC, 2 replies.
- [GitHub] [spark] khalidmammadov closed pull request #40015: [SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/23 12:03:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #29643: [SPARK-32638][SQL][FOLLOWUP] Move the plan rewriting methods to QueryPlan - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 12:13:40 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #40138: [SPARK-41793][SQL] Incorrect result for window frames defined by a range clause on large decimals - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 12:36:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40130: [SPARK-42531][CONNECT] Scala Client Add Collections Functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 13:10:41 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40137: [SPARK-42049][SQL][FOLLOWUP] Always filter away invalid ordering/partitioning - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/23 13:19:17 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/23 13:35:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40143: [SPARK-42538][CONNECT] Make `sql.functions#lit` function support more types - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 14:10:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40143: [SPARK-42538][CONNECT] Make `sql.functions#lit` function support more types - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 14:26:48 UTC, 6 replies.
- [GitHub] [spark] hvanhovell closed pull request #40130: [SPARK-42531][CONNECT] Scala Client Add Collections Functions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 15:45:54 UTC, 0 replies.
- [GitHub] [spark] ritikam2 commented on pull request #40116: [WIP]SPARK-41391 Fix - posted by "ritikam2 (via GitHub)" <gi...@apache.org> on 2023/02/23 15:46:24 UTC, 2 replies.
- [GitHub] [spark] bersprockets commented on pull request #40140: [SPARK-42286][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/02/23 16:15:22 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40143: [SPARK-42538][CONNECT] Make `sql.functions#lit` function support more types - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 16:20:30 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40143: [SPARK-42538][CONNECT] Make `sql.functions#lit` function support more types - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/23 16:22:55 UTC, 8 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40143: [SPARK-42538][CONNECT] Make `sql.functions#lit` function support more types - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/23 16:24:20 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #40137: [SPARK-42049][SQL][FOLLOWUP] Always filter away invalid ordering/partitioning - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/23 17:04:06 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40137: [SPARK-42049][SQL][FOLLOWUP] Always filter away invalid ordering/partitioning - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/23 17:15:44 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/23 17:22:40 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40139: [SPARK-39859][SQL][FOLLOWUP] Only get ColStats when isExtended is true in Describe Column - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/23 17:40:07 UTC, 0 replies.
- [GitHub] [spark] xkrogen opened a new pull request, #40144: [SPARK-42539][SQL][HIVE] Elminiate separate classloader when using 'builtin' Hive version for metadata client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/23 18:03:16 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 18:22:18 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #40144: [SPARK-42539][SQL][HIVE] Elminiate separate classloader when using 'builtin' Hive version for metadata client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/23 18:22:25 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40144: [SPARK-42539][SQL][HIVE] Elminiate separate classloader when using 'builtin' Hive version for metadata client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/23 18:37:08 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen commented on a diff in pull request #40140: [SPARK-42286][SPARK-41991][SPARK-42473][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/02/23 19:15:19 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/23 19:31:05 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40133: [SPARK-42533][CONNECT][Scala] Add ssl for Scala client - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/23 20:48:50 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40145: [SPARK-42541][CONNECT] Support Pivot with provided pivot column values - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 20:57:31 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40145: [SPARK-42541][CONNECT] Support Pivot with provided pivot column values - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 20:58:31 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #36506: [SPARK-25050][SQL] Avro: writing complex unions - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/23 21:23:40 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #40146: [SPARK-42120][SQL] Add built-in table-valued function json_tuple - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/23 21:25:17 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/02/23 21:31:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #36506: [SPARK-25050][SQL] Avro: writing complex unions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/23 21:34:45 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/23 21:59:06 UTC, 5 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40148: [SPARK-42544][CONNNECT] Spark Connect Scala Client: support parameterized SQL - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 22:16:15 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40148: [SPARK-42544][CONNNECT] Spark Connect Scala Client: support parameterized SQL - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/23 22:16:37 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle multi columns properly - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/02/23 22:39:38 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40133: [SPARK-42533][CONNECT][Scala] Add ssl for Scala client - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/23 23:00:07 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #40149: [SPARK-42122][SQL] Add built-in table-valued function stack - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/23 23:06:12 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/23 23:09:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/24 00:03:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40135: [SPARK-42444][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/24 00:04:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/24 00:19:56 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #40151: [SPARK-42121][SQL] Add built-in table-valued functions posexplode and posexplode_outer - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/24 00:39:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 00:44:11 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 00:51:38 UTC, 2 replies.
- [GitHub] [spark] wangyum commented on pull request #40140: [SPARK-42286][SPARK-41991][SPARK-42473][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/24 01:04:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40152: [SPARK-42545][K8S][DOCS] Remove `experimental` from `Volcano` docs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 01:32:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40152: [SPARK-42545][K8S][DOCS] Remove `experimental` from `Volcano` docs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 01:47:13 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40148: [SPARK-42544][CONNNECT] Spark Connect Scala Client: support parameterized SQL - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 01:53:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40097: [WIP][SPARK-42508][CONNECT][ML] Extract the common .ml classes to `mllib-common` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/24 01:56:50 UTC, 0 replies.
- [GitHub] [spark] william-wang commented on pull request #40152: [SPARK-42545][K8S][DOCS] Remove `experimental` from `Volcano` docs - posted by "william-wang (via GitHub)" <gi...@apache.org> on 2023/02/24 01:57:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40152: [SPARK-42545][K8S][DOCS] Remove `experimental` from `Volcano` docs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 02:04:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40134: [SPARK-42534][SQL] Fix DB2Dialect Limit clause - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 02:19:46 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #40134: [SPARK-42534][SQL] Fix DB2Dialect Limit clause - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 02:19:59 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/24 02:28:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40092: [SPARK-42475][CONNECT][DOCS] Getting Started: Live Notebook for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/24 02:33:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40144: [SPARK-42539][SQL][HIVE] Elminiate separate classloader when using 'builtin' Hive version for metadata client - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 02:37:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39954: [SPARK-42289][SQL] DS V2 pushdown could let JDBC dialect decide to push down offset and limit - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 03:07:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39954: [SPARK-42289][SQL] DS V2 pushdown could let JDBC dialect decide to push down offset and limit - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 03:08:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40153: [SPARK-42547][PYTHON] Make PySpark working with Python 3.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/24 03:14:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40153: [SPARK-42547][PYTHON] Make PySpark working with Python 3.7 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/24 03:14:32 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40122: [SPARK-42349][PYTHON] Support pandas cogroup with multiple df - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/24 03:24:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40153: [SPARK-42547][PYTHON] Make PySpark working with Python 3.7 - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/24 03:25:55 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #40104: [SPARK-42510][CONNECT][PYTHON] Implement `DataFrame.mapInPandas` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/24 03:26:52 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40145: [SPARK-42541][CONNECT] Support Pivot with provided pivot column values - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 03:27:03 UTC, 4 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #40154: [SPARK-42548][SQL] Add PlainReferences to skip rewriting attributes - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/24 03:28:17 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40154: [SPARK-42548][SQL] Add PlainReferences to skip rewriting attributes - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/24 03:33:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40145: [SPARK-42541][CONNECT] Support Pivot with provided pivot column values - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/24 03:39:19 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40140: [SPARK-42286][SPARK-41991][SPARK-42473][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/24 03:43:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40153: [SPARK-42547][PYTHON] Make PySpark working with Python 3.7 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 03:44:48 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40015: [SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/24 03:55:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40013: [SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/24 04:01:12 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #40142: [SPARK-41171][SQL] Infer and push down window limit through window if partitionSpec is empty - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/24 04:11:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 04:13:50 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 04:15:10 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 04:16:48 UTC, 17 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40154: [SPARK-42548][SQL] Add PlainReferences to skip rewriting attributes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 04:21:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39170: [SPARK-41674][SQL] Runtime filter should supports multi level shuffle join side as filter creation side - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 04:25:49 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #40155: [SPARK-42534][SQL][3.4] Fix DB2Dialect Limit clause - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/24 04:32:36 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #40155: [SPARK-42534][SQL][3.4] Fix DB2Dialect Limit clause - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/24 04:33:52 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #40146: [SPARK-42120][SQL] Add built-in table-valued function json_tuple - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/24 04:48:48 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40156: [SPARK-41823][CONNECT] Scala Client resolve ambiguous columns in Join - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 04:58:16 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40156: [SPARK-41823][CONNECT] Scala Client resolve ambiguous columns in Join - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 04:58:26 UTC, 2 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40154: [SPARK-42548][SQL] Add ReferenceAllColumns to skip rewriting attributes - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/24 04:59:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40153: [SPARK-42547][PYTHON] Make PySpark working with Python 3.7 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 05:15:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40137: [SPARK-42049][SQL][FOLLOWUP] Always filter away invalid ordering/partitioning - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 05:54:09 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/24 06:29:40 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/24 06:34:36 UTC, 0 replies.
- [GitHub] [spark] jerrypeng commented on a diff in pull request #39931: [SPARK-42376][SS] Introduce watermark propagation among operators - posted by "jerrypeng (via GitHub)" <gi...@apache.org> on 2023/02/24 06:53:18 UTC, 10 replies.
- [GitHub] [spark] wankunde opened a new pull request, #40157: [SPARK-42551] Support subexpression elimination in FilterExec - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/24 07:04:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/24 07:54:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/24 08:30:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40013: [SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/24 08:46:07 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/24 09:28:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/24 09:43:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/24 09:43:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39807: [WIP][SPARK-42240][INFRA][CONNECT][TESTS] Move `ClientE2ETestSuite` into a separate module and add new GA task to test shaded jvm client with maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/24 09:43:52 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40121: [SPARK-42528][CORE] Optimize PercentileHeap - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/24 09:50:18 UTC, 1 replies.
- [GitHub] [spark] AlanBateman commented on pull request #39909: [SPARK-42369][CORE] Fix constructor for java.nio.DirectByteBuffer - posted by "AlanBateman (via GitHub)" <gi...@apache.org> on 2023/02/24 10:07:29 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39170: [SPARK-41674][SQL] Runtime filter should supports multi level shuffle join side as filter creation side - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/24 10:15:05 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40158: [MINOR][CONNECT]Typo fixes - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/24 11:19:47 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40159: [WIP][SPARK-42509][SQL] WindowGroupLimitExec supports codegen - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/24 11:52:26 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #40113: [WIP][SPARK-42509][SQL] WindowGroupLimitExec supports codegen - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/24 11:52:48 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #40113: [WIP][SPARK-42509][SQL] WindowGroupLimitExec supports codegen - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/24 11:52:50 UTC, 0 replies.
- [GitHub] [spark] alkis commented on a diff in pull request #40121: [SPARK-42528][CORE] Optimize PercentileHeap - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/02/24 12:17:47 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40121: [SPARK-42528][CORE] Optimize PercentileHeap - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 12:22:28 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/24 12:37:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40155: [SPARK-42534][SQL][3.4] Fix DB2Dialect Limit clause - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 12:43:47 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40133: [SPARK-42533][CONNECT][Scala] Add ssl for Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 12:44:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40155: [SPARK-42534][SQL][3.4] Fix DB2Dialect Limit clause - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 12:44:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40133: [SPARK-42533][CONNECT][Scala] Add ssl for Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 12:44:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/24 12:45:40 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39909: [SPARK-42369][CORE] Fix constructor for java.nio.DirectByteBuffer - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/24 13:28:16 UTC, 2 replies.
- [GitHub] [spark] grundprinzip closed pull request #40063: [CONNECT] Eager Execution of DF.sql() - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/24 14:58:59 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #40160: [CONNECT] Eager Execution of DF.sql() - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/24 14:59:35 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/02/24 15:50:09 UTC, 6 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/24 16:48:35 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40156: [SPARK-41823][CONNECT] Scala Client resolve ambiguous columns in Join - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/24 17:05:23 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/24 17:10:53 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40140: [3.3][SPARK-42286][SPARK-41991][SPARK-42473][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/24 18:56:07 UTC, 0 replies.
- [GitHub] [spark] huanliwang-db opened a new pull request, #40161: [SPARK-42565][SS] Error log improve ment for the lock acquisition of RocksDB state store instance - posted by "huanliwang-db (via GitHub)" <gi...@apache.org> on 2023/02/24 18:58:05 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40143: [SPARK-42538][CONNECT] Make `sql.functions#lit` function support more types - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/24 19:01:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/24 19:03:25 UTC, 0 replies.
- [GitHub] [spark] huanliwang-db opened a new pull request, #40162: [SPARK-42566][SS] RocksDB StateStore lock acquisition should happen after getting input iterator from inputRDD - posted by "huanliwang-db (via GitHub)" <gi...@apache.org> on 2023/02/24 19:34:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40140: [3.3][SPARK-42286][SPARK-41991][SPARK-42473][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 20:05:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40140: [3.3][SPARK-42286][SPARK-41991][SPARK-42473][SQL] Fallback to previous codegen code path for complex expr with CAST - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/24 20:05:52 UTC, 0 replies.
- [GitHub] [spark] aimtsou commented on pull request #37817: [SPARK-40376][PYTHON] Avoid Numpy deprecation warning - posted by "aimtsou (via GitHub)" <gi...@apache.org> on 2023/02/24 21:21:34 UTC, 3 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #40163: [SPARK-42567] Track load time for state store provider and log warning if it exceeds threshold - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/02/24 21:26:09 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #40163: [SPARK-42567][SS][SQL] Track load time for state store provider and log warning if it exceeds threshold - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/02/24 21:27:45 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #39449: [SPARK-40688][SQL] Support data masking built-in function 'mask_first_n' - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/24 21:32:32 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40164: [SPARK-42569][CONNECT] Throw unsupported exceptions for non-supported API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/24 22:11:35 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40165: [SPARK-42568][CONNECT] Fix SparkConnectStreamHandler to handle configs properly while planning - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/24 22:11:57 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40164: [SPARK-42569][CONNECT] Throw unsupported exceptions for non-supported API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/24 22:12:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40013: [SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/24 22:20:13 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #40163: [SPARK-42567][SS][SQL] Track load time for state store provider and log warning if it exceeds threshold - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/24 22:49:55 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40161: [SPARK-42565][SS] Error log improvement for the lock acquisition of RocksDB state store instance - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/24 22:54:41 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40161: [SPARK-42565][SS] Error log improvement for the lock acquisition of RocksDB state store instance - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/24 22:54:56 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on a diff in pull request #40163: [SPARK-42567][SS][SQL] Track load time for state store provider and log warning if it exceeds threshold - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/02/24 23:03:21 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #40104: [SPARK-42510][CONNECT][PYTHON] Implement `DataFrame.mapInPandas` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/02/24 23:40:11 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40166: [SPARK-42570][CONNECT][PYTHON] Fix DataFrameReader to use the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/25 00:14:23 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40166: [SPARK-42570][CONNECT][PYTHON] Fix DataFrameReader to use the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/25 00:15:55 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38464: [SPARK-32628][SQL] Use bloom filter to improve dynamic partition pruning - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/25 00:20:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40167: [SPARK-42561][CONNECT] Add temp view API to Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 00:34:36 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40167: [SPARK-42561][CONNECT] Add temp view API to Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 00:35:07 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40166: [SPARK-42570][CONNECT][PYTHON] Fix DataFrameReader to use the default source - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 00:36:31 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40166: [SPARK-42570][CONNECT][PYTHON] Fix DataFrameReader to use the default source - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 00:37:27 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on pull request #40166: [SPARK-42570][CONNECT][PYTHON] Fix DataFrameReader to use the default source - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/25 00:49:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40158: [MINOR][CONNECT] Typo fixes & update comment - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 00:54:41 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40158: [MINOR][CONNECT] Typo fixes & update comment - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 00:55:13 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40164: [SPARK-42569][CONNECT] Throw unsupported exceptions for non-supported API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 00:59:28 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40167: [SPARK-42561][CONNECT] Add temp view API to Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 01:01:43 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40168: [SPARK-42573] Enable binary compatibility tests on all major client APIs - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/25 01:24:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/25 01:32:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40150: [SPARK-41834][CONNECT] Implement SparkSession.conf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/25 01:32:53 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40168: [SPARK-42573][Connect][Scala] Enable binary compatibility tests on all major client APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 01:38:06 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40167: [SPARK-42561][CONNECT] Add temp view API to Dataset - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/25 01:38:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40165: [SPARK-42568][CONNECT] Fix SparkConnectStreamHandler to handle configs properly while planning - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/25 01:38:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40165: [SPARK-42568][CONNECT] Fix SparkConnectStreamHandler to handle configs properly while planning - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/25 01:39:02 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40164: [SPARK-42569][CONNECT] Throw unsupported exceptions for non-supported API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 01:40:32 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40167: [SPARK-42561][CONNECT] Add temp view API to Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 01:41:57 UTC, 2 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40168: [SPARK-42573][Connect][Scala] Enable binary compatibility tests on all major client APIs - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/25 01:48:22 UTC, 3 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #40169: [SPARK-42575][Connect][Scala] Make all client tests to extend from ConnectFunSuite - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/25 01:53:37 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40169: [SPARK-42575][Connect][Scala] Make all client tests to extend from ConnectFunSuite - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/25 01:55:35 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40170: [SPARK-42574][CONNECT][PYTHON] Fix toPandas to handle duplicated column names - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/25 01:59:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40145: [SPARK-42541][CONNECT] Support Pivot with provided pivot column values - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 02:29:44 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40145: [SPARK-42541][CONNECT] Support Pivot with provided pivot column values - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 02:30:30 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #40128: [SPARK-42466][Core][K8S]: Cleanup k8s upload directory when job terminates - posted by "shrprasa (via GitHub)" <gi...@apache.org> on 2023/02/25 03:01:43 UTC, 0 replies.
- [GitHub] [spark] Surbhi-Vijay opened a new pull request, #40171: [Tests] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "Surbhi-Vijay (via GitHub)" <gi...@apache.org> on 2023/02/25 09:33:29 UTC, 0 replies.
- [GitHub] [spark] Surbhi-Vijay commented on pull request #40171: [Tests] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "Surbhi-Vijay (via GitHub)" <gi...@apache.org> on 2023/02/25 09:37:57 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40162: [SPARK-42566][SS] RocksDB StateStore lock acquisition should happen after getting input iterator from inputRDD - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/25 09:38:08 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40162: [SPARK-42566][SS] RocksDB StateStore lock acquisition should happen after getting input iterator from inputRDD - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/25 09:38:42 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40163: [SPARK-42567][SS][SQL] Track load time for state store provider and log warning if it exceeds threshold - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/25 09:40:23 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40163: [SPARK-42567][SS][SQL] Track load time for state store provider and log warning if it exceeds threshold - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/25 09:41:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40116: [WIP]SPARK-41391 Fix - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/25 14:36:26 UTC, 2 replies.
- [GitHub] [spark] Kilo59 commented on pull request #29591: [SPARK-32714][PYTHON] Initial pyspark-stubs port. - posted by "Kilo59 (via GitHub)" <gi...@apache.org> on 2023/02/25 15:57:39 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40171: [Tests] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/25 17:51:35 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40168: [SPARK-42573][Connect][Scala] Enable binary compatibility tests on all major client APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 17:58:59 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40168: [SPARK-42573][Connect][Scala] Enable binary compatibility tests on all major client APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 17:59:38 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40169: [SPARK-42575][Connect][Scala] Make all client tests to extend from ConnectFunSuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:02:28 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40169: [SPARK-42575][Connect][Scala] Make all client tests to extend from ConnectFunSuite - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:02:59 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40167: [SPARK-42561][CONNECT] Add temp view API to Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:03:59 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40167: [SPARK-42561][CONNECT] Add temp view API to Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:04:35 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40164: [SPARK-42569][CONNECT] Throw unsupported exceptions for non-supported API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:06:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40164: [SPARK-42569][CONNECT] Throw unsupported exceptions for non-supported API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:07:20 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40143: [SPARK-42538][CONNECT] Make `sql.functions#lit` function support more types - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:09:05 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40166: [SPARK-42570][CONNECT][PYTHON] Fix DataFrameReader to use the default source - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:13:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40166: [SPARK-42570][CONNECT][PYTHON] Fix DataFrameReader to use the default source - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/25 18:14:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40169: [SPARK-42575][Connect][Scala] Make all client tests to extend from ConnectFunSuite - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/25 19:40:18 UTC, 0 replies.
- [GitHub] [spark] santosh-d3vpl3x commented on pull request #40122: [SPARK-42349][PYTHON] Support pandas cogroup with multiple df - posted by "santosh-d3vpl3x (via GitHub)" <gi...@apache.org> on 2023/02/25 21:16:29 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40172: [SPARK-42569][CONNECT][FOLLOW-UP] Throw unsupported exceptions for persist - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 23:40:36 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40172: [SPARK-42569][CONNECT][FOLLOW-UP] Throw unsupported exceptions for persist - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/25 23:40:45 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40173: [SPARK-42576][CONNECT] Add 2nd groupBy method to Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/26 00:11:42 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40173: [SPARK-42576][CONNECT] Add 2nd groupBy method to Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/26 00:11:47 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38464: [SPARK-32628][SQL] Use bloom filter to improve dynamic partition pruning - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/02/26 00:23:39 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40173: [SPARK-42576][CONNECT] Add 2nd groupBy method to Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/26 01:22:21 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40174: [SPARK-42573][CONNECT][FOLLOW-UP] fix broken build after variable rename - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/26 01:53:22 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40174: [SPARK-42573][CONNECT][FOLLOW-UP] fix broken build after variable rename - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/26 01:53:29 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40173: [SPARK-42576][CONNECT] Add 2nd groupBy method to Dataset - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/26 01:53:54 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40173: [SPARK-42576][CONNECT] Add 2nd groupBy method to Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/26 02:35:14 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40173: [SPARK-42576][CONNECT] Add 2nd groupBy method to Dataset - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/26 02:36:15 UTC, 0 replies.
- [GitHub] [spark] amaliujia closed pull request #40174: [SPARK-42573][CONNECT][FOLLOW-UP] fix broken build after variable rename - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/26 02:39:12 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40175: [SPARK-42580][CONNECT] Scala client add client side typed APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/26 03:10:36 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #40115: [SPARK-42525][SQL] Collapse two adjacent windows with the same partition/order in subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/26 03:21:52 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40115: [SPARK-42525][SQL] Collapse two adjacent windows with the same partition/order in subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/26 03:26:48 UTC, 0 replies.
- [GitHub] [spark] ritikam2 commented on pull request #40116: SPARK-41391[SQL][WIP] - posted by "ritikam2 (via GitHub)" <gi...@apache.org> on 2023/02/26 08:27:18 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40176: [SPARK-42564][CONNECT] Implement SparkSession.version and SparkSession.time - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/26 13:16:43 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40177: [SPARK-42583][SQL] Remove the outer join if they are all distinct aggregate functions - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/26 13:46:25 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40116: SPARK-41391[SQL][WIP] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/26 15:26:59 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #40040: [SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/02/26 17:06:09 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #39678: [SPARK-16484][SQL] Add HyperLogLogPlusPlus sketch generator/evaluator/aggregator - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/02/26 17:24:48 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #40178: [MINOR][FOLLOWUP] Remove Jenkins from web page. - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/26 17:47:56 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #40178: [MINOR][DOCS][FOLLOWUP] Remove `Jenkins` from web page. - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/02/26 17:49:36 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40175: [SPARK-42580][CONNECT] Scala client add client side typed APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/26 20:44:35 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #40177: [SPARK-42583][SQL] Remove the outer join if they are all distinct aggregate functions - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/26 23:10:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40178: [MINOR][DOCS][FOLLOWUP] Remove `Jenkins` from web page. - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/26 23:39:05 UTC, 2 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40179: [SPARK-42560][CONNECT] Add ColumnName class - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 00:16:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40178: [MINOR][DOCS][FOLLOWUP] Remove `Jenkins` from web page. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 00:22:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40170: [SPARK-42574][CONNECT][PYTHON] Fix toPandas to handle duplicated column names - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 00:22:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40170: [SPARK-42574][CONNECT][PYTHON] Fix toPandas to handle duplicated column names - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 00:23:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40179: [SPARK-42560][CONNECT] Add ColumnName class - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 00:27:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40172: [SPARK-42569][CONNECT][FOLLOW-UP] Throw unsupported exceptions for persist - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 00:28:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40172: [SPARK-42569][CONNECT][FOLLOW-UP] Throw unsupported exceptions for persist - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 00:28:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39991: [SPARK-42419][CONNECT][PYTHON] Migrate into error framework for Spark Connect Column API. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 00:31:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40179: [SPARK-42560][CONNECT] Add ColumnName class - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 00:37:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40176: [SPARK-42564][CONNECT] Implement SparkSession.version and SparkSession.time - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 00:38:42 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #40176: [SPARK-42564][CONNECT] Implement SparkSession.version and SparkSession.time - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 00:39:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39995: [WIP][CONNECT] Initial runtime SQL configuration implementation - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/27 00:47:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40170: [SPARK-42574][CONNECT][PYTHON] Fix toPandas to handle duplicated column names - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/27 01:02:06 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40178: [MINOR][DOCS][FOLLOWUP] Remove `Jenkins` from web page. - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/27 01:54:20 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #40177: [SPARK-42583][SQL] Remove the outer join if they are all distinct aggregate functions - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/27 01:54:25 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40180: [SPARK-42587][CONNECT][TESTS] Use wrapper versions for SBT and Maven in `connect` module tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 01:57:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40180: [SPARK-42587][CONNECT][TESTS] Use wrapper versions for SBT and Maven in `connect` module tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 02:00:58 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on pull request #40157: [SPARK-42551][SQL] Support subexpression elimination in FilterExec - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/02/27 02:30:45 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40176: [SPARK-42564][CONNECT] Implement SparkSession.version and SparkSession.time - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 02:33:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40180: [SPARK-42587][CONNECT][TESTS] Use wrapper versions for SBT and Maven in `connect` module tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 02:34:54 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40179: [SPARK-42560][CONNECT] Add ColumnName class - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 02:37:48 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40179: [SPARK-42560][CONNECT] Add ColumnName class - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 02:38:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40181: [SPARK-42589][CONNECT][TESTS] Exclude `RelationalGroupedDataset.apply` from `CompatibilitySuite` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 02:45:52 UTC, 0 replies.
- [GitHub] [spark] zml1206 opened a new pull request, #40182: [SPARK-42588][SQL] Collapse two adjacent windows with the equivalent partition/order expressions - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/02/27 02:48:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40181: [SPARK-42589][CONNECT][TESTS] Exclude `RelationalGroupedDataset.apply` from `CompatibilitySuite` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 02:59:12 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40181: [SPARK-42589][CONNECT][TESTS] Exclude `RelationalGroupedDataset.apply` from `CompatibilitySuite` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 03:00:31 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40181: [SPARK-42589][CONNECT][TESTS] Exclude `RelationalGroupedDataset.apply` from `CompatibilitySuite` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 03:11:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40183: [SPARK-42587][CONNECT][TESTS][FOLLOWUP] Fix `scalafmt` failure - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 03:13:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40183: [SPARK-42587][CONNECT][TESTS][FOLLOWUP] Fix `scalafmt` failure - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 03:15:43 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40183: [SPARK-42587][CONNECT][TESTS][FOLLOWUP] Fix `scalafmt` failure - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 03:50:41 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40184: [SPARK-42569][CONNECT] Throw exceptions for unsupported session API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/27 03:55:54 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40184: [SPARK-42569][CONNECT] Throw exceptions for unsupported session API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/27 03:57:25 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #39990: [SPARK-42415][SQL] The built-in dialects support OFFSET and paging query. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/27 04:08:04 UTC, 0 replies.
- [GitHub] [spark] gatorsmile commented on pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2023/02/27 04:10:17 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40185: [SPARK-42586][CONNECT] Add RuntimeConfig for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 04:11:24 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40184: [SPARK-42569][CONNECT] Throw exceptions for unsupported session API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 04:18:14 UTC, 8 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40184: [SPARK-42569][CONNECT] Throw exceptions for unsupported session API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 04:18:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40179: [SPARK-42560][CONNECT] Add ColumnName class - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 04:32:30 UTC, 0 replies.
- [GitHub] [spark] navinvishy commented on a diff in pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/02/27 04:40:01 UTC, 3 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40186: [SPARK-42581][CONNECT] Add SQLImplicits. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 04:40:35 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #40149: [SPARK-42122][SQL] Add built-in table-valued function stack - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/27 05:13:18 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db closed pull request #40149: [SPARK-42122][SQL] Add built-in table-valued function stack - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/27 05:13:25 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db closed pull request #40146: [SPARK-42120][SQL] Add built-in table-valued function json_tuple - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/27 05:13:36 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #40183: [SPARK-42587][CONNECT][TESTS][FOLLOWUP] Fix `scalafmt` failure - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/27 05:18:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40115: [SPARK-42525][SQL] Collapse two adjacent windows with the same partition/order in subquery - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 06:24:13 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40121: [SPARK-42528][CORE] Optimize PercentileHeap - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 06:26:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40121: [SPARK-42528][CORE] Optimize PercentileHeap - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 06:27:27 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #40187: [SPARK-42572] [SQL] [SS] Fix behavior for StateStoreProvider.validateStateRowFormat - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/27 07:46:08 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on pull request #38699: [SPARK-41188][CORE][ML] Set executorEnv OMP_NUM_THREADS to be spark.task.cpus by default for spark executor JVM processes - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/27 07:54:31 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40073: [SPARK-42484] [SQL] UnsafeRowUtils better error message - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/27 08:07:12 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #40188: [SPARK-42592][SS][DOCS] Document how to perform chained time window aggregations - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/27 08:14:52 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40188: [SPARK-42592][SS][DOCS] Document how to perform chained time window aggregations - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/27 08:16:00 UTC, 2 replies.
- [GitHub] [spark] zml1206 commented on pull request #40115: [SPARK-42525][SQL] Collapse two adjacent windows with the same partition/order in subquery - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/02/27 08:16:00 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on pull request #40182: [SPARK-42588][SQL] Collapse two adjacent windows with the equivalent partition/order expressions - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/02/27 08:19:36 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #40187: [SPARK-42572][SQL][SS] Fix behavior for StateStoreProvider.validateStateRowFormat - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/27 08:30:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 08:40:53 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #40189: [SPARK-27483] Put fallback logic for StreamingRelation(V2) to analyzer rule - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/27 08:42:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40144: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 08:54:39 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #40064: [SPARK-42478] Make a serializable jobTrackerId instead of a non-serializable JobID in FileWriterFactory - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 08:56:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40188: [SPARK-42592][SS][DOCS] Document how to perform chained time window aggregations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 09:04:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40188: [SPARK-42592][SS][DOCS] Document how to perform chained time window aggregations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 09:04:56 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40190: [SPARK-42597][SQL] UnwrapCastInBinaryComparison support unwrap timestamp type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/27 09:30:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40191: [SPARK-42554][CONNECT] Add `dev/connect-jvm-client-mima-check` tool to instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/27 09:33:34 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by "smallzhongfeng (via GitHub)" <gi...@apache.org> on 2023/02/27 09:41:26 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40192: [SPARK-42600][SQL] CatalogImpl.currentDatabase shall use NamespaceHelper instead of MultipartIdentifierHelper - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/27 09:54:20 UTC, 0 replies.
- [GitHub] [spark] alkis opened a new pull request, #40193: [SPARK-42528] Optimize PercentileHeap further by removing averaging - posted by "alkis (via GitHub)" <gi...@apache.org> on 2023/02/27 10:06:14 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #40188: [SPARK-42592][SS][DOCS] Document how to perform chained time window aggregations - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/27 11:16:38 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40192: [SPARK-42600][SQL] CatalogImpl.currentDatabase shall use NamespaceHelper instead of MultipartIdentifierHelper - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 11:26:34 UTC, 1 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #40194: [SPARK-42602][CORE] Add reason argument to TaskScheduler.cancelTasks - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/02/27 11:52:50 UTC, 0 replies.
- [GitHub] [spark] jiang13021 opened a new pull request, #40195: [SPARK-42553][SQL] ensure at least one time unit after "interval" - posted by "jiang13021 (via GitHub)" <gi...@apache.org> on 2023/02/27 12:42:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/27 12:57:06 UTC, 1 replies.
- [GitHub] [spark] huangxiaopingRD opened a new pull request, #40196: [SPARK-42603][SQL] Set spark.sql.legacy.createHiveTableByDefault to f… - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/02/27 13:00:44 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40185: [SPARK-42586][CONNECT] Add RuntimeConfig for Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 13:05:12 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40186: [SPARK-42581][CONNECT] Add SQLImplicits. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 13:11:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40186: [SPARK-42581][CONNECT] Add SQLImplicits. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 13:11:49 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40175: [SPARK-42580][CONNECT] Scala client add client side typed APIs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 13:15:19 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40190: [SPARK-42597][SQL] UnwrapCastInBinaryComparison support unwrap timestamp type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/27 13:44:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40160: [CONNECT] Eager Execution of DF.sql() - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 14:02:49 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40193: [SPARK-42528] Optimize PercentileHeap further by removing averaging - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/27 14:37:26 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/27 15:07:09 UTC, 3 replies.
- [GitHub] [spark] srowen commented on pull request #40193: [SPARK-42528] Optimize PercentileHeap further by removing averaging - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/27 16:26:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40197: [SPARK-42605][CONNECT] Add TypedColumn - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 16:48:58 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40188: [SPARK-42592][SS][DOCS] Document how to perform chained time window aggregations - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/27 17:22:45 UTC, 2 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #40144: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/27 17:44:42 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40126: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/27 18:14:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40184: [SPARK-42569][CONNECT] Throw exceptions for unsupported session API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/27 18:18:25 UTC, 9 replies.
- [GitHub] [spark] amaliujia commented on pull request #40185: [SPARK-42586][CONNECT] Add RuntimeConfig for Scala Client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/27 18:31:08 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40186: [SPARK-42581][CONNECT] Add SQLImplicits. - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/27 18:34:58 UTC, 0 replies.
- [GitHub] [spark] ritikam2 commented on pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "ritikam2 (via GitHub)" <gi...@apache.org> on 2023/02/27 18:46:41 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #40116: [SPARK-41391][SQL] The output column name of groupBy.agg(count_distinct) is incorrect - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/27 18:56:00 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40197: [SPARK-42605][CONNECT] Add TypedColumn - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 19:13:10 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40197: [SPARK-42605][CONNECT] Add TypedColumn - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 19:13:46 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #40198: [SPARK-42337][SQL][FOLLOWUP] Update the error message for INVALID_TEMP_OBJ_REFERENCE - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/27 19:17:08 UTC, 0 replies.
- [GitHub] [spark] jzhuge opened a new pull request, #40199: [SPARK-42596][CORE] OMP_NUM_THREADS not set to number of executor cor… - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/27 19:22:24 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/27 19:28:20 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on pull request #40139: [SPARK-39859][SQL][FOLLOWUP] Only get ColStats when isExtended is true in Describe Column - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/27 19:37:16 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40200: [SPARK-42542][CONNECT] Support Pivot without providing pivot column values - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/27 19:42:33 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #40015: [SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/02/27 19:42:34 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40200: [SPARK-42542][CONNECT] Support Pivot without providing pivot column values - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/27 19:42:46 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40201: [SPARK-42510][CONNECT][PYTHON][TEST] Enable `DataFrame.mapInPandas` parity tests - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/27 20:16:18 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #39990: [SPARK-42415][SQL] The built-in dialects support OFFSET and paging query. - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/02/27 21:44:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 21:53:07 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40160: [CONNECT] Eager Execution of DF.sql() - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/27 22:11:24 UTC, 2 replies.
- [GitHub] [spark] hvanhovell closed pull request #40200: [SPARK-42542][CONNECT] Support Pivot without providing pivot column values - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/27 22:40:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40199: [SPARK-42596][CORE] OMP_NUM_THREADS not set to number of executor cor… - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/27 22:42:49 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #40144: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/27 22:58:11 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #40144: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/02/27 22:59:13 UTC, 2 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/27 23:10:10 UTC, 1 replies.
- [GitHub] [spark] xkrogen commented on pull request #40144: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/27 23:19:05 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40188: [SPARK-42592][SS][DOCS] Document how to perform chained time window aggregations - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/27 23:38:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40201: [SPARK-42510][CONNECT][PYTHON][TEST] Enable more `DataFrame.mapInPandas` parity tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 00:16:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40201: [SPARK-42510][CONNECT][PYTHON][TEST] Enable more `DataFrame.mapInPandas` parity tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 00:16:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40199: [SPARK-42596][CORE] OMP_NUM_THREADS not set to number of executor cor… - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 00:31:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40199: [SPARK-42596][CORE][YARN] OMP_NUM_THREADS not set to number of executor cores by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 00:32:24 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38699: [SPARK-41188][CORE][ML] Set executorEnv OMP_NUM_THREADS to be spark.task.cpus by default for spark executor JVM processes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 00:33:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40151: [SPARK-42121][SQL] Add built-in table-valued functions posexplode, posexplode_outer, json_tuple and stack - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 00:37:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40151: [SPARK-42121][SQL] Add built-in table-valued functions posexplode, posexplode_outer, json_tuple and stack - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 00:39:11 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #40202: [SPARK-42608][SQL] Use full inner field names in resolution errors - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/02/28 00:40:59 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 commented on pull request #40194: [SPARK-42602][CORE] Add reason argument to TaskScheduler.cancelTasks - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/02/28 00:41:54 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40151: [SPARK-42121][SQL] Add built-in table-valued functions posexplode, posexplode_outer, json_tuple and stack - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/28 00:43:12 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40203: [SPARK-42612][CONNECT][PYTHON][TEST] Enable more parity tests related to functions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/28 00:54:56 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40203: [SPARK-42612][CONNECT][PYTHON][TEST] Enable more parity tests related to functions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/28 01:01:06 UTC, 0 replies.
- [GitHub] [spark] jiangxb1987 commented on a diff in pull request #40194: [SPARK-42602][CORE] Add reason argument to TaskScheduler.cancelTasks - posted by "jiangxb1987 (via GitHub)" <gi...@apache.org> on 2023/02/28 01:04:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40139: [SPARK-39859][SQL][FOLLOWUP] Only get ColStats when isExtended is true in Describe Column - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 01:34:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40139: [SPARK-39859][SQL][FOLLOWUP] Only get ColStats when isExtended is true in Describe Column - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 01:35:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40203: [SPARK-42612][CONNECT][PYTHON][TEST] Enable more parity tests related to functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 01:38:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40199: [SPARK-42596][CORE][YARN] OMP_NUM_THREADS not set to number of executor cores by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 01:39:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40139: [SPARK-39859][SQL][FOLLOWUP] Only get ColStats when isExtended is true in Describe Column - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 01:42:05 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #40192: [SPARK-42600][SQL] CatalogImpl.currentDatabase shall use NamespaceHelper instead of MultipartIdentifierHelper - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/28 01:45:15 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on a diff in pull request #40199: [SPARK-42596][CORE][YARN] OMP_NUM_THREADS not set to number of executor cores by default - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/28 01:47:15 UTC, 3 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #40203: [SPARK-42612][CONNECT][PYTHON][TESTS] Enable more parity tests related to functions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/28 01:48:35 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40203: [SPARK-42612][CONNECT][PYTHON][TESTS] Enable more parity tests related to functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 01:52:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40192: [SPARK-42600][SQL] CatalogImpl.currentDatabase shall use NamespaceHelper instead of MultipartIdentifierHelper - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 01:57:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40192: [SPARK-42600][SQL] CatalogImpl.currentDatabase shall use NamespaceHelper instead of MultipartIdentifierHelper - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 01:58:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40194: [SPARK-42602][CORE] Add reason argument to TaskScheduler.cancelTasks - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 02:00:26 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40190: [SPARK-42597][SQL] UnwrapCastInBinaryComparison support unwrap timestamp type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 02:02:51 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #40204: [SPARK-42601][SQL] New physical type Decimal128 for DecimalType - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/02/28 02:03:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40157: [SPARK-42551][SQL] Support subexpression elimination in FilterExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 02:05:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40175: [SPARK-42580][CONNECT] Scala client add client side typed APIs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 02:07:30 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on pull request #40199: [SPARK-42596][CORE][YARN] OMP_NUM_THREADS not set to number of executor cores by default - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/28 02:10:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40205: [SPARK-42610][CONNECT] Add encoders to SQLImplicits - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 02:15:42 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/02/28 02:18:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40199: [SPARK-42596][CORE][YARN] OMP_NUM_THREADS not set to number of executor cores by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 02:19:32 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/02/28 02:19:33 UTC, 3 replies.
- [GitHub] [spark] huangxiaopingRD commented on pull request #40196: [SPARK-42603][SQL] Set spark.sql.legacy.createHiveTableByDefault to false. - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2023/02/28 02:24:45 UTC, 1 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/02/28 02:24:56 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 02:28:34 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40191: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/28 02:34:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40015: [SPARK-42437][PySpark][Connect] PySpark catalog.cacheTable will allow to specify storage level - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/28 02:37:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 02:42:23 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40202: [SPARK-42608][SQL] Use full inner field names in resolution errors - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 02:44:13 UTC, 2 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #40207: [SPARK-42614][CONNECT] Make constructors private[sql] - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 02:54:38 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #40208: [SPARK-42592][SS][DOCS][FOLLOWUP] Add missed commit on reflecting review comment - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/28 03:10:20 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40208: [SPARK-42592][SS][DOCS][FOLLOWUP] Add missed commit on reflecting review comment - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/28 03:10:54 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40187: [SPARK-42572][SQL][SS] Fix behavior for StateStoreProvider.validateStateRowFormat - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/28 03:12:39 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40187: [SPARK-42572][SQL][SS] Fix behavior for StateStoreProvider.validateStateRowFormat - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/28 03:13:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40198: [SPARK-42337][SQL][FOLLOWUP] Update the error message for INVALID_TEMP_OBJ_REFERENCE - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/28 03:23:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #40013: [SPARK-42367][CONNECT][PYTHON] `DataFrame.drop` should handle duplicated columns properly - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/28 03:26:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 03:39:45 UTC, 4 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #40208: [SPARK-42592][SS][DOCS][FOLLOWUP] Add missed commit on reflecting review comment - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/28 03:54:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 03:58:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40136: [SPARK-42515][BUILD][CONNECT][TESTS] Make `write table` in `ClientE2ETestSuite` sbt local test pass - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 03:58:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39039: [SPARK-40776][SQL][PROTOBUF][DOCS] Spark-Protobuf docs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 03:59:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39039: [SPARK-40776][SQL][PROTOBUF][DOCS] Spark-Protobuf docs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 03:59:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40203: [SPARK-42612][CONNECT][PYTHON][TESTS] Enable more parity tests related to functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 04:01:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40203: [SPARK-42612][CONNECT][PYTHON][TESTS] Enable more parity tests related to functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 04:01:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40144: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 04:04:34 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #40202: [SPARK-42608][SQL] Use full inner field names in resolution errors - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/02/28 04:15:27 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #40209: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Disable ANSI for one more conv test case in MathFunctionsSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 04:19:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40209: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Disable ANSI for one more conv test case in MathFunctionsSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 04:19:42 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #40208: [SPARK-42592][SS][DOCS][FOLLOWUP] Add missed commit on reflecting review comment - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/02/28 04:38:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40144: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 04:40:07 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40141: [SPARK-42406] Terminate Protobuf recursive fields by dropping the field - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/28 04:43:48 UTC, 1 replies.
- [GitHub] [spark] gengliangwang closed pull request #40141: [SPARK-42406] Terminate Protobuf recursive fields by dropping the field - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/28 04:44:33 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/02/28 05:00:18 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/28 05:16:48 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #40209: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Disable ANSI for one more conv test case in MathFunctionsSuite - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/02/28 05:25:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40207: [SPARK-42614][CONNECT] Make constructors private[sql] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 05:29:02 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40194: [SPARK-42602][CORE] Add reason argument to TaskScheduler.cancelTasks - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 05:31:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 05:31:29 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40205: [SPARK-42610][CONNECT] Add encoders to SQLImplicits - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 05:32:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40177: [SPARK-42583][SQL] Remove the outer join if they are all distinct aggregate functions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 05:36:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/28 05:38:17 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #40211: [SPARK-42616][SQL] SparkSQLCLIDriver shall only close started hive sessionState - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/02/28 05:43:23 UTC, 0 replies.
- [GitHub] [spark] jzhuge opened a new pull request, #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THR… - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/28 05:43:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THR… - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/02/28 05:48:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40142: [SPARK-41171][SQL] Infer and push down window limit through window if partitionSpec is empty - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 05:53:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40142: [SPARK-41171][SQL] Infer and push down window limit through window if partitionSpec is empty - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 05:53:45 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40193: [SPARK-42528] Optimize PercentileHeap further by removing averaging - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/02/28 06:20:51 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THR… - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/28 06:24:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/28 07:29:44 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40154: [SPARK-42548][SQL] Add ReferenceAllColumns to skip rewriting attributes - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/02/28 07:46:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40193: [SPARK-42528] Optimize PercentileHeap further by removing averaging - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 07:46:50 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #40189: [SPARK-27483] [SS] [SQL] Put fallback logic for StreamingRelation(V2) to analyzer rule - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/28 07:47:14 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40154: [SPARK-42548][SQL] Add ReferenceAllColumns to skip rewriting attributes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 07:52:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40154: [SPARK-42548][SQL] Add ReferenceAllColumns to skip rewriting attributes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/02/28 07:53:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40214: [SPARK-42491][BUILD] Upgrade jetty to 9.4.51.v20230217 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/28 07:54:51 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #40215: [SPARK-42591][SS][DOCS] Add examples of unblocked workloads after SPARK-42376 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/28 07:56:52 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40215: [SPARK-42591][SS][DOCS] Add examples of unblocked workloads after SPARK-42376 - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/02/28 07:57:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THR… - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 08:17:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/28 08:19:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40209: [SPARK-42427][SQL][TESTS][FOLLOW-UP] Disable ANSI for one more conv test case in MathFunctionsSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 08:20:37 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #38035: [SPARK-42438][SQL] Improve constraint propagation using multiTransform - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/02/28 08:23:05 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #40190: [SPARK-42597][SQL] UnwrapCastInBinaryComparison support unwrap timestamp type - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/02/28 08:40:20 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40202: [SPARK-42608][SQL] Use full inner field names in resolution errors - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 09:00:52 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/02/28 09:18:13 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #40216: [WIP][SPARK-42593][PS] Deprecate & remove the APIs that will be removed in pandas 2.0. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/02/28 09:48:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/02/28 09:55:11 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/28 10:18:49 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #40217: [SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/28 12:03:01 UTC, 0 replies.
- [GitHub] [spark] Surbhi-Vijay commented on pull request #40171: [SPARK-42598][TEST] Refactor TPCH schema to separate file similar to TPCDS for code reuse - posted by "Surbhi-Vijay (via GitHub)" <gi...@apache.org> on 2023/02/28 12:50:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #40218: [SPARK-42579][WIP] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/28 13:09:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40218: [SPARK-42579][WIP] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/02/28 13:10:48 UTC, 0 replies.
- [GitHub] [spark] jelmerk opened a new pull request, #40219: Disable substitution in values - posted by "jelmerk (via GitHub)" <gi...@apache.org> on 2023/02/28 13:23:43 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40217: [WIP][SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 13:56:43 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40217: [WIP][SPARK-42559][CONNECT] Implement DataFrameNaFunctions - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/02/28 14:00:03 UTC, 1 replies.
- [GitHub] [spark] olaky commented on pull request #40124: [SPARK-37980][SQL] Access row_index via _metadata if possible in tests - posted by "olaky (via GitHub)" <gi...@apache.org> on 2023/02/28 14:41:27 UTC, 0 replies.
- [GitHub] [spark] sandeep-katta0102 commented on pull request #40219: Disable substitution in values - posted by "sandeep-katta0102 (via GitHub)" <gi...@apache.org> on 2023/02/28 14:43:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/28 14:55:48 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #40219: Disable substitution in values - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/28 14:57:21 UTC, 4 replies.
- [GitHub] [spark] srowen commented on pull request #40214: [SPARK-42491][BUILD] Upgrade jetty to 9.4.51.v20230217 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/02/28 14:58:04 UTC, 0 replies.
- [GitHub] [spark] jelmerk commented on pull request #40219: Disable substitution in values - posted by "jelmerk (via GitHub)" <gi...@apache.org> on 2023/02/28 15:15:40 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/02/28 15:33:02 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 15:35:12 UTC, 3 replies.
- [GitHub] [spark] jelmerk commented on pull request #40219: [SPARK-42622][CORE] Disable substitution in values - posted by "jelmerk (via GitHub)" <gi...@apache.org> on 2023/02/28 15:52:22 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/02/28 16:05:35 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/28 16:08:56 UTC, 0 replies.
- [GitHub] [spark] aimtsou opened a new pull request, #40220: [PYTHON] Change alias for numpy deprecated and removed types - posted by "aimtsou (via GitHub)" <gi...@apache.org> on 2023/02/28 16:12:07 UTC, 0 replies.
- [GitHub] [spark] steveloughran opened a new pull request, #40221: [SPARK-41551][SQL] Dynamic/absolute path support in PathOutputCommitters - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/02/28 16:13:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 16:37:13 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/28 16:38:43 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 16:39:13 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40210: [SPARK-42615][CONNECT][PYTHON] Refactor the AnalyzePlan RPC and add `session.version` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 16:40:10 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/28 17:10:26 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on pull request #40147: [SPARK-42543][CONNECT] Specify protocol for UDF artifact transfer in JVM/Scala client - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/02/28 17:14:53 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on a diff in pull request #40212: [SPARK-42613][CORE][PYTHON][YARN] PythonRunner should set OMP_NUM_THREADS to task cpus instead of executor cores by default - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/02/28 17:16:47 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37468: [SPARK-40034][SQL] PathOutputCommitters to support dynamic partitions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 17:23:57 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/28 17:28:35 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/28 17:28:42 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/02/28 17:31:24 UTC, 0 replies.
- [GitHub] [spark] shardulm94 commented on pull request #39673: [SPARK-42132][SQL] Deduplicate attributes in groupByKey.cogroup - posted by "shardulm94 (via GitHub)" <gi...@apache.org> on 2023/02/28 17:32:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/02/28 17:33:24 UTC, 1 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40126: [SPARK-40822][SQL] Stable derived column aliases - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/02/28 17:55:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40207: [SPARK-42614][CONNECT] Make constructors private[sql] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 18:12:15 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40223: [SPARK-42624][PYTHON][TESTS] Reorganize imports in test_functions - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/28 20:01:14 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #40213: [SPARK-42599][CONNECT][INFRA] Introduce `dev/connect-jvm-client-mima-check` instead of `CompatibilitySuite` - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/02/28 20:59:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 21:15:32 UTC, 0 replies.
- [GitHub] [spark] xkrogen opened a new pull request, #40224: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/28 21:46:35 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #40224: [SPARK-42539][SQL][HIVE] Eliminate separate classloader when using 'builtin' Hive version for metadata client - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/02/28 21:47:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 21:48:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #40222: [SPARK-42615][CONNECT][FOLLOW-UP] Implement correct version API in SparkSession for Scala client - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 21:48:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #40225: [SPARK-42625][BUILD] Upgrade zstd-jni to 1.5.4-2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/02/28 21:54:48 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #40198: [SPARK-42337][SQL][FOLLOWUP] Update the error message for INVALID_TEMP_OBJ_REFERENCE - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/02/28 22:25:35 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #40189: [SPARK-27483] [SS] [SQL] Move fallback logic for StreamingRelation(V2) to an analyzer rule - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/02/28 23:07:32 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #40206: [SPARK-42611][SQL] Insert char/varchar length checks for inner fields during resolution - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/02/28 23:25:08 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #40226: [SPARK-41868][CONNECT][PYTHON] Fix createDataFrame to support durations - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/02/28 23:31:17 UTC, 0 replies.