You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [spark] bersprockets commented on a diff in pull request #41809: [SPARK-44251][SQL] Set nullable correctly on coalesced join key in full outer USING join - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/07/01 00:12:17 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41802: [SPARK-44256][BUILD] Upgrade rocksdbjni to 8.3.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40468: [SPARK-42838][SQL] changed error class name _LEGACY_ERROR_TEMP_2000 - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40341: [WIP][SPARK-42715][SQL] Tips for Optimizing NegativeArraySizeException - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #39754: [SPARK-42199][SQL] Fix issues around Dataset.groupByKey - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38296: [SPARK-40830][SQL] Provide groupByKey shortcuts for groupBy.as - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37211: [SPARK-39644][SQL] Add RangePartitioning reporting for V2 DataSources - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/01 00:26:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41802: [SPARK-44256][BUILD] Upgrade rocksdbjni to 8.3.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/01 00:27:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/01 00:28:43 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41805: [SPARK-44259][CONNECT][TESTS] Make `connect-client-jvm` pass on Java 21 except `RemoteSparkSession`-based tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/01 00:30:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41805: [SPARK-44259][CONNECT][TESTS] Make `connect-client-jvm` pass on Java 21 except `RemoteSparkSession`-based tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/01 00:31:01 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41746: [SPARK-44198][CORE] Fix inconsistent Log Level Setting between Spark Driver and Executors - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/01 00:38:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41810: [MINOR] Fix Typo in `build/mvn` script - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/01 01:03:36 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/01 03:08:41 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #41723: [SPARK-44179][CORE]Fix the number of executors is calculated incorrctly when the task fails and it is speculated that the task is still executing - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/01 03:31:28 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/01 03:34:38 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41785: [SPARK-44241][Core] Mistakenly set io.connectionTimeout/connectionCreationTimeout to zero or negative will cause incessant executor cons/destructions - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/01 03:37:08 UTC, 7 replies.
- [GitHub] [spark] mridulm commented on pull request #40883: [WIP][SPARK-43221][CORE] the BlockManager with the persisted block is preferred - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/01 03:40:54 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39011: [SPARK-41469][CORE] Avoid unnecessary task rerun on decommissioned executor lost if shuffle data migrated - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/01 03:45:44 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #40412: [SPARK-42784] should still create subDir when the number of subDir in merge dir is less than conf - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/01 03:48:13 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #40412: [SPARK-42784] should still create subDir when the number of subDir in merge dir is less than conf - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/01 03:50:26 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #41614: [SPARK-44060][SQL] Code-gen for build side outer shuffled hash join - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/07/01 05:04:36 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #41614: [SPARK-44060][SQL] Code-gen for build side outer shuffled hash join - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/07/01 05:05:16 UTC, 0 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #41801: [Spark Ticket Here]SSH Environment Manager - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/07/01 05:08:13 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41804: [SPARK-43851][SQL] Support LCA in grouping expressions - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 05:47:43 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41804: [SPARK-43851][SQL] Support LCA in grouping expressions - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 05:48:31 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41578: [SPARK-44044][SS] Improve Error message for Window functions with streaming - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 05:51:04 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41578: [SPARK-44044][SS] Improve Error message for Window functions with streaming - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 05:51:34 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41804: [SPARK-43851][SQL] Support LCA in grouping expressions - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/01 05:53:37 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/01 06:52:43 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41788: [SPARK-44244][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2305-2309] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 09:03:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41788: [SPARK-44244][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2305-2309] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 09:03:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41794: [SPARK-44254][SQL] Move QueryExecutionErrors to sql/api - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 09:07:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41797: [SPARK-44255][SQL] Relocate StorageLevel to common/utils - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 09:10:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41797: [SPARK-44255][SQL] Relocate StorageLevel to common/utils - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 09:10:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41111: [SPARK-39420][SQL] Support `ANALYZE TABLE` on Datasource V2 tables - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/01 09:27:01 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/01 09:42:48 UTC, 4 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41813: [SPARK-44268][CORE][TEST] Add tests to ensure error-classes.json and docs are in sync - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/01 15:36:46 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41813: [SPARK-44268][CORE][TEST] Add tests to ensure error-classes.json and docs are in sync - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/01 15:37:38 UTC, 2 replies.
- [GitHub] [spark] ashangit commented on a diff in pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "ashangit (via GitHub)" <gi...@apache.org> on 2023/07/01 20:00:18 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40468: [SPARK-42838][SQL] changed error class name _LEGACY_ERROR_TEMP_2000 - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/02 00:24:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40439: [SPARK-42807][CORE] Apply custom log URL pattern for yarn-client AM log URL in SHS - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/02 00:24:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40297: [SPARK-42412][WIP] Initial PR of Spark connect ML - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/02 00:24:24 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #41076: [SPARK-43396][CORE] Add config to control max ratio of decommissioning executors - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/02 00:32:43 UTC, 1 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #41761: [SPARK-43828][CORE] Add config to control whether close idle connections - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/02 00:36:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41804: [SPARK-43851][SQL] Support LCA in grouping expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/02 00:39:09 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41804: [SPARK-43851][SQL] Support LCA in grouping expressions - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/02 01:17:23 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41111: [SPARK-39420][SQL] Support `ANALYZE TABLE` on Datasource V2 tables - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/02 01:29:04 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41761: [SPARK-43828][CORE] Add config to control whether close idle connections - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/02 02:20:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #41794: [SPARK-44254][SQL] Move QueryExecutionErrors that used by DataType to sql/api as DataTypeErrors - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/02 03:12:09 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41762: [SPARK-44215][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/02 03:34:40 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41746: [SPARK-44198][CORE] Fix inconsistent Log Level Setting between Spark Driver and Executors - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/02 03:36:42 UTC, 4 replies.
- [GitHub] [spark] mridulm commented on pull request #41746: [SPARK-44198][CORE] Fix inconsistent Log Level Setting between Spark Driver and Executors - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/02 03:45:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/02 04:30:48 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #41788: [SPARK-44244][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2305-2309] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/02 05:01:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41814: [SPARK-44259][CONNECT][TESTS][FOLLOWUP] No longer initializing `Ammonite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/02 05:01:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41814: [SPARK-44259][CONNECT][TESTS][FOLLOWUP] No longer initializing `Ammonite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/02 05:02:17 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41814: [SPARK-44259][CONNECT][TESTS][FOLLOWUP] No longer initializing `Ammonite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/02 05:04:59 UTC, 8 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41815: [SPARK-40731][] Make `streaming` pass on Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/02 07:14:03 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41816: [SPARK-44269][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2310-2314] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/02 07:14:14 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41794: [SPARK-44254][SQL] Move QueryExecutionErrors that used by DataType to sql/api as DataTypeErrors - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/02 07:18:18 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41817: [SPARK-43851][SQL][FOLLOWUP] Move resolve LCA in grouping expressions - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/02 07:18:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41794: [SPARK-44254][SQL] Move QueryExecutionErrors that used by DataType to sql/api as DataTypeErrors - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/02 07:18:55 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41817: [SPARK-43851][SQL][FOLLOWUP] Move resolve LCA in grouping expressions - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/02 07:19:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41815: [SPARK-40731][DSTREAM] Make `streaming` pass on Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/02 07:25:40 UTC, 1 replies.
- [GitHub] [spark] holdenk commented on pull request #41076: [SPARK-43396][CORE] Add config to control max ratio of decommissioning executors - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/02 07:58:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41815: [SPARK-40731][DSTREAM] Make `streaming` pass on Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/02 08:01:41 UTC, 3 replies.
- [GitHub] [spark] NarekDW commented on pull request #39719: [SPARK-42169] [SQL] Implement code generation for to_csv function (StructsToCsv) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/07/02 08:09:25 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/02 12:29:15 UTC, 3 replies.
- [GitHub] [spark] junyuc25 opened a new pull request, #41818: [WIP] Upgrade AWS SDK - posted by "junyuc25 (via GitHub)" <gi...@apache.org> on 2023/07/02 13:14:53 UTC, 0 replies.
- [GitHub] [spark] junyuc25 closed pull request #41818: [WIP] Upgrade AWS SDK - posted by "junyuc25 (via GitHub)" <gi...@apache.org> on 2023/07/02 13:18:36 UTC, 0 replies.
- [GitHub] [spark] junyuc25 opened a new pull request, #41819: Upgrade codes related to IAM, DDB and Kinesis clients - posted by "junyuc25 (via GitHub)" <gi...@apache.org> on 2023/07/02 13:19:00 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41785: [SPARK-44241][Core] Mistakenly set io.connectionTimeout/connectionCreationTimeout to zero or negative will cause incessant executor cons/destructions - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/02 13:50:43 UTC, 2 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #41820: [SPARK-44271][SQL] Move util functions from DataType to ResolveDefaultColumns - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/02 14:19:02 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #41820: [SPARK-44271][SQL] Move util functions from DataType to ResolveDefaultColumns - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/02 14:23:18 UTC, 0 replies.
- [GitHub] [spark] shuwang21 opened a new pull request, #41821: SPARK-44272: Path Inconsistency when Operating statCache within Yarn Client - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/07/02 15:33:22 UTC, 0 replies.
- [GitHub] [spark] shuwang21 commented on pull request #41821: SPARK-44272: Path Inconsistency when Operating statCache within Yarn Client - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/07/02 15:49:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41813: [SPARK-44268][CORE][TEST] Add tests to ensure error-classes.json and docs are in sync - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/02 15:50:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41813: [SPARK-44268][CORE][TEST] Add tests to ensure error-classes.json and docs are in sync - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/02 15:51:20 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41746: [SPARK-44198][CORE] Fix inconsistent Log Level Setting between Spark Driver and Executors - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/02 18:48:07 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/02 21:53:17 UTC, 14 replies.
- [GitHub] [spark] itholic commented on pull request #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/02 22:30:31 UTC, 8 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41760: [SPARK-44211][PYTHON][CONNECT] Implement SparkSession.is_stopped - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:18:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41760: [SPARK-44211][PYTHON][CONNECT] Implement SparkSession.is_stopped - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:19:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41787: [SPARK-44245][PYTHON] pyspark.sql.dataframe doctests behave differently - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:21:01 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40551: [SPARK] Project implements ExposesMetadataColumns - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/03 00:23:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40550: [SPARK] LogicalPlan.metadataOutput always contains AttributeReference - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/03 00:23:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40439: [SPARK-42807][CORE] Apply custom log URL pattern for yarn-client AM log URL in SHS - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/03 00:23:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #39160: [SPARK-41667][K8S] Expose env var SPARK_DRIVER_POD_NAME in Driver Pod - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/03 00:23:27 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38357: [SPARK-40887][K8S] Allow Spark on K8s to integrate w/ Log Service - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/03 00:23:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41787: [SPARK-44245][PYTHON] pyspark.sql.dataframe doctests behave differently - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:23:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41807: [SPARK-44263][CONNECT] Channel Builder support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:28:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41789: [CONNECT][SPARK-44246] Follow-ups for Spark Connect Jar/Classfile Isolation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:32:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41792: [SPARK-44249][SQL][PYTHON] Refactor PythonUDTFRunner to send its return type separately - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:33:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41792: [SPARK-44249][SQL][PYTHON] Refactor PythonUDTFRunner to send its return type separately - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:33:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41789: [SPARK-44246][CONNECT][FOLLOW-UP] Miscellaneous cleanups for Spark Connect Jar/Classfile Isolation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:35:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41789: [SPARK-44246][CONNECT][FOLLOW-UP] Miscellaneous cleanups for Spark Connect Jar/Classfile Isolation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:35:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41776: [SPARK-41822][CONNECT][TESTS][FOLLOW-UP] Remove the need of a fixed port to allow parallel running of tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:36:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41776: [SPARK-41822][CONNECT][TESTS][FOLLOW-UP] Remove the need of a fixed port to allow parallel running of tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:36:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:47:57 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41742: [SPARK-44195][R] Add JobTag APIs to SparkR SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:53:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41742: [SPARK-44195][R] Add JobTag APIs to SparkR SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:53:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 00:58:42 UTC, 10 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41815: [SPARK-40731][DSTREAM] Make `streaming` pass on Java 21 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 01:08:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41815: [SPARK-40731][DSTREAM] Make `streaming` pass on Java 21 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 01:08:44 UTC, 0 replies.
- [GitHub] [spark] junyuc25 closed pull request #41819: Upgrade codes related to IAM, DDB and Kinesis clients - posted by "junyuc25 (via GitHub)" <gi...@apache.org> on 2023/07/03 01:11:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41822: [MINOR][SQL][TESTS] Use SystemUtils.isJavaVersionAtMost for java version check - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 01:23:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41822: [MINOR][SQL][TESTS] Use SystemUtils.isJavaVersionAtMost for java version check - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 01:23:21 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41823: [SPARK-43476][PYTHON][TESTS] Enable SeriesStringTests.test_string_replace for pandas 2.0.0. - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/03 01:28:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41823: [SPARK-43476][PYTHON][TESTS] Enable SeriesStringTests.test_string_replace for pandas 2.0.0. - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/03 01:28:22 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41822: [MINOR][SQL][TESTS] Use SystemUtils.isJavaVersionAtMost for java version check - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/03 01:38:44 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #41725: [SPARK-44180][SQL] DistributionAndOrderingUtils should apply ResolveTimeZone - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/03 01:44:38 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #41733: [SPARK-44185][SQL] Fix inconsistent path qualifying between catalog and data operations - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/03 01:45:32 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41733: [SPARK-44185][SQL] Fix inconsistent path qualifying between catalog and data operations - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/03 01:46:10 UTC, 0 replies.
- [GitHub] [spark] lyy-pineapple commented on pull request #41723: [SPARK-44179][CORE]Fix the number of executors is calculated incorrctly when the task fails and it is speculated that the task is still executing - posted by "lyy-pineapple (via GitHub)" <gi...@apache.org> on 2023/07/03 02:09:19 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41606: [WIP] [SPARK-44061] Add assertDFEquality util function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 02:15:10 UTC, 11 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41824: [SPARK-43570][SPARK-43571] Enable DateOpsTests.[test_rsub|test_sub] for pandas 2.0.0. - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/03 02:30:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/03 02:47:12 UTC, 2 replies.
- [GitHub] [spark] cxzl25 commented on pull request #40972: [SPARK-43301][CORE][SHUFFLE] BlockStoreClient getHostLocalDirs RPC supports IOException retry - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/03 03:14:38 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on a diff in pull request #40972: [SPARK-43301][CORE][SHUFFLE] BlockStoreClient getHostLocalDirs RPC supports IOException retry - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/03 03:16:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41749: [SPARK-44199] CacheManager refreshes the fileIndex unnecessarily - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 03:46:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41822: [MINOR][SQL][TESTS] Use SystemUtils.isJavaVersionAtMost for java version check - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 04:01:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41823: [SPARK-43476][PYTHON][TESTS] Enable SeriesStringTests.test_string_replace for pandas 2.0.0. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/03 05:48:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/03 06:11:57 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #41825: [SPARK-44274][CONNECT] Move out util functions used by ArtifactManager to common/utils - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/03 06:22:38 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #41825: [SPARK-44274][CONNECT] Move out util functions used by ArtifactManager to common/utils - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/03 06:22:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38285: [SPARK-40820][PYTHON] Creating StructType from Json - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 06:26:49 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41823: [SPARK-43476][PYTHON][TESTS] Enable SeriesStringTests.test_string_replace for pandas 2.0.0. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 06:29:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41823: [SPARK-43476][PYTHON][TESTS] Enable SeriesStringTests.test_string_replace for pandas 2.0.0. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 06:30:23 UTC, 0 replies.
- [GitHub] [spark] bzhaoopenstack opened a new pull request, #37234: [SPARK-39822][PYTHON][PS] Provide a good feedback to users - posted by "bzhaoopenstack (via GitHub)" <gi...@apache.org> on 2023/07/03 06:31:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37234: [SPARK-39822][PYTHON][PS] Provide a good feedback to users - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 06:32:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40460: [SPARK-42828][PYTHON][SQL] More explicit Python type annotations for GroupedData - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 06:37:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #40460: [SPARK-42828][PYTHON][SQL] More explicit Python type annotations for GroupedData - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 06:38:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39719: [SPARK-42169] [SQL] Implement code generation for to_csv function (StructsToCsv) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/03 07:12:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39719: [SPARK-42169] [SQL] Implement code generation for to_csv function (StructsToCsv) - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/03 07:13:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 07:25:55 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 07:26:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41769: [WIP] [SPARK-44216] Assert equality test message formatting - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 07:27:16 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41826: skip on jdk21 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/03 07:40:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 07:45:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #40528: [SPARK-42584][CONNECT] Improve output of Column.explain - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 07:58:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41342: [SPARK-43829][CONNECT] Improve SparkConnectPlanner by reuse Dataset and avoid construct new Dataset - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 08:01:04 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/03 08:18:31 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41823: [SPARK-43476][PYTHON][TESTS] Enable SeriesStringTests.test_string_replace for pandas 2.0.0. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 08:19:36 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41824: [SPARK-43570][SPARK-43571][PYTHON][TESTS] Enable DateOpsTests.[test_rsub|test_sub] for pandas 2.0.0. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 08:19:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41809: [SPARK-44251][SQL] Set nullable correctly on coalesced join key in full outer USING join - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 08:27:56 UTC, 0 replies.
- [GitHub] [spark] kunalgoyal98 commented on pull request #38285: [SPARK-40820][PYTHON] Creating StructType from Json - posted by "kunalgoyal98 (via GitHub)" <gi...@apache.org> on 2023/07/03 08:43:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41786: [WIP][SPARK-44243][CORE] Add a parameter to determine the locality of local shuffle reader - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 08:53:10 UTC, 0 replies.
- [GitHub] [spark] maryannxue commented on pull request #41786: [WIP][SPARK-44243][CORE] Add a parameter to determine the locality of local shuffle reader - posted by "maryannxue (via GitHub)" <gi...@apache.org> on 2023/07/03 08:55:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41824: [SPARK-43570][SPARK-43571][PYTHON][TESTS] Enable DateOpsTests.[test_rsub|test_sub] for pandas 2.0.0. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/03 09:04:05 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41823: [SPARK-43476][PYTHON][TESTS] Enable SeriesStringTests.test_string_replace for pandas 2.0.0. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/03 09:13:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41824: [SPARK-43570][SPARK-43571][PYTHON][TESTS] Enable DateOpsTests.[test_rsub|test_sub] for pandas 2.0.0. - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/03 09:19:07 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41606: [WIP] [SPARK-44061] Add assertDFEquality util function - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/03 09:21:07 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #40528: [SPARK-42584][CONNECT] Improve output of Column.explain - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/03 09:53:40 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #41813: [SPARK-44268][CORE][TEST] Add tests to ensure error-classes.json and docs are in sync - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/03 10:09:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41440: [SPARK-43952][CORE][CONNECT][SQL] Add SparkContext APIs for query cancellation by tag - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 10:33:54 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41827: [SPARK-44200][SQL][FOLLOWUP] Add `TABLE_VALUED_FUNCTION_TOO_MANY_TABLE_ARGUMENTS` error into doc - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/03 10:34:11 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/03 10:35:03 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/03 10:35:20 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41827: [SPARK-44200][SQL][FOLLOWUP] Add `TABLE_VALUED_FUNCTION_TOO_MANY_TABLE_ARGUMENTS` error into doc - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/03 10:35:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41827: [SPARK-44200][SQL][FOLLOWUP] Add `TABLE_VALUED_FUNCTION_TOO_MANY_TABLE_ARGUMENTS` error into doc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 10:40:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41827: [SPARK-44200][SQL][FOLLOWUP] Add `TABLE_VALUED_FUNCTION_TOO_MANY_TABLE_ARGUMENTS` error into doc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 10:41:09 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #41440: [SPARK-43952][CORE][CONNECT][SQL] Add SparkContext APIs for query cancellation by tag - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/03 11:06:07 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41828: upgrade github action - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/03 11:16:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41016: [SPARK-43341][SQL] Patch StructType.toDDL not picking up on non-nullability of nested column - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 11:33:20 UTC, 0 replies.
- [GitHub] [spark] dillitz opened a new pull request, #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "dillitz (via GitHub)" <gi...@apache.org> on 2023/07/03 11:44:17 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on a diff in pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/03 11:59:27 UTC, 1 replies.
- [GitHub] [spark] dillitz commented on a diff in pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "dillitz (via GitHub)" <gi...@apache.org> on 2023/07/03 12:07:18 UTC, 15 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41315: [SPARK-43755][CONNECT] Move execution out of SparkExecutePlanStreamHandler and to a different thread - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/03 12:48:27 UTC, 3 replies.
- [GitHub] [spark] BramBoog commented on pull request #41016: [SPARK-43341][SQL] Patch StructType.toDDL not picking up on non-nullability of nested column - posted by "BramBoog (via GitHub)" <gi...@apache.org> on 2023/07/03 13:06:29 UTC, 0 replies.
- [GitHub] [spark] iemejia opened a new pull request, #41830: Upgrade to Avro 1.11.2 - posted by "iemejia (via GitHub)" <gi...@apache.org> on 2023/07/03 13:25:48 UTC, 0 replies.
- [GitHub] [spark] iemejia commented on pull request #41830: [SPARK-44277][BUILD] Upgrade to Avro 1.11.2 - posted by "iemejia (via GitHub)" <gi...@apache.org> on 2023/07/03 13:27:53 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41606: [WIP] [SPARK-44061] Add assertDFEquality util function - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/03 13:46:39 UTC, 1 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/07/03 14:25:08 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/03 14:49:15 UTC, 1 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #41807: [SPARK-44263][CONNECT] Channel Builder support - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/07/03 15:27:39 UTC, 2 replies.
- [GitHub] [spark] wankunde commented on pull request #41786: [WIP][SPARK-44243][CORE] Add a parameter to determine the locality of local shuffle reader - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/07/03 15:29:48 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/03 15:33:45 UTC, 4 replies.
- [GitHub] [spark] heyihong opened a new pull request, #41831: [SPARK-44278] Implement a GRPC server interceptor that cleans up thread local properties - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/03 16:04:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41687: [SPARK-44131][SQL] Add call_function and deprecate call_udf for Scala API - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/03 16:37:15 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41811: [SPARK-44266][SQL] Move Util.truncatedString to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/03 17:06:28 UTC, 0 replies.
- [GitHub] [spark] sandip-db opened a new pull request, #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/07/03 17:25:51 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41792: [SPARK-44249][SQL][PYTHON] Refactor PythonUDTFRunner to send its return type separately - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/03 18:02:10 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/03 18:27:59 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #41791: [WIP][SC-134831] POC for MSK IAM Support - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/03 18:34:03 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41833: [WIP] Check approximate PySpark DF Equality - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/03 18:36:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41817: [SPARK-43851][SQL][FOLLOWUP] Move resolve LCA in grouping expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/03 19:03:15 UTC, 1 replies.
- [GitHub] [spark] anishshri-db commented on a diff in pull request #41791: [WIP][SC-134831] POC for MSK IAM Support - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/03 19:19:02 UTC, 6 replies.
- [GitHub] [spark] panchalhp-db commented on a diff in pull request #41801: [Spark Ticket Here]SSH Environment Manager - posted by "panchalhp-db (via GitHub)" <gi...@apache.org> on 2023/07/03 20:49:43 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #41683: [SPARK-36680][SQL] Supports Dynamic Table Options for Spark SQL - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/07/03 21:02:44 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41606: [WIP] [SPARK-44061] Add assertDFEqual util function - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/03 21:05:09 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41834: [WIP] Make assertDFEqual to call pandas or PySpark util - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/03 21:05:59 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41606: [WIP] [SPARK-44061] Add assertDFEqual util function - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/03 21:27:42 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #41835: [SPARK-44281][SQL] Move QueryCompilation error that used by DataType to sql/api as DataTypeErrors - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/03 21:49:55 UTC, 0 replies.
- [GitHub] [spark] nihalpot commented on a diff in pull request #41791: [WIP][SC-134831] POC for MSK IAM Support - posted by "nihalpot (via GitHub)" <gi...@apache.org> on 2023/07/03 22:01:32 UTC, 8 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41836: [SPARK-44282][CONNECT] Prepare DataType parsing for use in Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/03 22:04:58 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41836: [SPARK-44282][CONNECT] Prepare DataType parsing for use in Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/03 22:06:41 UTC, 1 replies.
- [GitHub] [spark] sandip-db commented on pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2023/07/03 22:11:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #41835: [SPARK-44281][SQL] Move QueryCompilation error that used by DataType to sql/api as DataTypeErrors - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/03 22:16:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41837: [SPARK-44283][Connect] Move Origin to SQL/API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/03 22:33:27 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41837: [SPARK-44283][Connect] Move Origin to SQL/API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/03 22:33:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41835: [SPARK-44281][SQL] Move QueryCompilation error that used by DataType to sql/api as DataTypeErrors - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/03 22:35:29 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #41837: [SPARK-44283][Connect] Move Origin to SQL/API - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/03 22:44:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41749: [SPARK-44199] CacheManager refreshes the fileIndex unnecessarily - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/03 23:06:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41749: [SPARK-44199] CacheManager refreshes the fileIndex unnecessarily - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/03 23:07:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41800: [SPARK-44150][PYTHON][CONNECT] Explicit Arrow casting for mismatched return type in Arrow Python UDF - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 23:13:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41800: [SPARK-44150][PYTHON][CONNECT] Explicit Arrow casting for mismatched return type in Arrow Python UDF - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 23:14:14 UTC, 0 replies.
- [GitHub] [spark-connect-go] HyukjinKwon commented on a diff in pull request #12: [SPARK-44141] Removed need to have buf preinstalled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 23:16:19 UTC, 0 replies.
- [GitHub] [spark-connect-go] arnarpall commented on a diff in pull request #12: [SPARK-44141] Removed need to have buf preinstalled - posted by "arnarpall (via GitHub)" <gi...@apache.org> on 2023/07/03 23:18:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 23:26:10 UTC, 7 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41832: [SPARK-44265][SQL] Built-in XML data source support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/03 23:39:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/03 23:40:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/03 23:41:11 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/03 23:43:16 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41830: [SPARK-44277][BUILD] Upgrade to Avro 1.11.2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 00:05:18 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 00:14:45 UTC, 4 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40551: [SPARK] Project implements ExposesMetadataColumns - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/04 00:24:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40550: [SPARK] LogicalPlan.metadataOutput always contains AttributeReference - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/04 00:24:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #39160: [SPARK-41667][K8S] Expose env var SPARK_DRIVER_POD_NAME in Driver Pod - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/04 00:24:27 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38357: [SPARK-40887][K8S] Allow Spark on K8s to integrate w/ Log Service - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/04 00:24:28 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37234: [SPARK-39822][PYTHON][PS] Provide a good feedback to users - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/04 00:24:29 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41826: Skip some pyspark test on jdk21 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 00:48:34 UTC, 2 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #41839: [SPARK-44287][SQL] Define PartitionEvaluator API for RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/04 00:54:34 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on pull request #41606: [WIP] [SPARK-44061] Add assertDFEqual util function - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/04 00:54:59 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41826: Skip some pyspark test on jdk21 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 00:57:16 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #41839: [SPARK-44287][SQL] Define PartitionEvaluator API for RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/04 00:57:51 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #41840: [SPARK-44288] Set the column family options before passing to DBOptions in RocksDB state store provider - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/04 01:02:58 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #41840: [SPARK-44288] Set the column family options before passing to DBOptions in RocksDB state store provider - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/04 01:03:22 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41724: [SPARK-44210][CONNECT][SQL][PYTHON] Strengthen type checking and better comply with Connect specifications for `levenshtein` function - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 01:08:11 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #41840: [SPARK-44288] Set the column family options before passing to DBOptions in RocksDB state store provider - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/04 01:14:40 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #40981: [SPARK-43311][SS] Add RocksDB state store provider memory management enhancements - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/04 01:15:36 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #40981: [SPARK-43311][SS] Add RocksDB state store provider memory management enhancements - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/04 01:23:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41841: [SPARK-44194][PYTHON][CORE] Add JobTag APIs to PySpark SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 01:31:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41841: [SPARK-44194][PYTHON][CORE] Add JobTag APIs to PySpark SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 01:31:16 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41606: [WIP] [SPARK-44061] Add assertDFEqual util function - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/04 02:03:42 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41842: [MINOR] Eliminate maven build warnings: Using platform locale (en actually) to format date/time, i.e. build is platform dependent! - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 02:08:13 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41842: [MINOR] Eliminate maven build warnings: Using platform locale (en actually) to format date/time, i.e. build is platform dependent! - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 02:15:52 UTC, 1 replies.
- [GitHub] [spark] panbingkun closed pull request #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 02:18:01 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41842: [MINOR] Eliminate maven build warnings: Using platform locale (en actually) to format date/time, i.e. build is platform dependent! - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 02:32:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41701: [SPARK-44146][CONNECT] Isolate Spark Connect Session jars and classfiles - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 02:49:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41625: [SPARK-44078][CONNECT][CORE] Add support for classloader/resource isolation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 03:01:13 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41811: [SPARK-44266][SQL] Move Util.truncatedString to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/04 03:03:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41811: [SPARK-44266][SQL] Move Util.truncatedString to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/04 03:04:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41825: [SPARK-44274][CONNECT] Move out util functions used by ArtifactManager to common/utils - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/04 03:04:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41825: [SPARK-44274][CONNECT] Move out util functions used by ArtifactManager to common/utils - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/04 03:05:11 UTC, 0 replies.
- [GitHub] [spark] wForget commented on pull request #41609: [SPARK-44065][SQL] Optimize BroadcastHashJoin skew when localShuffleReader is disabled - posted by "wForget (via GitHub)" <gi...@apache.org> on 2023/07/04 03:46:25 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41841: [SPARK-44194][PYTHON][CORE] Add JobTag APIs to PySpark SparkContext - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 03:47:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 03:50:38 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #41840: [SPARK-44288][SS] Set the column family options before passing to DBOptions in RocksDB state store provider - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/04 04:06:05 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #41840: [SPARK-44288][SS] Set the column family options before passing to DBOptions in RocksDB state store provider - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/04 04:06:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #41835: [SPARK-44281][SQL] Move QueryCompilation error that used by DataType to sql/api as DataTypeErrors - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/04 04:29:52 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41839: [SPARK-44287][SQL] Use PartitionEvaluator API for RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/04 04:41:58 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #41816: [SPARK-44269][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2310-2314] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/04 04:58:03 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #41836: [SPARK-44282][CONNECT] Prepare DataType parsing for use in Spark Connect Scala Client - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/04 05:02:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/04 05:04:49 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41816: [SPARK-44269][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2310-2314] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/04 05:07:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41816: [SPARK-44269][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2310-2314] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/04 05:08:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41606: [WIP] [SPARK-44061] Add assertDFEqual util function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 05:24:10 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41606: [WIP] [SPARK-44061] Add assertDFEqual util function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 05:27:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41606: [WIP] [SPARK-44061][PYTHON] Add assertDFEqual util function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 05:37:26 UTC, 6 replies.
- [GitHub] [spark] mingkangli-db opened a new pull request, #41843: [SPARK-44280][CORE] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "mingkangli-db (via GitHub)" <gi...@apache.org> on 2023/07/04 05:52:51 UTC, 0 replies.
- [GitHub] [spark] mingkangli-db commented on pull request #41843: [SPARK-44280][CORE] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "mingkangli-db (via GitHub)" <gi...@apache.org> on 2023/07/04 05:56:16 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #41793: [SPARK-44250][ML][PYTHON][CONNECT] Implement classification evaluator - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/04 06:02:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41843: [SPARK-44280][CORE] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/04 06:03:08 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #41840: [SPARK-44288][SS] Set the column family options before passing to DBOptions in RocksDB state store provider - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/04 06:25:10 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #41625: [SPARK-44078][CONNECT][CORE] Add support for classloader/resource isolation - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/04 06:36:21 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #41609: [SPARK-44065][SQL] Optimize BroadcastHashJoin skew when localShuffleReader is disabled - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/04 06:41:21 UTC, 1 replies.
- [GitHub] [spark] vicennial opened a new pull request, #41844: [SPARK-44293][CONNECT] Fix invalid URI for custom JARs in Spark Connect - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/04 06:55:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41826: Skip some pyspark test on jdk21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 07:15:25 UTC, 3 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41743: [SPARK-42554][CONNECT] Implement GRPC exceptions interception for conversion - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/04 07:29:21 UTC, 5 replies.
- [GitHub] [spark] nija-at opened a new pull request, #41845: [SPARK-44291update default schema for agnostic encoder - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/07/04 07:35:32 UTC, 0 replies.
- [GitHub] [spark] nija-at closed pull request #41845: [SPARK-44291update default schema for agnostic encoder - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/07/04 07:35:37 UTC, 0 replies.
- [GitHub] [spark] nija-at opened a new pull request, #41846: [SPARK-44291][CONNECT] Fix incorrect default schema for agnostic encoder - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/07/04 07:41:27 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #41847: [SPARK-44294][UI] Fix HeapHistogram column shows unexpectedly w/ select-all-box - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/04 07:57:18 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41847: [SPARK-44294][UI] Fix HeapHistogram column shows unexpectedly w/ select-all-box - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/04 07:59:45 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41847: [SPARK-44294][UI] Fix HeapHistogram column shows unexpectedly w/ select-all-box - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/04 08:00:03 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41848: [SPARK-44295][BUILD] Upgrade `scala-parser-combinators` to 2.3.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 08:07:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41849: [SPARK-44296][BUILD] Upgrade dropwizard metrics 4.2.19 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 08:20:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41789: [SPARK-44246][CONNECT][FOLLOW-UP] Miscellaneous cleanups for Spark Connect Jar/Classfile Isolation - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 08:31:07 UTC, 4 replies.
- [GitHub] [spark] lxtwhu commented on pull request #33520: [SPARK-36289][SQL] Rewrite distinct count case when expressions without Expand node - posted by "lxtwhu (via GitHub)" <gi...@apache.org> on 2023/07/04 08:40:17 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on a diff in pull request #41743: [SPARK-42554][CONNECT] Implement GRPC exceptions interception for conversion - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/04 08:40:42 UTC, 4 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41850: [SPARK-44292][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2315-2319] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/04 08:56:12 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41851: Move SparkThrowableSuite to spark-common-utils - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 09:01:51 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on a diff in pull request #41807: [SPARK-44263][CONNECT] Channel Builder support - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/04 09:02:14 UTC, 4 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41817: [SPARK-43851][SQL][FOLLOWUP] Move resolve LCA in grouping expressions - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/04 09:06:05 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41807: [SPARK-44263][CONNECT] Channel Builder support - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/04 09:12:15 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41527: [SPARK-43879][CONNECT] Decouple handle command and send response on server side - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/04 09:12:26 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41687: [SPARK-44131][SQL] Add call_function and deprecate call_udf for Scala API - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/04 09:14:20 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41852: [SPARK-44297][CORE][TESTS] Make `ClassLoaderIsolationSuite` test pass with Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 09:20:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41852: [SPARK-44297][CORE][TESTS] Make `ClassLoaderIsolationSuite` test pass with Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 09:29:22 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41849: [SPARK-44296][BUILD] Upgrade dropwizard metrics 4.2.19 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 09:30:40 UTC, 2 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #41844: [SPARK-44293][CONNECT] Fix invalid URI for custom JARs in Spark Connect - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/04 09:45:27 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/04 10:13:05 UTC, 7 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41826: Skip some pyspark test on jdk21 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 10:16:30 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41789: [SPARK-44246][CONNECT][FOLLOW-UP] Miscellaneous cleanups for Spark Connect Jar/Classfile Isolation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 10:17:55 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt opened a new pull request, #41853: [SPARK-34612] Make outputDeterministicLevel a public API - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/07/04 10:46:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41848: [SPARK-44295][BUILD] Upgrade `scala-parser-combinators` to 2.3.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 11:06:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41848: [SPARK-44295][BUILD] Upgrade `scala-parser-combinators` to 2.3.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 11:07:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41844: [SPARK-44293][CONNECT] Fix invalid URI for custom JARs in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 11:08:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41844: [SPARK-44293][CONNECT] Fix invalid URI for custom JARs in Spark Connect - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 11:10:11 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41848: [SPARK-44295][BUILD] Upgrade `scala-parser-combinators` to 2.3.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 11:10:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41826: [SPARK-44298][BUILD] Disable PySpark test on the daily test of Java 21 before the new arrow version release - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/04 11:21:12 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41852: [SPARK-44297][CORE][TESTS] Make `ClassLoaderIsolationSuite` test pass with Scala 2.13 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/04 12:24:40 UTC, 0 replies.
- [GitHub] [spark] ashangit commented on pull request #41806: [SPARK-44242] Improve Max Heap not set check - posted by "ashangit (via GitHub)" <gi...@apache.org> on 2023/07/04 12:40:47 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #41854: [SPARK-44300][CONNECT][BUG-FIX] Fix artifact cleanup to limit deletion scope to session specific artifacts - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/04 13:43:47 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41855: [SPARK-44262][SQL] Add `DropTable` to JdbcDialect - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/04 14:04:54 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41855: [SPARK-44262][SQL] Add `DropTable` to JdbcDialect - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/04 14:05:30 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41606: [WIP] [SPARK-44061][PYTHON] Add assertDFEqual util function - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/04 14:28:36 UTC, 8 replies.
- [GitHub] [spark] LuciferYang closed pull request #41852: [SPARK-44297][CORE][TESTS] Make `ClassLoaderIsolationSuite` test pass with Scala 2.13 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 14:37:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41849: [SPARK-44296][BUILD] Upgrade dropwizard metrics 4.2.19 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/04 14:38:58 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #41743: [SPARK-42554][CONNECT] Implement GRPC exceptions interception for conversion - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/04 16:05:17 UTC, 2 replies.
- [GitHub] [spark] oss-maker opened a new pull request, #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "oss-maker (via GitHub)" <gi...@apache.org> on 2023/07/04 16:54:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41850: [SPARK-44292][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2315-2319] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/04 17:08:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41850: [SPARK-44292][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2315-2319] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/04 17:08:58 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41156: [SPARK-40129][SQL] Fix Decimal multiply can produce the wrong answer - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/04 17:23:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41779: [SPARK-44236][SQL] Disable WholeStageCodegen when set `spark.sql.codegen.factoryMode` to NO_CODEGEN - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/04 17:26:10 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/04 19:39:42 UTC, 2 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41743: [SPARK-42554][CONNECT] Implement GRPC exceptions interception for conversion - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/04 20:03:43 UTC, 1 replies.
- [GitHub] [spark] learningchess2003 opened a new pull request, #41857: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/04 21:40:07 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 closed pull request #41857: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/04 21:50:09 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on pull request #41743: [SPARK-42554][CONNECT] Implement GRPC exceptions interception for conversion - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/04 22:24:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41844: [SPARK-44293][CONNECT] Fix invalid URI for custom JARs in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 23:41:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41844: [SPARK-44293][CONNECT] Fix invalid URI for custom JARs in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 23:41:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41855: [SPARK-44262][SQL] Add `DropTable` to JdbcDialect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 23:44:40 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on pull request #41787: [SPARK-44245][PYTHON] pyspark.sql.dataframe doctests behave differently - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/04 23:45:20 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #41762: [SPARK-44215][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/04 23:46:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41787: [SPARK-44245][PYTHON] pyspark.sql.dataframe doctests behave differently - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/04 23:49:11 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/05 00:06:36 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41606: [WIP] [SPARK-44061][PYTHON] Add assertDFEqual util function - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/05 01:30:41 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41156: [SPARK-40129][SQL] Fix Decimal multiply can produce the wrong answer - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/05 01:39:07 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #41826: [SPARK-44298][BUILD] Disable PySpark test on the daily test of Java 21 before the new arrow version release - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/05 02:01:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41826: [SPARK-44298][BUILD] Disable PySpark test on the daily test of Java 21 before the new arrow version release - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/05 02:02:19 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41855: [SPARK-44262][SQL] Add `DropTable` and `getInsertStatement` to JdbcDialect - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/05 02:39:40 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41855: [SPARK-44262][SQL] Add `DropTable` and `getInsertStatement` to JdbcDialect - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/05 02:45:49 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41855: [SPARK-44262][SQL] Add `DropTable` and `getInsertStatement` to JdbcDialect - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/05 02:51:44 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41724: [SPARK-44210][CONNECT][SQL][PYTHON] Strengthen type checking and better comply with Connect specifications for `levenshtein` function - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/05 02:52:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41854: [SPARK-44300][CONNECT][BUG-FIX] Fix artifact cleanup to limit deletion scope to session specific artifacts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 03:06:12 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41858: [SPARK-44299][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_227[4-6,8] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/05 03:06:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41854: [SPARK-44300][CONNECT][BUG-FIX] Fix artifact cleanup to limit deletion scope to session specific artifacts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 03:06:44 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41850: [SPARK-44292][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2315-2319] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/05 03:07:53 UTC, 0 replies.
- [GitHub] [spark] wankunde closed pull request #41786: [WIP][SPARK-44243][CORE] Add a parameter to determine the locality of local shuffle reader - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/07/05 03:44:32 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #40972: [SPARK-43301][CORE][SHUFFLE] BlockStoreClient getHostLocalDirs RPC supports IOException retry - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/05 03:52:42 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/07/05 03:53:01 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/05 03:54:11 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41821: [SPARK-44272][YARN] Path Inconsistency when Operating statCache within Yarn Client - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/05 03:58:48 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41821: [SPARK-44272][YARN] Path Inconsistency when Operating statCache within Yarn Client - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/05 04:08:47 UTC, 10 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/05 04:41:46 UTC, 1 replies.
- [GitHub] [spark] otterc opened a new pull request, #41859: [SPARK-44215][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/07/05 04:58:58 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #41839: [SPARK-44287][SQL] Use PartitionEvaluator API for RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/05 05:15:58 UTC, 2 replies.
- [GitHub] [spark] shuwang21 commented on a diff in pull request #41821: [SPARK-44272][YARN] Path Inconsistency when Operating statCache within Yarn Client - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/07/05 05:41:50 UTC, 4 replies.
- [GitHub] [spark] wForget commented on pull request #41609: [SPARK-44065][SQL] Optimize BroadcastHashJoin skew in OptimizeSkewedJoin - posted by "wForget (via GitHub)" <gi...@apache.org> on 2023/07/05 06:06:24 UTC, 0 replies.
- [GitHub] [spark] maheshk114 opened a new pull request, #41860: SPARK-44307 : Bloom filter is not added for left outer join if the left side table is smaller than broadcast threshold. - posted by "maheshk114 (via GitHub)" <gi...@apache.org> on 2023/07/05 06:40:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41814: [SPARK-44259][CONNECT][TESTS][FOLLOWUP] No longer initializing `Ammonite` for Java 21 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 07:07:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41830: [SPARK-44277][BUILD] Upgrade to Avro 1.11.2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 07:11:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41814: [SPARK-44259][CONNECT][TESTS][FOLLOWUP] No longer initializing `Ammonite` for Java 21 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 07:18:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41814: [SPARK-44259][CONNECT][TESTS][FOLLOWUP] No longer initializing `Ammonite` for Java 21 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 07:19:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41847: [SPARK-44294][UI] Fix HeapHistogram column shows unexpectedly w/ select-all-box - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 07:24:54 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41843: [SPARK-44280][CORE] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 07:27:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41854: [SPARK-44300][CONNECT][BUG-FIX] Fix artifact cleanup to limit deletion scope to session specific artifacts - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/05 07:29:26 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on pull request #41854: [SPARK-44300][CONNECT] Fix artifact cleanup to limit deletion scope to session specific artifacts - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/05 07:38:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41847: [SPARK-44294][UI] Fix HeapHistogram column shows unexpectedly w/ select-all-box - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 07:41:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41859: [SPARK-44215][3.3][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/05 07:47:53 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41858: [SPARK-44299][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_227[4-6,8] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/05 07:55:02 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #41861: [SPARK-44309][UI] Display Add/Remove Time of Executors on Executors Tab - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/05 08:14:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41743: [SPARK-42554][CONNECT] Implement GRPC exceptions interception for conversion - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 08:43:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41743: [SPARK-42554][CONNECT] Implement GRPC exceptions interception for conversion - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 08:44:19 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #41527: [SPARK-43879][CONNECT] Decouple handle command and send response on server side - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/05 08:46:13 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41527: [SPARK-43879][CONNECT] Decouple handle command and send response on server side - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/05 08:50:06 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/05 08:58:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41858: [SPARK-44299][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_227[4-6,8] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/05 09:22:59 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41862: Improve connect server start log - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/05 09:27:33 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41863: [SPARK-44303][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2320-2324] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/05 09:36:35 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41863: [SPARK-44303][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2320-2324] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/05 10:18:46 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41575: [SPARK-38477][CORE] Use error class in org.apache.spark.shuffle - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/05 10:27:56 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41863: [SPARK-44303][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2320-2324] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/05 10:29:16 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 commented on a diff in pull request #41575: [SPARK-38477][CORE] Use error class in org.apache.spark.shuffle - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/07/05 10:32:26 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/05 11:17:24 UTC, 5 replies.
- [GitHub] [spark] oss-maker commented on pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "oss-maker (via GitHub)" <gi...@apache.org> on 2023/07/05 13:28:07 UTC, 5 replies.
- [GitHub] [spark] panbingkun commented on pull request #41862: [SPARK-44310][CONNECT] The Connect Server startup log should display the hostname and port - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/05 14:07:12 UTC, 2 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41839: [SPARK-44287][SQL] Use PartitionEvaluator API for RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/05 14:11:57 UTC, 3 replies.
- [GitHub] [spark] learningchess2003 opened a new pull request, #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/05 15:16:57 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 commented on pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/05 15:35:04 UTC, 5 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41865: [SPARK-44268][CORE][TEST][FOLLOWUP] Add test to generate `sql-error-conditions` doc automatic - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/05 15:49:08 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41865: [SPARK-44268][CORE][TEST][FOLLOWUP] Add test to generate `sql-error-conditions` doc automatic - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/05 15:49:51 UTC, 1 replies.
- [GitHub] [spark] otterc commented on pull request #41859: [SPARK-44215][3.3][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/07/05 16:00:48 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #41800: [SPARK-44150][PYTHON][CONNECT] Explicit Arrow casting for mismatched return type in Arrow Python UDF - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/05 16:06:55 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #41820: [SPARK-44271][SQL] Move default values functions from StructType to ResolveDefaultColumns - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/05 16:30:56 UTC, 0 replies.
- [GitHub] [spark] srinivasst commented on pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "srinivasst (via GitHub)" <gi...@apache.org> on 2023/07/05 16:46:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41847: [SPARK-44294][UI] Fix HeapHistogram column shows unexpectedly w/ select-all-box - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/05 16:48:58 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #41801: [Spark Ticket Here]SSH Environment Manager - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/05 16:53:04 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 closed pull request #41801: [Spark Ticket Here]SSH Environment Manager - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/05 16:53:04 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #41817: [SPARK-43851][SQL][FOLLOWUP] Move resolve LCA in grouping expressions - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/07/05 17:20:27 UTC, 1 replies.
- [GitHub] [spark] siying commented on pull request #41578: [SPARK-44044][SS] Improve Error message for Window functions with streaming - posted by "siying (via GitHub)" <gi...@apache.org> on 2023/07/05 17:24:28 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/07/05 17:41:54 UTC, 15 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDFEqual util function - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/05 17:59:46 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/05 18:46:15 UTC, 8 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #41705: [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/07/05 18:51:40 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41846: [SPARK-44291][SPARK-43416][CONNECT] Fix incorrect schema for range query - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/05 19:12:34 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41846: [SPARK-44291][SPARK-43416][CONNECT] Fix incorrect schema for range query - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/05 19:13:08 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #41851: Move SparkThrowableSuite to spark-common-utils - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/05 19:15:22 UTC, 4 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41865: [SPARK-44268][CORE][TEST][FOLLOWUP] Add test to generate `sql-error-conditions` doc automatic - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/05 19:19:23 UTC, 1 replies.
- [GitHub] [spark] lucyyao-db commented on a diff in pull request #41705: [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails - posted by "lucyyao-db (via GitHub)" <gi...@apache.org> on 2023/07/05 19:37:59 UTC, 12 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/05 19:39:20 UTC, 2 replies.
- [GitHub] [spark] hvanhovell closed pull request #41585: [WIP] Alternative for JoinWith - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/05 20:16:10 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41585: [WIP] Alternative for JoinWith - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/05 20:16:24 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41836: [SPARK-44282][CONNECT] Prepare DataType parsing for use in Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/05 20:22:59 UTC, 0 replies.
- [GitHub] [spark] dillitz commented on pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "dillitz (via GitHub)" <gi...@apache.org> on 2023/07/05 20:29:15 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41835: [SPARK-44281][SQL] Move QueryCompilation error that used by DataType to sql/api as DataTypeErrors - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/05 20:40:40 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 commented on a diff in pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/05 20:55:32 UTC, 23 replies.
- [GitHub] [spark] dillitz opened a new pull request, #41866: [SPARK-44312][CONNECT][PYTHON] Allow to set a user agent with an environment variable - posted by "dillitz (via GitHub)" <gi...@apache.org> on 2023/07/05 21:08:36 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/05 21:38:05 UTC, 8 replies.
- [GitHub] [spark] jdesjean commented on a diff in pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/07/05 21:50:06 UTC, 3 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #41778: [WIP] DeepspeedDistributor Class That Will Utilize the Deepspeed Launcher Boilerplate - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/05 22:11:39 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/05 22:13:54 UTC, 4 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41778: [WIP] DeepspeedDistributor Class That Will Utilize the Deepspeed Launcher Boilerplate - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/05 22:15:04 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41817: [SPARK-43851][SQL][FOLLOWUP] Move resolve LCA in grouping expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/05 22:30:25 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/05 22:52:22 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/05 23:02:09 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #41347: [SPARK-43838][SQL] Fix subquery on single table with having clause can't be optimized - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/05 23:07:50 UTC, 0 replies.
- [GitHub] [spark] allisonport-db opened a new pull request, #41868: [SPARK-44313] Fix generated column expression validation when there is a char/varchar column in the schema - posted by "allisonport-db (via GitHub)" <gi...@apache.org> on 2023/07/05 23:12:08 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/05 23:26:31 UTC, 7 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/05 23:36:07 UTC, 8 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41861: [SPARK-44309][UI] Display Add/Remove Time of Executors on Executors Tab - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 23:43:49 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41866: [SPARK-44312][CONNECT][PYTHON] Allow to set a user agent with an environment variable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 23:47:38 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41862: [SPARK-44310][CONNECT] The Connect Server startup log should display the hostname and port - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 23:52:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41862: [SPARK-44310][CONNECT] The Connect Server startup log should display the hostname and port - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/05 23:52:32 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #41706: [DO NOT MERGE] Pickle vs. Arrow Type Coercion Difference - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/06 00:23:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41623: [SPARK-44154] Implement bitmap functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/06 00:38:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41623: [SPARK-44154] Implement bitmap functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/06 00:39:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41836: [SPARK-44282][CONNECT] Prepare DataType parsing for use in Spark Connect Scala Client - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/06 00:46:28 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41869: Revert "[SPARK-43851][SQL] Support LCA in grouping expressions" - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/06 00:58:20 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41859: [SPARK-44215][3.3][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/06 01:41:46 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #41859: [SPARK-44215][3.3][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/06 01:50:38 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #41859: [SPARK-44215][3.3][SHUFFLE] If num chunks are 0, then server should throw a RuntimeException - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/06 01:50:39 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #41609: [SPARK-44065][SQL] Optimize BroadcastHashJoin skew in OptimizeSkewedJoin - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/06 01:58:13 UTC, 5 replies.
- [GitHub] [spark] ShreyeshArangath commented on pull request #41861: [SPARK-44309][UI] Display Add/Remove Time of Executors on Executors Tab - posted by "ShreyeshArangath (via GitHub)" <gi...@apache.org> on 2023/07/06 01:58:16 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41865: [SPARK-44268][CORE][TEST][FOLLOWUP] Add test to generate `sql-error-conditions` doc automatic - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/06 01:59:53 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41870: [SPARK-44154][SQL][FOLLOWUP] Add `INVALID_BITMAP_POSITION ` error into doc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/06 02:05:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41870: [SPARK-44154][SQL][FOLLOWUP] Add `INVALID_BITMAP_POSITION ` error into doc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/06 02:06:21 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41861: [SPARK-44309][UI] Display Add/Remove Time of Executors on Executors Tab - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/06 02:06:47 UTC, 1 replies.
- [GitHub] [spark] liukuijian8040 commented on pull request #41162: [SPARK-43491][SQL] In expression should act as same as EqualTo when elements in IN expression have same DataType. - posted by "liukuijian8040 (via GitHub)" <gi...@apache.org> on 2023/07/06 02:09:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41870: [SPARK-44154][SQL][FOLLOWUP] Add `INVALID_BITMAP_POSITION` error into doc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/06 02:11:51 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41870: [SPARK-44154][SQL][FOLLOWUP] Add `INVALID_BITMAP_POSITION` error into doc - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/06 02:12:11 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #41871: [DO-NOT-MERGE] Test CI with protobuf 4.21.6 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/06 02:25:43 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41868: [SPARK-44313][SQL] Fix generated column expression validation when there is a char/varchar column in the schema - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/06 02:26:10 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41868: [SPARK-44313][SQL] Fix generated column expression validation when there is a char/varchar column in the schema - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/06 02:27:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41872: [SPARK-44314] Add a new checkstyle rule to prohibit the use of `@Test(expected = SomeException.class)` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/06 02:35:11 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #41705: [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/06 02:50:08 UTC, 5 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #41873: [SPARK-44315][SQL][CONNECT] Move DefinedByConstructorParams to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/06 02:57:11 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #41873: [SPARK-44315][SQL][CONNECT] Move DefinedByConstructorParams to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/06 02:57:33 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41874: [SPARK-44316][BUILD] Upgrade Jersey to 2.40 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/06 03:10:32 UTC, 0 replies.
- [GitHub] [spark] es94129 commented on a diff in pull request #41778: [WIP] DeepspeedDistributor Class That Will Utilize the Deepspeed Launcher Boilerplate - posted by "es94129 (via GitHub)" <gi...@apache.org> on 2023/07/06 03:18:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41872: [SPARK-44314][BUILD][CORE][TESTS] Add a new checkstyle rule to prohibit the use of `@Test(expected = ExpectedException.class)` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/06 03:27:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41872: [SPARK-44314][BUILD][CORE][TESTS] Add a new checkstyle rule to prohibit the use of `@Test(expected = ExpectedException.class)` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/06 03:27:12 UTC, 1 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #41875: [SPARK-44317][SQL] Use PartitionEvaluator API for ShuffledHashJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/06 04:04:03 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #41875: [SPARK-44317][SQL] Use PartitionEvaluator API for ShuffledHashJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/06 04:06:15 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41843: [SPARK-44280][CORE] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/06 04:07:22 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #41843: [SPARK-44280][CORE] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/07/06 04:12:01 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #41843: [SPARK-44280][CORE] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/07/06 04:14:52 UTC, 1 replies.
- [GitHub] [spark] nija-at commented on pull request #41866: [SPARK-44312][CONNECT][PYTHON] Allow to set a user agent with an environment variable - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/07/06 04:53:44 UTC, 0 replies.
- [GitHub] [spark] nija-at commented on a diff in pull request #41866: [SPARK-44312][CONNECT][PYTHON] Allow to set a user agent with an environment variable - posted by "nija-at (via GitHub)" <gi...@apache.org> on 2023/07/06 05:02:36 UTC, 1 replies.
- [GitHub] [spark] oss-maker commented on a diff in pull request #41860: SPARK-44307 : Bloom filter is not added for left outer join if the left side table is smaller than broadcast threshold. - posted by "oss-maker (via GitHub)" <gi...@apache.org> on 2023/07/06 05:31:29 UTC, 1 replies.
- [GitHub] [spark] eejbyfeldt opened a new pull request, #41876: [SPARK-44311][SQL] Improved support for UDFs on value classes - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/07/06 05:48:39 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/06 06:22:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/06 06:38:06 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/06 06:58:58 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41340: [SPARK-44318][BUILD] Remove useless dependencies - javax.ws.rs-api - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/06 07:08:02 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41858: [SPARK-44299][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_227[4-6,8] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 07:08:10 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41858: [SPARK-44299][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_227[4-6,8] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 07:08:59 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41858: [SPARK-44299][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_227[4-6,8] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/06 07:13:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41874: [SPARK-44316][BUILD] Upgrade Jersey to 2.40 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/06 07:16:05 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41874: [SPARK-44316][BUILD] Upgrade Jersey to 2.40 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/06 07:25:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41340: [SPARK-44318][BUILD] Remove useless dependencies - javax.ws.rs-api - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/06 07:30:41 UTC, 0 replies.
- [GitHub] [spark] ksn06 commented on pull request #40744: [SPARK-24497][SQL] Support recursive SQL - posted by "ksn06 (via GitHub)" <gi...@apache.org> on 2023/07/06 07:51:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/06 08:06:30 UTC, 4 replies.
- [GitHub] [spark] oss-maker commented on a diff in pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "oss-maker (via GitHub)" <gi...@apache.org> on 2023/07/06 08:18:51 UTC, 8 replies.
- [GitHub] [spark] peter-toth commented on pull request #40744: [SPARK-24497][SQL] Support recursive SQL - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/07/06 08:21:09 UTC, 0 replies.
- [GitHub] [spark] maheshk114 commented on a diff in pull request #41860: SPARK-44307 : [SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "maheshk114 (via GitHub)" <gi...@apache.org> on 2023/07/06 08:21:12 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41869: Revert "[SPARK-43851][SQL] Support LCA in grouping expressions" - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/06 08:25:03 UTC, 2 replies.
- [GitHub] [spark] dillitz commented on a diff in pull request #41866: [SPARK-44312][CONNECT][PYTHON] Allow to set a user agent with an environment variable - posted by "dillitz (via GitHub)" <gi...@apache.org> on 2023/07/06 08:25:37 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #41677: [SPARK-35564][SQL] Improve subexpression elimination - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/07/06 09:48:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41495: [SPARK-44290][CONNECT] Session-based files and archives in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/06 09:52:11 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #41877: [SPARK-43660][CONNECT][PS] Enable `resample` with Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/06 10:40:35 UTC, 0 replies.
- [GitHub] [spark] oss-maker commented on a diff in pull request #41860: SPARK-44307 : [SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "oss-maker (via GitHub)" <gi...@apache.org> on 2023/07/06 11:23:25 UTC, 0 replies.
- [GitHub] [spark] oss-maker commented on pull request #41860: SPARK-44307 : [SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "oss-maker (via GitHub)" <gi...@apache.org> on 2023/07/06 11:25:28 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #41863: [SPARK-44303][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2320-2324] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/06 11:33:50 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #41860: SPARK-44307 : [SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/06 11:41:22 UTC, 0 replies.
- [GitHub] [spark] shuwang21 commented on pull request #41440: [SPARK-43952][CORE][CONNECT][SQL] Add SparkContext APIs for query cancellation by tag - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/07/06 12:01:30 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41440: [SPARK-43952][CORE][CONNECT][SQL] Add SparkContext APIs for query cancellation by tag - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/06 12:18:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41869: Revert "[SPARK-43851][SQL] Support LCA in grouping expressions" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 12:19:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41869: Revert "[SPARK-43851][SQL] Support LCA in grouping expressions" - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 12:20:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41838: [SPARK-44284][CONNECT] Create simple conf system for sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 12:22:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41863: [SPARK-44303][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2320-2324] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 12:23:53 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41837: [SPARK-44283][CONNECT] Move Origin to SQL/API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 12:24:14 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41863: [SPARK-44303][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2320-2324] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 12:24:23 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41837: [SPARK-44283][CONNECT] Move Origin to SQL/API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 12:25:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41575: [SPARK-38477][CORE] Use error class in org.apache.spark.shuffle - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 12:36:32 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41575: [SPARK-38477][CORE] Use error class in org.apache.spark.shuffle - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/06 12:39:28 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41878: [SPARK-44268][CORE][TEST][FOLLOWUP] Only print clue when assert doc failed - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/06 13:12:00 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41878: [SPARK-44268][CORE][TEST][FOLLOWUP] Only print clue when assert doc failed - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/06 13:12:12 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41879: [SPARK-44321][CONNECT] Decouple ParseException from AnalysisException - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 13:18:56 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41879: [SPARK-44321][CONNECT] Decouple ParseException from AnalysisException - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 13:19:04 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41878: [SPARK-44268][CORE][TEST][FOLLOWUP] Only print clue when assert doc failed - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 13:31:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41879: [SPARK-44321][CONNECT] Decouple ParseException from AnalysisException - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/06 13:40:44 UTC, 0 replies.
- [GitHub] [spark] mingkangli-db commented on a diff in pull request #41843: [SPARK-44280][SQL] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "mingkangli-db (via GitHub)" <gi...@apache.org> on 2023/07/06 13:51:30 UTC, 7 replies.
- [GitHub] [spark] panbingkun commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/06 13:55:53 UTC, 3 replies.
- [GitHub] [spark] LuciferYang closed pull request #41872: [SPARK-44314][BUILD][CORE][TESTS] Add a new checkstyle rule to prohibit the use of `@Test(expected = ExpectedException.class)` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/06 14:17:55 UTC, 0 replies.
- [GitHub] [spark] cdkrot opened a new pull request, #41880: [SPARK-44263][CONNECT] Custom Interceptors Support - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/06 14:31:06 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on pull request #41880: [SPARK-44263][CONNECT] Custom Interceptors Support - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/06 14:31:33 UTC, 1 replies.
- [GitHub] [spark] cdkrot commented on a diff in pull request #41880: [SPARK-44263][CONNECT] Custom Interceptors Support - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/06 14:34:35 UTC, 9 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #41881: [WIP][SPARK-43983][PYTHON][ML][CONNECT] Implement cross validator estimator - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/06 14:43:16 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #40474: [SPARK-42849] [WIP] [SQL] Session Variables - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/07/06 15:39:16 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #41874: [SPARK-44316][BUILD] Upgrade Jersey to 2.40 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/06 16:19:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41874: [SPARK-44316][BUILD] Upgrade Jersey to 2.40 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/06 16:19:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41874: [SPARK-44316][BUILD] Upgrade Jersey to 2.40 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/06 16:19:39 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #41495: [SPARK-44290][CONNECT] Session-based files and archives in Spark Connect - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/06 16:44:02 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/06 16:46:22 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/06 16:52:32 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41831: [SPARK-44278][CONNECT] Implement a GRPC server interceptor that cleans up thread local properties - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/06 17:04:40 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41750: [SPARK-44200][SQL] Support TABLE argument parser rule for TableValuedFunction - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/06 17:08:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #41882: [SPARK-44324][SQL][CONNECT] Move CaseInsensitiveMap to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/06 18:25:32 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #41882: [SPARK-44324][SQL][CONNECT] Move CaseInsensitiveMap to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/06 18:25:50 UTC, 7 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41880: [SPARK-44263][CONNECT] Custom Interceptors Support - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/06 19:37:18 UTC, 3 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41883: [SPARK-44322][CONNECT] Make parser use SqlApiConf instead of SQLConf. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 19:57:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41883: [SPARK-44322][CONNECT] Make parser use SqlApiConf instead of SQLConf. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 19:58:05 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #41883: [SPARK-44322][CONNECT] Make parser use SqlApiConf instead of SQLConf. - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/06 20:02:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41843: [SPARK-44280][SQL] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/06 20:54:06 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41614: [SPARK-44060][SQL] Code-gen for build side outer shuffled hash join - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/06 21:02:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41614: [SPARK-44060][SQL] Code-gen for build side outer shuffled hash join - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/06 21:02:45 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #41770: [WIP] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/06 21:15:33 UTC, 4 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40997: [SPARK-43321][Connect] Dataset#Joinwith - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 21:16:18 UTC, 2 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #41843: [SPARK-44280][SQL] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/07/06 21:22:49 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40997: [SPARK-43321][Connect] Dataset#Joinwith - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 21:41:05 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #40997: [SPARK-43321][Connect] Dataset#Joinwith - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 21:43:18 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41876: [SPARK-44311][SQL] Improved support for UDFs on value classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/06 21:44:35 UTC, 0 replies.
- [GitHub] [spark] szehon-ho commented on a diff in pull request #41614: [SPARK-44060][SQL] Code-gen for build side outer shuffled hash join - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/07/06 22:20:30 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #41884: [SPARK-44325][SQL]Use PartitionEvaluator API for SortMergeJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/06 22:31:39 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #41770: [WIP] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/06 23:30:17 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41770: [WIP] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/06 23:31:42 UTC, 22 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #41770: [WIP] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/07/06 23:43:53 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #41877: [SPARK-43660][CONNECT][PS] Enable `resample` with Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/07 00:05:26 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #41871: [DO-NOT-MERGE] Test CI with protobuf 4.21.6 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/07 00:09:10 UTC, 0 replies.
- [GitHub] [spark] itholic closed pull request #41871: [DO-NOT-MERGE] Test CI with protobuf 4.21.6 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/07 00:09:11 UTC, 0 replies.
- [GitHub] [spark] jdesjean commented on pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/07/07 00:31:11 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/07 00:31:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41873: [SPARK-44315][SQL][CONNECT] Move DefinedByConstructorParams to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 00:35:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41873: [SPARK-44315][SQL][CONNECT] Move DefinedByConstructorParams to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 00:35:06 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #41885: [SPARK-44326][SQL][CONNECT] Move utils that are used from Scala client to the common modules - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/07 02:07:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41759: [SPARK-44206][SQL] DataSet.selectExpr scope Session.active - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 02:15:53 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41759: [SPARK-44206][SQL] DataSet.selectExpr scope Session.active - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/07 02:17:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #41885: [SPARK-44326][SQL][CONNECT] Move utils that are used from Scala client to the common modules - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/07 02:19:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41877: [SPARK-43660][CONNECT][PS] Enable `resample` with Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/07 02:24:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41877: [SPARK-43660][CONNECT][PS] Enable `resample` with Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/07 02:25:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41886: [SPARK-44327][SQL][CONNECT] Add functions `any` and `len` to Scala - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/07 02:28:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/07 02:55:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41829: [SPARK-44275][CONNECT] Add configurable retry mechanism to Scala Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/07 02:56:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41866: [SPARK-44312][CONNECT][PYTHON] Allow to set a user agent with an environment variable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/07 02:57:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41865: [SPARK-44268][CORE][TEST][FOLLOWUP] Add test to generate `sql-error-conditions` doc automatic - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/07 02:58:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41865: [SPARK-44268][CORE][TEST][FOLLOWUP] Add test to generate `sql-error-conditions` doc automatic - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/07 02:58:34 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X closed pull request #41878: [SPARK-44268][CORE][TEST][FOLLOWUP] Only print clue when assert doc failed - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/07 02:58:50 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41887: Align executor id - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/07 03:35:16 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41887: Align executor id - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/07 03:37:14 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #41888: [SPARK-44330][SQL] Use PartitionEvaluator API in BroadcastNestedLoopJoinExec & BroadcastHashJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/07 03:46:55 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41889: [SPARK-44328][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2325-2328] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/07 04:38:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/07 05:08:05 UTC, 1 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #41876: [SPARK-44311][SQL] Improved support for UDFs on value classes - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/07/07 06:12:25 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41887: [SPARK-44332][CORE][WEBUI] The Executor ID should start with 1 when running on `Spark cluster of [N, cores, memory] locally` mode - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/07 06:35:05 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41887: [SPARK-44332][CORE][WEBUI] The Executor ID should start with 1 when running on `Spark cluster of [N, cores, memory] locally` mode - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/07 06:43:50 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41886: [SPARK-44327][SQL][CONNECT] Add functions `any` and `len` to Scala - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/07 07:19:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41886: [SPARK-44327][SQL][CONNECT] Add functions `any` and `len` to Scala - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/07 07:20:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41883: [SPARK-44322][CONNECT] Make parser use SqlApiConf instead of SQLConf. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/07 07:26:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #41883: [SPARK-44322][CONNECT] Make parser use SqlApiConf instead of SQLConf. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/07 07:29:31 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41890: [SPARK-44333][CONNECT][SQL] Move EnhancedLogicalPlan out of ParserUtils - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/07 07:42:24 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41890: [SPARK-44333][CONNECT][SQL] Move EnhancedLogicalPlan out of ParserUtils - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/07 07:42:32 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41817: [SPARK-43851][SQL] Support LCA in grouping expressions - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/07 07:46:29 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41861: [SPARK-44309][UI] Display Add/Remove Time of Executors on Executors Tab - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/07 08:06:56 UTC, 0 replies.
- [GitHub] [spark] surnaik commented on pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "surnaik (via GitHub)" <gi...@apache.org> on 2023/07/07 08:26:37 UTC, 5 replies.
- [GitHub] [spark] mcdull-zhang commented on pull request #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "mcdull-zhang (via GitHub)" <gi...@apache.org> on 2023/07/07 09:03:35 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #41891: [SPARK-44334][SQL][UI] Status of execution w/ error and w/o jobs shall be FAILED not COMPLETED in SqlResource - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/07 09:44:47 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41891: [SPARK-44334][SQL][UI] Status of execution w/ error and w/o jobs shall be FAILED not COMPLETED in SqlResource - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/07 09:45:20 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #41880: [SPARK-44263][CONNECT] Custom Interceptors Support - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/07 10:34:27 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/07 10:36:08 UTC, 6 replies.
- [GitHub] [spark] beliefer commented on pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/07 10:36:27 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41687: [SPARK-44131][SQL] Add call_function and deprecate call_udf for Scala API - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/07 10:38:47 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #41889: [SPARK-44328][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2325-2328] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/07 10:39:36 UTC, 2 replies.
- [GitHub] [spark] surnaik commented on a diff in pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "surnaik (via GitHub)" <gi...@apache.org> on 2023/07/07 11:18:27 UTC, 1 replies.
- [GitHub] [spark] oss-maker commented on pull request #41860: [SPARK-44307] : [SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "oss-maker (via GitHub)" <gi...@apache.org> on 2023/07/07 12:19:41 UTC, 1 replies.
- [GitHub] [spark] fbiville commented on pull request #41855: [SPARK-44262][SQL] Add `dropTable` and `getInsertStatement` to JdbcDialect - posted by "fbiville (via GitHub)" <gi...@apache.org> on 2023/07/07 13:00:38 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on pull request #41807: [SPARK-44263][CONNECT] Channel Builder support - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/07 13:06:15 UTC, 0 replies.
- [GitHub] [spark] cdkrot closed pull request #41807: [SPARK-44263][CONNECT] Channel Builder support - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/07 13:06:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41889: [SPARK-44328][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2325-2328] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/07 14:26:22 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #41875: [SPARK-44317][SQL] Use PartitionEvaluator API in ShuffledHashJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/07 14:41:55 UTC, 1 replies.
- [GitHub] [spark] vinodkc commented on pull request #41888: [SPARK-44330][SQL] Use PartitionEvaluator API in BroadcastNestedLoopJoinExec & BroadcastHashJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/07 14:42:37 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/07 14:42:53 UTC, 0 replies.
- [GitHub] [spark] maheshk114 commented on pull request #41860: [SPARK-44307] : [SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "maheshk114 (via GitHub)" <gi...@apache.org> on 2023/07/07 14:48:21 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41315: [SPARK-43755][CONNECT] Move execution out of SparkExecutePlanStreamHandler and to a different thread - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/07 15:09:04 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/07 16:17:46 UTC, 1 replies.
- [GitHub] [spark] jameslamb opened a new pull request, #41892: [MINOR][DOCS] clarify array_position return value - posted by "jameslamb (via GitHub)" <gi...@apache.org> on 2023/07/07 16:27:28 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/07 17:18:33 UTC, 9 replies.
- [GitHub] [spark] hvanhovell closed pull request #41883: [SPARK-44322][CONNECT] Make parser use SqlApiConf instead of SQLConf. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/07 17:36:20 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41890: [SPARK-44333][CONNECT][SQL] Move EnhancedLogicalPlan out of ParserUtils - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/07 17:37:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41885: [SPARK-44326][SQL][CONNECT] Move utils that are used from Scala client to the common modules - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/07 17:37:48 UTC, 0 replies.
- [GitHub] [spark] nihalpot commented on a diff in pull request #41791: [SPARK-44285] POC for MSK IAM Support - posted by "nihalpot (via GitHub)" <gi...@apache.org> on 2023/07/07 17:38:20 UTC, 8 replies.
- [GitHub] [spark] hvanhovell closed pull request #41885: [SPARK-44326][SQL][CONNECT] Move utils that are used from Scala client to the common modules - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/07 17:38:49 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/07 17:57:52 UTC, 5 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the … - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:44:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the … - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:45:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the … - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:47:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:50:24 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41888: [SPARK-44330][SQL] Use PartitionEvaluator API in BroadcastNestedLoopJoinExec & BroadcastHashJoinExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:51:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41875: [SPARK-44317][SQL] Use PartitionEvaluator API in ShuffledHashJoinExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:52:14 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/07 19:53:50 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:54:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41839: [SPARK-44287][SQL] Use PartitionEvaluator API in RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:58:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41839: [SPARK-44287][SQL] Use PartitionEvaluator API in RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 19:58:41 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #41791: [SPARK-44285] POC for MSK IAM Support - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/07 20:25:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 20:38:34 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41347: [SPARK-43838][SQL] Fix subquery on single table with having clause can't be optimized - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/07 20:43:39 UTC, 11 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/07 21:22:13 UTC, 3 replies.
- [GitHub] [spark] asl3 closed pull request #41833: [WIP] Check approximate PySpark DF Equality - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/07 21:41:08 UTC, 0 replies.
- [GitHub] [spark] asl3 closed pull request #41769: [WIP] [SPARK-44216] Assert equality test message formatting - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/07 21:41:21 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] opened a new pull request, #41894: Bump h2 from 2.1.214 to 2.2.220 in /connector/connect/server - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/07/07 22:00:42 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] opened a new pull request, #41895: Bump h2 from 2.1.214 to 2.2.220 in /sql/core - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/07/07 22:08:17 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41896: Add new pyspark_testing module, update GHA - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/07 22:53:25 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40268: [SPARK-42500][SQL] ConstantPropagation support more cases - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/08 00:22:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/08 00:22:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40182: [SPARK-42588][SQL] Collapse two adjacent windows with the equivalent partition/order expressions in two withColumn() - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/08 00:22:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/08 00:22:14 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #41897: [SPARK-44337] Any fields set to 'Any.getDefaultInstance' cause parse errors - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/08 01:31:36 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #41897: [SPARK-44337] Any fields set to 'Any.getDefaultInstance' cause parse errors - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/08 01:32:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #41898: [SPARK-44338][SQL] Fix view schema mismatch error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/08 01:38:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41898: [SPARK-44338][SQL] Fix view schema mismatch error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/08 01:38:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41898: [SPARK-44338][SQL] Fix view schema mismatch error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/08 01:39:09 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41875: [SPARK-44317][SQL] Use PartitionEvaluator API in ShuffledHashJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/08 02:53:42 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on pull request #41860: [SPARK-44307] : [SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/08 04:03:34 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41875: [SPARK-44317][SQL] Use PartitionEvaluator API in ShuffledHashJoinExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/08 04:15:25 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41606: [SPARK-44061][PYTHON] Add assertDataFrameEqual util function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/08 04:55:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41495: [SPARK-44290][CONNECT] Session-based files and archives in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/08 04:59:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41495: [SPARK-44290][CONNECT] Session-based files and archives in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/08 05:00:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41895: Bump h2 from 2.1.214 to 2.2.220 in /sql/core - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/08 05:03:57 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] commented on pull request #41895: Bump h2 from 2.1.214 to 2.2.220 in /sql/core - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/07/08 05:04:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41894: Bump h2 from 2.1.214 to 2.2.220 in /connector/connect/server - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/08 05:04:02 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] commented on pull request #41894: Bump h2 from 2.1.214 to 2.2.220 in /connector/connect/server - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/07/08 05:04:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41892: [MINOR][DOCS] clarify array_position return value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/08 05:05:55 UTC, 0 replies.
- [GitHub] [spark] jameslamb commented on pull request #41892: [MINOR][DOCS] clarify array_position return value - posted by "jameslamb (via GitHub)" <gi...@apache.org> on 2023/07/08 05:28:12 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/08 06:40:51 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41899: [SPARK-44340][SQL] Define the computing logic through PartitionEvaluator API and use it in WindowGroupLimitExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/08 07:40:34 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41900: [SPARK-44342][SQL] Replace SQLContext with SparkSession for GenTPCDSData - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/08 08:19:18 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41900: [SPARK-44342][SQL] Replace SQLContext with SparkSession for GenTPCDSData - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/08 08:21:46 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #41887: [SPARK-44332][CORE][WEBUI] Fix the sorting error of Executor ID Column on Executors UI Page - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/08 11:32:47 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the … - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/08 11:50:33 UTC, 1 replies.
- [GitHub] [spark] tedyu opened a new pull request, #41901: Reuse evaluator in ColumnarToRowExec#doExecute - posted by "tedyu (via GitHub)" <gi...@apache.org> on 2023/07/08 14:24:02 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #41901: Reuse evaluator in ColumnarToRowExec#doExecute - posted by "tedyu (via GitHub)" <gi...@apache.org> on 2023/07/08 14:25:16 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/08 17:50:24 UTC, 3 replies.
- [GitHub] [spark] hvanhovell closed pull request #41879: [SPARK-44321][CONNECT] Decouple ParseException from AnalysisException - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/08 19:10:56 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41902: [SPARK-44331][CONNECT][PYTHON] Add bitmap functions to Scala and Python - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/08 21:07:36 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #40831: [SPARK-43171][K8S] Support custom Unix username in Pod - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/08 23:12:08 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/08 23:14:41 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #40574: [SPARK-42942][SQL] Support coalesce table cache stage partitions - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/08 23:19:40 UTC, 2 replies.
- [GitHub] [spark] holdenk commented on pull request #38852: [SPARK-41341][CORE] Wait shuffle fetch to finish when decommission executor - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/08 23:27:28 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #39907: [SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/08 23:42:32 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #41765: [SPARK-43203][SQL][3.4] Move all Drop Table case to DataSource V2 - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/08 23:43:57 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40591: [SPARK-42950][CORE] Add exit code in SparkListenerApplicationEnd - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/09 00:26:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40363: [SPARK_42744] delete uploaded file when job finish for k8s - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/09 00:26:28 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40268: [SPARK-42500][SQL] ConstantPropagation support more cases - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/09 00:26:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40182: [SPARK-42588][SQL] Collapse two adjacent windows with the equivalent partition/order expressions in two withColumn() - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/09 00:26:32 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40093: [SPARK-42500][SQL] ConstantPropagation supports more cases - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/09 00:26:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37514: [SPARK-30628][SQL] Support Subquery partition pruning and DPP for V2 file source - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/09 00:26:34 UTC, 0 replies.
- [GitHub] [spark] Kimahriman closed pull request #37514: [SPARK-30628][SQL] Support Subquery partition pruning and DPP for V2 file source - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/07/09 02:37:05 UTC, 0 replies.
- [GitHub] [spark] koertkuipers opened a new pull request, #41903: [SPARK-44323][SQL] Do not allow options inside tuples to set to null - posted by "koertkuipers (via GitHub)" <gi...@apache.org> on 2023/07/09 03:40:52 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #38852: [SPARK-41341][CORE] Wait shuffle fetch to finish when decommission executor - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/09 04:04:42 UTC, 0 replies.
- [GitHub] [spark] gdhuper opened a new pull request, #41904: [SPARK-43389] [PySpark, SQL] Added a null check for lineSep option - posted by "gdhuper (via GitHub)" <gi...@apache.org> on 2023/07/09 04:11:11 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41902: [SPARK-44331][CONNECT][PYTHON] Add bitmap functions to Scala and Python - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/09 04:38:12 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 opened a new pull request, #41905: [SPARK-44126][CORE] Shuffle migration failure count should not increase when target executor decommissioned - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/09 04:42:05 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #41905: [SPARK-44126][CORE] Shuffle migration failure count should not increase when target executor decommissioned - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/09 04:42:29 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 opened a new pull request, #41906: [SPARK-44345][CORE] Only log unknown shuffle map output as error when shuffle migration disabled - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/09 04:49:58 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #41906: [SPARK-44345][CORE] Only log unknown shuffle map output as error when shuffle migration disabled - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/09 04:50:20 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41907: [SPARK-44329][CONNECT][PYTHON] Add hll_sketch_agg, hll_union_agg, to_varchar, try_aes_decrypt to Scala and Python - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/09 05:11:02 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/09 06:34:45 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #41905: [SPARK-44126][CORE] Shuffle migration failure count should not increase when target executor decommissioned - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/09 07:27:00 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41906: [SPARK-44345][CORE] Only log unknown shuffle map output as error when shuffle migration disabled - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/09 07:38:32 UTC, 3 replies.
- [GitHub] [spark] Yikun commented on pull request #40831: [SPARK-43171][K8S] Support custom Unix username in Pod - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/07/09 10:27:26 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #41908: [WIP][PS] Pandas 1.5.3 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/09 10:29:48 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41575: [SPARK-38477][CORE] Use error class in org.apache.spark.shuffle - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/09 12:52:41 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41909: [SPARK-44320][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1067,1150,1220,1265,1277] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/09 13:12:30 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41909: [SPARK-44320][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1067,1150,1220,1265,1277] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/09 13:15:47 UTC, 1 replies.
- [GitHub] [spark] koertkuipers commented on pull request #41903: [SPARK-44323][SQL] Do not allow options inside tuples to set to null - posted by "koertkuipers (via GitHub)" <gi...@apache.org> on 2023/07/09 15:35:12 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #41904: [SPARK-43389] [PySpark, SQL] Added a null check for lineSep option - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/09 15:55:59 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/09 15:57:24 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39907: [SPARK-42359][SQL] Support row skipping when reading CSV files - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/09 15:57:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40591: [SPARK-42950][CORE] Add exit code in SparkListenerApplicationEnd - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/10 00:23:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40363: [SPARK_42744] delete uploaded file when job finish for k8s - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/10 00:23:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41881: [WIP][SPARK-43983][PYTHON][ML][CONNECT] Implement cross validator estimator - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 00:35:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41902: [SPARK-44331][CONNECT][PYTHON] Add bitmap functions to Scala and Python - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 00:37:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41902: [SPARK-44331][CONNECT][PYTHON] Add bitmap functions to Scala and Python - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 00:37:34 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41907: [SPARK-44329][CONNECT][PYTHON] Add hll_sketch_agg, hll_union_agg, to_varchar, try_aes_decrypt to Scala and Python - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/10 00:58:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41907: [SPARK-44329][CONNECT][PYTHON] Add hll_sketch_agg, hll_union_agg, to_varchar, try_aes_decrypt to Scala and Python - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 01:08:07 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41907: [SPARK-44329][CONNECT][PYTHON] Add hll_sketch_agg, hll_union_agg, to_varchar, try_aes_decrypt to Scala and Python - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 01:09:43 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41910: [SPARK-44347][BUILD] Upgrade janino to 3.1.10 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/10 01:10:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41911: [SPARK-43710][PS][FOLLOWUP] Fix `date_part` invocations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 01:17:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41892: [MINOR][DOCS] clarify array_position return value - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 01:43:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41892: [MINOR][DOCS] clarify array_position return value - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 01:43:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41910: [SPARK-44347][BUILD] Upgrade janino to 3.1.10 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 01:45:40 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41900: [SPARK-44342][SQL] Replace SQLContext with SparkSession for GenTPCDSData - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/10 02:01:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41900: [SPARK-44342][SQL] Replace SQLContext with SparkSession for GenTPCDSData - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/10 02:02:08 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the … - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/10 02:28:56 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/10 02:39:40 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #41912: [DO-NOT-MERGE] Test pandas 1.5.3 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/10 02:46:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41902: [SPARK-44331][CONNECT][PYTHON] Add bitmap functions to Scala and Python - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 03:04:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41907: [SPARK-44329][CONNECT][PYTHON] Add hll_sketch_agg, hll_union_agg, to_varchar, try_aes_decrypt to Scala and Python - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 03:06:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41882: [SPARK-44324][SQL][CONNECT] Move CaseInsensitiveMap to sql/api - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 03:26:43 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41913: [SPARK-44290][CONNECT][FOLLOW-UP] Skip flaky tests, and fix a typo in session UUID together - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 04:25:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41914: [SPARK-44349][R] Add math functions to SparkR - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 04:39:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41910: [SPARK-44347][BUILD] Upgrade janino to 3.1.10 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 05:01:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41912: [DO-NOT-MERGE] Test pandas 1.5.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 05:02:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41673: [SPARK-44091][YARN][TESTS] Introduce `withResourceTypes` to `ResourceRequestTestHelper` to restore `resourceTypes` as default value after testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 05:06:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41911: [SPARK-43710][PS][FOLLOWUP] Fix `date_part` invocations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 05:29:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41911: [SPARK-43710][PS][FOLLOWUP] Fix `date_part` invocations - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 05:29:19 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #41912: [DO-NOT-MERGE] Test pandas 1.5.3 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/10 05:36:50 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41912: [DO-NOT-MERGE] Test pandas 1.5.3 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 05:46:23 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41913: [SPARK-44290][CONNECT][FOLLOW-UP] Skip flaky tests, and fix a typo in session UUID together - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/10 05:47:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41913: [SPARK-44290][CONNECT][FOLLOW-UP] Skip flaky tests, and fix a typo in session UUID together - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:01:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41913: [SPARK-44290][CONNECT][FOLLOW-UP] Skip flaky tests, and fix a typo in session UUID together - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:01:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41889: [SPARK-44328][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2325-2328] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/10 06:18:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41889: [SPARK-44328][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2325-2328] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/10 06:19:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41915: Make some syntactic simplification - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 06:23:23 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41916: [SPARK-44350][BUILD] Upgrade sbt to 1.9.2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/10 06:24:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41917: [SPARK-44194][DOCS][FOLLOWUP] Add missing `versionadded` annotations for JobTag APIs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 06:25:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41910: [SPARK-44347][BUILD] Upgrade janino to 3.1.10 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:31:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41908: [WIP][PS] Pandas 1.5.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:31:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41905: [SPARK-44126][CORE] Shuffle migration failure count should not increase when target executor decommissioned - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:32:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41904: [SPARK-43389][SQL] Added a null check for lineSep option - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:35:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41897: [SPARK-44337][PROTOBUF] Any fields set to 'Any.getDefaultInstance' cause parse errors - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:39:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41897: [SPARK-44337][PROTOBUF] Any fields set to 'Any.getDefaultInstance' cause parse errors - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:40:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41891: [SPARK-44334][SQL][UI] Status of execution w/ error and w/o jobs shall be FAILED not COMPLETED in SqlResource - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:40:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41812: [SPARK-44267][PS][INFRA] Upgrade `pandas` to 2.0.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 06:43:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41918: [DO_NOT_MERGE][INFRA] test dockerfile - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 06:43:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41909: [SPARK-44320][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1067,1150,1220,1265,1277] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/10 07:04:44 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41690: [SPARK-44133][PYTHON] Upgrade MyPy from 0.920 to 0.982 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 07:17:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41919: Test RTools - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 07:48:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41690: [SPARK-44133][PYTHON] Upgrade MyPy from 0.920 to 0.982 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 08:06:10 UTC, 3 replies.
- [GitHub] [spark] itholic closed pull request #41912: [DO-NOT-MERGE] Test pandas 1.5.3 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/10 08:30:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 08:30:32 UTC, 9 replies.
- [GitHub] [spark] itholic commented on pull request #41908: [WIP][PS] Pandas 1.5.3 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/10 08:59:37 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #41918: [DO_NOT_MERGE][INFRA] test dockerfile - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/07/10 09:02:05 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41917: [SPARK-44194][DOCS][FOLLOWUP] Add missing `versionadded` annotations for JobTag APIs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 09:20:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41917: [SPARK-44194][DOCS][FOLLOWUP] Add missing `versionadded` annotations for JobTag APIs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 09:20:46 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/07/10 09:37:31 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41918: [DO_NOT_MERGE][INFRA] test dockerfile - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 09:46:35 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41899: [SPARK-44340][SQL] Define the computing logic through PartitionEvaluator API and use it in WindowGroupLimitExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/10 10:03:10 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41687: [SPARK-44131][SQL] Add call_function and deprecate call_udf for Scala API - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 10:11:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41687: [SPARK-44131][SQL] Add call_function and deprecate call_udf for Scala API - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/10 10:12:00 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41820: [SPARK-44271][SQL] Move default values functions from StructType to ResolveDefaultColumns - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 10:22:17 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41820: [SPARK-44271][SQL] Move default values functions from StructType to ResolveDefaultColumns - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 10:23:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41919: Test RTools - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 11:10:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41919: Test RTools - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 11:10:36 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #41881: [WIP][SPARK-43983][PYTHON][ML][CONNECT] Implement cross validator estimator - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/10 11:14:38 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41920: [SPARK-44343][CONNECT] Prepare ScalaReflection to the move to SQL/API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 11:19:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41920: [SPARK-44343][CONNECT] Prepare ScalaReflection to the move to SQL/API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 11:19:58 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41887: [SPARK-44332][CORE][WEBUI] Fix the sorting error of Executor ID Column on Executors UI Page - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 11:45:03 UTC, 0 replies.
- [GitHub] [spark] ramon-garcia closed pull request #41717: [SPARK-44165] Add support for TIME columns in Parquet files - posted by "ramon-garcia (via GitHub)" <gi...@apache.org> on 2023/07/10 11:57:39 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41921: [SPARK-44352][CONNECT] Put back sameType and friends in DataType. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 12:00:22 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41921: [SPARK-44352][CONNECT] Put back sameType and friends in DataType. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 12:00:29 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/07/10 12:16:04 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #41922: [WIP][SQL] Support `WITH ... INSERT INTO` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/10 12:48:12 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41342: [SPARK-43829][CONNECT] Improve SparkConnectPlanner by reuse Dataset and avoid construct new Dataset - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/10 13:00:31 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #41923: [SPARK-38476][CORE] Use error class in org.apache.spark.storage - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/07/10 13:04:02 UTC, 0 replies.
- [GitHub] [spark] xy2953396112 commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "xy2953396112 (via GitHub)" <gi...@apache.org> on 2023/07/10 13:20:54 UTC, 0 replies.
- [GitHub] [spark] jiangxb1987 commented on a diff in pull request #41905: [SPARK-44126][CORE] Shuffle migration failure count should not increase when target executor decommissioned - posted by "jiangxb1987 (via GitHub)" <gi...@apache.org> on 2023/07/10 13:40:15 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41347: [SPARK-43838][SQL] Fix subquery on single table with having clause can't be optimized - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/10 13:54:32 UTC, 14 replies.
- [GitHub] [spark] LuciferYang closed pull request #41915: [SPARK-44351][SQL] Make some syntactic simplification - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 15:05:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41915: [SPARK-44351][SQL] Make some syntactic simplification - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/10 15:05:42 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41924: Add support for List[Row] data type for expected - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/10 15:43:51 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41896: Add new pyspark_testing module, update GHA - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/10 17:34:13 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen closed pull request #41908: [WIP][PS] Pandas 1.5.3 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/10 17:44:50 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #41916: [SPARK-44350][BUILD] Upgrade sbt to 1.9.2 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/10 18:29:45 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #41916: [SPARK-44350][BUILD] Upgrade sbt to 1.9.2 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/10 18:30:07 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41899: [SPARK-44340][SQL] Define the computing logic through PartitionEvaluator API and use it in WindowGroupLimitExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/10 18:32:20 UTC, 1 replies.
- [GitHub] [spark] hvanhovell closed pull request #41920: [SPARK-44343][CONNECT] Prepare ScalaReflection to the move to SQL/API - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 18:45:17 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #41921: [SPARK-44352][CONNECT] Put back sameType and friends in DataType. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 18:46:32 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/07/10 18:57:37 UTC, 5 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #41925: [SPARK-44353][CONNECT][SQL] Remove StructType.toAttributes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 19:26:23 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41925: [SPARK-44353][CONNECT][SQL] Remove StructType.toAttributes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/10 19:26:33 UTC, 1 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41926: [SPARK-44363] [PYTHON] Display percent of unequal rows in DataFrame comparison - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/10 19:51:40 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41770: [Spark Ticket For This Component Here] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/10 19:55:55 UTC, 15 replies.
- [GitHub] [spark] warrenzhu25 commented on a diff in pull request #41906: [SPARK-44345][CORE] Only log unknown shuffle map output as error when shuffle migration disabled - posted by "warrenzhu25 (via GitHub)" <gi...@apache.org> on 2023/07/10 20:06:56 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #41770: [Spark Ticket For This Component Here] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/10 20:35:44 UTC, 3 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API with ignore_nullable optional flag - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/10 20:47:55 UTC, 0 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #41770: [Spark Ticket For This Component Here] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/07/10 22:05:02 UTC, 3 replies.
- [GitHub] [spark] rangadi commented on pull request #41897: [SPARK-44337][PROTOBUF] Any fields set to 'Any.getDefaultInstance' cause parse errors - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/10 22:39:35 UTC, 0 replies.
- [GitHub] [spark] agubichev commented on a diff in pull request #41439: [SPARK-43778][SQL]Reassign expression IDs in the scalar subquery input - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/07/10 22:41:00 UTC, 5 replies.
- [GitHub] [spark] rangadi commented on pull request #41791: [SPARK-44285] MSK IAM Support - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/10 22:42:42 UTC, 0 replies.
- [GitHub] [spark] nihalpot commented on pull request #41791: [SPARK-44285] MSK IAM Support - posted by "nihalpot (via GitHub)" <gi...@apache.org> on 2023/07/10 22:52:26 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #41881: [SPARK-43983][PYTHON][ML][CONNECT] Implement cross validator estimator - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/10 23:00:33 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #41928: [WIP] Move parser and data type to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/10 23:05:27 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API with ignore_nullable optional flag - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/10 23:42:25 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/10 23:48:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API with ignore_nullable optional flag - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/10 23:49:31 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/11 00:06:52 UTC, 13 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41929: [DO-NOT-MERGE] Check CI failures - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 00:08:14 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #41887: [SPARK-44332][CORE][WEBUI] Fix the sorting error of Executor ID Column on Executors UI Page - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/11 00:16:06 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #41887: [SPARK-44332][CORE][WEBUI] Fix the sorting error of Executor ID Column on Executors UI Page - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/11 00:16:16 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/11 00:17:48 UTC, 6 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #41930: [SPARK-44360][SQL] Support schema pruning in delta-based MERGE operations - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/07/11 00:20:09 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #41439: [SPARK-43778][SQL]Reassign expression IDs in the scalar subquery input - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/11 00:26:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41469: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 00:50:13 UTC, 4 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #41770: [SPARK-44264] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/07/11 01:00:08 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/11 01:01:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41687: [SPARK-44131][SQL] Add call_function and deprecate call_udf for Scala API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/11 01:03:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41929: [DO-NOT-MERGE] Check CI failure - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 01:07:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the active conf - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/11 01:07:48 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41901: Reuse evaluator in ColumnarToRowExec#doExecute - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/11 01:08:15 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41770: [SPARK-44264] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/11 01:25:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41469: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/11 01:46:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41469: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/11 01:47:25 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the active conf - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/11 01:55:46 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on a diff in pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the active conf - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/07/11 02:11:55 UTC, 2 replies.
- [GitHub] [spark] tedyu commented on pull request #41901: [SPARK-44287] Reuse evaluator in ColumnarToRowExec#doExecute - posted by "tedyu (via GitHub)" <gi...@apache.org> on 2023/07/11 02:13:48 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API with ignore_nullable optional flag - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/11 02:36:00 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41469: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 02:37:40 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #41809: [SPARK-44251][SQL] Set nullable correctly on coalesced join key in full outer USING join - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/11 03:19:16 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #41809: [SPARK-44251][SQL] Set nullable correctly on coalesced join key in full outer USING join - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/11 03:21:54 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41899: [SPARK-44340][SQL] Define the computing logic through PartitionEvaluator API and use it in WindowGroupLimitExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/11 03:34:37 UTC, 0 replies.
- [GitHub] [spark-connect-go] hiboyang opened a new pull request, #13: [SPARK-44368] Support Repartition and RepartitionByRange in Spark Connect Go Client - posted by "hiboyang (via GitHub)" <gi...@apache.org> on 2023/07/11 03:36:23 UTC, 0 replies.
- [GitHub] [spark-connect-go] hiboyang commented on pull request #13: [SPARK-44368] Support Repartition and RepartitionByRange in Spark Connect Go Client - posted by "hiboyang (via GitHub)" <gi...@apache.org> on 2023/07/11 03:37:55 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41926: [SPARK-44363] [PYTHON] Display percent of unequal rows in DataFrame comparison - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 03:48:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41926: [SPARK-44363] [PYTHON] Display percent of unequal rows in DataFrame comparison - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 03:49:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41896: [SPARK-44357] [PYTHON] Add pyspark_testing module for GHA tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 03:51:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41896: [SPARK-44357] [PYTHON] Add pyspark_testing module for GHA tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 03:51:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 03:57:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 03:58:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41901: [SPARK-44287][SQL] Reuse evaluator in ColumnarToRowExec#doExecute - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 03:58:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41770: [SPARK-44264][ML][PYTHON] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 04:04:15 UTC, 7 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 04:09:52 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/11 04:11:22 UTC, 0 replies.
- [GitHub] [spark] datavisortedyu commented on pull request #41901: [SPARK-44287][SQL] Reuse evaluator in ColumnarToRowExec#doExecute - posted by "datavisortedyu (via GitHub)" <gi...@apache.org> on 2023/07/11 04:14:24 UTC, 2 replies.
- [GitHub] [spark] tedyu commented on pull request #41901: [SPARK-44287][SQL] Reuse evaluator in ColumnarToRowExec#doExecute - posted by "tedyu (via GitHub)" <gi...@apache.org> on 2023/07/11 04:42:50 UTC, 0 replies.
- [GitHub] [spark] tedyu closed pull request #41901: [SPARK-44287][SQL] Reuse evaluator in ColumnarToRowExec#doExecute - posted by "tedyu (via GitHub)" <gi...@apache.org> on 2023/07/11 04:43:00 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #41931: [SPARK-43665][CONNECT][PS] Enable PandasSQLStringFormatter.vformat to work with Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/11 04:48:28 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #41931: [SPARK-43665][CONNECT][PS] Enable PandasSQLStringFormatter.vformat to work with Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/11 04:48:39 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41914: [SPARK-44349][R] Add math functions to SparkR - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/11 04:52:12 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41931: [SPARK-43665][CONNECT][PS] Enable PandasSQLStringFormatter.vformat to work with Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/11 04:56:41 UTC, 4 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41932: [SPARK-44131][SQL][FOLLOWUP] Support qualified function name for call_function - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/11 05:24:58 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/11 05:31:37 UTC, 4 replies.
- [GitHub] [spark] beliefer commented on pull request #41932: [SPARK-44131][SQL][FOLLOWUP] Support qualified function name for call_function - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/11 05:33:46 UTC, 3 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41931: [SPARK-43665][CONNECT][PS] Enable PandasSQLStringFormatter.vformat to work with Spark Connect - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/11 05:34:04 UTC, 5 replies.
- [GitHub] [spark] viirya commented on pull request #41899: [SPARK-44340][SQL] Define the computing logic through PartitionEvaluator API and use it in WindowGroupLimitExec - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/11 05:58:52 UTC, 0 replies.
- [GitHub] [spark] maheshk114 commented on pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "maheshk114 (via GitHub)" <gi...@apache.org> on 2023/07/11 06:09:27 UTC, 6 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/11 06:20:44 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/11 06:21:54 UTC, 10 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/11 06:26:35 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41934: [SPARK-43974][CONNECT][BUILD][FOLLOWUP] Upgrade buf to v1.23.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 06:45:24 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41934: [SPARK-43974][CONNECT][BUILD][FOLLOWUP] Upgrade buf to v1.23.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 06:47:24 UTC, 2 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #41935: [WIP] Inject parser and active session to Dataset APIs - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/11 07:04:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41934: [SPARK-43974][CONNECT][BUILD][FOLLOWUP] Upgrade buf to v1.23.1 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 07:07:04 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41934: [SPARK-43974][CONNECT][BUILD][FOLLOWUP] Upgrade buf to v1.23.1 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 07:16:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41931: [SPARK-43665][CONNECT][PS] Enable PandasSQLStringFormatter.vformat to work with Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 07:18:55 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 07:22:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/11 07:25:06 UTC, 6 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 07:29:06 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 07:29:33 UTC, 2 replies.
- [GitHub] [spark] panbingkun commented on pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 07:41:10 UTC, 4 replies.
- [GitHub] [spark] beliefer commented on pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/11 07:42:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41936: Revert "[SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1" - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 07:43:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41936: Revert "[SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1" - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/11 07:45:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41936: Revert "[SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1" - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/11 07:45:37 UTC, 1 replies.
- [GitHub] [spark] gdhuper commented on a diff in pull request #41904: [SPARK-43389][SQL] Added a null check for lineSep option - posted by "gdhuper (via GitHub)" <gi...@apache.org> on 2023/07/11 07:58:03 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41909: [SPARK-44320][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1067,1150,1220,1265,1277] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/11 08:15:21 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41909: [SPARK-44320][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1067,1150,1220,1265,1277] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/11 08:16:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41880: [SPARK-44263][CONNECT] Custom Interceptors Support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 08:33:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41880: [SPARK-44263][CONNECT] Custom Interceptors Support - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 08:34:15 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41936: Revert "[SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1" - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 09:30:57 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41923: [SPARK-38476][CORE] Use error class in org.apache.spark.storage - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/11 10:06:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41923: [SPARK-38476][CORE] Use error class in org.apache.spark.storage - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/11 10:07:04 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41935: [WIP] Inject parser and active session to Dataset APIs - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/11 10:08:10 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41937: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 10:48:15 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #41938: [SPARK-44373][SQL] Wrap withActive for Dataset API w/ parse logic to make parser related configuration work - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/11 10:49:29 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41937: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 10:50:03 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41938: [SPARK-44373][SQL] Wrap withActive for Dataset API w/ parse logic to make parser related configuration work - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/11 10:57:18 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #41909: [SPARK-44320][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1067,1150,1220,1265,1277] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 11:00:51 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/11 11:17:30 UTC, 4 replies.
- [GitHub] [spark] beliefer opened a new pull request, #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/11 11:24:13 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #41940: [SPARK-44374][PYTHON][ML] Add example code for distributed ML for spark connect - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/11 12:03:32 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #41904: [SPARK-43389][SQL] Added a null check for lineSep option - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/11 12:17:39 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41941: Test pb 3.23.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 12:18:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41942: [WIP][SPARK-44348][CORE][CONNECT][TESTS] Reenable test_artifact - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 12:24:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/11 12:55:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41942: [SPARK-44348][CORE][CONNECT][TESTS] Reenable test_artifact with relevant changes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 13:06:47 UTC, 0 replies.
- [GitHub] [spark] pm-nuance commented on pull request #37417: [SPARK-33782][K8S][CORE]Place spark.files, spark.jars and spark.files under the current working directory on the driver in K8S cluster mode - posted by "pm-nuance (via GitHub)" <gi...@apache.org> on 2023/07/11 14:38:37 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API with ignore_nullable optional flag - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/11 14:44:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/11 14:58:33 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt opened a new pull request, #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/07/11 15:07:52 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #41770: [SPARK-44264][ML][PYTHON] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/11 15:36:55 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41944: [SPARK-44377][BUILD] Exclude Junit5 dependencies from `jersey-test-framework-provider-simple` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 15:44:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41944: [SPARK-44377][BUILD] Exclude Junit5 dependencies from `jersey-test-framework-provider-simple` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/11 16:00:43 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41930: [SPARK-44360][SQL] Support schema pruning in delta-based MERGE operations - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/11 16:08:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41930: [SPARK-44360][SQL] Support schema pruning in delta-based MERGE operations - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/11 16:08:43 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #41791: [SPARK-44285] MSK IAM Support - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/11 16:55:55 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/11 17:07:16 UTC, 1 replies.
- [GitHub] [spark] ramon-garcia commented on a diff in pull request #41717: Withdrawn - posted by "ramon-garcia (via GitHub)" <gi...@apache.org> on 2023/07/11 17:15:21 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #41942: [SPARK-44348][CORE][CONNECT][PYTHON] Reenable test_artifact with relevant changes - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/11 17:33:48 UTC, 1 replies.
- [GitHub] [spark] WweiL commented on pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/11 17:37:48 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/11 17:39:35 UTC, 4 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41770: [SPARK-44264][ML][PYTHON] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/11 18:30:34 UTC, 12 replies.
- [GitHub] [spark] srowen closed pull request #41602: SPARK-44058: Removed createPartition deprecated method for HiveShim.scala - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/11 18:35:24 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/11 18:37:45 UTC, 4 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #41930: [SPARK-44360][SQL] Support schema pruning in delta-based MERGE operations - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/07/11 18:37:55 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41924: [SPARK-44364] [PYTHON] Add support for List[Row] data type for expected - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/11 18:39:25 UTC, 1 replies.
- [GitHub] [spark] sarutak commented on pull request #41891: [SPARK-44334][SQL][UI] Status of execution w/ error and w/o jobs shall be FAILED not COMPLETED in SqlResource - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/11 18:40:14 UTC, 1 replies.
- [GitHub] [spark] sarutak commented on a diff in pull request #41891: [SPARK-44334][SQL][UI] Status of execution w/ error and w/o jobs shall be FAILED not COMPLETED in SqlResource - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/11 18:45:55 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41924: [SPARK-44364] [PYTHON] Add support for List[Row] data type for expected - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/11 20:19:25 UTC, 3 replies.
- [GitHub] [spark] rangadi opened a new pull request, #41945: [TEMP] Feb impl - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/11 20:23:36 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/11 21:00:59 UTC, 4 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #41946: [WIP] FunctionPickler Class - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/11 22:08:18 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/07/11 22:13:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 22:33:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41770: [SPARK-44264][ML][PYTHON] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 22:37:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41770: [SPARK-44264][ML][PYTHON] Write a Deepspeed Distributed Learning Class DeepspeedTorchDistributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 22:37:31 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #41778: [WIP] DeepspeedDistributor Class That Will Utilize the Deepspeed Launcher Boilerplate - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/11 22:55:36 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 closed pull request #41778: [WIP] DeepspeedDistributor Class That Will Utilize the Deepspeed Launcher Boilerplate - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/11 22:55:40 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41947: [SPARK-44217] [PYTHON] Allow custom precision for fp approx equality - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/11 23:05:45 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #41946: [WIP] FunctionPickler Class - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/11 23:16:17 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #41948: [SPARK-44380][PYTHON] Support for UDTF to analyze in Python - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/11 23:34:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41944: [SPARK-44377][BUILD] Exclude Junit5 dependencies from `jersey-test-framework-provider-simple` - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/11 23:40:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 23:51:26 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41940: [SPARK-44374][PYTHON][ML] Add example code for distributed ML for spark connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/11 23:57:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41947: [SPARK-44217] [PYTHON] Allow custom precision for fp approx equality - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 00:20:16 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/12 00:47:19 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/12 00:47:30 UTC, 1 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41946: [WIP] FunctionPickler Class - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/12 01:26:36 UTC, 11 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 01:29:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41901: [SPARK-44287][SQL] Reuse evaluator in ColumnarToRowExec#doExecute - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/12 01:34:17 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #41940: [SPARK-44374][PYTHON][ML] Add example code for distributed ML for spark connect - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/12 01:55:25 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/12 02:06:10 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/12 02:08:20 UTC, 4 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/12 02:14:15 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41891: [SPARK-44334][SQL][UI] Status in the REST API response for a failed DDL/DML with no jobs should be FAILED rather than COMPLETED - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 02:28:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41944: [SPARK-44377][BUILD] Exclude Junit5 dependencies from `jersey-test-framework-provider-simple` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 02:37:56 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 03:02:20 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41884: [SPARK-44325][SQL] Use PartitionEvaluator API in SortMergeJoinExec - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 03:03:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41947: [SPARK-44217][PYTHON] Allow custom precision for fp approx equality - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 03:06:41 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 03:14:37 UTC, 6 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41931: [SPARK-43665][CONNECT][PS] Enable PandasSQLStringFormatter.vformat to work with Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 03:27:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41931: [SPARK-43665][CONNECT][PS] Enable PandasSQLStringFormatter.vformat to work with Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 03:27:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41899: [SPARK-44340][SQL] Define the computing logic through PartitionEvaluator API and use it in WindowGroupLimitExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/12 03:28:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41899: [SPARK-44340][SQL] Define the computing logic through PartitionEvaluator API and use it in WindowGroupLimitExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/12 03:29:57 UTC, 0 replies.
- [GitHub] [spark] rxin commented on pull request #41687: [SPARK-44131][SQL] Add call_function and deprecate call_udf for Scala API - posted by "rxin (via GitHub)" <gi...@apache.org> on 2023/07/12 03:46:23 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #40506: [SPARK-42881][SQL] Codegen Support for get_json_object - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/12 03:48:29 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41932: [SPARK-44131][SQL][FOLLOWUP] Support qualified function name for call_function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/12 04:12:26 UTC, 9 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 04:12:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41950: [SPARK-44131][SQL][FOLLOWUP] Revert the deprecation message - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 04:29:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41932: [SPARK-44131][SQL][FOLLOWUP] Support qualified function name for call_function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 04:31:32 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 04:44:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41933: [SPARK-44370][CONNECT] Migrate Buf remote generation alpha to remote plugins - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 04:45:08 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41891: [SPARK-44334][SQL][UI] Status in the REST API response for a failed DDL/DML with no jobs should be FAILED rather than COMPLETED - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 05:35:23 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41891: [SPARK-44334][SQL][UI] Status in the REST API response for a failed DDL/DML with no jobs should be FAILED rather than COMPLETED - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 05:35:34 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt commented on a diff in pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/07/12 05:38:50 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41938: [SPARK-44373][SQL] Wrap withActive for Dataset API w/ parse logic to make parser related configuration work - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 05:42:34 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/07/12 05:53:10 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 05:55:07 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41932: [SPARK-44131][SQL][FOLLOWUP] Support qualified function name for call_function - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/12 06:01:28 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/12 06:02:42 UTC, 4 replies.
- [GitHub] [spark] hvanhovell closed pull request #41925: [SPARK-44353][CONNECT][SQL] Remove StructType.toAttributes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/12 06:27:29 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #41875: [SPARK-44317][SQL] Use PartitionEvaluator API in ShuffledHashJoinExec - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/12 06:29:56 UTC, 5 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/12 06:41:53 UTC, 13 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #41951: [SPARK-44367][SQL][UI] Show error message on UI for each failed query - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 06:54:20 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/12 07:31:55 UTC, 2 replies.
- [GitHub] [spark] haiyangsun-db commented on a diff in pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "haiyangsun-db (via GitHub)" <gi...@apache.org> on 2023/07/12 07:44:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41950: [SPARK-44131][SQL][FOLLOWUP] Revert the deprecation message - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 08:32:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41950: [SPARK-44131][SQL][FOLLOWUP] Revert the deprecation message - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 08:32:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41914: [SPARK-44349][R] Add math functions to SparkR - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 08:37:45 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41111: [SPARK-39420][SQL] Support `ANALYZE TABLE` on Datasource V2 tables - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/12 08:40:33 UTC, 1 replies.
- [GitHub] [spark] Kwafoor commented on pull request #41535: [SPARK-32559][SQL] Fix the trim logic did't handle ASCII control characters correctly - posted by "Kwafoor (via GitHub)" <gi...@apache.org> on 2023/07/12 09:21:59 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41535: [SPARK-32559][SQL] Fix the trim logic did't handle ASCII control characters correctly - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 09:26:35 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41952: CheckError for *View*Suite, *Namespace*Suite, *DataSource*Suite - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/12 09:31:43 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41705: [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/12 09:38:23 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 09:44:18 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #41315: [SPARK-43755][CONNECT] Move execution out of SparkExecutePlanStreamHandler and to a different thread - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/12 09:51:49 UTC, 15 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/12 10:01:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41942: [SPARK-44348][CORE][CONNECT][PYTHON] Reenable test_artifact with relevant changes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 10:15:26 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41951: [SPARK-44367][SQL][UI] Show error message on UI for each failed query - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/12 10:24:33 UTC, 6 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41937: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 10:32:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41937: [SPARK-43974][CONNECT][BUILD] Upgrade buf to v1.23.1 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/12 10:33:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41951: [SPARK-44367][SQL][UI] Show error message on UI for each failed query - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/12 10:45:48 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41725: [SPARK-44180][SQL] DistributionAndOrderingUtils should apply ResolveTimeZone - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/12 10:48:50 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 11:02:24 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #41725: [SPARK-44180][SQL] DistributionAndOrderingUtils should apply ResolveTimeZone - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/12 11:14:18 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41898: [SPARK-44338][SQL] Fix view schema mismatch error message - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/12 12:02:53 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #41953: [WIP][SPARK-43995][CONNECT] Add support for UDFRegistration for the Connect Scala Client - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/12 12:06:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #41954: [WIP][SQL] Check the number of input types in `InvokeLike` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/12 12:20:15 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #41955: [SPARK-44279][BUILD] Upgrade `word-wrap` - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/12 12:24:23 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #41943: [SPARK-44376][BUILD] Fix maven build using scala 2.13 and Java 11 or later - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/12 12:26:07 UTC, 4 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #41955: [SPARK-44279][BUILD] Upgrade `word-wrap` - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/12 12:27:34 UTC, 4 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41956: Configure a custom `RetryPolicy` when `Test encryption(SparkConnectClientSuite)` initializing `SparkConnectClient` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 12:39:56 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41957: [SPARK-44385][SQL] Use PartitionEvaluator API in MergingSessionsExec & UpdatingSessionsExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/12 13:16:22 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #41958: [SPARK-44386] Use PartitionEvaluator API in HashAggregateExec, ObjectHashAggregateExec, SortAggregateExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/12 14:03:53 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41952: [SPARK-44384][SQL][TESTS] Use checkError() to check Exception in *View*Suite, *Namespace*Suite, *DataSource*Suite - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/12 14:12:17 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on pull request #41955: [SPARK-44279][BUILD] Upgrade `word-wrap` - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/12 15:13:13 UTC, 2 replies.
- [GitHub] [spark] vicennial opened a new pull request, #41959: [SPARK-44388][CONNECT] Fix protobuf cast issue when UDF instance is updated - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/12 15:28:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41960: [SPARK-44390][CORE][SQL] Rename `SparkSerDerseUtils` to `SparkSerDeUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 15:42:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41960: [SPARK-44390][CORE][SQL] Rename `SparkSerDerseUtils` to `SparkSerDeUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 15:43:29 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41961: [MINOR][CONNECT] Fix compilation warning related to `Top-level wildcard is not allowed and will error under -Xsource:3` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/12 15:46:01 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #41962: [SPARK-44392][SQL][TESTS] Add tests for schema pruning in delta-based UPDATEs - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/07/12 16:33:57 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #41962: [SPARK-44392][SQL][TESTS] Add tests for schema pruning in delta-based UPDATEs - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/07/12 16:34:33 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41956: [SPARK-44387][CONNECT][TESTS] Configure a custom `RetryPolicy` when `Test encryption` initializing `SparkConnectClient` - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/12 16:51:06 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #41955: [SPARK-44279][BUILD] Upgrade `Eslint` to v8.44.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/12 16:58:41 UTC, 2 replies.
- [GitHub] [spark] sarutak commented on pull request #41955: [SPARK-44279][BUILD] Upgrade `Eslint` to v8.44.0 - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/12 16:59:00 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #41963: [SPARK-44393][BUILD] Upgrade `H2` from 2.1.214 to 2.2.220 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/12 17:03:01 UTC, 0 replies.
- [GitHub] [spark] dillitz commented on pull request #41956: [SPARK-44387][CONNECT][TESTS] Configure a custom `RetryPolicy` when `Test encryption` initializing `SparkConnectClient` - posted by "dillitz (via GitHub)" <gi...@apache.org> on 2023/07/12 17:04:23 UTC, 0 replies.
- [GitHub] [spark] jasonli-db opened a new pull request, #41964: [WIP][SPARK-44394] Add a Spark UI page for Spark Connect - posted by "jasonli-db (via GitHub)" <gi...@apache.org> on 2023/07/12 17:21:19 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #41965: [SPARK-44395][SQL] Update TVF arguments to require parentheses around identifier after TABLE keyword - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/12 17:31:16 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #41965: [SPARK-44395][SQL] Update TVF arguments to require parentheses around identifier after TABLE keyword - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/12 17:35:20 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41947: [SPARK-44217][PYTHON] Allow custom precision for fp approx equality - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/12 18:05:38 UTC, 0 replies.
- [GitHub] [spark] jdesjean opened a new pull request, #41966: [Do not review] [SPARK-43923-2] - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/07/12 18:30:15 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #41959: [SPARK-44388][CONNECT] Fix protobuf cast issue when UDF instance is updated - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/12 18:35:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41962: [SPARK-44392][SQL][TESTS] Add tests for schema pruning in delta-based UPDATEs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/12 19:22:23 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #41962: [SPARK-44392][SQL][TESTS] Add tests for schema pruning in delta-based UPDATEs - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/07/12 19:23:59 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41967: [SPARK-44397][PYTHON] Expose assertDataFrameEqual in pyspark.testing.utils - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/12 19:31:42 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #41968: [Spark Ticket Here][WIP] Clean Up Deepspeed Code - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/12 19:54:15 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on a diff in pull request #41955: [SPARK-44279][BUILD] Upgrade `optionator` to ^0.9.3 - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/12 20:04:01 UTC, 2 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #41955: [SPARK-44279][BUILD] Upgrade `optionator` to ^0.9.3 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/12 20:09:36 UTC, 3 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41967: [SPARK-44397][PYTHON] Expose assertDataFrameEqual in pyspark.testing.utils - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/12 20:12:51 UTC, 1 replies.
- [GitHub] [spark] rangadi opened a new pull request, #41969: [SPARK-44398][CONNECT] Scala foreachBatch API - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/12 20:31:26 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #41946: [SPARK-44264] FunctionPickler Class - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/12 20:43:05 UTC, 2 replies.
- [GitHub] [spark] rangadi commented on pull request #41969: [SPARK-44398][CONNECT] Scala foreachBatch API - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/12 20:46:13 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #41970: [SPARK-44399][PYHTON] Import SparkSession in Python UDF only when useArrow is None - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/12 20:58:28 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41946: [SPARK-44264] FunctionPickler Class - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/12 21:19:34 UTC, 4 replies.
- [GitHub] [spark] hvanhovell closed pull request #40611: [SPARK-42981][CONNECT] Add direct arrow serialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/12 21:24:21 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #41971: [SPARK-43321][Connect][Followup] Better names for APIs used in Scala Client joinWith - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/12 21:41:22 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #41971: [SPARK-43321][Connect][Followup] Better names for APIs used in Scala Client joinWith - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/12 21:44:18 UTC, 1 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #41969: [SPARK-44398][CONNECT] Scala foreachBatch API - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/12 21:47:37 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #41968: [Spark Ticket Here][WIP] Clean Up Deepspeed Code - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/12 21:52:31 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/12 21:54:40 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #41959: [SPARK-44388][CONNECT] Fix protobuf cast issue when UDF instance is updated - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/12 22:00:14 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41972: Fix PySpark error class get_error_message - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/12 22:10:01 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #41971: [SPARK-43321][Connect][Followup] Better names for APIs used in Scala Client joinWith - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/12 22:39:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41968: [SPARK-44264]Clean Up Deepspeed Code - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 23:23:42 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41965: [SPARK-44395][SQL] Update TVF arguments to require parentheses around identifier after TABLE keyword - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/12 23:33:39 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #41928: [WIP] Move parser and data type to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/12 23:40:19 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41942: [SPARK-44348][CORE][CONNECT][PYTHON] Reenable test_artifact with relevant changes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 23:48:33 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #41969: [SPARK-44398][CONNECT] Scala foreachBatch API - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/12 23:48:40 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41942: [SPARK-44348][CORE][CONNECT][PYTHON] Reenable test_artifact with relevant changes - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/12 23:49:04 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #41973: [DO NOT MERGE/REVIEW] PROTOTYPING: refactoring the TorchDistributor code to take in a run_t… - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/12 23:56:30 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #41969: [SPARK-44398][CONNECT] Scala foreachBatch API - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/13 00:01:27 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #41965: [SPARK-44395][SQL] Update TVF arguments to require parentheses around identifier after TABLE keyword - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/13 00:01:52 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #41974: [WIP][SPARK-44401][PYTHON][DOCS] Arrow Python UDF Use Guide - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/13 00:16:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40635: [SPARK-42860][SQL] Add analysed logical mode in org.apache.spark.sql.execution.ExplainMode - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/13 00:23:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40567: [SPARK-42935] [SQL] Add union required distribution push down - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/13 00:23:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40477: [SPARK-42805]`DeduplicateRelations` rule show process `LOGICAL_RDD` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/13 00:23:09 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #41975: [SPARK-44402][SQL][TESTS] Add tests for schema pruning in delta-based DELETEs - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/07/13 00:43:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41954: [SPARK-44391][SQL] Check the number of argument types in `InvokeLike` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/13 00:57:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41954: [SPARK-44391][SQL] Check the number of argument types in `InvokeLike` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/13 00:58:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41967: [SPARK-44397][PYTHON] Expose assertDataFrameEqual in pyspark.testing.utils - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 01:05:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41967: [SPARK-44397][PYTHON] Expose assertDataFrameEqual in pyspark.testing.utils - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 01:06:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41970: [SPARK-44399][PYHTON][CONNECT] Import SparkSession in Python UDF only when useArrow is None - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 01:26:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41970: [SPARK-44399][PYHTON][CONNECT] Import SparkSession in Python UDF only when useArrow is None - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 01:27:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41968: [SPARK-44264][PYTHON][ML][FOLLOW-UP] Clean Up Deepspeed Code - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 01:30:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41963: [SPARK-44393][BUILD] Upgrade `H2` from 2.1.214 to 2.2.220 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 01:31:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41963: [SPARK-44393][BUILD] Upgrade `H2` from 2.1.214 to 2.2.220 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 01:31:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41961: [MINOR][CONNECT] Fix compilation warning related to `Top-level wildcard is not allowed and will error under -Xsource:3` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 01:32:41 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41968: [SPARK-44264][PYTHON][ML][FOLLOW-UP] Clean Up Deepspeed Code - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/13 01:36:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41961: [MINOR][CONNECT] Fix compilation warning related to `Top-level wildcard is not allowed and will error under -Xsource:3` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 01:40:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41961: [MINOR][CONNECT] Fix compilation warning related to `Top-level wildcard is not allowed and will error under -Xsource:3` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 02:40:13 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #40419: [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/13 02:43:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41956: [SPARK-44387][CONNECT][TESTS] Configure a custom `RetryPolicy` when `Test encryption` initializing `SparkConnectClient` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 02:46:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41956: [SPARK-44387][CONNECT][TESTS] Configure a custom `RetryPolicy` when `Test encryption` initializing `SparkConnectClient` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 02:46:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41529: [SPARK-43988][INFRA] Add a daily maven testing GitHub Action job - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 02:52:24 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #41976: [SPARK-44021][SQL][FOLLOW-UP] Fix log messages when the number of partition exceeds maxPartitionNum - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/13 03:11:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41972: [MINOR][PYTHON] Fix PySpark error class get_error_message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 03:20:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41972: [MINOR][PYTHON] Fix PySpark error class get_error_message - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 03:20:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41977: [SPARK-44348][TESTS][PYTHON][FOLLOW-UP] Reduces the memory used in local-cluster tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 03:30:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41977: [SPARK-44348][TESTS][PYTHON][FOLLOW-UP] Reduces the memory used in local-cluster tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 03:30:42 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41947: [SPARK-44217][PYTHON] Allow custom precision for fp approx equality - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 03:34:59 UTC, 5 replies.
- [GitHub] [spark] zml1206 opened a new pull request, #41978: [SPARK-32268][SQL][FOLLOWUP] Filter creation side size threshold judgment should prun column in injectBloomFilter - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/07/13 03:56:42 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #41925: [SPARK-44353][CONNECT][SQL] Remove StructType.toAttributes - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/13 04:02:29 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #41979: [SPARK-43952][SQL][FOLLOWUP] Correct AQE cancel broadcast job tag - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/13 04:05:03 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #41979: [SPARK-43952][SQL][FOLLOWUP] Correct AQE cancel broadcast job tag - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/13 04:06:30 UTC, 3 replies.
- [GitHub] [spark] vinodkc commented on pull request #41746: [WIP][SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/13 04:12:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41980: [SPARK-41811][PYTHON][CONNECT] Implement SparkSession.sql's string formatter - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 04:13:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41980: [SPARK-41811][PYTHON][CONNECT] Implement SparkSession.sql's string formatter - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 04:14:53 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41980: [SPARK-41811][PYTHON][CONNECT] Implement SparkSession.sql's string formatter - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 04:15:55 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41958: [SPARK-44386][SQL] Use PartitionEvaluator API in HashAggregateExec, ObjectHashAggregateExec, SortAggregateExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/13 04:27:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41980: [SPARK-41811][PYTHON][CONNECT] Implement SparkSession.sql's string formatter - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 04:47:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41977: [SPARK-44348][TESTS][PYTHON][FOLLOW-UP] Reduces the memory used in local-cluster tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 05:04:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41867: [SPARK-43964][SQL][PYTHON] Support arrow-optimized Python UDTFs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 05:26:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41981: [SPARK-44407][BUILD] Add a new Scala checkstyle rule to prohibit using `enum` as a variable or function name - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 05:31:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41981: [SPARK-44407][BUILD] Add a new Scala checkstyle rule to prohibit using `enum` as a variable or function name - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 05:31:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #41982: [SPARK-44407][BUILD] Add a new Scala checkstyle rule to prohibit using enum as a variable or function name - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 05:33:53 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #41980: [SPARK-41811][PYTHON][CONNECT] Implement SparkSession.sql's string formatter - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/13 05:42:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41960: [SPARK-44390][CORE][SQL] Rename `SparkSerDerseUtils` to `SparkSerDeUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 05:44:40 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41979: [SPARK-43952][SQL][FOLLOWUP] Correct AQE cancel broadcast job tag - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/13 05:53:48 UTC, 0 replies.
- [GitHub] [spark] jiaoqingbo opened a new pull request, #41983: [SPARK-44203][SQL][HIVE] Return nextRenewalDate instead of None for obtainDelegationTokens method - posted by "jiaoqingbo (via GitHub)" <gi...@apache.org> on 2023/07/13 06:15:49 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41971: [SPARK-43321][Connect][Followup] Better names for APIs used in Scala Client joinWith - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/13 06:21:06 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41971: [SPARK-43321][Connect][Followup] Better names for APIs used in Scala Client joinWith - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/13 06:21:23 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #41979: [SPARK-43952][SQL][FOLLOWUP] Correct AQE cancel broadcast job tag - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/13 06:32:05 UTC, 2 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41947: [SPARK-44217][PYTHON] Allow custom precision for fp approx equality - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/13 06:33:31 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41957: [SPARK-44385][SQL] Use PartitionEvaluator API in MergingSessionsExec & UpdatingSessionsExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/13 07:02:44 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #41984: [MINOR] Removing redundant parentheses from SQL function docs - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/13 07:07:42 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41984: [MINOR] Removing redundant parentheses from SQL function docs - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/13 07:11:57 UTC, 5 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #41954: [SPARK-44391][SQL] Check the number of argument types in `InvokeLike` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 07:43:18 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on pull request #41953: [SPARK-43995][SPARK-43996][CONNECT] Add support for UDFRegistration to the Connect Scala Client - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/13 07:44:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41954: [SPARK-44391][SQL] Check the number of argument types in `InvokeLike` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 07:45:37 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41984: [MINOR] Removing redundant parentheses from SQL function docs - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/13 08:38:13 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on pull request #41959: [SPARK-44388][CONNECT] Fix protobuf cast issue when UDF instance is updated - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/13 09:01:18 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #41979: [SPARK-43952][SQL][FOLLOWUP] Correct AQE cancel broadcast job tag - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/13 09:01:38 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #41954: [SPARK-44391][SQL] Check the number of argument types in `InvokeLike` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 09:17:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41959: [SPARK-44388][CONNECT] Fix protobuf cast issue when UDF instance is updated - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 09:54:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41959: [SPARK-44388][CONNECT] Fix protobuf cast issue when UDF instance is updated - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 09:55:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41980: [SPARK-41811][PYTHON][CONNECT] Implement SparkSession.sql's string formatter - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 09:58:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #41985: [SPARK-44391][SQL][3.4] Check the number of argument types in `InvokeLike` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 09:58:57 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X closed pull request #40741: [SPARK-41811][CONNECT][CLIENT] Support sql with dataframes and columns - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/13 10:05:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #40741: [SPARK-41811][CONNECT][CLIENT] Support sql with dataframes and columns - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 10:10:33 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #40741: [SPARK-41811][CONNECT][CLIENT] Support sql with dataframes and columns - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/13 10:14:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41748: [SPARK-44145][SQL] Callback when ready for execution - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 10:21:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the active conf - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/13 10:33:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41893: [SPARK-44335][SQL] SQL parser should have a SQLConf parameter instead of relying on the active conf - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/13 10:33:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41986: [SPARK-44406][CONNECT] Make `SparkSession.sql` work properly with dropped temp view - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 10:36:31 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41986: [SPARK-44406][CONNECT] Make `SparkSession.sql` work properly with dropped temp view - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/13 10:38:38 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41924: [SPARK-44364] [PYTHON] Add support for List[Row] data type for expected - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 11:05:04 UTC, 2 replies.
- [GitHub] [spark] cdkrot opened a new pull request, #41987: [SPARK-44410][PYTHON][Connect] Set active session in create, not just getOrCreate - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/13 12:42:11 UTC, 0 replies.
- [GitHub] [spark] bartosz25 opened a new pull request, #41988: [MINOR][SS][DOCS] Fix typos in the Scaladoc and make the semantic of getCurrentWatermarkMs explicit - posted by "bartosz25 (via GitHub)" <gi...@apache.org> on 2023/07/13 12:48:59 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41985: [SPARK-44391][SQL][3.4] Check the number of argument types in `InvokeLike` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 13:20:47 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #41952: [SPARK-44384][SQL][TESTS] Use checkError() to check Exception in *View*Suite, *Namespace*Suite, *DataSource*Suite - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 13:25:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41952: [SPARK-44384][SQL][TESTS] Use checkError() to check Exception in *View*Suite, *Namespace*Suite, *DataSource*Suite - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 13:26:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 14:20:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39937: [SPARK-42309][SQL] Introduce `INCOMPATIBLE_DATA_TO_TABLE` and sub classes. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/13 14:21:50 UTC, 0 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #41946: [SPARK-44264] FunctionPickler Class - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/07/13 16:23:34 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #41975: [SPARK-44402][SQL][TESTS] Add tests for schema pruning in delta-based DELETEs - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/13 17:19:02 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #41975: [SPARK-44402][SQL][TESTS] Add tests for schema pruning in delta-based DELETEs - posted by "aokolnychyi (via GitHub)" <gi...@apache.org> on 2023/07/13 17:23:42 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #41898: [SPARK-44338][SQL] Fix view schema mismatch error message - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/13 17:32:17 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #41898: [SPARK-44338][SQL] Fix view schema mismatch error message - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/13 17:32:54 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #41969: [SPARK-44398][CONNECT] Scala foreachBatch API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/13 17:48:02 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #41969: [SPARK-44398][CONNECT] Scala foreachBatch API - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/13 17:48:29 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/13 18:04:56 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on pull request #41955: [SPARK-44279][BUILD] Upgrade `optionator` to ^0.9.3 - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/13 18:24:59 UTC, 1 replies.
- [GitHub] [spark] sarutak closed pull request #41955: [SPARK-44279][BUILD] Upgrade `optionator` to ^0.9.3 - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/13 18:27:25 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #41315: [SPARK-43755][CONNECT] Move execution out of SparkExecutePlanStreamHandler and to a different thread - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/13 18:38:35 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/13 19:09:07 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/13 19:34:05 UTC, 3 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41986: [SPARK-44406][CONNECT] Make `SparkSession.sql` work properly with dropped temp view - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/13 19:34:13 UTC, 0 replies.
- [GitHub] [spark] szehon-ho opened a new pull request, #41990: [SPARK-42454][SQL] SPJ: encapsulate all SPJ related parameters in BatchScanExec - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/07/13 19:50:48 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/13 20:59:03 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #41965: [SPARK-44395][SQL] Update TVF arguments to require parentheses around identifier after TABLE keyword - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/13 21:03:38 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #41965: [SPARK-44395][SQL] Update TVF arguments to require parentheses around identifier after TABLE keyword - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/13 21:04:21 UTC, 0 replies.
- [GitHub] [spark] nihalpot closed pull request #41791: [SPARK-44285] MSK IAM Support - posted by "nihalpot (via GitHub)" <gi...@apache.org> on 2023/07/13 21:10:27 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #41904: [SPARK-43389][SQL] Added a null check for lineSep option - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/13 23:17:53 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #41904: [SPARK-43389][SQL] Added a null check for lineSep option - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/13 23:17:55 UTC, 0 replies.
- [GitHub] [spark] gdhuper commented on pull request #41904: [SPARK-43389][SQL] Added a null check for lineSep option - posted by "gdhuper (via GitHub)" <gi...@apache.org> on 2023/07/13 23:22:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41924: [SPARK-44364] [PYTHON] Add support for List[Row] data type for expected - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 23:27:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41924: [SPARK-44364] [PYTHON] Add support for List[Row] data type for expected - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 23:27:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41984: [MINOR] Removing redundant parentheses from SQL function docs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 23:40:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41984: [MINOR] Removing redundant parentheses from SQL function docs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 23:40:31 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/13 23:43:09 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41986: [SPARK-44406][CONNECT] Make `SparkSession.sql` work properly with dropped temp view - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 23:44:03 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API public - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 23:49:05 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/07/13 23:51:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41946: [SPARK-44264][PYTHON][ML] FunctionPickler Class - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/13 23:52:40 UTC, 5 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #41946: [SPARK-44264][PYTHON][ML] FunctionPickler Class - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/14 00:18:42 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40667: Improve IDE build experience against jdk11 - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 00:24:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40635: [SPARK-42860][SQL] Add analysed logical mode in org.apache.spark.sql.execution.ExplainMode - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 00:24:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40633: [SPARK-43000][SQL] Do not cast to double type in `PromoteStrings` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 00:24:32 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40567: [SPARK-42935] [SQL] Add union required distribution push down - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 00:24:35 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40477: [SPARK-42805]`DeduplicateRelations` rule show process `LOGICAL_RDD` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 00:24:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41986: [SPARK-44406][CONNECT] Make `SparkSession.sql` work properly with dropped temp view - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 00:27:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 00:35:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41752: [SPARK-44201][CONNECT][SS]Add support for Streaming Listener in Scala for Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 00:36:06 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/14 00:39:49 UTC, 20 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/14 00:55:55 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/14 00:56:12 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41946: [SPARK-44264][PYTHON][ML] FunctionPickler Class - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/14 01:17:57 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/14 01:18:07 UTC, 2 replies.
- [GitHub] [spark] jiaoqingbo commented on pull request #41983: [SPARK-44203][SQL][HIVE] Return nextRenewalDate instead of None for obtainDelegationTokens method - posted by "jiaoqingbo (via GitHub)" <gi...@apache.org> on 2023/07/14 01:22:50 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41725: [SPARK-44180][SQL] DistributionAndOrderingUtils should apply ResolveTimeZone - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/14 01:24:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41725: [SPARK-44180][SQL] DistributionAndOrderingUtils should apply ResolveTimeZone - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/14 01:25:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/14 01:37:44 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41953: [SPARK-43995][SPARK-43996][CONNECT] Add support for UDFRegistration to the Connect Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 01:51:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41953: [SPARK-43995][SPARK-43996][CONNECT] Add support for UDFRegistration to the Connect Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 01:52:28 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #41991: [SPARK-44413] Clarify error for unsupported arg data type in assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/14 01:56:27 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #37011: [SPARK-39625][SPARK-38904][SQL] Add Dataset.as(StructType) - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/14 02:12:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/14 02:18:32 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/14 02:20:52 UTC, 1 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #41992: [SPARK-44409][SQL] Handle char/varchar in Dataset.to to keep consistent with others - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/14 02:45:14 UTC, 0 replies.
- [GitHub] [spark] caican00 opened a new pull request, #41993: [SPARK-44414][SQL] Fixed matching check for CharType/VarcharType - posted by "caican00 (via GitHub)" <gi...@apache.org> on 2023/07/14 02:52:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41987: [SPARK-44410][PYTHON][Connect] Set active session in create, not just getOrCreate - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/14 03:02:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41987: [SPARK-44410][PYTHON][Connect] Set active session in create, not just getOrCreate - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 03:06:36 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #41994: [SPARK-44415][BUILD] Upgrade snappy-java to 1.1.10.2 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/14 03:18:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #41995: [SPARK-44416][CONNECT][BUILD] Upgrade buf to v1.24.0 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/14 03:51:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41995: [SPARK-44416][CONNECT][BUILD] Upgrade buf to v1.24.0 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/14 03:51:35 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41993: [SPARK-44414][SQL] Fixed matching check for CharType/VarcharType - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/14 04:13:36 UTC, 3 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41993: [SPARK-44414][SQL] Fixed matching check for CharType/VarcharType - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/14 04:13:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41767: [SPARK-44222][BUILD][PYTHON] Upgrade `grpc` to 1.56.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 04:23:08 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on pull request #41630: [SPARK-44080][SQL] Update Spark SQL config default value for thriftserver - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/14 04:29:49 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] opened a new pull request, #41996: Bump grpcio from 1.48.1 to 1.53.0 in /dev - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 04:30:08 UTC, 0 replies.
- [GitHub] [spark] caican00 commented on a diff in pull request #41993: [SPARK-44414][SQL] Fixed matching check for CharType/VarcharType - posted by "caican00 (via GitHub)" <gi...@apache.org> on 2023/07/14 04:33:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41997: [SPARK-44222][BUILD][PYTHON] Upgrade grpc to 1.56.0 with lower/upperbound - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 04:50:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41991: [SPARK-44413][PYTHON] Clarify error for unsupported arg data type in assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 04:52:45 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/14 04:56:50 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #41998: [SPARK-44411][SQL] Use PartitionEvaluator API in ArrowEvalPythonExec and BatchEvalPythonExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/14 04:58:59 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/14 05:05:01 UTC, 2 replies.
- [GitHub] [spark] viirya commented on pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/14 05:05:14 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41997: [SPARK-44222][BUILD][PYTHON] Upgrade grpc to 1.56.0 with lower/upperbound - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 05:06:39 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41997: [SPARK-44222][BUILD][PYTHON] Upgrade grpc to 1.56.0 with lower/upperbound - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 05:06:59 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] commented on pull request #41996: Bump grpcio from 1.48.1 to 1.53.0 in /dev - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 05:07:49 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] closed pull request #41996: Bump grpcio from 1.48.1 to 1.53.0 in /dev - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2023/07/14 05:07:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #41999: [SPARK-44418][PYTHON][CONNECT] Upgrade protobuf from 3.19.5 to 3.20.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 05:26:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #41997: [SPARK-44222][BUILD][PYTHON] Upgrade grpc to 1.56.0 with lower/upperbound - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/14 05:32:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #41985: [SPARK-44391][SQL][3.4] Check the number of argument types in `InvokeLike` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/14 05:39:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41947: [SPARK-44217][PYTHON] Allow custom precision for fp approx equality - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 06:10:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41947: [SPARK-44217][PYTHON] Allow custom precision for fp approx equality - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 06:11:36 UTC, 0 replies.
- [GitHub] [spark] caican00 opened a new pull request, #42000: [SPARK-44419][SQL] Support to extract partial filters of datasource v2 table and push them down - posted by "caican00 (via GitHub)" <gi...@apache.org> on 2023/07/14 06:16:41 UTC, 0 replies.
- [GitHub] [spark] caican00 commented on pull request #42000: [SPARK-44419][SQL] Support to extract partial filters of datasource v2 table and push them down - posted by "caican00 (via GitHub)" <gi...@apache.org> on 2023/07/14 06:31:06 UTC, 3 replies.
- [GitHub] [spark] yaooqinn commented on pull request #41992: [SPARK-44409][SQL] Handle char/varchar in Dataset.to to keep consistent with others - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/14 06:36:46 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41941: [SPARK-44382][BUILD] Upgrade protobuf-java to 3.23.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/14 06:37:25 UTC, 1 replies.
- [GitHub] [spark] Yikf commented on pull request #42000: [SPARK-44419][SQL] Support to extract partial filters of datasource v2 table and push them down - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2023/07/14 07:15:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41941: [SPARK-44382][BUILD] Upgrade protobuf-java to 3.23.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/14 07:19:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41999: [SPARK-44418][PYTHON][CONNECT] Upgrade protobuf from 3.19.5 to 3.20.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 07:34:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41999: [SPARK-44418][PYTHON][CONNECT] Upgrade protobuf from 3.19.5 to 3.20.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 07:35:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41994: [SPARK-44415][BUILD] Upgrade snappy-java to 1.1.10.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/14 07:38:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41995: [SPARK-44416][CONNECT][BUILD] Upgrade buf to v1.24.0 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/14 08:00:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/14 08:08:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42001: Test bc-java 1.75 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/14 08:22:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42002: Test use LinkedList instead of ArrayList in `TaskMemoryManager#acquireExecutionMemory` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/14 08:25:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42002: [CORE]Test use LinkedList instead of ArrayList in `TaskMemoryManager#acquireExecutionMemory` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/14 08:50:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41315: [SPARK-43755][CONNECT] Move execution out of SparkExecutePlanStreamHandler and to a different thread - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/14 09:12:57 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/14 10:14:52 UTC, 5 replies.
- [GitHub] [spark] caican00 opened a new pull request, #42003: [SPARK-44426][SQL] Optimize adaptive skew join for ExistenceJoin - posted by "caican00 (via GitHub)" <gi...@apache.org> on 2023/07/14 10:31:49 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41958: [SPARK-44386][SQL] Use PartitionEvaluator API in HashAggregateExec, ObjectHashAggregateExec, SortAggregateExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/14 10:39:14 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42004: [SPARK-44427][SQL] Use PartitionEvaluator API in MapElementsExec, MapGroupsExec, MapPartitionsExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/14 12:57:01 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/14 13:19:16 UTC, 22 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42005: [SPARK-44428][SQL][TEST] Add test case for all PartitionEvaluator API - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/14 13:32:01 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42005: [SPARK-44428][SQL][TEST] Add test case for all PartitionEvaluator API - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/14 13:32:51 UTC, 1 replies.
- [GitHub] [spark] jdesjean commented on a diff in pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/07/14 15:31:48 UTC, 11 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42006: [SPARK-44429][SQL][TESTS] Make `MsSqlServerIntegrationSuite` robust in ANSI mode - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/14 15:44:30 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on pull request #41987: [SPARK-44410][PYTHON][Connect] Set active session in create, not just getOrCreate - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/14 16:12:38 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/14 16:23:10 UTC, 3 replies.
- [GitHub] [spark] jchen5 opened a new pull request, #42007: [SPARK-44431][SQL] Fix behavior of null IN (empty list) in optimization rules - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/14 16:33:08 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API public - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/14 16:34:24 UTC, 1 replies.
- [GitHub] [spark] jchen5 commented on pull request #42007: [SPARK-44431][SQL] Fix behavior of null IN (empty list) in optimization rules - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/14 16:34:51 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42008: [SPARK-44430][SQL] Add cause to `AnalysisException` when option is invalid - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/14 16:36:43 UTC, 0 replies.
- [GitHub] [spark] xuanyuanking closed pull request #41315: [SPARK-43755][CONNECT] Move execution out of SparkExecutePlanStreamHandler and to a different thread - posted by "xuanyuanking (via GitHub)" <gi...@apache.org> on 2023/07/14 16:57:05 UTC, 0 replies.
- [GitHub] [spark] xuanyuanking commented on pull request #41315: [SPARK-43755][CONNECT] Move execution out of SparkExecutePlanStreamHandler and to a different thread - posted by "xuanyuanking (via GitHub)" <gi...@apache.org> on 2023/07/14 16:57:26 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #41946: [SPARK-44264][PYTHON][ML] FunctionPickler Class - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/14 18:05:08 UTC, 1 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #41973: [SPARK-44264] Refactoring TorchDistributor To Allow for Custom "run_training_on_file" Function Pointer - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/07/14 18:17:04 UTC, 11 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #41973: [SPARK-44264] Refactoring TorchDistributor To Allow for Custom "run_training_on_file" Function Pointer - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/14 18:24:15 UTC, 10 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/14 18:24:55 UTC, 1 replies.
- [GitHub] [spark] szehon-ho commented on a diff in pull request #41990: [SPARK-42454][SQL] SPJ: encapsulate all SPJ related parameters in BatchScanExec - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2023/07/14 19:49:12 UTC, 5 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #41974: [SPARK-44401][PYTHON][DOCS] Arrow Python UDF Use Guide - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/14 19:52:58 UTC, 1 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #41990: [SPARK-42454][SQL] SPJ: encapsulate all SPJ related parameters in BatchScanExec - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/07/14 19:56:03 UTC, 2 replies.
- [GitHub] [spark] rangadi commented on pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/14 20:05:44 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42009: [SPARK-44422] Spark Connect fine grained interrupt - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/14 20:15:00 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42009: [SPARK-44422] Spark Connect fine grained interrupt - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/14 20:15:17 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/14 20:17:35 UTC, 3 replies.
- [GitHub] [spark] srielau commented on pull request #40474: [SPARK-42849] [WIP] [SQL] Session Variables - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/07/14 20:28:28 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/14 20:55:54 UTC, 6 replies.
- [GitHub] [spark] jasonli-db commented on pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "jasonli-db (via GitHub)" <gi...@apache.org> on 2023/07/14 21:04:16 UTC, 5 replies.
- [GitHub] [spark] xinrong-meng closed pull request #41946: [SPARK-44264][PYTHON][ML] FunctionPickler Class - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/14 21:12:51 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #41946: [SPARK-44264][PYTHON][ML] FunctionPickler Class - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/14 21:12:59 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/14 21:18:28 UTC, 9 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #42010: [SPARK-44438] Shutdown scheduled executor used for maintenance task if an error is reported - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/14 21:22:22 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42010: [SPARK-44438] Shutdown scheduled executor used for maintenance task if an error is reported - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/14 21:41:44 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/14 22:40:26 UTC, 1 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #41973: [SPARK-44264] Refactoring TorchDistributor To Allow for Custom "run_training_on_file" Function Pointer - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/07/14 22:56:43 UTC, 2 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #40474: [SPARK-42849] [SQL] Session Variables - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/14 22:57:58 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/14 23:07:39 UTC, 7 replies.
- [GitHub] [spark] jasonli-db commented on a diff in pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "jasonli-db (via GitHub)" <gi...@apache.org> on 2023/07/14 23:27:21 UTC, 9 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42011: [SPARK-44396][Connect] Direct Arrow Deserialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/14 23:31:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42011: [SPARK-44396][Connect] Direct Arrow Deserialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/14 23:40:51 UTC, 1 replies.
- [GitHub] [spark] ericm-db opened a new pull request, #42012: [SPARK-44440][SS]Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2023/07/14 23:41:40 UTC, 0 replies.
- [GitHub] [spark] bogao007 opened a new pull request, #42013: [SPARK-44439][CONNECT][SS]Fixed listListeners to only send ids back to client - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/14 23:52:34 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42013: [SPARK-44439][CONNECT][SS]Fixed listListeners to only send ids back to client - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/14 23:53:53 UTC, 1 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42013: [SPARK-44439][CONNECT][SS]Fixed listListeners to only send ids back to client - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/14 23:59:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40667: Improve IDE build experience against jdk11 - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/15 00:22:43 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40633: [SPARK-43000][SQL] Do not cast to double type in `PromoteStrings` - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/15 00:22:44 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40553: [SPARK-39722] [SQL] getString API for Dataset - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/15 00:22:45 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/15 00:32:17 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #42014: [SPARK-44412][SQL] Use PartitionEvaluator API in ArrowEvalPythonUDTFExec & BatchEvalPythonUDTFExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/15 00:56:38 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #41976: [SPARK-44021][SQL][FOLLOW-UP] Fix log messages when the number of partition exceeds maxPartitionNum - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/15 02:22:14 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #41976: [SPARK-44021][SQL][FOLLOW-UP] Fix log messages when the number of partition exceeds maxPartitionNum - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/15 02:22:41 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #41994: [SPARK-44415][BUILD] Upgrade snappy-java to 1.1.10.2 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/15 02:23:42 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #41994: [SPARK-44415][BUILD] Upgrade snappy-java to 1.1.10.2 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/15 02:30:05 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42014: [SPARK-44412][SQL] Use PartitionEvaluator API in ArrowEvalPythonUDTFExec & BatchEvalPythonUDTFExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/15 04:13:19 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #41990: [SPARK-42454][SQL] SPJ: encapsulate all SPJ related parameters in BatchScanExec - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/07/15 04:45:59 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #41990: [SPARK-42454][SQL] SPJ: encapsulate all SPJ related parameters in BatchScanExec - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/07/15 04:46:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42001: Test bc-java 1.75 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/15 05:03:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42015: [SPARK-44441][BUILD] Upgrade `bcprov-jdk15on` and `bcpkix-jdk15on` to 1.70 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/15 05:08:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42015: [SPARK-44441][BUILD] Upgrade `bcprov-jdk15on` and `bcpkix-jdk15on` to 1.70 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/15 05:22:57 UTC, 2 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #41968: [SPARK-44264][PYTHON][ML][FOLLOW-UP] Clean Up Deepspeed Code - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/15 05:24:57 UTC, 0 replies.
- [GitHub] [spark] naveentnj opened a new pull request, #42016: Update BindingParquetOutputCommitter.scala - posted by "naveentnj (via GitHub)" <gi...@apache.org> on 2023/07/15 08:19:56 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42017: [SPARK-44443][SQL] Use PartitionEvaluator API in CoGroupExec, DeserializeToObjectExec, ExternalRDDScanExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/15 08:55:44 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42015: [SPARK-44441][BUILD] Upgrade `bcprov-jdk15on` and `bcpkix-jdk15on` to 1.70 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/15 11:14:40 UTC, 1 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42018: [SPARK-42321][SQL] Assign name to _LEGACY_ERROR_TEMP_2133 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/15 11:52:50 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #41569: [SPARK-39979][SQL][FOLLOW-UP] Support large variable types in pandas UDF, createDataFrame and toPandas with Arrow - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/07/15 12:26:01 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #41988: [MINOR][SS][DOCS] Fix typos in the Scaladoc and make the semantic of getCurrentWatermarkMs explicit - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/15 13:31:46 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #41988: [MINOR][SS][DOCS] Fix typos in the Scaladoc and make the semantic of getCurrentWatermarkMs explicit - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/15 13:31:54 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42010: [SPARK-44438][SS] Shutdown scheduled executor used for maintenance task if an error is reported - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/15 13:32:43 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42010: [SPARK-44438][SS] Shutdown scheduled executor used for maintenance task if an error is reported - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/15 13:33:41 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42012: [SPARK-44440][SS]Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/15 13:36:19 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #42015: [SPARK-44441][BUILD] Upgrade `bcprov-jdk15on` and `bcpkix-jdk15on` to 1.70 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/15 15:40:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42015: [SPARK-44441][BUILD] Upgrade `bcprov-jdk15on` and `bcpkix-jdk15on` to 1.70 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/15 16:40:46 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #42015: [SPARK-44441][BUILD] Upgrade `bcprov-jdk15on` and `bcpkix-jdk15on` to 1.70 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/15 17:17:17 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42015: [SPARK-44441][BUILD] Upgrade `bcprov-jdk15on` and `bcpkix-jdk15on` to 1.70 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/15 17:17:38 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #42019: [SPARK-44445][BUILD] Upgrade `htmlunit` to 3.3.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/15 18:32:00 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42019: [SPARK-44445][BUILD] Upgrade `htmlunit` to 3.3.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/15 19:30:15 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen closed pull request #42019: [SPARK-44445][BUILD] Upgrade `htmlunit` to 3.3.0 - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/15 19:30:16 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #42014: [SPARK-44412][SQL] Use PartitionEvaluator API in ArrowEvalPythonUDTFExec & BatchEvalPythonUDTFExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/15 20:37:16 UTC, 1 replies.
- [GitHub] [spark] learningchess2003 opened a new pull request, #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/15 23:33:12 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 closed pull request #41864: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/15 23:33:38 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 commented on pull request #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/15 23:36:47 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40621: Fix ExecutorAllocationManager cannot allocate new instances when all … - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/16 00:26:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40553: [SPARK-39722] [SQL] getString API for Dataset - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/16 00:26:04 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42012: [SPARK-44440][SS]Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/16 00:53:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/16 02:22:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42021: [SPARK-43418][CONNECT][FOLLOWUP] Remove the deprecation warning in `SparkSession.Builder.build` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/16 02:29:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42021: [SPARK-43418][CONNECT][FOLLOWUP] Remove the deprecation warning in `SparkSession.Builder.build` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/16 03:24:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42021: [SPARK-43418][CONNECT][FOLLOWUP] Remove the deprecation warning in `SparkSession.Builder.build` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 03:43:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42021: [SPARK-43418][CONNECT][FOLLOWUP] Remove the deprecation warning in `SparkSession.Builder.build` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/16 03:48:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42021: [SPARK-43418][CONNECT][FOLLOWUP] Remove the deprecation warning in `SparkSession.Builder.build` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/16 03:48:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 07:23:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41989: [SPARK-43965][PYTHON][CONNECT] Support Python UDTF in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 07:23:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41987: [SPARK-44410][PYTHON][CONNECT] Set active session in create, not just getOrCreate - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 07:31:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41987: [SPARK-44410][PYTHON][CONNECT] Set active session in create, not just getOrCreate - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 07:31:36 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #42012: [SPARK-44440][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/16 08:35:53 UTC, 1 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #42022: [WIP][SQL] Move `WithCTE` into command queries - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/07/16 12:55:41 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42023: [SPARK-44446][PYTHON] Add checks for expected list type special cases - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/16 19:44:57 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #42024: [SPARK-44361][SQL] Use PartitionEvaluator API in MapInBatchExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/16 21:47:47 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #42025: [SPARK-44447][SQL] Use PartitionEvaluator API in FlatMapGroupsInPandasExec, FlatMapCoGroupsInPandasExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/16 21:54:17 UTC, 0 replies.
- [GitHub] [spark] jchen5 opened a new pull request, #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/16 22:49:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API public - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 23:46:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41927: [SPARK-44216] [PYTHON] Make assertSchemaEqual API public - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 23:46:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41968: [SPARK-44264][PYTHON][ML][FOLLOW-UP] Clean Up Deepspeed Code - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 23:54:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41968: [SPARK-44264][PYTHON][ML][FOLLOW-UP] Clean Up Deepspeed Code - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/16 23:55:15 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40672: [SPARK-43035][CONNECT] Add error class in Spark Connect server's ErrorInfo - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:24:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40621: Fix ExecutorAllocationManager cannot allocate new instances when all … - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:24:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40523: [SPARK-42897][SQL] Avoid evaluate variables multiple times for SMJ and SHJ fullOuter join - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:24:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40461: [SPARK-42831][SQL] Show result expressions in AggregateExec - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:24:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40415: [Do not merge] Add JDBC to DataFrameWriter - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:24:56 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40315: [SPARK-42699][CONNECT] SparkConnectServer should make client and AM same exit code - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:24:57 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40314: [SPARK-42698][CORE] SparkSubmit should also stop SparkContext when exit program in yarn mode and pass exitCode to AM side - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:24:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40114: [SPARK-42513][SQL] Push down topK through join - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/17 00:25:01 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42027: [SPARK-44413][PYTHON] Clarify error for unsupported arg data type in … - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/17 00:47:21 UTC, 0 replies.
- [GitHub] [spark] asl3 closed pull request #41991: [DO-NOT-MERGE] Clarify error for unsupported arg data type in assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/17 00:51:37 UTC, 0 replies.
- [GitHub] [spark] ericm-db commented on a diff in pull request #42012: [SPARK-44440][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2023/07/17 01:04:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42028: [SPARK-44451][BUILD] Make built document downloadable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 01:05:39 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42028: [SPARK-44451][BUILD] Make built document downloadable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 01:10:43 UTC, 2 replies.
- [GitHub] [spark] jchen5 commented on pull request #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/17 01:20:01 UTC, 2 replies.
- [GitHub] [spark] caican00 commented on pull request #42003: [SPARK-44426][SQL] Optimize adaptive skew join for ExistenceJoin - posted by "caican00 (via GitHub)" <gi...@apache.org> on 2023/07/17 01:57:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37011: [SPARK-39625][SPARK-38904][SQL] Add Dataset.as(StructType) - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/17 01:59:58 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #42029: [SPARK-44362][SQL] Use PartitionEvaluator API in AggregateInPandasExec and AttachDistributedSequenceExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/17 02:03:25 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #42028: [SPARK-44451][BUILD] Make built document downloadable - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/07/17 02:05:38 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42028: [SPARK-44451][BUILD] Make built document downloadable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 02:07:23 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42028: [SPARK-44451][BUILD] Make built document downloadable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 02:08:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 02:53:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42016: Update BindingParquetOutputCommitter.scala - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 02:58:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42008: [SPARK-44430][SQL] Add cause to `AnalysisException` when option is invalid - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 03:01:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42008: [SPARK-44430][SQL] Add cause to `AnalysisException` when option is invalid - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 03:01:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42006: [SPARK-44429][SQL][TESTS] Make `MsSqlServerIntegrationSuite` robust in ANSI mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 03:05:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42006: [SPARK-44429][SQL][TESTS] Make `MsSqlServerIntegrationSuite` robust in ANSI mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 03:05:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42007: [SPARK-44431][SQL] Fix behavior of null IN (empty list) in optimization rules - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/17 03:14:50 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #42007: [SPARK-44431][SQL] Fix behavior of null IN (empty list) in optimization rules - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/17 03:17:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41993: [SPARK-44414][SQL] Fixed matching check for CharType/VarcharType - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/17 03:17:35 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #41974: [SPARK-44401][PYTHON][DOCS] Arrow Python UDF Use Guide - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/17 03:33:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42030: [SPARK-44452][CONNECT][TESTS] Move `override test` function from `RemoteSparkSession` to `ConnectFunSuite` and ignore `ArrowEncoderSuite` for Java 21 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/17 03:35:19 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42031: [SPARK-44453][PYTHON] Use difflib to display errors in assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/17 03:41:00 UTC, 0 replies.
- [GitHub] [spark] maheshk114 commented on a diff in pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "maheshk114 (via GitHub)" <gi...@apache.org> on 2023/07/17 03:45:27 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/17 03:50:07 UTC, 8 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42028: [SPARK-44451][BUILD] Make built document downloadable - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/17 04:00:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42013: [SPARK-44439][CONNECT][SS]Fixed listListeners to only send ids back to client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 04:35:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42013: [SPARK-44439][CONNECT][SS]Fixed listListeners to only send ids back to client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 04:36:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42032: [MINOR][DOCS] Fix a typo in dev/merge_spark_pr.py - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 04:36:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42032: [MINOR][DOCS] Fix a typo in dev/merge_spark_pr.py - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 04:36:26 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42032: [MINOR][DOCS] Fix a typo in dev/merge_spark_pr.py - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 04:39:38 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #42033: [SPARK-44454][SQL][HIVE] HiveShim getTablesByType support fallback - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/17 05:02:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41990: [SPARK-42454][SQL] SPJ: encapsulate all SPJ related parameters in BatchScanExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/17 05:21:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42005: [SPARK-44428][SQL][TEST] Add test case for all PartitionEvaluator API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/17 05:27:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41992: [SPARK-44409][SQL] Handle char/varchar in Dataset.to to keep consistent with others - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/17 05:35:32 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X closed pull request #42005: [SPARK-44428][SQL][TEST] Add test case for all PartitionEvaluator API - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/17 05:35:49 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/17 06:03:31 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/17 06:13:26 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen opened a new pull request, #42034: [SPARK-44455][SQL] Quote identifiers with backticks in SHOW CREATE TABLE result - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/07/17 06:38:21 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #42035: [SPARK-42944] Streaming ForeachBatch - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/17 07:00:50 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42035: [SPARK-42944] Streaming ForeachBatch - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/17 07:06:17 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #42035: [SPARK-42944] Streaming ForeachBatch - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/17 07:06:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42036: [SPARK-44355][SQL] Move WithCTE into command queries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/17 07:40:36 UTC, 0 replies.
- [GitHub] [spark] 7mming7 opened a new pull request, #42037: SPARK-44305 Dynamically choose whether to broadcast hadoop conf - posted by "7mming7 (via GitHub)" <gi...@apache.org> on 2023/07/17 07:57:51 UTC, 0 replies.
- [GitHub] [spark] TongWei1105 opened a new pull request, #42038: SPARK-42500: ConstantPropagation support more case - posted by "TongWei1105 (via GitHub)" <gi...@apache.org> on 2023/07/17 08:47:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42039: Test ArrowEncoderSuite with Java 17 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/17 09:01:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42039: Test ArrowEncoderSuite with Java 17 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/17 09:01:53 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #42038: [SPARK-42500][SQL] ConstantPropagation support more case - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/17 09:09:26 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #42038: [SPARK-42500][SQL] ConstantPropagation support more case - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/07/17 09:48:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42039: [SPARK-44457][CONNECT][TESTS] Add `instantNow/localDateTimeNow` to `ConnectFunSuite` and make `ArrowEncoderSuite` pass Java 17 daily test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/17 09:59:13 UTC, 0 replies.
- [GitHub] [spark] somani commented on pull request #41860: [SPARK-44307][SQL] Add Bloom filter for left outer join even if the left side table is smaller than broadcast threshold. - posted by "somani (via GitHub)" <gi...@apache.org> on 2023/07/17 10:09:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42040: [WIP][SPARK-43611][SQL][PS][CONNCECT] Fix unexpected `AnalysisException` from Spark Connect client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 10:13:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42040: [WIP][SPARK-43611][SQL][PS][CONNCECT] Fix unexpected `AnalysisException` from Spark Connect client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 10:25:49 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42040: [WIP][SPARK-43611][SQL][PS][CONNCECT] Fix unexpected `AnalysisException` from Spark Connect client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/17 10:27:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42039: [SPARK-44457][CONNECT][TESTS] Add `truncatedTo(ChronoUnit.MICROS)` to make `ArrowEncoderSuite` in Java 17 daily test GA task pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/17 10:35:06 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #42041: [DO-NOT-MERGE][TESTS] Enable pandas API on Spark tests related to SPARK-43611 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/17 10:49:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42040: [WIP][SPARK-43611][SQL][PS][CONNCECT] Fix unexpected `AnalysisException` from Spark Connect client - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/17 11:02:53 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41251: [SPARK-43521][SQL] Add `CREATE TABLE LIKE FILE` statement - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/17 12:35:18 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/17 12:54:31 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/17 13:57:23 UTC, 1 replies.
- [GitHub] [spark] jchen5 opened a new pull request, #42042: [SPARK-44448][SQL] Add test cases for DenseRankLimitIterator InferWindowGroupLimit bug - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/17 14:40:55 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42035: [SPARK-42944] Streaming ForeachBatch in Python - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/17 16:33:27 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/17 16:47:13 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42035: [SPARK-42944] Streaming ForeachBatch in Python - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/17 16:56:14 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42035: [SPARK-42944] Streaming ForeachBatch in Python - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/17 17:32:14 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #41974: [SPARK-44401][PYTHON][DOCS] Arrow Python UDF Use Guide - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/17 17:50:15 UTC, 3 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/07/17 17:59:01 UTC, 1 replies.
- [GitHub] [spark] learningchess2003 commented on a diff in pull request #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/17 18:31:30 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42034: [SPARK-44455][SQL] Quote identifiers with backticks in SHOW CREATE TABLE result - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/17 18:33:45 UTC, 2 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #42023: [SPARK-44446][PYTHON] Add checks for expected list type special cases - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/17 18:43:39 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #42023: [SPARK-44446][PYTHON] Add checks for expected list type special cases - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/17 18:43:40 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen commented on a diff in pull request #42034: [SPARK-44455][SQL] Quote identifiers with backticks in SHOW CREATE TABLE result - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/07/17 19:40:52 UTC, 0 replies.
- [GitHub] [spark] harshmotw-db opened a new pull request, #42043: [SPARK-44154][SQL] Added more unit tests to BitmapExpressionUtilsSuite and made minor improvements to Bitmap Aggregate Expressions - posted by "harshmotw-db (via GitHub)" <gi...@apache.org> on 2023/07/17 21:44:06 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42027: [SPARK-44413][PYTHON] Clarify error for unsupported arg data type in … - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/17 22:09:16 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42044: [SPARK-43967][PYTHON] Support regular Python UDTFs with empty return values - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/17 22:59:44 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42045: [DO NOT REVIEW/WIP] Refactoring TorchDistributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/17 23:19:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42035: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 23:21:09 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42031: [SPARK-44453][PYTHON] Use difflib to display errors in assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 23:21:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42031: [SPARK-44453][PYTHON] Use difflib to display errors in assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 23:21:48 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42035: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/17 23:39:01 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42027: [SPARK-44413][PYTHON] Clarify error for unsupported arg data type in assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 23:50:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42027: [SPARK-44413][PYTHON] Clarify error for unsupported arg data type in assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/17 23:51:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 00:04:18 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 00:04:48 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #39952: [SPARK-40770][PYTHON][FOLLOW-UP] Improved error messages for mapInPandas for schema mismatch - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/18 00:08:51 UTC, 0 replies.
- [GitHub] [spark] siying opened a new pull request, #42046: [SPARK-40434][SS] Implement applyInPandasWithState in PySpark - posted by "siying (via GitHub)" <gi...@apache.org> on 2023/07/18 00:31:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40672: [SPARK-43035][CONNECT] Add error class in Spark Connect server's ErrorInfo - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/18 00:36:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40523: [SPARK-42897][SQL] Avoid evaluate variables multiple times for SMJ and SHJ fullOuter join - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/18 00:36:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40461: [SPARK-42831][SQL] Show result expressions in AggregateExec - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/18 00:36:15 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40415: [Do not merge] Add JDBC to DataFrameWriter - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/18 00:36:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40315: [SPARK-42699][CONNECT] SparkConnectServer should make client and AM same exit code - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/18 00:36:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40314: [SPARK-42698][CORE] SparkSubmit should also stop SparkContext when exit program in yarn mode and pass exitCode to AM side - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/18 00:36:18 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40114: [SPARK-42513][SQL] Push down topK through join - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/18 00:36:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/18 00:57:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42020: [SPARK-44059] Add analyzer support of named arguments for built-in functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/18 00:58:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42047: [SPARK-44465][BUILD] Upgrade zstd-jni to 1.5.5-5 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/18 01:02:13 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42047: [SPARK-44465][BUILD] Upgrade zstd-jni to 1.5.5-5 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/18 01:03:55 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42035: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/18 01:33:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/18 02:16:17 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41932: [SPARK-44131][SQL][FOLLOWUP] Support qualified function name for call_function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/18 02:35:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42048: [SPARK-44467][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/18 02:44:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42048: [SPARK-44467][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/18 02:46:01 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41932: [SPARK-44131][SQL][FOLLOWUP] Support qualified function name for call_function - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/18 02:47:50 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42049: [SPARK-44466][SQL] Update initialSessionOptions to the value after supplementation - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/18 02:48:32 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42049: [SPARK-44466][SQL] Update initialSessionOptions to the value after supplementation - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/18 02:49:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41993: [SPARK-44414][SQL] Fixed matching check for CharType/VarcharType - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/18 02:51:11 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42050: [SPARK-44468][BUILD] Add daily test GA task for branch3.5 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/18 03:00:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42043: [SPARK-44154][SQL] Added more unit tests to BitmapExpressionUtilsSuite and made minor improvements to Bitmap Aggregate Expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/18 03:02:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42043: [SPARK-44154][SQL] Added more unit tests to BitmapExpressionUtilsSuite and made minor improvements to Bitmap Aggregate Expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/18 03:03:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42050: [SPARK-44468][BUILD] Add daily test GA task for branch3.5 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/18 03:03:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42050: [SPARK-44468][BUILD] Add daily test GA task for branch3.5 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/18 03:10:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42050: [SPARK-44468][BUILD] Add daily test GA task for branch3.5 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 03:13:04 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41932: [SPARK-44131][SQL][PYTHON][CONNECT][FOLLOWUP] Support qualified function name for call_function - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/18 03:22:13 UTC, 12 replies.
- [GitHub] [spark] beliefer commented on pull request #42042: [SPARK-44448][SQL] Add test cases for DenseRankLimitIterator InferWindowGroupLimit bug - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/18 03:28:43 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41932: [SPARK-44131][SQL][PYTHON][CONNECT][FOLLOWUP] Support qualified function name for call_function - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/18 03:37:25 UTC, 7 replies.
- [GitHub] [spark] cloud-fan closed pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/18 03:37:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42051: [SPARK-44348][CONNECT][FOLLOW-UP] Avoid double slashes in the URI - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 03:42:05 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42044: [SPARK-43967][PYTHON] Support regular Python UDTFs with empty return values - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/18 03:55:24 UTC, 0 replies.
- [GitHub] [spark] shrutiverma29 opened a new pull request, #42052: [SPARK-43035][Connect] Add error class in Spark Connect server's ErrorInfo - posted by "shrutiverma29 (via GitHub)" <gi...@apache.org> on 2023/07/18 04:20:14 UTC, 0 replies.
- [GitHub] [spark] shrutiverma29 closed pull request #42052: [SPARK-43035][Connect] Add error class in Spark Connect server's ErrorInfo - posted by "shrutiverma29 (via GitHub)" <gi...@apache.org> on 2023/07/18 04:21:18 UTC, 0 replies.
- [GitHub] [spark] shrutiverma29 opened a new pull request, #42053: [SPARK-43035][Connect] Add error class in Spark Connect server's ErrorInfo - posted by "shrutiverma29 (via GitHub)" <gi...@apache.org> on 2023/07/18 04:24:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41932: [SPARK-44131][SQL][PYTHON][CONNECT][FOLLOWUP] Support qualified function name for call_function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/18 05:40:27 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42051: [SPARK-44348][CONNECT][FOLLOW-UP] Avoid double slashes in the URI - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 05:50:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42051: [SPARK-44348][CONNECT][FOLLOW-UP] Avoid double slashes in the URI - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 05:50:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42044: [SPARK-43967][PYTHON] Support regular Python UDTFs with empty return values - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 05:58:48 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42049: [SPARK-44466][SQL] Update initialSessionOptions to the value after supplementation - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/18 05:58:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42044: [SPARK-43967][PYTHON] Support regular Python UDTFs with empty return values - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 05:59:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41974: [SPARK-44401][PYTHON][DOCS] Arrow Python UDF Use Guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 06:01:29 UTC, 1 replies.
- [GitHub] [spark] xuanyuanking opened a new pull request, #42054: [SPARK-44470][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "xuanyuanking (via GitHub)" <gi...@apache.org> on 2023/07/18 06:28:32 UTC, 0 replies.
- [GitHub] [spark] xuanyuanking opened a new pull request, #42055: [SPARK-44471][INFRA] Add Github action test job for branch-3.5 - posted by "xuanyuanking (via GitHub)" <gi...@apache.org> on 2023/07/18 06:47:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42056: [SPARK-43203][SQL][FOLLOWUP] V2SessionCatalog.dropTable should handle null table - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/18 06:51:00 UTC, 0 replies.
- [GitHub] [spark] xuanyuanking opened a new pull request, #42057: [SPARK-44471][INFRA][BRANCH-3.5] Add Github action test job for branch-3.5 - posted by "xuanyuanking (via GitHub)" <gi...@apache.org> on 2023/07/18 06:57:49 UTC, 0 replies.
- [GitHub] [spark] zml1206 commented on pull request #41978: [SPARK-32268][SQL][FOLLOWUP] Filter creation side size threshold judgment should prun column in injectBloomFilter - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2023/07/18 07:12:18 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/18 07:34:09 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42028: [SPARK-44451][BUILD] Make built document downloadable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/18 07:40:55 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 opened a new pull request, #42058: SPARK-42972. ExecutorAllocationManager cannot allocate new instances … - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/07/18 07:48:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #41934: [SPARK-43974][CONNECT][BUILD][FOLLOWUP] Upgrade buf to v1.23.1 - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/18 07:48:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 07:55:41 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 commented on pull request #42058: [SPARK-42972][spark-structured-streaming]. ExecutorAllocationManager cannot allocate new instances … - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/07/18 08:04:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:12:45 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42057: [SPARK-44471][INFRA][BRANCH-3.5] Add Github action test job for branch-3.5 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:14:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42057: [SPARK-44471][INFRA][BRANCH-3.5] Add Github action test job for branch-3.5 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:15:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42055: [SPARK-44471][INFRA] Change branches in build_and_test.yml for master branch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:16:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42055: [SPARK-44471][INFRA] Change branches in build_and_test.yml for master branch - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:17:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42050: [SPARK-44468][BUILD] Add daily test GA task for branch3.5 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:17:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42059: [SPARK-43923][CONNECT][FOLLOW-UP][TESTS] Skip "Test observe response" at SparkConnectServiceSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:29:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42059: [SPARK-43923][CONNECT][FOLLOW-UP][TESTS] Skip "Test observe response" at SparkConnectServiceSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:31:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42059: [SPARK-43923][CONNECT][FOLLOW-UP][TESTS] Skip "Test observe response" at SparkConnectServiceSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 08:32:02 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/18 08:43:09 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/18 09:31:30 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #42056: [SPARK-43203][SQL][FOLLOWUP] V2SessionCatalog.dropTable should handle null table - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/18 09:40:15 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42056: [SPARK-43203][SQL][FOLLOWUP] V2SessionCatalog.dropTable should handle null table - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/18 09:40:44 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42048: [SPARK-44467][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/18 09:41:41 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42048: [SPARK-44467][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/18 09:41:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42042: [SPARK-44448][SQL] Add test cases for DenseRankLimitIterator InferWindowGroupLimit bug - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 09:57:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42039: [SPARK-44457][CONNECT][TESTS] Add `truncatedTo(ChronoUnit.MICROS)` to make `ArrowEncoderSuite` in Java 17 daily test GA task pass - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 10:00:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42037: [SPARK-44305][SQL] Dynamically choose whether to broadcast hadoop conf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/18 10:02:37 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42060: [SPARK-43755][FOLLOWUP] Open `AdaptiveSparkPlanHelper.allChildren` instead of copying - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/18 10:23:08 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42060: [SPARK-43755][FOLLOWUP] Open `AdaptiveSparkPlanHelper.allChildren` instead of copying - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/18 10:28:35 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on pull request #42042: [SPARK-44448][SQL] Add test cases for DenseRankLimitIterator InferWindowGroupLimit bug - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/18 10:46:13 UTC, 1 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #42061: [MINOR] Move `spark.stage.maxConsecutiveAttempts` to config - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/18 10:56:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/18 11:54:39 UTC, 3 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/07/18 12:04:08 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #41518: [SPARK-19335][SPARK-38200][SQL] Add upserts for writing to JDBC - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/07/18 12:13:01 UTC, 1 replies.
- [GitHub] [spark] 7mming7 commented on a diff in pull request #42037: [SPARK-44305][SQL] Dynamically choose whether to broadcast hadoop conf - posted by "7mming7 (via GitHub)" <gi...@apache.org> on 2023/07/18 13:03:11 UTC, 0 replies.
- [GitHub] [spark] jchen5 closed pull request #42042: [SPARK-44448][SQL] Add test cases for DenseRankLimitIterator InferWindowGroupLimit bug - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/18 13:25:51 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/07/18 13:39:45 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #42029: [SPARK-44362][SQL] Use PartitionEvaluator API in AggregateInPandasExec and AttachDistributedSequenceExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/18 14:35:55 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #42025: [SPARK-44447][SQL] Use PartitionEvaluator API in FlatMapGroupsInPandasExec, FlatMapCoGroupsInPandasExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/18 14:36:11 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #42024: [SPARK-44361][SQL] Use PartitionEvaluator API in MapInBatchExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/18 14:36:26 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #41998: [SPARK-44411][SQL] Use PartitionEvaluator API in ArrowEvalPythonExec and BatchEvalPythonExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/18 14:36:38 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #42062: [SPARK-44476][CORE][CONNECT] Fix population of artifacts for a JobArtifactState with no associated artifacts - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/18 14:45:38 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on pull request #42062: [SPARK-44476][CORE][CONNECT] Fix population of artifacts for a JobArtifactState with no associated artifacts - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/18 14:54:19 UTC, 0 replies.
- [GitHub] [spark] touchida commented on pull request #41628: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions - posted by "touchida (via GitHub)" <gi...@apache.org> on 2023/07/18 15:44:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42047: [SPARK-44465][BUILD] Upgrade zstd-jni to 1.5.5-5 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/18 15:55:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42058: [SPARK-42972][DSTREAM]ExecutorAllocationManager cannot allocate new instances when all executors down - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/18 15:56:17 UTC, 0 replies.
- [GitHub] [spark] agubichev commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2023/07/18 16:38:51 UTC, 2 replies.
- [GitHub] [spark] jdesjean opened a new pull request, #42063: [SPARK-44474] Reenable "Test observe response" at SparkConnectServiceSuite - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/07/18 17:33:31 UTC, 0 replies.
- [GitHub] [spark] jdesjean commented on pull request #41443: [SPARK-43923][CONNECT] Post listenerBus events during ExecutePlanRequest - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/07/18 17:44:55 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #42064: [SPARK-44477][SQL] Treat TYPE_CHECK_FAILURE_WITH_HINT as an error subclass - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/07/18 17:55:48 UTC, 0 replies.
- [GitHub] [spark] revans2 commented on pull request #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "revans2 (via GitHub)" <gi...@apache.org> on 2023/07/18 18:24:24 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #42011: [SPARK-44396][Connect] Direct Arrow Deserialization - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/18 18:40:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42011: [SPARK-44396][Connect] Direct Arrow Deserialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/18 21:17:05 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42065: [SPARK-43965][FOLLOW-UP] Include test_parity_udtf in spark test module - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/18 21:45:28 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42065: [SPARK-43965][PYTHON][CONNECT][FOLLOWUP] Include test_parity_udtf in spark test module - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/18 21:47:01 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42065: [SPARK-43965][PYTHON][CONNECT][FOLLOWUP] Include test_parity_udtf in spark test module - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/18 21:51:23 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42046: [SPARK-44464][SS] Implement applyInPandasWithState in PySpark - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/18 22:31:02 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42046: [SPARK-44464][SS] Implement applyInPandasWithState in PySpark - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/18 22:39:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42035: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 00:03:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42035: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 00:04:24 UTC, 0 replies.
- [GitHub] [spark] ericm-db opened a new pull request, #42066: Maintenance thread pool optional - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2023/07/19 00:25:57 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #42045: [SPARK-44264][ML][PYTHON] Incorporating FunctionPickler Into TorchDistributor - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2023/07/19 00:29:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42062: [SPARK-44476][CORE][CONNECT] Fix population of artifacts for a JobArtifactState with no associated artifacts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 00:36:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42062: [SPARK-44476][CORE][CONNECT] Fix population of artifacts for a JobArtifactState with no associated artifacts - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 00:37:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42060: [SPARK-43755][CONNECT][MINOR] Open `AdaptiveSparkPlanHelper.allChildren` instead of using copy in `MetricGenerator` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 00:51:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42060: [SPARK-43755][CONNECT][MINOR] Open `AdaptiveSparkPlanHelper.allChildren` instead of using copy in `MetricGenerator` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 00:51:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41882: [SPARK-44324][SQL][CONNECT] Move CaseInsensitiveMap to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 01:02:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41882: [SPARK-44324][SQL][CONNECT] Move CaseInsensitiveMap to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 01:02:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 01:12:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42026: [SPARK-44448][SQL] Fix wrong results bug from DenseRankLimitIterator and InferWindowGroupLimit - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 01:13:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41974: [SPARK-44401][PYTHON][DOCS] Arrow Python UDF Use Guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 01:21:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41974: [SPARK-44401][PYTHON][DOCS] Arrow Python UDF Use Guide - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 01:21:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41973: [SPARK-44264][ML][PYTHON] Refactoring TorchDistributor To Allow for Custom "run_training_on_file" Function Pointer - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 01:50:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41973: [SPARK-44264][ML][PYTHON] Refactoring TorchDistributor To Allow for Custom "run_training_on_file" Function Pointer - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 01:50:25 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42067: [SPARK-44264] Support Distributed Training of Functions Using Deepspeed - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/19 02:15:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42054: [SPARK-44470][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 02:18:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42063: [SPARK-44474][CONNECT] Reenable "Test observe response" at SparkConnectServiceSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 02:25:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42063: [SPARK-44474][CONNECT] Reenable "Test observe response" at SparkConnectServiceSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 02:25:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42061: [MINOR] Move `spark.stage.maxConsecutiveAttempts` to config - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 02:27:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42061: [MINOR] Move `spark.stage.maxConsecutiveAttempts` to config - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 02:27:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41932: [SPARK-44131][SQL][PYTHON][CONNECT][FOLLOWUP] Support qualified function name for call_function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 03:47:29 UTC, 5 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 03:51:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41949: [SPARK-44375][SQL] Use PartitionEvaluator API in DebugExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 03:52:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41998: [SPARK-44411][SQL] Use PartitionEvaluator API in ArrowEvalPythonExec and BatchEvalPythonExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 03:57:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41998: [SPARK-44411][SQL] Use PartitionEvaluator API in ArrowEvalPythonExec and BatchEvalPythonExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 03:58:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42024: [SPARK-44361][SQL] Use PartitionEvaluator API in MapInBatchExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 04:00:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42065: [SPARK-43965][PYTHON][CONNECT][FOLLOWUP] Include test_parity_udtf in spark test module - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/19 04:00:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42065: [SPARK-43965][PYTHON][CONNECT][FOLLOWUP] Include test_parity_udtf in spark test module - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/19 04:01:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42024: [SPARK-44361][SQL] Use PartitionEvaluator API in MapInBatchExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 04:02:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42024: [SPARK-44361][SQL] Use PartitionEvaluator API in MapInBatchExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 04:03:20 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42058: [SPARK-42972][DSTREAM]ExecutorAllocationManager cannot allocate new instances when all executors down - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/19 04:46:31 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #42061: [MINOR] Move `spark.stage.maxConsecutiveAttempts` to config - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/19 04:48:40 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42058: [SPARK-42972][DSTREAM]ExecutorAllocationManager cannot allocate new instances when all executors down - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/19 04:50:36 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on a diff in pull request #42061: [MINOR] Move `spark.stage.maxConsecutiveAttempts` to config - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/19 04:52:46 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42066: [SPARK-44480][SS] Add option for thread pool to perform maintenance for RocksDB/HDFS State Store Providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/19 04:56:12 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #42037: [SPARK-44305][SQL] Dynamically choose whether to broadcast hadoop conf - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/19 04:59:31 UTC, 0 replies.
- [GitHub] [spark] ericm-db commented on pull request #42066: [SPARK-44480][SS] Add option for thread pool to perform maintenance for RocksDB/HDFS State Store Providers - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2023/07/19 05:03:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42068: [SPARK-44361][SQL][FOLLOW-UP] Remove unused variables and fix import statements - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 05:09:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42068: [SPARK-44361][SQL][FOLLOW-UP] Remove unused variables and fix import statements - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 05:09:12 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42067: [SPARK-44264][ML][PYTHON] Support Distributed Training of Functions Using Deepspeed - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/19 05:21:21 UTC, 2 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #42069: [WIP] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/19 05:23:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42003: [SPARK-44426][SQL] Optimize adaptive skew join for ExistenceJoin - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/19 05:28:11 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/19 05:38:50 UTC, 0 replies.
- [GitHub] [spark] 7mming7 commented on pull request #42037: [SPARK-44305][SQL] Dynamically choose whether to broadcast hadoop conf - posted by "7mming7 (via GitHub)" <gi...@apache.org> on 2023/07/19 06:01:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42070: [MINOR][INFRA] Update the labeler for CORE and CONNECT - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/19 06:01:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 06:29:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42068: [SPARK-44361][SQL][FOLLOW-UP] Remove unused variables and fix import statements - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 06:53:22 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42003: [SPARK-44426][SQL] Optimize adaptive skew join for ExistenceJoin - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/19 06:54:47 UTC, 2 replies.
- [GitHub] [spark] mridulm closed pull request #41821: [SPARK-44272][YARN] Path Inconsistency when Operating statCache within Yarn Client - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/19 07:19:21 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/19 07:44:41 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/19 07:48:23 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #42069: [WIP] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/19 08:18:06 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42067: [SPARK-44264][ML][PYTHON] Support Distributed Training of Functions Using Deepspeed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 08:29:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42067: [SPARK-44264][ML][PYTHON] Support Distributed Training of Functions Using Deepspeed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 08:29:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42068: [SPARK-44361][SQL][FOLLOW-UP] Remove unused variables and fix import statements - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 09:02:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42070: [MINOR][INFRA] Update the labeler for CORE and CONNECT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 09:03:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42070: [MINOR][INFRA] Update the labeler for CORE and CONNECT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 09:03:47 UTC, 0 replies.
- [GitHub] [spark] Deependra-Patel opened a new pull request, #42071: [SPARK-44209] Expose amount of shuffle data available on the node - posted by "Deependra-Patel (via GitHub)" <gi...@apache.org> on 2023/07/19 09:20:31 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42054: [SPARK-44470][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/19 10:02:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42054: [SPARK-44470][BUILD] Setting version to 4.0.0-SNAPSHOT - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/19 10:02:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42072: [SPARK-44481][CONNECT][PYTHON] Make pyspark.sql.is_remote an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 10:25:50 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42073: [SPARK-44482][CONNECT] Connect server should can specify the bind address - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/19 11:08:35 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42073: [SPARK-44482][CONNECT] Connect server should can specify the bind address - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/19 11:28:07 UTC, 1 replies.
- [GitHub] [spark] harupy commented on a diff in pull request #42072: [SPARK-44481][CONNECT][PYTHON] Make pyspark.sql.is_remote an API - posted by "harupy (via GitHub)" <gi...@apache.org> on 2023/07/19 12:08:16 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on pull request #41831: [SPARK-44278][CONNECT] Implement a GRPC server interceptor that cleans up thread local properties - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/19 13:04:08 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42011: [SPARK-44396][Connect] Direct Arrow Deserialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/19 13:27:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41850: [SPARK-44292][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2315-2319] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 14:12:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41349: [SPARK-43839][SQL] Convert `_LEGACY_ERROR_TEMP_1337` to `UNSUPPORTED_FEATURE.TIME_TRAVEL` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/19 14:50:37 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/19 15:20:25 UTC, 1 replies.
- [GitHub] [spark] ericm-db commented on pull request #42066: [SPARK-44480][SS] Use thread pool to perform maintenance activity for hdfs/rocksdb state store providers - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2023/07/19 17:22:43 UTC, 0 replies.
- [GitHub] [spark] siying commented on pull request #42046: [SPARK-44464][SS] Implement applyInPandasWithState in PySpark - posted by "siying (via GitHub)" <gi...@apache.org> on 2023/07/19 18:24:57 UTC, 1 replies.
- [GitHub] [spark] siying opened a new pull request, #42074: [SPARK-44464][SS] Fix applyInPandasWithStatePythonRunner to output rows that have Null as first column value - posted by "siying (via GitHub)" <gi...@apache.org> on 2023/07/19 18:30:39 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42075: [SPARK-43966][SQL][PYTHON] Support non-deterministic table-valued functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/19 18:37:34 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/19 18:59:36 UTC, 4 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42076: [SPARK-44449][CONNECT] Upcasting for direct Arrow Deserialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/19 19:20:03 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42077: [SPARK-44484][SS]Add batchDuration to StreamingQueryProgress json method - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/19 20:41:03 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42078: [WIP][DO NOT REVIEW] Testing stuff - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/19 21:37:51 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #41711: [SPARK-44155] Adding a dev utility to improve error messages based on LLM - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/19 22:03:39 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42072: [SPARK-44481][CONNECT][PYTHON] Make pyspark.sql.is_remote an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 23:55:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41831: [SPARK-44278][CONNECT] Implement a GRPC server interceptor that cleans up thread local properties - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 23:58:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41831: [SPARK-44278][CONNECT] Implement a GRPC server interceptor that cleans up thread local properties - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/19 23:59:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42072: [SPARK-44481][CONNECT][PYTHON] Make pyspark.sql.is_remote an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 00:02:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42072: [SPARK-44481][CONNECT][PYTHON] Make pyspark.sql.is_remote an API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 00:02:48 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #42079: [WIP][SPARK-44486][PYTHON][CONNECT] Implement PyArrow `self_destruct` feature for `toPandas` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/20 00:10:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40728: [WIP][SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/20 00:21:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/20 00:21:11 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42075: [SPARK-43966][SQL][PYTHON] Support non-deterministic table-valued functions - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/20 00:21:23 UTC, 0 replies.
- [GitHub] [spark] ericm-db opened a new pull request, #42080: Statestoresuite threadpool - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2023/07/20 00:54:57 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/20 01:07:03 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #41349: [SPARK-43839][SQL] Convert `_LEGACY_ERROR_TEMP_1337` to `UNSUPPORTED_FEATURE.TIME_TRAVEL` - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/20 01:45:42 UTC, 3 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42081: [SPARK-44487][TEST] Fix KubernetesSuite report NPE when not set spark.kubernetes.test.unpackSparkDir - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/20 02:04:03 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42081: [SPARK-44487][TEST] Fix KubernetesSuite report NPE when not set spark.kubernetes.test.unpackSparkDir - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/20 02:04:23 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41347: [SPARK-43838][SQL] Fix subquery on single table with having clause can't be optimized - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 02:15:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41347: [SPARK-43838][SQL] Fix subquery on single table with having clause can't be optimized - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 02:16:37 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41347: [SPARK-43838][SQL] Fix subquery on single table with having clause can't be optimized - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/20 02:18:46 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #41850: [SPARK-44292][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[2315-2319] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/20 02:28:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42081: [SPARK-44487][TEST] Fix KubernetesSuite report NPE when not set spark.kubernetes.test.unpackSparkDir - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 02:32:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42082: [SPARK-43839][SQL][FOLLOWUP] Convert _LEGACY_ERROR_TEMP_1337 to UNSUPPORTED_FEATURE.TIME_TRAVEL - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/20 02:42:17 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42082: [SPARK-43839][SQL][FOLLOWUP] Convert _LEGACY_ERROR_TEMP_1337 to UNSUPPORTED_FEATURE.TIME_TRAVEL - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/20 02:48:54 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42082: [SPARK-43839][SQL][FOLLOWUP] Convert _LEGACY_ERROR_TEMP_1337 to UNSUPPORTED_FEATURE.TIME_TRAVEL - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/20 02:49:16 UTC, 1 replies.
- [GitHub] [spark] richardc-db opened a new pull request, #42083: Support deserializing long types when creating `Metadata` object from JObject - posted by "richardc-db (via GitHub)" <gi...@apache.org> on 2023/07/20 03:08:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42007: [SPARK-44431][SQL] Fix behavior of null IN (empty list) in optimization rules - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 03:09:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42007: [SPARK-44431][SQL] Fix behavior of null IN (empty list) in optimization rules - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 03:10:04 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42084: [SPARK-44292][SQL][FOLLOWUP] Make TYPE_CHECK_FAILURE_WITH_HINT use correct name - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/20 03:32:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 03:45:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 03:46:07 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/20 04:27:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/20 04:52:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/20 04:58:36 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on pull request #41932: [SPARK-44131][SQL][PYTHON][CONNECT][FOLLOWUP] Support qualified function name for call_function - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/20 05:00:05 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42040: [WIP][SPARK-43611][SQL][PS][CONNCECT] Fix unexpected `AnalysisException` from Spark Connect client - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/20 05:09:12 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42087: [SPARK-44264] Added Example to Deepspeed Distributor - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/20 05:17:20 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42088: [SPARK-44491][INFRA] Add `branch-3.5` to `publish_snapshot` GitHub Action job - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/20 05:45:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42088: [SPARK-44491][INFRA] Add `branch-3.5` to `publish_snapshot` GitHub Action job - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 05:47:05 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #41951: [SPARK-44367][SQL][UI] Show error message on UI for each failed query - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/20 05:51:51 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG ` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/20 06:11:27 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/20 06:11:42 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42041: [DO-NOT-MERGE][PS][TESTS] Enable pandas API on Spark tests related to SPARK-43611 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/20 06:13:36 UTC, 0 replies.
- [GitHub] [spark] itholic closed pull request #42041: [DO-NOT-MERGE][PS][TESTS] Enable pandas API on Spark tests related to SPARK-43611 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/20 06:13:38 UTC, 0 replies.
- [GitHub] [spark] liangyu-1 commented on a diff in pull request #42058: [SPARK-42972][DSTREAM]ExecutorAllocationManager cannot allocate new instances when all executors down - posted by "liangyu-1 (via GitHub)" <gi...@apache.org> on 2023/07/20 06:26:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42088: [SPARK-44491][INFRA] Add `branch-3.5` to `publish_snapshot` GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 06:34:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42088: [SPARK-44491][INFRA] Add `branch-3.5` to `publish_snapshot` GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 06:35:12 UTC, 0 replies.
- [GitHub] [spark] yihua commented on pull request #40728: [WIP][SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "yihua (via GitHub)" <gi...@apache.org> on 2023/07/20 06:36:59 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42084: [SPARK-44292][SQL][FOLLOWUP] Make TYPE_CHECK_FAILURE_WITH_HINT use correct name - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/20 06:42:18 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG ` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 06:46:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42088: [SPARK-44491][INFRA] Add `branch-3.5` to `publish_snapshot` GitHub Action job - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/20 06:47:19 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #42089: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/20 07:15:29 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/20 07:16:16 UTC, 0 replies.
- [GitHub] [spark] dh20 opened a new pull request, #42090: [SPARK-44483] [SQL] When using Spark to read the hive table, the number of file partitions cannot be set using Spark's configuration settings - posted by "dh20 (via GitHub)" <gi...@apache.org> on 2023/07/20 07:19:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 07:20:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41928: [SPARK-44475][SQL][CONNECT] Relocate DataType and Parser to sql/api - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 07:26:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #41562: [DRAFT] Change StreamingQueryProgress use Jackson API instead of json4s - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 07:37:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42077: [SPARK-44484][SS]Add batchDuration to StreamingQueryProgress json method - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 07:43:12 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42077: [SPARK-44484][SS]Add batchDuration to StreamingQueryProgress json method - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/20 07:48:29 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42091: [SPARK-44494] Test use minikube 1.30.1 to test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 08:01:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42091: [SPARK-44494] Test use minikube 1.30.1 to test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 08:03:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42091: [SPARK-44494] Test use minikube 1.30.1 to test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 08:05:40 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42091: [SPARK-44494] Test use minikube 1.30.1 to test - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/20 08:06:43 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #42092: [SPARK-44496][SQL][CONNECT] Move Interfaces needed by SCSC to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/20 08:41:15 UTC, 0 replies.
- [GitHub] [spark] ASiegeLion commented on pull request #33958: [SPARK-36718][SQL] Only collapse projects if we don't duplicate expensive expressions - posted by "ASiegeLion (via GitHub)" <gi...@apache.org> on 2023/07/20 08:59:31 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #42093: [SPARK-44497][WEBUI] Show task partition id in Task table - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/20 09:20:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42082: [SPARK-43839][SQL][FOLLOWUP] Convert _LEGACY_ERROR_TEMP_1337 to UNSUPPORTED_FEATURE.TIME_TRAVEL - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 09:43:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42082: [SPARK-43839][SQL][FOLLOWUP] Convert _LEGACY_ERROR_TEMP_1337 to UNSUPPORTED_FEATURE.TIME_TRAVEL - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/20 09:45:02 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42092: [SPARK-44496][SQL][CONNECT] Move Interfaces needed by SCSC to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/20 09:50:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42091: [SPARK-44494][INFRA] Use `minikube` v1.30.1 for `k8s-integration-tests` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/20 10:23:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42091: [SPARK-44494][INFRA] Use `minikube` v1.30.1 for `k8s-integration-tests` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/20 10:24:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42091: [SPARK-44494][INFRA] Use `minikube` v1.30.1 for `k8s-integration-tests` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/20 11:02:37 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42093: [SPARK-44497][WEBUI] Show task partition id in Task table - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/20 11:46:41 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42049: [SPARK-44466][SQL] Exclude configs starting with `SPARK_DRIVER_PREFIX` and `SPARK_EXECUTOR_PREFIX` from modifiedConfigs - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/20 13:33:25 UTC, 0 replies.
- [GitHub] [spark] juanvisoler commented on pull request #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "juanvisoler (via GitHub)" <gi...@apache.org> on 2023/07/20 16:19:29 UTC, 0 replies.
- [GitHub] [spark] vkorukanti commented on pull request #40728: [SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "vkorukanti (via GitHub)" <gi...@apache.org> on 2023/07/20 16:42:02 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on pull request #42092: [SPARK-44496][SQL][CONNECT] Move Interfaces needed by SCSC to sql/api - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/20 17:08:32 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #42087: [SPARK-44264] Added Example to Deepspeed Distributor - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/20 18:08:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42094: [SPARK-44501][K8S] Ignore checksum files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/20 18:33:13 UTC, 0 replies.
- [GitHub] [spark] liuzqt opened a new pull request, #42095: [SPARK-44485][CORE][SQL] Optimize TreeNode.generateTreeString - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/07/20 18:40:10 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #41628: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/07/20 19:22:19 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42096: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python followups - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/20 19:43:26 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42096: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python followups - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/20 19:44:51 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42096: [SPARK-42944][SS][PYTHON] Streaming ForeachBatch in Python followups - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/20 19:45:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42094: [SPARK-44501][K8S] Ignore checksum files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/20 19:46:45 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42094: [SPARK-44501][K8S] Ignore checksum files in KubernetesLocalDiskShuffleExecutorComponents - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/20 19:51:46 UTC, 2 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42097: [SPARK-44502][DOC][SS][PYTHON] Add missing versionchanged field to streaming functions - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/20 19:59:15 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42097: [SPARK-44502][DOC][SS][PYTHON] Add missing versionchanged field to streaming functions - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/20 20:00:37 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42096: [SPARK-42944][SS][PYTHON][CONNECT] Streaming ForeachBatch in Python followups - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/20 20:02:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42094: [SPARK-44501][K8S] Ignore checksum files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/20 20:02:08 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42094: [SPARK-44501][K8S] Ignore checksum files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/20 20:23:09 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #42098: [SPARK-44504][SS] Unload provider thereby forcing DB instance close and releasing resources on maintenance task error - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/20 20:56:16 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #42098: [SPARK-44504][SS] Unload provider thereby forcing DB instance close and releasing resources on maintenance task error - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2023/07/20 20:56:49 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42099: [SPARK-44505][DSv2] Provide override for columnar support in Scan - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/20 22:24:13 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #42099: [SPARK-44505][DSv2] Provide override for columnar support in Scan - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/20 22:26:04 UTC, 1 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #42100: [SPARK-44503][SQL] Add SQL grammar for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/20 22:30:01 UTC, 0 replies.
- [GitHub] [spark] grundprinzip closed pull request #41117: Initial go client workflow v3 - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/20 22:54:16 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #41705: [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/20 23:46:05 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #41705: [SPARK-44252][SS] Define a new error class and apply for the case where loading state from DFS fails - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/20 23:51:47 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42077: [SPARK-44484][SS]Add batchDuration to StreamingQueryProgress json method - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/20 23:55:15 UTC, 1 replies.
- [GitHub] [spark] yihua commented on pull request #40728: [SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "yihua (via GitHub)" <gi...@apache.org> on 2023/07/21 00:15:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40719: [WIP]Speed up parquet reading with Java Vector API - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/21 00:21:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40728: [SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/21 00:21:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40608: [SPARK-35198][CONNECT][CORE][PYTHON][SQL] Add support for calling debugCodegen from Python & Java - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/21 00:21:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42084: [SPARK-44292][SQL][FOLLOWUP] Make TYPE_CHECK_FAILURE_WITH_HINT use correct name - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 00:23:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42084: [SPARK-44292][SQL][FOLLOWUP] Make TYPE_CHECK_FAILURE_WITH_HINT use correct name - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 00:23:58 UTC, 0 replies.
- [GitHub] [spark] vkorukanti opened a new pull request, #40728: [SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "vkorukanti (via GitHub)" <gi...@apache.org> on 2023/07/21 00:24:43 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42098: [SPARK-44504][SS] Unload provider thereby forcing DB instance close and releasing resources on maintenance task error - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/21 00:26:53 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42095: [SPARK-44485][SQL] Optimize TreeNode.generateTreeString - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 00:39:01 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42064: [SPARK-44477][SQL] Treat TYPE_CHECK_FAILURE_WITH_HINT as an error subclass - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 00:44:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42064: [SPARK-44477][SQL] Treat TYPE_CHECK_FAILURE_WITH_HINT as an error subclass - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 00:45:03 UTC, 0 replies.
- [GitHub] [spark] liuzqt commented on pull request #42095: [SPARK-44485][SQL] Optimize TreeNode.generateTreeString - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/07/21 01:06:55 UTC, 2 replies.
- [GitHub] [spark] thomasg19930417 commented on pull request #37406: [SPARK-39921][SQL] SkewJoin--Stream side skew in BroadcastHashJoin - posted by "thomasg19930417 (via GitHub)" <gi...@apache.org> on 2023/07/21 01:36:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42081: [SPARK-44487][TEST] Fix KubernetesSuite report NPE when not set spark.kubernetes.test.unpackSparkDir - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 01:47:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42081: [SPARK-44487][TEST] Fix KubernetesSuite report NPE when not set spark.kubernetes.test.unpackSparkDir - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 01:47:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42083: [SPARK-44488] Support deserializing long types when creating `Metadata` object from JObject - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 01:49:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #33958: [SPARK-36718][SQL] Only collapse projects if we don't duplicate expensive expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 01:52:20 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42049: [SPARK-44466][SQL] Exclude configs starting with `SPARK_DRIVER_PREFIX` and `SPARK_EXECUTOR_PREFIX` from modifiedConfigs - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/21 01:52:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41979: [SPARK-43952][SQL][FOLLOWUP] Correct AQE cancel broadcast job tag - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 01:53:22 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42089: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/21 01:53:35 UTC, 1 replies.
- [GitHub] [spark] dh20 commented on pull request #42090: [SPARK-44483] [SQL] When using Spark to read the hive table, the number of file partitions cannot be set using Spark's configuration settings - posted by "dh20 (via GitHub)" <gi...@apache.org> on 2023/07/21 01:56:23 UTC, 1 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 02:23:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42089: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 02:26:16 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 02:29:15 UTC, 1 replies.
- [GitHub] [spark] ulysses-you closed pull request #41979: [SPARK-43952][SQL][FOLLOWUP] Correct AQE cancel broadcast job tag - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/21 02:41:47 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/21 03:04:54 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 03:06:24 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42102: [SPARK-44506][BUILD] Upgrade mima-core & sbt-mima-plugin to 1.1.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/21 03:09:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42097: [SPARK-44502][DOC][SS][PYTHON] Add missing versionchanged field to streaming functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 03:09:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42097: [SPARK-44502][DOC][SS][PYTHON] Add missing versionchanged field to streaming functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 03:10:10 UTC, 0 replies.
- [GitHub] [spark] ychris78 opened a new pull request, #42103: [SPARK-44473] Overwriting the same partition of a partitioned table multiple times with empty data yields non-idempotent results - posted by "ychris78 (via GitHub)" <gi...@apache.org> on 2023/07/21 03:24:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #42104: [SPARK-44507][SQL][CONNECT] Scala client does not depend on AnalysisException - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/21 03:24:33 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42090: [SPARK-44483] [SQL] When using Spark to read the hive table, the number of file partitions cannot be set using Spark's configuration settings - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/21 03:50:04 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #40524: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 03:57:12 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42089: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 03:57:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 03:57:26 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42089: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 03:59:57 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #41948: [SPARK-44380][SQL][PYTHON] Support for Python UDTF to analyze in Python - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/21 04:03:17 UTC, 0 replies.
- [GitHub] [spark] richardc-db commented on a diff in pull request #42083: [SPARK-44488][SQL] Support deserializing long types when creating `Metadata` object from JObject - posted by "richardc-db (via GitHub)" <gi...@apache.org> on 2023/07/21 04:04:45 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #42105: [SPARK-44365][SQL] Use PartitionEvaluator API in FileSourceScanExec, RowDataSourceScanExec, MergeRowsExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/21 04:15:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42099: [SPARK-44505][SQL] Provide override for columnar support in Scan for DSv2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 04:18:07 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42075: [SPARK-43966][SQL][PYTHON] Support non-deterministic table-valued functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 04:20:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42075: [SPARK-43966][SQL][PYTHON] Support non-deterministic table-valued functions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 04:21:31 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42074: [SPARK-44464][SS] Fix applyInPandasWithStatePythonRunner to output rows that have Null as first column value - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/21 04:25:03 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42036: [SPARK-44355][SQL] Move WithCTE into command queries - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/21 04:38:59 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 04:40:23 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42036: [SPARK-44355][SQL] Move WithCTE into command queries - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/21 04:44:00 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42098: [SPARK-44504][SS] Unload provider thereby forcing DB instance close and releasing resources on maintenance task error - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/21 04:46:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42095: [SPARK-44485][SQL] Optimize TreeNode.generateTreeString - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 04:48:01 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42106: [SPARK-44341][SQL][PYTHON] Move the base trait WindowEvaluatorFactoryBase to a single file - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/21 04:48:17 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42077: [SPARK-44484][SS]Add batchDuration to StreamingQueryProgress json method - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/21 04:49:16 UTC, 0 replies.
- [GitHub] [spark] wang-zhun commented on pull request #37406: [SPARK-39921][SQL] SkewJoin--Stream side skew in BroadcastHashJoin - posted by "wang-zhun (via GitHub)" <gi...@apache.org> on 2023/07/21 04:50:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42089: [SPARK-42898][SQL] Mark that string/date casts do not need time zone id - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 04:53:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42074: [SPARK-44464][SS] Fix applyInPandasWithStatePythonRunner to output rows that have Null as first column value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 04:55:02 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41763: [SPARK-44219][SQL] Adds extra per-rule validations for optimization rewrites. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 05:08:36 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42074: [SPARK-44464][SS] Fix applyInPandasWithStatePythonRunner to output rows that have Null as first column value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 05:11:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #40728: [SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 05:15:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #40728: [SPARK-39634][SQL] Allow file splitting in combination with row index generation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 05:16:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/21 05:23:58 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #41856: [SPARK-44301][SQL] Add Benchmark Suite for TPCH - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/21 05:43:27 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42095: [SPARK-44485][SQL] Optimize TreeNode.generateTreeString - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/21 05:51:06 UTC, 0 replies.
- [GitHub] [spark] dh20 commented on a diff in pull request #42090: [SPARK-44483] [SQL] When using Spark to read the hive table, the number of file partitions cannot be set using Spark's configuration settings - posted by "dh20 (via GitHub)" <gi...@apache.org> on 2023/07/21 05:56:10 UTC, 3 replies.
- [GitHub] [spark] ulysses-you commented on pull request #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/21 06:36:25 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42093: [SPARK-44497][WEBUI] Show task partition id in Task table - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/21 06:54:24 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42071: [SPARK-44209] Expose amount of shuffle data available on the node - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/21 07:03:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42102: [SPARK-44506][BUILD] Upgrade mima-core & sbt-mima-plugin to 1.1.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 07:09:35 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42099: [SPARK-44505][SQL] Provide override for columnar support in Scan for DSv2 - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/21 07:09:57 UTC, 5 replies.
- [GitHub] [spark] LuciferYang closed pull request #42102: [SPARK-44506][BUILD] Upgrade mima-core & sbt-mima-plugin to 1.1.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 07:11:26 UTC, 0 replies.
- [GitHub] [spark] anthonywainer commented on pull request #38285: [SPARK-40820][PYTHON] Creating StructType from Json - posted by "anthonywainer (via GitHub)" <gi...@apache.org> on 2023/07/21 07:35:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42107: [MINOR][CONNECT] Remove redundant type cast in `ArtifactManager` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 08:09:28 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42108: [SPARK-44510][UI] Update dataTables to 1.13.5 and remove some unreached png files - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 08:25:51 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42106: [SPARK-44341][SQL][PYTHON][FOLLOWUP] Move the base trait WindowEvaluatorFactoryBase to a single file - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/21 08:47:23 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42095: [SPARK-44485][SQL] Optimize TreeNode.generateTreeString - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/21 08:53:34 UTC, 1 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42108: [SPARK-44510][UI] Update dataTables to 1.13.5 and remove some unreached png files - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/21 09:05:26 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42104: [SPARK-44507][SQL][CONNECT] Scala client does not depend on AnalysisException - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/21 10:44:33 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42109: [SPARK-44404][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1009,1010,1013,1015,1016,1278] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/21 13:14:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG ` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 13:15:11 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42099: [SPARK-44505][SQL] Provide override for columnar support in Scan for DSv2 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 13:17:57 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42106: [SPARK-44341][SQL][PYTHON][FOLLOWUP] Move the base trait WindowEvaluatorFactoryBase to a single file - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 13:20:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42106: [SPARK-44341][SQL][PYTHON][FOLLOWUP] Move the base trait WindowEvaluatorFactoryBase to a single file - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/21 13:21:31 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42110: [SPARK-42098][SQL] Fix ResolveInlineTables can not handle with RuntimeReplaceable expression - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/21 14:25:11 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42110: [SPARK-42098][SQL] Fix ResolveInlineTables can not handle with RuntimeReplaceable expression - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/21 14:29:12 UTC, 5 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #42079: [SPARK-44486][PYTHON][CONNECT] Implement PyArrow `self_destruct` feature for `toPandas` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/21 17:31:14 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #42100: [SPARK-44503][SQL] Add SQL grammar for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/21 18:57:22 UTC, 1 replies.
- [GitHub] [spark] liuzqt commented on a diff in pull request #42095: [SPARK-44485][SQL] Optimize TreeNode.generateTreeString - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/07/21 20:16:36 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40719: [WIP]Speed up parquet reading with Java Vector API - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/22 00:21:32 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42111: [SPARK-44380][PYTHON][FOLLOWUP] Set __doc__ for analyze static method when Arrow is enabled - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/22 00:27:28 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42111: [SPARK-44380][PYTHON][FOLLOWUP] Set __doc__ for analyze static method when Arrow is enabled - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/22 00:27:42 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42110: [SPARK-42098][SQL] Fix ResolveInlineTables can not handle with RuntimeReplaceable expression - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/22 00:37:07 UTC, 3 replies.
- [GitHub] [spark] yaooqinn closed pull request #42108: [SPARK-44510][UI] Update dataTables to 1.13.5 and remove some unreached png files - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/22 05:11:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42095: [SPARK-44485][SQL] Optimize TreeNode.generateTreeString - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/22 07:43:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42111: [SPARK-44380][PYTHON][FOLLOWUP] Set __doc__ for analyze static method when Arrow is enabled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/22 07:49:03 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42111: [SPARK-44380][PYTHON][FOLLOWUP] Set __doc__ for analyze static method when Arrow is enabled - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/22 07:49:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42107: [MINOR][CONNECT] Remove redundant type cast in `ArtifactManager` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/22 07:50:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42107: [MINOR][CONNECT] Remove redundant type cast in `ArtifactManager` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/22 07:50:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42105: [SPARK-44365][SQL] Use PartitionEvaluator API in FileSourceScanExec, RowDataSourceScanExec, MergeRowsExec - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/22 07:51:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/22 07:52:40 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on pull request #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/22 08:11:07 UTC, 0 replies.
- [GitHub] [spark] sarutak closed pull request #42101: [MINOR][UI] Simplify columnDefs in stagepage.js - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/22 08:12:45 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42112: [SPARK-44493][SQL] Extract pushable predicates from disjunctive predicates - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/22 09:40:19 UTC, 0 replies.
- [GitHub] [spark] caican00 commented on a diff in pull request #42003: [SPARK-44426][SQL] Optimize adaptive skew join for ExistenceJoin - posted by "caican00 (via GitHub)" <gi...@apache.org> on 2023/07/22 13:02:20 UTC, 4 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/22 15:03:46 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #41712: [SPARK-44132][SQL] Materialize `Stream` of join column names to avoid codegen failure - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/07/22 20:08:23 UTC, 3 replies.
- [GitHub] [spark] steven-aerts commented on pull request #41712: [SPARK-44132][SQL] Materialize `Stream` of join column names to avoid codegen failure - posted by "steven-aerts (via GitHub)" <gi...@apache.org> on 2023/07/22 20:20:31 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42092: [SPARK-44496][SQL][CONNECT] Move Interfaces needed by SCSC to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/22 23:52:15 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42092: [SPARK-44496][SQL][CONNECT] Move Interfaces needed by SCSC to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/22 23:53:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40749: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/23 00:22:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42107: [MINOR][CONNECT] Remove redundant type cast in `ArtifactManager` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/23 07:37:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42099: [SPARK-44505][SQL] Provide override for columnar support in Scan for DSv2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/23 07:40:19 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #42099: [SPARK-44505][SQL] Provide override for columnar support in Scan for DSv2 - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/23 07:45:15 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42113: [SPARK-44513][BUILD] Upgrade snappy-java to 1.1.10.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/23 13:02:02 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42105: [SPARK-44365][SQL] Use PartitionEvaluator API in FileSourceScanExec, RowDataSourceScanExec, MergeRowsExec - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/23 14:33:34 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42114: [SPARK-44514][SQL] Rewrite the join to filter if one side maximum number of rows is 1 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/23 15:11:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40749: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/24 00:20:55 UTC, 0 replies.
- [GitHub] [spark] wForget commented on a diff in pull request #41609: [SPARK-44065][SQL] Optimize BroadcastHashJoin skew in OptimizeSkewedJoin - posted by "wForget (via GitHub)" <gi...@apache.org> on 2023/07/24 01:33:02 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42115: [DO NOT Merge][TEST] free disk space - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/24 01:34:18 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/24 01:58:51 UTC, 0 replies.
- [GitHub] [spark] WweiL closed pull request #41096: [SPARK-42941][SS][CONNECT][DRAFT][DO-NOT-REVIEW] Python StreamingQueryListener POC - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/24 02:08:50 UTC, 0 replies.
- [GitHub] [spark] CodingCat opened a new pull request, #42117: [SPARK-44517][SQL]respect ignorenulls and child's nullability in first - posted by "CodingCat (via GitHub)" <gi...@apache.org> on 2023/07/24 02:17:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42039: [SPARK-44457][CONNECT][TESTS] Add `truncatedTo(ChronoUnit.MICROS)` to make `ArrowEncoderSuite` in Java 17 daily test GA task pass - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 02:56:14 UTC, 9 replies.
- [GitHub] [spark] yaooqinn closed pull request #42113: [SPARK-44513][BUILD] Upgrade snappy-java to 1.1.10.3 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/24 03:03:42 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42113: [SPARK-44513][BUILD] Upgrade snappy-java to 1.1.10.3 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/24 03:04:32 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42118: E2E Testing for Deepspeed - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/24 03:19:21 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42114: [SPARK-44514][SQL] Rewrite the join to filter if one side maximum number of rows is 1 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/24 03:32:08 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42049: [SPARK-44466][SQL] Exclude configs starting with `SPARK_DRIVER_PREFIX` and `SPARK_EXECUTOR_PREFIX` from modifiedConfigs - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/24 03:45:08 UTC, 1 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #42118: E2E Testing for Deepspeed - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/24 03:45:42 UTC, 0 replies.
- [GitHub] [spark] xiaochen-db commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "xiaochen-db (via GitHub)" <gi...@apache.org> on 2023/07/24 04:51:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42119: Test Scala xml 2.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 05:00:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42119: Test Scala xml 2.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 05:01:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42120: [SPARK-44509][PYTHON][CONNECT] Add job cancellation API set in Spark Connect Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/24 05:16:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42049: [SPARK-44466][SQL] Exclude configs starting with `SPARK_DRIVER_PREFIX` and `SPARK_EXECUTOR_PREFIX` from modifiedConfigs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/24 05:41:58 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42117: [SPARK-44517][SQL] Respect ignorenulls and child's nullability in first - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/24 05:44:23 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42121: [SPARK-44519][CONNECT] SparkConnectServerUtils generated incorrect parameters for jars - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/24 06:06:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42122: [SPARK-44521][CONNECT][TESTS] Use `Utils.createTempDir` to make the dirs generated by `SparkConnectServiceSuite` is cleaned up after testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 06:22:41 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42121: [SPARK-44519][CONNECT] SparkConnectServerUtils generated incorrect parameters for jars - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/24 06:26:50 UTC, 1 replies.
- [GitHub] [spark] mathewjacob1002 opened a new pull request, #42123: Added Deepspeed Folder to setup.py - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/24 06:28:43 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #42123: Added Deepspeed Folder to setup.py - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/24 06:29:12 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42124: [SPARK-44520][SQL] Replace the term UNSUPPORTED_DATA_SOURCE_FOR_DIRECT_QUERY with UNSUPPORTED_DATASOURCE_FOR_DIRECT_QUERY - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/24 06:37:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 07:07:11 UTC, 2 replies.
- [GitHub] [spark] StardustDL opened a new pull request, #42125: [WIP][SPARK-44098][Python][Test] Introduce python breaking change detection - posted by "StardustDL (via GitHub)" <gi...@apache.org> on 2023/07/24 07:30:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42123: Added Deepspeed Folder to setup.py - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/24 07:44:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42123: Added Deepspeed Folder to setup.py - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/24 07:44:56 UTC, 0 replies.
- [GitHub] [spark] kori73 commented on a diff in pull request #40991: [Spark-42330] Assign name to _LEGACY_ERROR_TEMP_2175: RULE_ID_NOT_FOUND - posted by "kori73 (via GitHub)" <gi...@apache.org> on 2023/07/24 07:50:46 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #40679: [SPARK-43041][SQL] Restore constructors of exceptions for compatibility in connector API - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 07:54:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42119: [SPARK-44522][BUILD] Upgrade `scala-xml` to 2.2.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 08:20:53 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42009: [SPARK-44422][CONNECT] Spark Connect fine grained interrupt - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/24 08:25:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42115: [DO NOT Merge][TEST] free disk space - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 08:39:31 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42126: [SPARK-44523][SQL] Filter's maxRows should be 0 if condition is FalseLiteral - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/24 09:01:59 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42126: [SPARK-44523][SQL] Filter's maxRows/maxRowsPerPartition is 0 if condition is FalseLiteral - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/24 09:08:02 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #42036: [SPARK-44355][SQL] Move WithCTE into command queries - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/07/24 09:18:04 UTC, 2 replies.
- [GitHub] [spark] LuciferYang closed pull request #42122: [SPARK-44521][CONNECT][TESTS] Use `Utils.createTempDir` to make the dirs generated by `SparkConnectServiceSuite` is cleaned up after testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 09:30:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42122: [SPARK-44521][CONNECT][TESTS] Use `Utils.createTempDir` to make the dirs generated by `SparkConnectServiceSuite` is cleaned up after testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 09:32:59 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #42113: [SPARK-44513][BUILD] Upgrade snappy-java to 1.1.10.3 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/24 09:54:10 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42127: [SPARK-44513][BUILD] Upgrade snappy-java to 1.1.10.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/24 10:32:10 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42127: [SPARK-44513][BUILD] Upgrade snappy-java to 1.1.10.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/24 10:32:34 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #42127: [SPARK-44513][BUILD][3.4] Upgrade snappy-java to 1.1.10.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/24 10:41:03 UTC, 5 replies.
- [GitHub] [spark] wangyum commented on pull request #42112: [SPARK-44493][SQL] Extract pushable predicates from disjunctive predicates - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/24 11:01:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42121: [SPARK-44519][CONNECT] SparkConnectServerUtils generated incorrect parameters for jars - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 11:39:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42121: [SPARK-44519][CONNECT] SparkConnectServerUtils generated incorrect parameters for jars - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/24 11:40:26 UTC, 0 replies.
- [GitHub] [spark] jeanlyn commented on pull request #41628: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions - posted by "jeanlyn (via GitHub)" <gi...@apache.org> on 2023/07/24 13:13:21 UTC, 1 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #42128: [SPARK-44525][SQL] Improve error message when Invoke method is not found - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/24 13:43:41 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #41966: [Do not review] [SPARK-43923-2] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/24 14:23:27 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42039: [SPARK-44457][CONNECT][TESTS] Add `truncatedTo(ChronoUnit.MICROS)` to make `ArrowEncoderSuite` in Java 17 daily test GA task pass - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/24 14:24:21 UTC, 6 replies.
- [GitHub] [spark] srowen commented on pull request #42119: [SPARK-44522][BUILD] Upgrade `scala-xml` to 2.2.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/24 14:25:05 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42129: [SPARK-44527][SQL] Simplify predicate if its children contain ScalarSubquery with empty output - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/24 14:35:13 UTC, 0 replies.
- [GitHub] [spark] keen85 commented on pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "keen85 (via GitHub)" <gi...@apache.org> on 2023/07/24 14:42:57 UTC, 0 replies.
- [GitHub] [spark] CodingCat commented on a diff in pull request #42117: [SPARK-44517][SQL] Respect ignorenulls and child's nullability in first - posted by "CodingCat (via GitHub)" <gi...@apache.org> on 2023/07/24 16:10:49 UTC, 4 replies.
- [GitHub] [spark] maddiedawson commented on pull request #42123: Added Deepspeed Folder to setup.py - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/24 16:20:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42100: [SPARK-44503][SQL] Add SQL grammar for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/24 16:34:41 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #41628: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/07/24 16:38:23 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42130: [SPARK-44507][SQL][CONNECT] Move AnalysisException to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 16:45:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42130: [SPARK-44507][SQL][CONNECT] Move AnalysisException to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 16:53:51 UTC, 2 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42131: [SPARK-43964][PYTHON][TESTS][FOLLOWUP] Skip a test using pandas when pandas is not available - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/24 17:56:23 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42131: [SPARK-43964][PYTHON][TESTS][FOLLOWUP] Skip a test using pandas when pandas is not available - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/24 17:56:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #42132: [SPARK-44528] Support proper usage of hasattr() for Connect dataframe - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/24 20:45:37 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #42132: [SPARK-44528] Support proper usage of hasattr() for Connect dataframe - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/24 20:45:47 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/24 21:28:27 UTC, 1 replies.
- [GitHub] [spark] liuzqt commented on a diff in pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2023/07/24 21:38:53 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42133: [SPARK-44530][CORE][CONNECT] Move SparkBuildInfo to common/util - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 22:01:01 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42133: [SPARK-44530][CORE][CONNECT] Move SparkBuildInfo to common/util - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 22:01:09 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42034: [SPARK-44455][SQL] Quote identifiers with backticks in SHOW CREATE TABLE result - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/24 22:19:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42134: [SPARK-44531][CONNECT][SQL] Move encoder inference to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 22:19:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42134: [SPARK-44531][CONNECT][SQL] Move encoder inference to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 22:20:12 UTC, 1 replies.
- [GitHub] [spark] gengliangwang closed pull request #42034: [SPARK-44455][SQL] Quote identifiers with backticks in SHOW CREATE TABLE result - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/24 22:20:21 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #42096: [SPARK-42944][SS][PYTHON][CONNECT][FOLLOWUP] Streaming ForeachBatch in Python followups - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/24 22:29:20 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #42096: [SPARK-42944][SS][PYTHON][CONNECT][FOLLOWUP] Streaming ForeachBatch in Python followups - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2023/07/24 22:32:07 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/24 22:34:23 UTC, 9 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42135: [SPARK-44533][PYTHON] Add support for accumulator, broadcast, and Spark files in Python UDTF's analyze - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/24 22:37:34 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe opened a new pull request, #42136: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "zhouyejoe (via GitHub)" <gi...@apache.org> on 2023/07/24 22:37:40 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe commented on pull request #42136: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "zhouyejoe (via GitHub)" <gi...@apache.org> on 2023/07/24 22:38:09 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on pull request #42135: [SPARK-44533][PYTHON] Add support for accumulator, broadcast, and Spark files in Python UDTF's analyze - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/24 22:39:11 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42137: [SPARK-44532][CONNECT][SQL] Move ArrowUtils to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 22:44:27 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #42130: [SPARK-44507][SQL][CONNECT] Move AnalysisException to sql/api - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/24 22:57:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42130: [SPARK-44507][SQL][CONNECT] Move AnalysisException to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/24 22:58:35 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42100: [SPARK-44503][SQL] Add SQL grammar for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/24 23:07:05 UTC, 1 replies.
- [GitHub] [spark] ueshin closed pull request #42100: [SPARK-44503][SQL] Add SQL grammar for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/24 23:08:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #42138: [SPARK-44534][K8S] Handle only shuffle files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/24 23:24:28 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/07/24 23:46:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42138: [SPARK-44534][K8S] Handle only shuffle files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/24 23:51:55 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/24 23:57:11 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #42138: [SPARK-44534][K8S] Handle only shuffle files in KubernetesLocalDiskShuffleExecutorComponents - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/07/24 23:58:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #42138: [SPARK-44534][K8S] Handle only shuffle files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/25 00:02:51 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42140: [SPARK-44535][CONNECT][SQL] Move required Streaming API to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/25 00:26:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42140: [SPARK-44535][CONNECT][SQL] Move required Streaming API to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/25 00:27:04 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42141: [SPARK-44536][BUILD] Upgrade sbt to 1.9.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/25 00:30:51 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #41630: [SPARK-44080][SQL] Support overriding SQL configurations for new connections - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/25 00:39:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42079: [SPARK-44486][PYTHON][CONNECT] Implement PyArrow `self_destruct` feature for `toPandas` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 00:43:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42079: [SPARK-44486][PYTHON][CONNECT] Implement PyArrow `self_destruct` feature for `toPandas` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 00:44:04 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42076: [SPARK-44449][CONNECT] Upcasting for direct Arrow Deserialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/25 00:45:33 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42076: [SPARK-44449][CONNECT] Upcasting for direct Arrow Deserialization - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/25 00:46:12 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42142: [SPARK-44537][BUILD] Upgrade kubernetes-client to 6.8.0 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/25 00:51:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #41932: [SPARK-44131][SQL][PYTHON][CONNECT][FOLLOWUP] Support qualified function name for call_function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 00:53:41 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2023/07/25 00:54:09 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #41932: [SPARK-44131][SQL][PYTHON][CONNECT][FOLLOWUP] Support qualified function name for call_function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 00:54:15 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42115: [SPARK-44524][BUILD] Add a new test group for pyspark-pandas-slow-connect module - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/25 01:13:40 UTC, 7 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42143: [SPARK-44539][BUILD] Upgrade RoaringBitmap to 0.9.46 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/25 01:29:13 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42143: [SPARK-44539][BUILD] Upgrade RoaringBitmap to 0.9.46 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/25 01:30:48 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42124: [SPARK-44520][SQL] Replace the term UNSUPPORTED_DATA_SOURCE_FOR_DIRECT_QUERY with UNSUPPORTED_DATASOURCE_FOR_DIRECT_QUERY and disclosure root AE - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/25 01:39:32 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42124: [SPARK-44520][SQL] Replace the term UNSUPPORTED_DATA_SOURCE_FOR_DIRECT_QUERY with UNSUPPORTED_DATASOURCE_FOR_DIRECT_QUERY and disclosure root AE - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/25 01:40:46 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #42128: [SPARK-44525][SQL] Improve error message when Invoke method is not found - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/25 01:47:33 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42144: [SPARK-37377][SQL][FOLLOWUP] Fix the partitioned join of one side test case not match - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/25 02:11:47 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42144: [SPARK-37377][SQL][FOLLOWUP] Fix the partitioned join of one side test case not match - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/25 02:12:08 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42120: [SPARK-44509][PYTHON][CONNECT] Add job cancellation API set in Spark Connect Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 02:33:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42120: [SPARK-44509][PYTHON][CONNECT] Add job cancellation API set in Spark Connect Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 02:33:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42135: [SPARK-44533][PYTHON] Add support for accumulator, broadcast, and Spark files in Python UDTF's analyze - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 02:34:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42120: [SPARK-44509][PYTHON][CONNECT] Add job cancellation API set in Spark Connect Python client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 02:38:37 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42049: [SPARK-44466][SQL] Exclude configs starting with `SPARK_DRIVER_PREFIX` and `SPARK_EXECUTOR_PREFIX` from modifiedConfigs - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/25 02:45:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42126: [SPARK-44523][SQL] Filter's maxRows/maxRowsPerPartition is 0 if condition is FalseLiteral - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 02:48:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 02:58:30 UTC, 5 replies.
- [GitHub] [spark] cxzl25 commented on pull request #42033: [SPARK-44454][SQL][HIVE] HiveShim getTablesByType support fallback - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/25 03:02:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42132: [SPARK-44528][CONNECT] Support proper usage of hasattr() for Connect dataframe - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:02:50 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42131: [SPARK-43964][PYTHON][TESTS][FOLLOWUP] Skip a test using pandas when pandas is not available - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:03:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42131: [SPARK-43964][PYTHON][TESTS][FOLLOWUP] Skip a test using pandas when pandas is not available - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:03:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42123: [SPARK-44264][PYTHON] Added Deepspeed Folder to setup.py - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:04:34 UTC, 1 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42145: [SPARK-44540][UI] Remove unused stylesheet and javascript files of jsonFormatter - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/25 03:04:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42126: [SPARK-44523][SQL] Filter's maxRows/maxRowsPerPartition is 0 if condition is FalseLiteral - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:06:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42126: [SPARK-44523][SQL] Filter's maxRows/maxRowsPerPartition is 0 if condition is FalseLiteral - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:06:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42145: [SPARK-44540][UI] Remove unused stylesheet and javascript files of jsonFormatter - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:08:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42143: [SPARK-44539][BUILD] Upgrade RoaringBitmap to 0.9.46 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:09:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42128: [SPARK-44525][SQL] Improve error message when Invoke method is not found - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 03:21:32 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:23:12 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42115: [SPARK-44524][BUILD] Add a new test group for pyspark-pandas-slow-connect module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 03:24:15 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42115: [SPARK-44524][BUILD] Add a new test group for pyspark-pandas-slow-connect module - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 03:26:26 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:27:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42036: [SPARK-44355][SQL] Move WithCTE into command queries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 03:31:53 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42146: [WIP][DO_NOT_MERGE][INFRA] Disable python packaging tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 03:37:26 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #42128: [SPARK-44525][SQL] Improve error message when Invoke method is not found - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/25 03:39:44 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42099: [SPARK-44505][SQL] Provide override for columnar support in Scan for DSv2 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 03:40:47 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42033: [SPARK-44454][SQL][HIVE] HiveShim getTablesByType support fallback - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/25 03:45:34 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42146: [WIP][DO_NOT_MERGE][INFRA] Disable python packaging tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 03:47:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42110: [SPARK-42098][SQL] Fix ResolveInlineTables can not handle with RuntimeReplaceable expression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 03:50:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42146: [WIP][DO_NOT_MERGE][INFRA] Disable python packaging tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 03:54:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #42130: [SPARK-44507][SQL][CONNECT] Move AnalysisException to sql/api - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/25 04:44:56 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #42133: [SPARK-44530][CORE][CONNECT] Move SparkBuildInfo to common/util - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/25 04:46:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42133: [SPARK-44530][CORE][CONNECT] Move SparkBuildInfo to common/util - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 05:00:11 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42133: [SPARK-44530][CORE][CONNECT] Move SparkBuildInfo to common/util - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 05:22:29 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42132: [SPARK-44528][CONNECT] Support proper usage of hasattr() for Connect dataframe - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/25 05:34:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42147: [SPARK-44541][SQL] Remove useless function `hasRangeExprAgainstEventTimeCol` from `UnsupportedOperationChecker` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 05:53:09 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42033: [SPARK-44454][SQL][HIVE] HiveShim getTablesByType support fallback - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/25 06:13:08 UTC, 2 replies.
- [GitHub] [spark] pan3793 commented on pull request #42033: [SPARK-44454][SQL][HIVE] HiveShim getTablesByType support fallback - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/07/25 06:14:56 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42148: [SPARK-44503][SQL][FOLLOWUP] Simplify the test case for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/25 06:47:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42138: [SPARK-44534][K8S] Handle only shuffle files in KubernetesLocalDiskShuffleExecutorComponents - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/25 07:12:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42149: [SPARK-44509][CONNECT][FOLLOW-UP] Add inheritable support in new job cancellation API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 07:32:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42149: [SPARK-44509][CONNECT][FOLLOW-UP] Add inheritable support in new job cancellation API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 07:35:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42149: [SPARK-44509][CONNECT][FOLLOW-UP] Add inheritable support in new job cancellation API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/25 07:36:19 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 07:49:10 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 07:52:58 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42125: [WIP][SPARK-44098][INFRA] Introduce python breaking change detection - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 08:01:00 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42146: [WIP][DO_NOT_MERGE][INFRA] Move python packaging tests to a separate module - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/25 08:59:59 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42149: [SPARK-44509][CONNECT][FOLLOW-UP] Add inheritable support in new job cancellation API - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 09:00:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42146: [WIP][DO_NOT_MERGE][INFRA] Move python packaging tests to a separate module - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 09:04:15 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #41609: [SPARK-44065][SQL] Optimize BroadcastHashJoin skew in OptimizeSkewedJoin - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/25 09:29:23 UTC, 0 replies.
- [GitHub] [spark] Deependra-Patel commented on pull request #42071: [SPARK-44209] Expose amount of shuffle data available on the node - posted by "Deependra-Patel (via GitHub)" <gi...@apache.org> on 2023/07/25 10:17:47 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #42145: [SPARK-44540][UI] Remove unused stylesheet and javascript files of jsonFormatter - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/25 10:40:24 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42145: [SPARK-44540][UI] Remove unused stylesheet and javascript files of jsonFormatter - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/25 10:40:57 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42148: [SPARK-44503][SQL][FOLLOWUP] Simplify the test case for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/25 10:54:19 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42144: [SPARK-37377][SQL][FOLLOWUP] Fix the partitioned join of one side test case not match - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/25 11:02:12 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/25 11:06:33 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42144: [SPARK-37377][SQL][FOLLOWUP] Fix the partitioned join of one side test case not match - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/25 11:08:03 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42117: [SPARK-44517][SQL] Respect ignorenulls and child's nullability in first - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/25 11:15:34 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42129: [SPARK-44527][SQL] Simplify predicate if its children contain ScalarSubquery with empty output - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/25 11:20:54 UTC, 2 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42140: [SPARK-44535][CONNECT][SQL] Move required Streaming API to sql/api - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/25 11:33:01 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42096: [SPARK-42944][SS][PYTHON][CONNECT][FOLLOWUP] Streaming ForeachBatch in Python followups - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/25 11:35:36 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42150: [SPARK-33325][CONNECT] Validate that user provided sessionId is an UUID - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/25 11:41:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42146: [WIP][DO_NOT_MERGE][INFRA] Move python packaging tests to a separate module - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 12:35:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42151: [WIP][PYTHON][DOCS] Refine the docs for `Union`, `UnionAll` and `unionByName` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 12:39:29 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42152: [MINOR][SQL] Fix comment in KeyGroupedPartitioning not match with parameter - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/25 12:45:49 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42152: [MINOR][SQL] Fix comment in KeyGroupedPartitioning not match with parameter - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/25 12:53:00 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42146: [SPARK-44544][INFRA] Move python packaging tests to a separate module - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/25 13:08:37 UTC, 0 replies.
- [GitHub] [spark] watfordkcf opened a new pull request, #42153: Update concat and concat_ws documentation to point out unexpected behavior - posted by "watfordkcf (via GitHub)" <gi...@apache.org> on 2023/07/25 13:40:28 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42036: [SPARK-44355][SQL] Move WithCTE into command queries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 13:52:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #41922: [SPARK-44356][SQL] Resolve `WITH` on top of `INSERT INTO` via CTEDef - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 13:53:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42036: [SPARK-44355][SQL] Move WithCTE into command queries - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/25 13:53:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42147: [SPARK-44541][SQL] Remove useless function `hasRangeExprAgainstEventTimeCol` from `UnsupportedOperationChecker` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 15:05:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42147: [SPARK-44541][SQL] Remove useless function `hasRangeExprAgainstEventTimeCol` from `UnsupportedOperationChecker` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/25 15:07:00 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/25 15:27:56 UTC, 2 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42150: [SPARK-33325][CONNECT] Validate that user provided sessionId is an UUID - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/25 16:10:46 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42135: [SPARK-44533][PYTHON] Add support for accumulator, broadcast, and Spark files in Python UDTF's analyze - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/25 16:53:00 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42141: [SPARK-44536][BUILD] Upgrade sbt to 1.9.3 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/25 17:03:44 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42148: [SPARK-44503][SQL][FOLLOWUP] Simplify the test case for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/25 17:19:31 UTC, 1 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #41301: [SPARK-43780][SQL] Support correlated references in join predicates - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/25 17:19:43 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #42148: [SPARK-44503][SQL][FOLLOWUP] Simplify the test case for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/25 17:28:31 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42137: [SPARK-44532][CONNECT][SQL] Move ArrowUtils to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/25 17:35:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42137: [SPARK-44532][CONNECT][SQL] Move ArrowUtils to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/25 17:35:48 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42148: [SPARK-44503][SQL][FOLLOWUP] Simplify the test case for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/25 17:39:53 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42154: [SPARK-44546] Add a dev utility to generate PySpark tests with LLM - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/25 18:18:15 UTC, 0 replies.
- [GitHub] [spark] ukby1234 opened a new pull request, #42155: For SPARK-44547, ignore fallback storage for cached RDD migration - posted by "ukby1234 (via GitHub)" <gi...@apache.org> on 2023/07/25 18:59:24 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42135: [SPARK-44533][PYTHON] Add support for accumulator, broadcast, and Spark files in Python UDTF's analyze - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/25 19:10:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42156: [SPARK-44532][CONNECT][SQL] Move ArrowUtils to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/25 19:36:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42142: [SPARK-44537][BUILD] Upgrade kubernetes-client to 6.8.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/25 19:45:35 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #42127: [SPARK-44513][BUILD][3.4] Upgrade snappy-java to 1.1.10.3 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/25 19:46:34 UTC, 0 replies.
- [GitHub] [spark-docker] galacticgumshoe opened a new pull request, #52: Add Support for Scala 2.13 in Spark 3.4.1 - posted by "galacticgumshoe (via GitHub)" <gi...@apache.org> on 2023/07/25 19:59:41 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42157: [SPARK-43968][PYTHON] Improve error messages for Python UDTFs with wrong number of outputs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/25 20:58:19 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42158: [SPARK-44548] Add support for pandas DataFrame assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/25 21:20:55 UTC, 0 replies.
- [GitHub] [spark] asl3 closed pull request #41834: [WIP] Make assertDFEqual to call pandas or PySpark util - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/25 21:35:26 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on pull request #42087: [SPARK-44264] Added Example to Deepspeed Distributor - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/25 22:03:17 UTC, 0 replies.
- [GitHub] [spark] maddiedawson commented on a diff in pull request #42118: [SPARK-44264][WIP]E2E Testing for Deepspeed - posted by "maddiedawson (via GitHub)" <gi...@apache.org> on 2023/07/25 23:59:59 UTC, 3 replies.
- [GitHub] [spark] mathewjacob1002 commented on a diff in pull request #42118: [SPARK-44264][WIP]E2E Testing for Deepspeed - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/26 00:00:54 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42087: [SPARK-44264] Added Example to Deepspeed Distributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 00:06:19 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42159: [DO-NOT-MERGE] Investigate flaky test - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 00:35:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42146: [SPARK-44544][INFRA] Move python packaging tests to a separate module - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 00:36:16 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/26 00:40:48 UTC, 9 replies.
- [GitHub] [spark] hvanhovell closed pull request #42133: [SPARK-44530][CORE][CONNECT] Move SparkBuildInfo to common/util - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 00:51:39 UTC, 0 replies.
- [GitHub] [spark] gatorsmile commented on a diff in pull request #42151: [WIP][PYTHON][DOCS] Refine the docs for `Union`, `UnionAll` and `unionByName` - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2023/07/26 00:56:00 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42151: [WIP][PYTHON][DOCS] Refine the docs for `Union`, `UnionAll` and `unionByName` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 00:58:20 UTC, 3 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #42087: [SPARK-44264] Added Example to Deepspeed Distributor - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/07/26 01:02:30 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #42146: [SPARK-44544][INFRA] Move python packaging tests to a separate module - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/26 01:03:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #40160: [SPARK-41725][CONNECT] Eager Execution of DF.sql() - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 01:06:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42149: [SPARK-44509][CONNECT][FOLLOW-UP] Add inheritable support in new job cancellation API - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 01:30:34 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42160: [SPARK-43838][SQL][FOLLOWUP] Add missing aggregate in `renewDuplicatedRelations` - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/26 01:36:41 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42160: [SPARK-43838][SQL][FOLLOWUP] Add missing aggregate in `renewDuplicatedRelations` - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/26 01:37:22 UTC, 2 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42143: [SPARK-44539][BUILD] Upgrade RoaringBitmap to 0.9.46 - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/26 01:41:40 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42161: [SPARK-44479][PYTHON] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/26 01:50:31 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42161: [SPARK-44479][PYTHON] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/26 01:50:50 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42143: [SPARK-44539][BUILD] Upgrade RoaringBitmap to 0.9.46 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/26 01:52:35 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42153: [DOCS] Update concat and concat_ws documentation to point out unexpected behavior - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/26 01:54:00 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #42142: [SPARK-44537][BUILD] Upgrade kubernetes-client to 6.8.0 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/26 02:07:46 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42141: [SPARK-44536][BUILD] Upgrade sbt to 1.9.3 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/26 02:08:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #41843: [SPARK-44280][SQL] Add convertJavaTimestampToTimestamp in JDBCDialect API - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/26 02:34:39 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42114: [SPARK-44514][SQL] Optimize join if maximum number of rows on one side is 1 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/26 03:03:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42127: [SPARK-44513][BUILD][3.4] Upgrade snappy-java to 1.1.10.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 03:08:57 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42162: [SPARK-44494][INFRA][3.4] Use `minikube` v1.30.1 for `k8s-integration-tests` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 03:16:51 UTC, 0 replies.
- [GitHub] [spark] jchen5 opened a new pull request, #42163: [SPARK-44551][SQL] Fix behavior of null IN (empty list) in expression execution - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/26 03:35:31 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #42163: [SPARK-44551][SQL] Fix behavior of null IN (empty list) in expression execution - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/26 03:37:11 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 03:37:39 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on pull request #42163: [SPARK-44551][SQL] Fix behavior of null IN (empty list) in expression execution - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/07/26 03:43:31 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 03:44:48 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42156: [SPARK-44532][CONNECT][SQL] Move ArrowUtils to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 03:50:46 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42156: [SPARK-44532][CONNECT][SQL] Move ArrowUtils to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 03:51:39 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42140: [SPARK-44535][CONNECT][SQL] Move required Streaming API to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 03:57:24 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42161: [SPARK-44479][PYTHON] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/26 04:15:19 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42157: [SPARK-43968][PYTHON] Improve error messages for Python UDTFs with wrong number of outputs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/26 04:15:39 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42161: [SPARK-44479][PYTHON] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/26 04:29:21 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42152: [MINOR][SQL] Fix comment in KeyGroupedPartitioning not match with parameter - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 04:52:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42152: [MINOR][SQL] Fix comment in KeyGroupedPartitioning not match with parameter - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 04:52:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42165: [SPARK-44552][SQL] Remove `private object ParseState` definition from `IntervalUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 05:05:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42162: [SPARK-44494][INFRA][3.4] Use `minikube` v1.30.1 for `k8s-integration-tests` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 05:11:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42162: [SPARK-44494][INFRA][3.4] Use `minikube` v1.30.1 for `k8s-integration-tests` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 05:12:05 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #42152: [MINOR][SQL] Fix comment in KeyGroupedPartitioning not match with parameter - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/07/26 05:17:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42127: [SPARK-44513][BUILD][3.4] Upgrade snappy-java to 1.1.10.3 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 05:33:48 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42166: [SPARK-44553][BUILD][3.4] Ignoring `connect-check-protos` logic in GA testing - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/26 06:05:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42167: Test py linter - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 06:17:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42146: [SPARK-44544][INFRA] Deduplicate `run_python_packaging_tests` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 06:20:45 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42123: [SPARK-44264][PYTHON] Added Deepspeed Folder to setup.py - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 07:21:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42087: [SPARK-44264][PYTHON][DOCS] Added Example to Deepspeed Distributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 07:22:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42087: [SPARK-44264][PYTHON][DOCS] Added Example to Deepspeed Distributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 07:22:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42146: [SPARK-44544][INFRA] Deduplicate `run_python_packaging_tests` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 07:53:19 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #42132: [SPARK-44528][CONNECT] Support proper usage of hasattr() for Connect dataframe - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/26 08:08:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42146: [SPARK-44544][INFRA] Deduplicate `run_python_packaging_tests` - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/26 08:29:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42128: [SPARK-44525][SQL] Improve error message when Invoke method is not found - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 08:34:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42128: [SPARK-44525][SQL] Improve error message when Invoke method is not found - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 08:35:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42125: [WIP][SPARK-44098][INFRA] Introduce python breaking change detection - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 09:34:20 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41765: [SPARK-43203][SQL][3.4] Move all Drop Table case to DataSource V2 - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/26 10:10:13 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #42168: [SPARK-44556][SQL] Reuse `OrcTail` when enable vectorizedReader - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/26 10:56:54 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42169: Use checkError() to check Exception in command Suite - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/26 11:04:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42144: [SPARK-37377][SQL][FOLLOWUP] Fix the partitioned join of one side test case not match - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/26 11:05:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42170: Test py linter 34 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 11:13:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42134: [SPARK-44531][CONNECT][SQL] Move encoder inference to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 11:15:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42171: Test py linter 3.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/26 11:16:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/26 11:30:32 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/26 11:30:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/26 11:31:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42159: [SPARK-44557][INFRA] Clean up untracked/ignored files before running pip packaging test in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 11:41:03 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 12:07:21 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42172: [SPARK-44544][INFRA][3.4] Deduplicate run_python_packaging_tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 12:17:44 UTC, 0 replies.
- [GitHub] [spark] watfordkcf commented on a diff in pull request #42153: [DOCS] Update concat and concat_ws documentation to point out unexpected behavior - posted by "watfordkcf (via GitHub)" <gi...@apache.org> on 2023/07/26 12:54:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42173: [SPARK-44544][INFRA][FOLLOWUP] run `run_python_packaging_tests` even if `pyspark-errors` is skipped - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 13:03:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42173: [SPARK-44544][INFRA][FOLLOWUP] run `run_python_packaging_tests` even if `pyspark-errors` is skipped - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 13:06:35 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42166: [SPARK-44553][BUILD][3.4] Ignoring `connect-check-protos` logic in GA testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/26 13:48:07 UTC, 1 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/26 15:37:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #42142: [SPARK-44537][BUILD] Upgrade kubernetes-client to 6.8.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/07/26 16:33:53 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/26 16:43:35 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/26 16:56:00 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #42154: [SPARK-44546] Add a dev utility to generate PySpark tests with LLM - posted by "zero323 (via GitHub)" <gi...@apache.org> on 2023/07/26 17:50:40 UTC, 0 replies.
- [GitHub] [spark] asl3 closed pull request #42154: [WIP][SPARK-44546] Add a dev utility to generate PySpark tests with LLM - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/26 17:54:13 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42157: [SPARK-43968][PYTHON] Improve error messages for Python UDTFs with wrong number of outputs - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/26 18:36:26 UTC, 0 replies.
- [GitHub] [spark] mathewjacob1002 commented on pull request #42118: [SPARK-44264][WIP]E2E Testing for Deepspeed - posted by "mathewjacob1002 (via GitHub)" <gi...@apache.org> on 2023/07/26 18:38:18 UTC, 0 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #42118: [SPARK-44264][WIP]E2E Testing for Deepspeed - posted by "lu-wang-dl (via GitHub)" <gi...@apache.org> on 2023/07/26 18:43:37 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/26 20:20:31 UTC, 3 replies.
- [GitHub] [spark] cdkrot opened a new pull request, #42175: [SPARK-44558] Export Spark Log Level - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/26 20:52:46 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42176: [SPARK-44479][PYTHON][3.5] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/26 21:51:50 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 opened a new pull request, #42177: [SPARK-44059] Add better error messages - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/26 22:17:38 UTC, 0 replies.
- [GitHub] [spark] holdenk opened a new pull request, #39825: [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/07/26 22:36:38 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #42178: [SPARK-44560][PYTHON][CONNECT] Improve tests and documentation for Arrow Python UDF - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/26 22:44:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/26 23:42:42 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42132: [SPARK-44528][CONNECT] Support proper usage of hasattr() for Connect dataframe - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 23:53:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42132: [SPARK-44528][CONNECT] Support proper usage of hasattr() for Connect dataframe - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/26 23:53:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42173: [SPARK-44544][INFRA][FOLLOWUP] run `run_python_packaging_tests` even if `pyspark-errors` is skipped - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 00:02:35 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42173: [SPARK-44544][INFRA][FOLLOWUP] run `run_python_packaging_tests` even if `pyspark-errors` is skipped - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 00:05:45 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42175: [SPARK-44558] Export Spark Log Level - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 00:09:46 UTC, 2 replies.
- [GitHub] [spark] srowen closed pull request #42119: [SPARK-44522][BUILD] Upgrade `scala-xml` to 2.2.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/27 00:11:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42172: [SPARK-44544][INFRA][3.4] Deduplicate `run_python_packaging_tests` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 00:14:37 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #42039: [SPARK-44457][CONNECT][TESTS] Add `truncatedTo(ChronoUnit.MICROS)` to make `ArrowEncoderSuite` in Java 17 daily test GA task pass - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/27 00:17:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37928: [SPARK-40485][SQL] Add partitionColValues to the partitioning options of the JDBC data source - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/27 00:18:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42125: [WIP][SPARK-44098][INFRA] Introduce python breaking change detection - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 00:20:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42172: [SPARK-44544][INFRA][3.4] Deduplicate `run_python_packaging_tests` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 00:22:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42172: [SPARK-44544][INFRA][3.4] Deduplicate `run_python_packaging_tests` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 00:23:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42166: [SPARK-44553][BUILD][3.4] Ignoring `connect-check-protos` logic in GA testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 00:33:42 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on a diff in pull request #52: Add Support for Scala 2.13 in Spark 3.4.1 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/07/27 00:43:30 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42161: [SPARK-44479][PYTHON] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 00:53:56 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42176: [SPARK-44479][PYTHON][3.5] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 00:54:21 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42176: [SPARK-44479][PYTHON][3.5] Fix ArrowStreamPandasUDFSerializer to accept no-column pandas DataFrame - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 00:57:02 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #41755: [SPARK-43999][SQL][CORE] Support force finish useless stage when AQE on - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/27 01:03:45 UTC, 0 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42179: [SPARK-44479][CONNECT][PYTHON] Fix protobuf conversion from an empty struct type - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 01:09:09 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42179: [SPARK-44479][CONNECT][PYTHON] Fix protobuf conversion from an empty struct type - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 01:09:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42179: [SPARK-44479][CONNECT][PYTHON] Fix protobuf conversion from an empty struct type - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 01:10:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42179: [SPARK-44479][CONNECT][PYTHON] Fix protobuf conversion from an empty struct type - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 01:10:51 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42169: [SPARK-44555][SQL] Use checkError() to check Exception in command Suite & assign some error class names - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/27 01:19:00 UTC, 0 replies.
- [GitHub] [spark] shuwang21 commented on pull request #41489: [SPARK-43987][Shuffle] Separate finalizeShuffleMerge Processing to Dedicated Thread Pools - posted by "shuwang21 (via GitHub)" <gi...@apache.org> on 2023/07/27 01:20:42 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42172: [SPARK-44544][INFRA][3.4] Deduplicate `run_python_packaging_tests` - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/27 01:23:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42158: [SPARK-44548][PYTHON] Add support for pandas-on-Spark DataFrame assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 01:58:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #41712: [SPARK-44132][SQL] Materialize `Stream` of join column names to avoid codegen failure - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 02:03:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42153: [DOCS] Update concat and concat_ws documentation to point out unexpected behavior - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/27 02:16:06 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42110: [SPARK-42098][SQL] Fix ResolveInlineTables can not handle with RuntimeReplaceable expression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 02:19:42 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42171: Test py linter 3.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 02:33:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42171: Test py linter 3.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 02:36:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 02:40:48 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #42180: [SPARK-44562][SQL] Add OptimizeOneRowRelationSubquery in batch of Subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/27 02:48:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 03:00:35 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42181: [SPARK-44563][BUILD] Upgrade Apache Arrow to 13.0.0 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/27 03:21:45 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/27 03:24:11 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42130: [SPARK-44507][SQL][CONNECT] Move AnalysisException to sql/api - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/27 03:25:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42182: [SPARK-43611][PS][CONNECT][TESTS] Enable more tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 03:31:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42181: [SPARK-44563][BUILD] Upgrade Apache Arrow to 13.0.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 03:33:45 UTC, 3 replies.
- [GitHub] [spark] kumarn opened a new pull request, #42183: [SPARK-43871][Pandas API on Spark][PySpark] Enable SeriesDateTimeTests for pandas 2.0.0. - posted by "kumarn (via GitHub)" <gi...@apache.org> on 2023/07/27 03:40:21 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2023/07/27 03:44:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42183: [SPARK-43871][PS][PySpark] Enable SeriesDateTimeTests for pandas 2.0.0. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 03:45:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42139: [SPARK-44154][SQL][FOLLOWUP] `BitmapCount` and `BitmapOrAgg` should use `DataTypeMismatch` to indicate unexpected input data type - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 03:46:36 UTC, 0 replies.
- [GitHub] [spark] kumarn commented on pull request #42183: [SPARK-43871][PS][PySpark] Enable SeriesDateTimeTests for pandas 2.0.0. - posted by "kumarn (via GitHub)" <gi...@apache.org> on 2023/07/27 03:47:42 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42181: [SPARK-44563][BUILD] Upgrade Apache Arrow to 13.0.0 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/27 03:55:16 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42184: [SPARK-41400][CONNECT] Remove Connect Client Catalyst Dependency - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/27 03:58:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42184: [SPARK-41400][CONNECT] Remove Connect Client Catalyst Dependency - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/27 04:01:22 UTC, 2 replies.
- [GitHub] [spark] ueshin closed pull request #42135: [SPARK-44533][PYTHON] Add support for accumulator, broadcast, and Spark files in Python UDTF's analyze - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 04:03:17 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #42158: [SPARK-44548][PYTHON] Add support for pandas-on-Spark DataFrame assertDataFrameEqual - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/27 04:04:29 UTC, 2 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #42158: [SPARK-44548][PYTHON] Add support for pandas-on-Spark DataFrame assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/27 04:10:10 UTC, 3 replies.
- [GitHub] [spark] amaliujia closed pull request #42104: [SPARK-44507][SQL][CONNECT] Scala client does not depend on AnalysisException - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/27 04:18:14 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42183: [SPARK-43871][PS] Enable SeriesDateTimeTests for pandas 2.0.0. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/27 04:31:58 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42151: [SPARK-44565][PYTHON][DOCS] Refine the docs for `Union`, `UnionAll` and `unionByName` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 04:53:09 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42151: [SPARK-44565][PYTHON][DOCS] Refine the docs for `Union`, `UnionAll` and `unionByName` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 04:55:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42159: [SPARK-44557][INFRA] Clean up untracked/ignored files before running pip packaging test in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 05:07:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42159: [SPARK-44557][INFRA] Clean up untracked/ignored files before running pip packaging test in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 05:08:19 UTC, 0 replies.
- [GitHub] [spark] kumarn commented on pull request #42183: [SPARK-43871][PS] Enable SeriesDateTimeTests for pandas 2.0.0. - posted by "kumarn (via GitHub)" <gi...@apache.org> on 2023/07/27 05:29:12 UTC, 0 replies.
- [GitHub] [spark] kumarn closed pull request #42183: [SPARK-43871][PS] Enable SeriesDateTimeTests for pandas 2.0.0. - posted by "kumarn (via GitHub)" <gi...@apache.org> on 2023/07/27 05:29:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 05:51:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #40474: [SPARK-42849] [SQL] Session Variables - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 06:21:18 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42118: [SPARK-44264][PYTHON]E2E Testing for Deepspeed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 06:30:29 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #42127: [SPARK-44513][BUILD][3.4] Upgrade snappy-java to 1.1.10.3 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/27 07:02:44 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42127: [SPARK-44513][BUILD][3.4] Upgrade snappy-java to 1.1.10.3 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/27 07:02:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42073: [SPARK-44482][CONNECT] Connect server should can specify the bind address - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 07:42:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42073: [SPARK-44482][CONNECT] Connect server should can specify the bind address - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 07:43:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42141: [SPARK-44536][BUILD] Upgrade sbt to 1.9.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 07:56:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42141: [SPARK-44536][BUILD] Upgrade sbt to 1.9.3 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 07:59:17 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #42129: [SPARK-44527][SQL] Replace ScalarSubquery with null if its maxRows is 0 - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/27 08:01:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41839: [SPARK-44287][SQL] Use PartitionEvaluator API in RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 08:09:51 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42185: [SPARK-44287][SQL][FOLLOWUP] Set partition index correctly - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 08:15:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42185: [SPARK-44287][SQL][FOLLOWUP] Set partition index correctly - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 08:15:34 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42173: [SPARK-44544][INFRA][FOLLOWUP] Force run `run_python_packaging_tests` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 08:16:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42185: [SPARK-44287][SQL][FOLLOWUP] Set partition index correctly - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 08:17:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42173: [SPARK-44544][INFRA][FOLLOWUP] Force run `run_python_packaging_tests` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 08:17:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42182: [SPARK-43611][PS][CONNECT][TESTS][FOLLOWUPS] Enable more tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 08:31:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42182: [SPARK-43611][PS][CONNECT][TESTS][FOLLOWUPS] Enable more tests - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 08:31:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42115: [SPARK-44524][BUILD] Balancing pyspark-pandas-connect and pyspark-pandas-slow-connect GA testing time - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/27 08:32:17 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #42115: [SPARK-44524][BUILD] Balancing pyspark-pandas-connect and pyspark-pandas-slow-connect GA testing time - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/27 08:54:05 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42186: Test latest minikube - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 09:14:25 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41808: [SPARK-44162][CORE] Support G1GC in spark metrics - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/27 09:37:50 UTC, 0 replies.
- [GitHub] [spark] cdkrot commented on a diff in pull request #42175: [SPARK-44558][CONNECT][PYTHON] Export Spark Log Level - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2023/07/27 09:43:16 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42180: [SPARK-44562][SQL] Add OptimizeOneRowRelationSubquery in batch of Subquery - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/27 09:48:37 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #42033: [SPARK-44454][SQL][HIVE] HiveShim getTablesByType support fallback - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/27 10:24:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42186: Test latest minikube - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 11:28:59 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42129: [SPARK-44527][SQL] Replace ScalarSubquery with null if its maxRows is 0 - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/27 11:39:07 UTC, 0 replies.
- [GitHub] [spark] ejblanco opened a new pull request, #42187: fix: typo in description - posted by "ejblanco (via GitHub)" <gi...@apache.org> on 2023/07/27 11:46:30 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42181: [SPARK-44563][BUILD] Upgrade Arrow to 13.0.0 & Netty to 4.1.95.Final - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/27 11:46:42 UTC, 3 replies.
- [GitHub] [spark] ejblanco commented on pull request #40491: [SPARK-41006][K8S] Generate new ConfigMap names for each run - posted by "ejblanco (via GitHub)" <gi...@apache.org> on 2023/07/27 11:57:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42181: [SPARK-44563][BUILD] Upgrade Arrow to 13.0.0 & Netty to 4.1.95.Final - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 12:20:01 UTC, 1 replies.
- [GitHub] [spark] watfordkcf commented on pull request #42153: [DOCS] Update concat and concat_ws documentation to point out unexpected behavior - posted by "watfordkcf (via GitHub)" <gi...@apache.org> on 2023/07/27 12:42:37 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #42099: [SPARK-44505][SQL] Provide override for columnar support in Scan for DSv2 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/27 12:54:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42170: Test py linter 34 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 12:54:22 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #42187: [MINOR][DOCS] fix: some minor typos - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/07/27 13:00:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42167: [SPARK-44554][INFRA] Make Python linter related checks pass of branch-3.3/3.4 daily testing - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/27 13:24:48 UTC, 3 replies.
- [GitHub] [spark] ejblanco closed pull request #42187: [MINOR][DOCS] fix: some minor typos - posted by "ejblanco (via GitHub)" <gi...@apache.org> on 2023/07/27 13:41:04 UTC, 0 replies.
- [GitHub] [spark] ejblanco opened a new pull request, #42188: [MINOR][DOCS] fix: some minor typos - posted by "ejblanco (via GitHub)" <gi...@apache.org> on 2023/07/27 13:42:24 UTC, 0 replies.
- [GitHub] [spark] ejblanco commented on pull request #42187: [MINOR][DOCS] fix: some minor typos - posted by "ejblanco (via GitHub)" <gi...@apache.org> on 2023/07/27 13:43:25 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #42188: [MINOR][DOCS] fix: some minor typos - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/27 13:54:05 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42188: [MINOR][DOCS] fix: some minor typos - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/27 13:54:06 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42109: [SPARK-44404][SQL] Assign names to the error class _LEGACY_ERROR_TEMP_[1009,1010,1013,1015,1016,1278] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/27 13:57:01 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41839: [SPARK-44287][SQL] Use PartitionEvaluator API in RowToColumnarExec & ColumnarToRowExec SQL operators. - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/27 16:10:49 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/27 16:34:04 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #42189: [SPARK-44361][SQL][FOLLOWUP] Use PartitionEvaluator API in MapInBatchExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/27 16:51:50 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #42189: [SPARK-44361][SQL][FOLLOWUP] Use PartitionEvaluator API in MapInBatchExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/27 16:52:13 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42157: [SPARK-43968][PYTHON] Improve error messages for Python UDTFs with wrong number of outputs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/27 17:02:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #42190: [SPARK-44574][SQL][CONNECT] Errors that moved into sq/api should also use AnalysisException - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/27 17:11:23 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #42190: [SPARK-44574][SQL][CONNECT] Errors that moved into sq/api should also use AnalysisException - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/27 17:11:36 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #41939: [SPARK-44341][SQL][PYTHON] Define the computing logic through PartitionEvaluator API and use it in WindowExec and WindowInPandasExec - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/07/27 17:28:01 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #42177: [SPARK-44059] Add better error messages - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/07/27 18:39:04 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx commented on a diff in pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2023/07/27 18:57:01 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/27 19:00:53 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42177: [SPARK-44059] Add better error messages for SQL named argumnts - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/27 19:28:14 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42158: [SPARK-44548][PYTHON] Add support for pandas-on-Spark DataFrame assertDataFrameEqual - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/27 19:34:05 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42157: [SPARK-43968][PYTHON] Improve error messages for Python UDTFs with wrong number of outputs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 20:02:28 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/27 20:13:19 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42157: [SPARK-43968][PYTHON] Improve error messages for Python UDTFs with wrong number of outputs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 20:18:07 UTC, 1 replies.
- [GitHub] [spark] ueshin closed pull request #42157: [SPARK-43968][PYTHON] Improve error messages for Python UDTFs with wrong number of outputs - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/27 20:18:49 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #42178: [SPARK-44560][PYTHON][CONNECT] Improve tests and documentation for Arrow Python UDF - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/27 20:45:17 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #42178: [SPARK-44560][PYTHON][CONNECT] Improve tests and documentation for Arrow Python UDF - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/27 20:45:55 UTC, 0 replies.
- [GitHub] [spark] ukby1234 commented on pull request #42155: [SPARK-44547][CORE] Ignore fallback storage for cached RDD migration - posted by "ukby1234 (via GitHub)" <gi...@apache.org> on 2023/07/27 20:51:51 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42174: [SPARK-44503][SQL] Add analysis and planning for PARTITION BY and ORDER BY clause after TABLE arguments for TVF calls - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/27 21:06:54 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #42192: [SPARK-43968][PYTHON][3.5] Improve error messages for Python UDTFs with wrong number of outputs - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/27 21:28:47 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/27 22:17:07 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 22:48:50 UTC, 0 replies.
- [GitHub] [spark] bogao007 opened a new pull request, #42193: [SPARK-44432][SS][CONNECT] Terminate streaming queries when a session times out in Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/27 22:52:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42150: [SPARK-44425][CONNECT] Validate that user provided sessionId is an UUID - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 22:54:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42150: [SPARK-44425][CONNECT] Validate that user provided sessionId is an UUID - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/27 22:55:12 UTC, 0 replies.
- [GitHub] [spark] attilapiros closed pull request #41746: [SPARK-44198][CORE] Support propagation of the log level to the executors - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/07/27 23:40:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42118: [SPARK-44264][PYTHON]E2E Testing for Deepspeed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 00:14:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42118: [SPARK-44264][PYTHON]E2E Testing for Deepspeed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 00:15:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40819: [WIP][SPARK-43160][PYTHON]: Removed typing.io deprecated namespace - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/28 00:18:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40805: [SPARK-40609][SQL] Unwrap cast in the join condition to unlock bucketed read - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/28 00:18:50 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40771: [SPARK-35723] set k8s pod container request, limit memory separately. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/28 00:18:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37928: [SPARK-40485][SQL] Add partitionColValues to the partitioning options of the JDBC data source - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/28 00:18:54 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/28 00:51:46 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/28 02:10:45 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42194: [SPARK-41471][SQL] Reduce Spark shuffle when only one side of a join is KeyGroupedPartitioning - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/28 02:11:04 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #41088: [SPARK-43402][SQL] FileSourceScanExec supports push down data filter with scalar subquery - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/28 02:13:11 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/28 02:17:08 UTC, 0 replies.
- [GitHub] [spark] advancedxy opened a new pull request, #42195: [SPARK-44542][CORE] Eagerly load SparkExitCode class in exception handler - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/07/28 02:29:52 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42196: Improve assertDataFrameEqual error message format - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/28 02:30:04 UTC, 0 replies.
- [GitHub] [spark] advancedxy commented on pull request #42195: [SPARK-44542][CORE] Eagerly load SparkExitCode class in exception handler - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/07/28 02:31:24 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42197: [SPARK-44567][INFRA] maven test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 02:31:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42175: [SPARK-44558][CONNECT][PYTHON] Export Spark Log Level - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 02:34:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42175: [SPARK-44558][CONNECT][PYTHON] Export Spark Log Level - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 02:34:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42167: [SPARK-44554][INFRA] Make Python linter related checks pass of branch-3.3/3.4 daily testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/28 02:35:34 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42167: [SPARK-44554][INFRA] Make Python linter related checks pass of branch-3.3/3.4 daily testing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 02:36:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42197: [SPARK-44567][INFRA] maven test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 02:52:46 UTC, 0 replies.
- [GitHub] [spark] advancedxy commented on pull request #37616: [SPARK-40178][PYTHON][SQL] Fix partitioning hint parameters in PySpark - posted by "advancedxy (via GitHub)" <gi...@apache.org> on 2023/07/28 03:16:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42185: [SPARK-44287][SQL][FOLLOWUP] Set partition index correctly - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/28 03:37:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42197: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 03:39:08 UTC, 0 replies.
- [GitHub] [spark] zhaomin1423 opened a new pull request, #42198: Remove redundant method parameter in kafka sink - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/07/28 03:40:43 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #41782: [SPARK-44239][SQL] Free memory allocated by large vectors when vectors are reset - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/07/28 03:44:56 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42115: [SPARK-44524][BUILD] Balancing pyspark-pandas-connect and pyspark-pandas-slow-connect GA testing time - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/28 03:45:07 UTC, 1 replies.
- [GitHub] [spark] zhaomin1423 closed pull request #42198: Remove redundant method parameter in kafka sink - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/07/28 03:45:46 UTC, 0 replies.
- [GitHub] [spark] zhaomin1423 commented on pull request #42198: Remove redundant method parameter in kafka sink - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/07/28 04:09:58 UTC, 3 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #42199: [SPARK-44579][SQL] Support Interrupt On Cancel in SQLExecution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/28 04:13:46 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #42112: [SPARK-44493][SQL] Support for translating catalyst expressions into partial datasource filters - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/07/28 04:20:28 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #40921: [SPARK-43242] fix throw 'Unexpected type of BlockId' in diagnose when… - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/28 04:21:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42110: [SPARK-42098][SQL] Fix ResolveInlineTables can not handle with RuntimeReplaceable expression - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/28 04:44:21 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/28 04:46:51 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42189: [SPARK-44361][SQL][FOLLOWUP] Use PartitionEvaluator API in MapInBatchExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/28 04:48:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 04:49:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42189: [SPARK-44361][SQL][FOLLOWUP] Use PartitionEvaluator API in MapInBatchExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/28 04:49:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42158: [SPARK-44548][PYTHON] Add support for pandas-on-Spark DataFrame assertDataFrameEqual - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/28 04:51:57 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #42168: [SPARK-44556][SQL] Reuse `OrcTail` when enable vectorizedReader - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/28 04:58:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #41802: [SPARK-44256][BUILD] Upgrade rocksdbjni to 8.3.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 05:33:38 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/28 05:34:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42158: [SPARK-44548][PYTHON] Add support for pandas-on-Spark DataFrame assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 05:42:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42196: Improve assertDataFrameEqual error message format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 05:46:00 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42196: Improve assertDataFrameEqual error message format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 05:46:56 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #41802: [SPARK-44256][BUILD] Upgrade rocksdbjni to 8.3.2 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/28 05:53:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42198: Remove redundant method parameter in kafka sink - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 06:25:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42197: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 06:28:34 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42192: [SPARK-43968][PYTHON][3.5] Improve error messages for Python UDTFs with wrong number of outputs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 06:30:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42192: [SPARK-43968][PYTHON][3.5] Improve error messages for Python UDTFs with wrong number of outputs - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 06:30:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42197: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 06:46:42 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42193: [SPARK-44432][SS][CONNECT] Terminate streaming queries when a session times out in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 07:03:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/28 07:04:56 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on a diff in pull request #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/28 07:58:49 UTC, 0 replies.
- [GitHub] [spark] sarutak commented on pull request #42085: [SPARK-44490][WEBUI] Remove `TaskPagedTable` in StagePage - posted by "sarutak (via GitHub)" <gi...@apache.org> on 2023/07/28 08:04:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42197: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 08:50:44 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42197: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 08:50:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42200: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 08:54:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42197: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/28 08:57:40 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42201: [MINOR][TESTS] Clearing residual files after SparkSubmitSuite - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/28 09:04:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #42201: [MINOR][TESTS] Clearing residual files after SparkSubmitSuite - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/28 09:09:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42202: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven #42200 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 09:12:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42200: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 09:12:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42202: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven #42200 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 09:13:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42198: Remove redundant method parameter in kafka sink - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 09:16:24 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42203: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 09:19:25 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #42197: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/07/28 09:22:18 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #42167: [SPARK-44554][INFRA] Make Python linter related checks pass of branch-3.3/3.4 daily testing - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/07/28 09:25:08 UTC, 4 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42204: Test maven 393 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 09:47:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42204: Test maven 393 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 10:03:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #42165: [SPARK-44552][SQL] Remove `private object ParseState` definition from `IntervalUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 10:10:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42165: [SPARK-44552][SQL] Remove `private object ParseState` definition from `IntervalUtils` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 10:11:24 UTC, 0 replies.
- [GitHub] [spark] Don-Burns commented on pull request #33428: [SPARK-36220][PYTHON] Fix pyspark.sql.types.Row type annotation - posted by "Don-Burns (via GitHub)" <gi...@apache.org> on 2023/07/28 10:12:23 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #42199: [SPARK-44579][SQL] Support Interrupt On Cancel in SQLExecution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/28 10:20:17 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #42205: [SPARK-44583][DOC] `spark.*.io.connectionCreationTimeout` parameter documentation - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/07/28 10:26:20 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42115: [SPARK-44524][BUILD] Balancing pyspark-pandas-connect and pyspark-pandas-slow-connect GA testing time - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/28 10:38:04 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #42206: [SPARK-44582] Skip iterator on SMJ if it was cleaned up - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/07/28 10:47:40 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42164: [SPARK-44538][CONNECT][SQL] Reinstate Row.jsonValue and friends - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/28 11:03:40 UTC, 0 replies.
- [GitHub] [spark] guilhem-depop opened a new pull request, #42207: [SPARK-XXXX][MLLIB] Fix warning condition in MLLib RankingMetrics ndcgAk - posted by "guilhem-depop (via GitHub)" <gi...@apache.org> on 2023/07/28 11:07:21 UTC, 0 replies.
- [GitHub] [spark] guilhem-depop commented on pull request #42207: [SPARK-XXXX][MLLIB] Fix warning condition in MLLib RankingMetrics ndcgAk - posted by "guilhem-depop (via GitHub)" <gi...@apache.org> on 2023/07/28 11:12:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42144: [SPARK-37377][SQL][FOLLOWUP] Fix the partitioned join of one side test case not match - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/28 11:13:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42144: [SPARK-37377][SQL][FOLLOWUP] Fix the partitioned join of one side test case not match - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/28 11:15:11 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #42206: [SPARK-44582] Skip iterator on SMJ if it was cleaned up - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/07/28 11:22:33 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42208: [SPARK-44340][SQL][FOLLOWUP] Set partition index correctly for WindowGroupLimitExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/28 11:37:09 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42207: [SPARK-XXXX][MLLIB] Fix warning condition in MLLib RankingMetrics ndcgAk - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/28 11:39:50 UTC, 1 replies.
- [GitHub] [spark] vicennial opened a new pull request, #42209: [SPARK-44584][CONNECT] Set client_type information for AddArtifactsRequest and ArtifactStatusesRequest in Scala Client - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/28 12:57:27 UTC, 0 replies.
- [GitHub] [spark] guilhem-depop commented on pull request #42207: [SPARK-44585][MLLIB] Fix warning condition in MLLib RankingMetrics ndcgAk - posted by "guilhem-depop (via GitHub)" <gi...@apache.org> on 2023/07/28 13:24:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42210: [SPARK-44586][INFRA][ML][PYTHON] `TorchDistributor` should install cpu-only Torch for testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/28 13:32:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42210: [SPARK-44586][INFRA][ML][PYTHON] `TorchDistributor` should install cpu-only Torch for testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/28 13:34:14 UTC, 0 replies.
- [GitHub] [spark] bogdanghit commented on a diff in pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "bogdanghit (via GitHub)" <gi...@apache.org> on 2023/07/28 13:39:09 UTC, 0 replies.
- [GitHub] [spark] danepitkin commented on pull request #41681: [SPARK-44128][BUILD] Upgrade netty to 4.1.93 - posted by "danepitkin (via GitHub)" <gi...@apache.org> on 2023/07/28 14:25:06 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #32882: [WIP][SPARK-35724][SQL] Support traversal pruning in extendedResolutionRules and postHocResolutionRules - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/28 14:25:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42184: [SPARK-41400][CONNECT] Remove Connect Client Catalyst Dependency - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/28 14:52:37 UTC, 0 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42211: [SPARK-44587] Increase Protobuf Marshaller Recursion Limit - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/28 14:53:15 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42211: [SPARK-44587] Increase Protobuf Marshaller Recursion Limit - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/28 15:12:34 UTC, 2 replies.
- [GitHub] [spark] grundprinzip closed pull request #41080: [WIP] Initial go client - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/28 15:13:17 UTC, 0 replies.
- [GitHub] [spark] heyihong commented on pull request #42211: [SPARK-44587] Increase Protobuf Marshaller Recursion Limit - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/28 15:43:12 UTC, 0 replies.
- [GitHub] [spark] heyihong closed pull request #42211: [SPARK-44587] Increase Protobuf Marshaller Recursion Limit - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/28 15:43:15 UTC, 0 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42212: [SPARK-44587] Increase Protobuf Marshaller Recursion Limit - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/28 15:44:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42195: [SPARK-44542][CORE] Eagerly load SparkExitCode class in exception handler - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/28 16:16:03 UTC, 1 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42212: [SPARK-44587] Increase Protobuf Marshaller Recursion Limit - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/28 16:16:14 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on pull request #42184: [SPARK-41400][CONNECT] Remove Connect Client Catalyst Dependency - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/28 16:23:21 UTC, 0 replies.
- [GitHub] [spark] zeruibao commented on a diff in pull request #41052: [SPARK-43380][SQL] Fix Avro data type conversion issues to avoid producing incorrect results - posted by "zeruibao (via GitHub)" <gi...@apache.org> on 2023/07/28 16:34:56 UTC, 5 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #41052: [SPARK-43380][SQL] Fix Avro data type conversion issues to avoid producing incorrect results - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/28 17:11:00 UTC, 1 replies.
- [GitHub] [spark] heyihong opened a new pull request, #42213: [SPARK-44590] Remove the arrow batch record limit for SqlCommandResult - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2023/07/28 17:16:00 UTC, 0 replies.
- [GitHub] [spark-docker] galacticgumshoe commented on a diff in pull request #52: Add Support for Scala 2.13 in Spark 3.4.1 - posted by "galacticgumshoe (via GitHub)" <gi...@apache.org> on 2023/07/28 18:14:47 UTC, 2 replies.
- [GitHub] [spark] bogao007 commented on a diff in pull request #42193: [SPARK-44432][SS][CONNECT] Terminate streaming queries when a session times out in Spark Connect - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/28 18:45:42 UTC, 0 replies.
- [GitHub] [spark-docker] galacticgumshoe commented on pull request #52: Add Support for Scala 2.13 in Spark 3.4.1 - posted by "galacticgumshoe (via GitHub)" <gi...@apache.org> on 2023/07/28 18:55:29 UTC, 0 replies.
- [GitHub] [spark] henrymai opened a new pull request, #42214: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks - posted by "henrymai (via GitHub)" <gi...@apache.org> on 2023/07/28 19:33:36 UTC, 0 replies.
- [GitHub] [spark] rayhondo opened a new pull request, #42215: Raykim/affirm 3.3.2 voluptuous - posted by "rayhondo (via GitHub)" <gi...@apache.org> on 2023/07/28 20:40:44 UTC, 0 replies.
- [GitHub] [spark] rayhondo closed pull request #42215: Raykim/affirm 3.3.2 voluptuous - posted by "rayhondo (via GitHub)" <gi...@apache.org> on 2023/07/28 20:41:07 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42023: [SPARK-44446][PYTHON] Add checks for expected list type special cases - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/28 21:04:00 UTC, 0 replies.
- [GitHub] [spark] asl3 commented on a diff in pull request #42023: [SPARK-44446][PYTHON] Add checks for expected list type special cases - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/28 21:09:46 UTC, 0 replies.
- [GitHub] [spark] learningchess2003 commented on pull request #42177: [SPARK-44059] Add better error messages for SQL named argumnts - posted by "learningchess2003 (via GitHub)" <gi...@apache.org> on 2023/07/28 21:15:07 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/07/28 22:15:10 UTC, 0 replies.
- [GitHub] [spark] ShreyeshArangath commented on pull request #42213: [SPARK-44590] Remove the arrow batch record limit for SqlCommandResult - posted by "ShreyeshArangath (via GitHub)" <gi...@apache.org> on 2023/07/28 22:28:58 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #42207: [SPARK-44585][MLLIB] Fix warning condition in MLLib RankingMetrics ndcgAk - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/28 22:29:56 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42207: [SPARK-44585][MLLIB] Fix warning condition in MLLib RankingMetrics ndcgAk - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/28 22:30:46 UTC, 0 replies.
- [GitHub] [spark] jdesjean opened a new pull request, #42216: [CONNECT][CORE][SQL][SPARK-44591] Add jobTags to SparkListenerSQLExecutionStart - posted by "jdesjean (via GitHub)" <gi...@apache.org> on 2023/07/28 22:42:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/29 00:04:51 UTC, 10 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40819: [WIP][SPARK-43160][PYTHON]: Removed typing.io deprecated namespace - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/29 00:18:52 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40805: [SPARK-40609][SQL] Unwrap cast in the join condition to unlock bucketed read - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/29 00:18:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40771: [SPARK-35723] set k8s pod container request, limit memory separately. - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/29 00:18:54 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42217: [WIP] Migrate test_sql assert_eq to assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/29 00:43:15 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42218: Fix pandas-on-Spark type checks for assertDataFrameEqual - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/29 00:53:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42219: [SPARK-44593][INFRA] Make `breaking-changes-buf` cancelable - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/29 00:56:24 UTC, 0 replies.
- [GitHub] [spark] zhaomin1423 commented on pull request #42198: [SPARK-44594][KAFKA] Remove redundant method parameter in kafka connector - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/07/29 00:59:27 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42184: [SPARK-41400][CONNECT] Remove Connect Client Catalyst Dependency - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/29 01:31:10 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/29 01:51:45 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/29 01:53:21 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42190: [SPARK-44574][SQL][CONNECT] Errors that moved into sq/api should also use AnalysisException - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/29 02:46:44 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42190: [SPARK-44574][SQL][CONNECT] Errors that moved into sq/api should also use AnalysisException - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/29 02:47:11 UTC, 0 replies.
- [GitHub] [spark] zhaomin1423 opened a new pull request, #42221: [SPARK-44595][CONNECT] Make the user session cache number and cache time be configurable in spark connect service - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/07/29 02:53:50 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/29 02:58:53 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/29 02:59:20 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42208: [SPARK-44340][SPARK-44341][SQL][PYTHON][FOLLOWUP] Set partition index correctly for WindowGroupLimitExec,WindowExec and WindowInPandasExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/29 03:03:24 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42221: [SPARK-44595][CONNECT] Make the user session cache number and cache time be configurable in spark connect service - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/29 03:49:25 UTC, 0 replies.
- [GitHub] [spark] zhaomin1423 commented on pull request #42221: [SPARK-44595][CONNECT] Make the user session cache number and cache time be configurable in spark connect service - posted by "zhaomin1423 (via GitHub)" <gi...@apache.org> on 2023/07/29 03:56:28 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #42222: [SPARK-43744][CONNECT][Followup]Throw error from the constructor - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/29 04:03:25 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/29 08:02:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42160: [SPARK-43838][SQL][FOLLOWUP] Add missing aggregate in `renewDuplicatedRelations` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/29 10:40:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42160: [SPARK-43838][SQL][FOLLOWUP] Add missing aggregate in `renewDuplicatedRelations` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/29 10:42:07 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42216: [SPARK-44591][CONNECT][SQL] Add jobTags to SparkListenerSQLExecutionStart - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/29 13:16:20 UTC, 2 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42069: [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/29 13:30:37 UTC, 7 replies.
- [GitHub] [spark] gengliangwang closed pull request #41964: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/29 16:20:23 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #42216: [SPARK-44591][CONNECT][SQL] Add jobTags to SparkListenerSQLExecutionStart - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/29 16:28:17 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42224: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/29 16:37:55 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42224: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/29 16:38:19 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42224: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/29 20:44:13 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42224: [SPARK-44394][CONNECT][WEBUI] Add a Spark UI page for Spark Connect - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/29 20:44:14 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42216: [SPARK-44591][CONNECT][SQL] Add jobTags to SparkListenerSQLExecutionStart - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/29 20:44:28 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42216: [SPARK-44591][CONNECT][SQL] Add jobTags to SparkListenerSQLExecutionStart - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/29 20:44:50 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40730: [SPARK-43086][CORE] Support bin pack task scheduling on executors - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/30 00:21:46 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40638: [SPARK-42774][SQL]Expose VectorTypes API for DataSourceV2 Batch Scans - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/30 00:21:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42210: [SPARK-44586][INFRA][ML][PYTHON] `TorchDistributor` should install cpu-only Torch for testing - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:37:10 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42199: [SPARK-44579][SQL] Support Interrupt On Cancel in SQLExecution - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:41:05 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42193: [SPARK-44432][SS][CONNECT] Terminate streaming queries when a session times out in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:41:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42193: [SPARK-44432][SS][CONNECT] Terminate streaming queries when a session times out in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:42:06 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #42225: [SPARK-43997][CONNECT] Add support for Java UDFs - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/30 01:42:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42218: [SPARK-44596] Fix pandas-on-Spark type checks for assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:50:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42218: [SPARK-44596] Fix pandas-on-Spark type checks for assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:50:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42201: [MINOR][TESTS] Clearing residual files after SparkSubmitSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:53:21 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42209: [SPARK-44584][CONNECT] Set client_type information for AddArtifactsRequest and ArtifactStatusesRequest in Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:54:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42209: [SPARK-44584][CONNECT] Set client_type information for AddArtifactsRequest and ArtifactStatusesRequest in Scala Client - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:54:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42213: [SPARK-44590] Remove the arrow batch record limit for SqlCommandResult - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:56:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42213: [SPARK-44590][SQL][CONNECT] Remove the arrow batch record limit for SqlCommandResult - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:56:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42213: [SPARK-44590][SQL][CONNECT] Remove the arrow batch record limit for SqlCommandResult - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:56:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42219: [SPARK-44593][INFRA] Make `breaking-changes-buf` cancelable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:57:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42219: [SPARK-44593][INFRA] Make `breaking-changes-buf` cancelable - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:57:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42222: [SPARK-43744][CONNECT][FOLLOW-UP]Throw error from the constructor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:58:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42222: [SPARK-43744][CONNECT][FOLLOW-UP]Throw error from the constructor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/30 01:58:38 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #42225: [SPARK-43997][CONNECT] Add support for Java UDFs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/30 03:14:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42226: [SPARK-44287][SQL][FOLLOWUP] Do not trigger execution too early - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/30 03:35:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42226: [SPARK-44287][SQL][FOLLOWUP] Do not trigger execution too early - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/30 03:35:36 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X opened a new pull request, #42227: [SPARK-35148][SQL] Support traversal pruning in transformUpWithNewOutput - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/30 04:03:21 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on pull request #42227: [SPARK-35148][SQL] Support traversal pruning in transformUpWithNewOutput - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/30 04:06:21 UTC, 0 replies.
- [GitHub] [spark] xy2953396112 commented on pull request #38534: [SPARK-38505][SQL] Make partial aggregation adaptive - posted by "xy2953396112 (via GitHub)" <gi...@apache.org> on 2023/07/30 07:30:57 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42212: [SPARK-44587] Increase protobuf marshaller recursion limit - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/30 16:34:03 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on pull request #42212: [SPARK-44587] Increase protobuf marshaller recursion limit - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/30 17:17:02 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski opened a new pull request, #42228: [SPARK-44421][CONNECT] Reattachable execution in Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/30 18:25:08 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #42193: [SPARK-44432][SS][CONNECT] Terminate streaming queries when a session times out in Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/30 22:50:47 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #42193: [SPARK-44432][SS][CONNECT] Terminate streaming queries when a session times out in Spark Connect - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/07/30 22:54:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42217: [SPARK-44597][PYTHON][TESTS] Migrate test_sql assert_eq to use assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 00:02:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42217: [SPARK-44597][PYTHON][TESTS] Migrate test_sql assert_eq to use assertDataFrameEqual - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 00:03:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42226: [SPARK-44287][SQL][FOLLOWUP] Do not trigger execution too early - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 00:15:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42226: [SPARK-44287][SQL][FOLLOWUP] Do not trigger execution too early - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 00:16:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40897: [SPARK-43228][SQL] Join keys also match PartitioningCollection in CoalesceBucketsInJoin - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/31 00:20:20 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40824: [SPARK-32064][SQL] Support temporary table - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/31 00:20:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40790: [SPARK-43116][SQL] Fix Cast.forceNullable - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/31 00:20:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40730: [SPARK-43086][CORE] Support bin pack task scheduling on executors - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/31 00:20:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #40638: [SPARK-42774][SQL]Expose VectorTypes API for DataSourceV2 Batch Scans - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/31 00:20:25 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40616: [SPARK-42991][SQL] Disable string type +/- interval in ANSI mode - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/31 00:20:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #40266: [SPARK-42660][SQL] Infer filters for Join produced by IN and EXISTS clause (RewritePredicateSubquery rule) - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2023/07/31 00:20:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42206: [SPARK-44582][SQL] Skip iterator on SMJ if it was cleaned up - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 00:33:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42212: [SPARK-44587][SQL][CONNECT] Increase protobuf marshaller recursion limit - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 00:39:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42212: [SPARK-44587][SQL][CONNECT] Increase protobuf marshaller recursion limit - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 00:39:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42210: [SPARK-44586][INFRA][ML][PYTHON] `TorchDistributor` should install cpu-only Torch for testing - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/31 00:49:05 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42229: [MINOR][TESTS] Rename ArrowParityTests to JobCancellationTests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 01:08:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #42115: [SPARK-44524][BUILD] Balancing pyspark-pandas-connect and pyspark-pandas-slow-connect GA testing time - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/31 01:13:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42201: [MINOR][TESTS] Clearing residual files after SparkSubmitSuite - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/31 01:38:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42201: [MINOR][TESTS] Clearing residual files after SparkSubmitSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 01:40:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42201: [MINOR][TESTS] Clearing residual files after SparkSubmitSuite - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 01:40:33 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #42199: [SPARK-44579][SQL] Support Interrupt On Cancel in SQLExecution - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/31 02:02:34 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #42199: [SPARK-44579][SQL] Support Interrupt On Cancel in SQLExecution - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/31 02:02:49 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/31 02:06:27 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42203: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 02:08:15 UTC, 1 replies.
- [GitHub] [spark] yaooqinn closed pull request #42153: [DOCS] Update concat and concat_ws documentation to point out unexpected behavior - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2023/07/31 02:11:40 UTC, 0 replies.
- [GitHub] [spark] ulysses-you closed pull request #41088: [SPARK-43402][SQL] FileSourceScanExec supports push down data filter with scalar subquery - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/31 02:15:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42203: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/31 02:18:25 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42203: [SPARK-44567][INFRA] Add a new Daily testing GitHub Action job for Maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 02:21:16 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42229: [MINOR][TESTS] Rename ArrowParityTests to JobCancellationTests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 02:40:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42229: [MINOR][TESTS] Rename ArrowParityTests to JobCancellationTests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 02:40:32 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #41755: [SPARK-43999][SQL][CORE] Support force finish useless stage when AQE on - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/07/31 02:58:36 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #42195: [SPARK-44542][CORE] Eagerly load SparkExitCode class in exception handler - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/31 03:12:49 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #42195: [SPARK-44542][CORE] Eagerly load SparkExitCode class in exception handler - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/07/31 03:12:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #42228: [SPARK-44421][CONNECT] Reattachable execution in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 03:18:36 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42225: [SPARK-43997][CONNECT] Add support for Java UDFs - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 03:21:52 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #40790: [SPARK-43116][SQL] Fix Cast.forceNullable - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 03:24:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42230: [SPARK-44602][SQL][CONNECT][PS] Make `WidenSetOperationTypes` retains the `Plan_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/31 03:57:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #42198: [SPARK-44594][KAFKA] Remove redundant method parameter in kafka connector - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 04:02:00 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42231: [SPARK-44603] Add pyspark.testing to setup.py - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/31 04:54:58 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #40972: [SPARK-43301][CORE][SHUFFLE] BlockStoreClient getHostLocalDirs RPC supports IOException retry - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/31 05:11:24 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #42136: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/31 05:14:45 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #41489: [SPARK-43987][Shuffle] Separate finalizeShuffleMerge Processing to Dedicated Thread Pools - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/31 05:52:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42208: [SPARK-44340][SPARK-44341][SQL][PYTHON][FOLLOWUP] Set partition index correctly for WindowGroupLimitExec,WindowExec and WindowInPandasExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 05:53:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42208: [SPARK-44340][SPARK-44341][SQL][PYTHON][FOLLOWUP] Set partition index correctly for WindowGroupLimitExec,WindowExec and WindowInPandasExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 05:53:48 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #42214: [SPARK-44588][CORE] Fix double encryption issue for migrated shuffle blocks - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/07/31 05:57:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42232: [SPARK-44604][BUILD] Upgrade Netty to 4.1.96.Final - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/31 05:58:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 05:59:00 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #42233: [SPARK-44340][SQL][PYTHON][FOLLOWUP][3.5] Set partition index correctly for WindowGroupLimitExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/31 06:25:57 UTC, 0 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/31 06:51:54 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #42230: [SPARK-44602][SQL][CONNECT][PS] Make `WidenSetOperationTypes` retain the `Plan_ID_TAG` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/31 07:03:40 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42234: [SPARK-44605][CORE] Refine internal ShuffleWriteProcessor API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 07:21:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 08:00:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #42235: [SPARK-44599][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 08:01:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 08:06:22 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42236: [SPARK-43646][CONNECT] Test proto case - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 08:21:57 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/31 08:24:34 UTC, 2 replies.
- [GitHub] [spark] Hisoka-X commented on a diff in pull request #41554: [SPARK-43781][SQL] Fix IllegalStateException when cogrouping two datasets derived from the same source - posted by "Hisoka-X (via GitHub)" <gi...@apache.org> on 2023/07/31 08:43:26 UTC, 1 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #41876: [SPARK-44311[CONNECT][SQL] Improved support for UDFs on value classes - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/07/31 08:48:45 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39673: [SPARK-42132][SQL] Deduplicate attributes in groupByKey.cogroup - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 08:53:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42220: [SPARK-44577][SQL] Fix INSERT BY NAME returns nonsensical error message - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 08:57:09 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/07/31 09:19:55 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #42223: [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/07/31 09:32:57 UTC, 5 replies.
- [GitHub] [spark] judahrand opened a new pull request, #42237: WIP: `ArrowConversionMixin` - posted by "judahrand (via GitHub)" <gi...@apache.org> on 2023/07/31 09:35:14 UTC, 0 replies.
- [GitHub] [spark] judahrand closed pull request #42237: WIP: `ArrowConversionMixin` - posted by "judahrand (via GitHub)" <gi...@apache.org> on 2023/07/31 09:36:21 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #42238: clear error - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/31 10:15:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42238: clear error - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 10:34:08 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42230: [SPARK-44602][SQL][CONNECT][PS] Make `WidenSetOperationTypes` retain the `Plan_ID_TAG` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 10:36:59 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #42230: [SPARK-44602][SQL][CONNECT][PS] Make `WidenSetOperationTypes` retain the `Plan_ID_TAG` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 10:38:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42086: [SPARK-43611][SQL][PS][CONNCECT] Make `ExtractWindowExpressions` retain the `PLAN_ID_TAG` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 10:40:02 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42238: clear error - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/31 11:06:52 UTC, 1 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #42225: [SPARK-43997][CONNECT] Add support for Java UDFs - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/07/31 11:08:27 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #42238: Clear some unused codes in "***Errors" and extract some common logic. - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 11:13:28 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #42238: Clear some unused codes in "***Errors" and extract some common logic. - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/07/31 11:15:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42239: [SPARK-44607][SQL] Remove unused function `containsNestedColumn` from `Filter` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 11:42:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #42240: [SPARK-44608][SQL] Remove unused definitions from `DataTypeExpression` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/07/31 11:48:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42234: [SPARK-44605][CORE] Refine internal ShuffleWriteProcessor API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 12:03:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42234: [SPARK-44605][CORE] Refine internal ShuffleWriteProcessor API - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 12:04:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #42241: [WIP][INFRA] Free up disk space for non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/31 12:06:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #42241: [WIP][INFRA] Free up disk space for non-container jobs - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/07/31 12:07:29 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #42233: [SPARK-44340][SQL][FOLLOWUP][3.5] Set partition index correctly for WindowGroupLimitExec - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/07/31 12:57:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42233: [SPARK-44340][SQL][FOLLOWUP][3.5] Set partition index correctly for WindowGroupLimitExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 12:58:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #42233: [SPARK-44340][SQL][FOLLOWUP][3.5] Set partition index correctly for WindowGroupLimitExec - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 12:59:39 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #41876: [SPARK-44311[CONNECT][SQL] Improved support for UDFs on value classes - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/31 13:31:49 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #42242: [SPARK-44610][SQL] DeduplicateRelations should retain Alias metadata when creating a new instance - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 14:49:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #42242: [SPARK-44610][SQL] DeduplicateRelations should retain Alias metadata when creating a new instance - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/07/31 14:49:32 UTC, 0 replies.
- [GitHub] [spark] thejdeep commented on pull request #42136: [SPARK-43100][CORE] Mismatch of field name in log event writer and parser for push shuffle metrics - posted by "thejdeep (via GitHub)" <gi...@apache.org> on 2023/07/31 15:25:48 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #42243: [SPARK-38475] Use error class in org.apache.spark.serializer - posted by "bozhang2820 (via GitHub)" <gi...@apache.org> on 2023/07/31 15:30:18 UTC, 0 replies.
- [GitHub] [spark] jasonli-db opened a new pull request, #42244: [SPARK-44591][CONNECT][WEBUI] Use jobTags in SparkListenerSQLExecutionStart to link SQL Execution IDs for Spark UI Connect page - posted by "jasonli-db (via GitHub)" <gi...@apache.org> on 2023/07/31 16:57:12 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #42225: [SPARK-43997][CONNECT] Add support for Java UDFs - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/31 16:58:01 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #42225: [SPARK-43997][CONNECT] Add support for Java UDFs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/31 16:58:31 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #42225: [SPARK-43997][CONNECT] Add support for Java UDFs - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/31 16:59:29 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42231: [SPARK-44603] Add pyspark.testing to setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 17:47:23 UTC, 0 replies.
- [GitHub] [spark] ueshin closed pull request #42231: [SPARK-44603] Add pyspark.testing to setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 17:48:47 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42228: [SPARK-44421][CONNECT] Reattachable execution in Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/31 17:51:17 UTC, 7 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/31 17:52:07 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42180: [SPARK-44562][SQL] Add OptimizeOneRowRelationSubquery in batch of Subquery - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/31 17:56:43 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #42228: [SPARK-44421][CONNECT] Reattachable execution in Spark Connect - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/07/31 18:00:13 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #42196: [SPARK-44218] Customize diff log in assertDataFrameEqual error message format - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/07/31 18:01:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42245: [SPARK-29497][CONNECT] Throw error when UDF is not deserializable. - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/31 18:28:32 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #42246: [SPARK-44611][CONNECT] Do not exclude scala-xml - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/07/31 18:32:18 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #42242: [SPARK-44610][SQL] DeduplicateRelations should retain Alias metadata when creating a new instance - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/31 18:36:40 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #42242: [SPARK-44610][SQL] DeduplicateRelations should retain Alias metadata when creating a new instance - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/07/31 18:37:09 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 18:37:32 UTC, 0 replies.
- [GitHub] [spark] cg505 opened a new pull request, #42247: rename spark connect client suites to avoid conflict - posted by "cg505 (via GitHub)" <gi...@apache.org> on 2023/07/31 18:40:29 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #42247: rename spark connect client suites to avoid conflict - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/31 18:42:30 UTC, 1 replies.
- [GitHub] [spark] ueshin opened a new pull request, #42248: [SPARK-44614][PYTHON][SQL][CONNECT] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 18:45:03 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42248: [SPARK-44614][PYTHON][SQL][CONNECT] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 18:46:54 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 18:55:19 UTC, 1 replies.
- [GitHub] [spark] ueshin closed pull request #42191: [SPARK-44559][PYTHON] Improve error messages for Python UDTF arrow cast - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 18:55:59 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #42246: [SPARK-44611][CONNECT] Do not exclude scala-xml - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/31 19:57:33 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #42245: [SPARK-29497][CONNECT] Throw error when UDF is not deserializable. - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2023/07/31 20:02:14 UTC, 1 replies.
- [GitHub] [spark] zhenlineo closed pull request #41596: [SPARK-43415][Connect] Alternative mapValues for KeyValueGroupedDS#agg using withContext - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/31 21:22:15 UTC, 0 replies.
- [GitHub] [spark] zhenlineo closed pull request #41501: [SPARK-43415][Connect] Add mapValues for KeyValueGroupedDS#agg using select rewrite - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/07/31 21:22:25 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 21:36:25 UTC, 1 replies.
- [GitHub] [spark] ueshin closed pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 21:37:04 UTC, 0 replies.
- [GitHub] [spark] bogao007 opened a new pull request, #42249: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/31 22:04:16 UTC, 0 replies.
- [GitHub] [spark] bogao007 commented on pull request #42116: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/31 22:06:45 UTC, 0 replies.
- [GitHub] [spark] bogao007 opened a new pull request, #42250: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/31 22:16:29 UTC, 0 replies.
- [GitHub] [spark] andygrove commented on a diff in pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by "andygrove (via GitHub)" <gi...@apache.org> on 2023/07/31 22:18:17 UTC, 0 replies.
- [GitHub] [spark] bogao007 closed pull request #42249: [SPARK-42941][SS][CONNECT] Python StreamingQueryListener - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2023/07/31 22:31:45 UTC, 0 replies.
- [GitHub] [spark] juliuszsompolski commented on a diff in pull request #42235: [SPARK-44599][CONNECT][PYTHON] Python client for reattaching to existing execute in Spark Connect - posted by "juliuszsompolski (via GitHub)" <gi...@apache.org> on 2023/07/31 22:47:50 UTC, 0 replies.
- [GitHub] [spark] asl3 opened a new pull request, #42251: [SPARK-44617] Support comparison between lists of Rows - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2023/07/31 22:56:25 UTC, 0 replies.
- [GitHub] [spark] ueshin commented on pull request #42248: [SPARK-44614][PYTHON][CONNECT] Add missing packages in setup.py - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2023/07/31 23:38:49 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #42230: [SPARK-44602][SQL][CONNECT][PS] Make `WidenSetOperationTypes` retain the `Plan_ID_TAG` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/07/31 23:52:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #42251: [SPARK-44617] Support comparison between lists of Rows - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/07/31 23:59:54 UTC, 0 replies.