You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [spark] github-actions[bot] closed pull request #37251: [SPARK-39838][SQL] Preserve explicit empty column metadata - posted by GitBox <gi...@apache.org> on 2022/11/01 00:29:07 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #38263: [SPARK-40692][SQL] Support data masking built-in function 'mask_hash' - posted by GitBox <gi...@apache.org> on 2022/11/01 00:56:38 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38449: [SPARK-40798][SQL][FOLLOW-UP] Fix ansi mode in v2 ALTER TABLE PARTITION - posted by GitBox <gi...@apache.org> on 2022/11/01 01:19:10 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38422: [SPARK-40948][SQL] Introduce new error class: PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/01 01:31:12 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38433: [SPARK-40943][SQL] Make the MSCK keyword optional in MSCK REPAIR TABLE commands - posted by GitBox <gi...@apache.org> on 2022/11/01 01:33:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38433: [SPARK-40943][SQL] Make the MSCK keyword optional in MSCK REPAIR TABLE commands - posted by GitBox <gi...@apache.org> on 2022/11/01 01:35:27 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38442: [SPARK-40953][CONNECT][PYTHON][TESTS][FOLLOW-UP] Add a test case for head(n) - posted by GitBox <gi...@apache.org> on 2022/11/01 01:42:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by GitBox <gi...@apache.org> on 2022/11/01 01:44:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38395: [SPARK-40917][SQL] Add a dedicated logical plan for `Summary` - posted by GitBox <gi...@apache.org> on 2022/11/01 01:45:25 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by GitBox <gi...@apache.org> on 2022/11/01 01:46:38 UTC, 3 replies.
- [GitHub] [spark] mridulm commented on pull request #38377: [SPARK-40901][CORE] Unable to store Spark Driver logs with Absolute Hadoop based URI FS Path - posted by GitBox <gi...@apache.org> on 2022/11/01 01:49:18 UTC, 5 replies.
- [GitHub] [spark] itholic opened a new pull request, #38455: [SPARK-40827][PS][TEST] Re-enable the DataFrame.corrwith test after fixing in future pandas. - posted by GitBox <gi...@apache.org> on 2022/11/01 01:56:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38420: [SPARK-40947][SPARK-40966][PS][INFRA][TEST] Upgrade pandas to 1.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/01 01:59:05 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38371: [SPARK-40968] Fix a few wrong/misleading comments in DAGSchedulerSuite - posted by GitBox <gi...@apache.org> on 2022/11/01 01:59:55 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38439: [SPARK-40890][SQL][TESTS] Check error classes in DataSourceV2SQLSuite - posted by GitBox <gi...@apache.org> on 2022/11/01 01:59:55 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38395: [SPARK-40917][SQL] Add a dedicated logical plan for `Summary` - posted by GitBox <gi...@apache.org> on 2022/11/01 02:12:56 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38347: [SPARK-40883][CONNECT] Support Range in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/01 02:16:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38347: [SPARK-40883][CONNECT] Support Range in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/01 02:16:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38451: [SPARK-40976][BUILD] Upgrade sbt to 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/01 02:17:19 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38438: [SPARK-40748][SQL] Migrate type check failures of conditions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 02:18:31 UTC, 2 replies.
- [GitHub] [spark] swamirishi commented on pull request #38377: [SPARK-40901][CORE] Unable to store Spark Driver logs with Absolute Hadoop based URI FS Path - posted by GitBox <gi...@apache.org> on 2022/11/01 02:19:01 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38393: [SPARK-40915][CONNECT] Improve `on` in Join in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 02:26:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38393: [SPARK-40915][CONNECT] Improve `on` in Join in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 02:27:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38415: [SPARK-40938][CONNECT] Support Alias for every type of Relation - posted by GitBox <gi...@apache.org> on 2022/11/01 02:30:52 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38455: [SPARK-40827][PS][TESTS] Re-enable the DataFrame.corrwith test after fixing in future pandas. - posted by GitBox <gi...@apache.org> on 2022/11/01 02:34:50 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38449: [SPARK-40798][SQL][FOLLOW-UP] Fix ansi mode in v2 ALTER TABLE PARTITION - posted by GitBox <gi...@apache.org> on 2022/11/01 02:35:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38456: Spark 40950 - posted by GitBox <gi...@apache.org> on 2022/11/01 02:54:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #38456: Spark 40950 - posted by GitBox <gi...@apache.org> on 2022/11/01 02:54:27 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38457: [SPARK-40371][SQL] Migrate type check failures of NthValue and NTile onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 02:55:00 UTC, 0 replies.
- [GitHub] [spark] JiexingLi commented on a diff in pull request #38371: [SPARK-40968] Fix a few wrong/misleading comments in DAGSchedulerSuite - posted by GitBox <gi...@apache.org> on 2022/11/01 02:57:59 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun closed pull request #22: [SPARK-40969] Replace spark TGZ url with apache archive url - posted by GitBox <gi...@apache.org> on 2022/11/01 03:14:15 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #22: [SPARK-40969] Replace spark TGZ url with apache archive url - posted by GitBox <gi...@apache.org> on 2022/11/01 03:14:42 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #21: [SPARK-40569] Add smoke test in standalone cluster for spark-docker - posted by GitBox <gi...@apache.org> on 2022/11/01 03:15:15 UTC, 1 replies.
- [GitHub] [spark-docker] dcoliversun commented on pull request #22: [SPARK-40969] Replace spark TGZ url with apache archive url - posted by GitBox <gi...@apache.org> on 2022/11/01 03:24:24 UTC, 0 replies.
- [GitHub] [spark] clairezhuang closed pull request #38446: [Spark-40974]When the value of quote or escape exists in the content of csv file, the character in the csv file will be misidentified - posted by GitBox <gi...@apache.org> on 2022/11/01 03:35:09 UTC, 0 replies.
- [GitHub] [spark] clairezhuang commented on pull request #38446: [Spark-40974]When the value of quote or escape exists in the content of csv file, the character in the csv file will be misidentified - posted by GitBox <gi...@apache.org> on 2022/11/01 03:43:10 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38458: [SPARK-40983][DOC] Remove Hadoop requirements for zstd mentioned in Parquet compression codec - posted by GitBox <gi...@apache.org> on 2022/11/01 03:47:24 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38459: [SPARK-40980][CONNECT] Support session.sql in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/01 04:21:17 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38460: [SPARK-40981][CONNECT][PYTHON] Support session.range in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 04:22:18 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/01 04:22:33 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/01 04:33:25 UTC, 5 replies.
- [GitHub] [spark] ben-zhang commented on a diff in pull request #38433: [SPARK-40943][SQL] Make the MSCK keyword optional in MSCK REPAIR TABLE commands - posted by GitBox <gi...@apache.org> on 2022/11/01 04:39:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38415: [SPARK-40938][CONNECT] Support Alias for every type of Relation - posted by GitBox <gi...@apache.org> on 2022/11/01 04:41:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38415: [SPARK-40938][CONNECT] Support Alias for every type of Relation - posted by GitBox <gi...@apache.org> on 2022/11/01 04:41:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38423: [SPARK-40949][CONNECT][PYTHON] Implement `DataFrame.sortWithinPartitions` - posted by GitBox <gi...@apache.org> on 2022/11/01 04:48:01 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38423: [SPARK-40949][CONNECT][PYTHON] Implement `DataFrame.sortWithinPartitions` - posted by GitBox <gi...@apache.org> on 2022/11/01 04:49:02 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38460: [SPARK-40981][CONNECT][PYTHON] Support session.range in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 04:54:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38395: [SPARK-40917][SQL] Add a dedicated logical plan for `Summary` - posted by GitBox <gi...@apache.org> on 2022/11/01 04:59:28 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38460: [SPARK-40981][CONNECT][PYTHON] Support session.range in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 05:27:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38460: [SPARK-40981][CONNECT][PYTHON] Support session.range in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 05:30:46 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38347: [SPARK-40883][CONNECT] Support Range in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/01 05:31:56 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38423: [SPARK-40949][CONNECT][PYTHON] Implement `DataFrame.sortWithinPartitions` - posted by GitBox <gi...@apache.org> on 2022/11/01 05:42:33 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38347: [SPARK-40883][CONNECT] Support Range in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/01 05:46:17 UTC, 1 replies.
- [GitHub] [spark] SandishKumarHN commented on a diff in pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/01 05:52:23 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on pull request #38459: [SPARK-40980][CONNECT][DSL] Support session.sql in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/01 06:01:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38263: [SPARK-40692][SQL] Support data masking built-in function 'mask_hash' - posted by GitBox <gi...@apache.org> on 2022/11/01 06:13:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38448: [SPARK-40975][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0021` to `UNSUPPORTED_TYPED_LITERAL` - posted by GitBox <gi...@apache.org> on 2022/11/01 06:19:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38455: [SPARK-40827][PS][TESTS] Re-enable the DataFrame.corrwith test after fixing in future pandas. - posted by GitBox <gi...@apache.org> on 2022/11/01 06:22:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38455: [SPARK-40827][PS][TESTS] Re-enable the DataFrame.corrwith test after fixing in future pandas. - posted by GitBox <gi...@apache.org> on 2022/11/01 06:23:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38448: [SPARK-40975][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0021` to `UNSUPPORTED_TYPED_LITERAL` - posted by GitBox <gi...@apache.org> on 2022/11/01 06:29:36 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38448: [SPARK-40975][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0021` to `UNSUPPORTED_TYPED_LITERAL` - posted by GitBox <gi...@apache.org> on 2022/11/01 06:29:51 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38448: [SPARK-40975][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0021` to `UNSUPPORTED_TYPED_LITERAL` - posted by GitBox <gi...@apache.org> on 2022/11/01 06:38:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38448: [SPARK-40975][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0021` to `UNSUPPORTED_TYPED_LITERAL` - posted by GitBox <gi...@apache.org> on 2022/11/01 06:38:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38170: [WIP][SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2201-2225 - posted by GitBox <gi...@apache.org> on 2022/11/01 06:45:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38447: [SPARK-40973][SQL] Rename `_LEGACY_ERROR_TEMP_0055` to `UNCLOSED_BRACKETED_COMMENT` - posted by GitBox <gi...@apache.org> on 2022/11/01 06:52:38 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38438: [SPARK-40748][SQL] Migrate type check failures of conditions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 07:10:54 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38461: [SPARK-34079][SQL][FOLLOWUP] Improve the readability and simplify the code for MergeScalarSubqueries - posted by GitBox <gi...@apache.org> on 2022/11/01 07:14:20 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38462: [SPARK-40533] [CONNECT] [PYTHON] Support most built-in literal types for Python in Spark Connect - posted by GitBox <gi...@apache.org> on 2022/11/01 07:20:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/11/01 07:25:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #35594: [SPARK-38270][SQL] Spark SQL CLI's AM should keep same exit code with client side - posted by GitBox <gi...@apache.org> on 2022/11/01 07:25:56 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 07:38:05 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38170: [WIP][SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2201-2225 - posted by GitBox <gi...@apache.org> on 2022/11/01 07:39:22 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38457: [SPARK-40371][SQL] Migrate type check failures of NthValue and NTile onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 08:28:28 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38457: [SPARK-40371][SQL] Migrate type check failures of NthValue and NTile onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 08:29:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38457: [SPARK-40371][SQL] Migrate type check failures of NthValue and NTile onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 08:29:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 08:40:23 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38439: [SPARK-40890][SQL][TESTS] Check error classes in DataSourceV2SQLSuite - posted by GitBox <gi...@apache.org> on 2022/11/01 08:42:10 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38439: [SPARK-40890][SQL][TESTS] Check error classes in DataSourceV2SQLSuite - posted by GitBox <gi...@apache.org> on 2022/11/01 08:42:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38460: [SPARK-40981][CONNECT][PYTHON] Support session.range in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 08:52:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38460: [SPARK-40981][CONNECT][PYTHON] Support session.range in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 08:53:24 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #38464: [SPARK-32628][SQL] Use bloom filter to improve dynamic partition pruning - posted by GitBox <gi...@apache.org> on 2022/11/01 08:56:50 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38464: [SPARK-32628][SQL] Use bloom filter to improve dynamic partition pruning - posted by GitBox <gi...@apache.org> on 2022/11/01 09:01:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38465: [SPARK-40985][BUILD] Upgrade RoaringBitmap to 0.9.35 - posted by GitBox <gi...@apache.org> on 2022/11/01 09:19:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38464: [SPARK-32628][SQL] Use bloom filter to improve dynamic partition pruning - posted by GitBox <gi...@apache.org> on 2022/11/01 09:53:37 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38466: [WIP][SPARK-40986][SQL] Using distinct to reduce the data size for bloom filter - posted by GitBox <gi...@apache.org> on 2022/11/01 10:40:03 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 11:07:13 UTC, 3 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #38467: [SPARK-40987][CORE] Avoid creating a directory when deleting a block, causing DAGScheduler to not work - posted by GitBox <gi...@apache.org> on 2022/11/01 11:24:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38468: [WIP][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/01 11:51:56 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt commented on a diff in pull request #38428: [SPARK-40912][CORE][WIP] Overhead of Exceptions in DeserializationStream - posted by GitBox <gi...@apache.org> on 2022/11/01 11:55:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38459: [SPARK-40980][CONNECT][TEST] Support session.sql in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/01 12:10:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38459: [SPARK-40980][CONNECT][TEST] Support session.sql in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/01 12:11:24 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2022/11/01 12:13:53 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38454: [SPARK-40978][SQL] Migrate `failAnalysis()` w/o a context onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 12:17:05 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #38458: [SPARK-40983][DOC] Remove Hadoop requirements for zstd mentioned in Parquet compression codec - posted by GitBox <gi...@apache.org> on 2022/11/01 12:33:19 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38458: [SPARK-40983][DOC] Remove Hadoop requirements for zstd mentioned in Parquet compression codec - posted by GitBox <gi...@apache.org> on 2022/11/01 12:34:35 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #38461: [SPARK-34079][SQL][FOLLOWUP] Improve the readability and simplify the code for MergeScalarSubqueries - posted by GitBox <gi...@apache.org> on 2022/11/01 12:36:05 UTC, 2 replies.
- [GitHub] [spark] gaoyajun02 commented on pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/01 12:40:52 UTC, 5 replies.
- [GitHub] [spark] MaxGekk closed pull request #38454: [SPARK-40978][SQL] Migrate `failAnalysis()` w/o a context onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 13:24:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38429: [SPARK-40800][SQL][FOLLOW-UP] Add a config to control whether to always inline one-row relation subquery - posted by GitBox <gi...@apache.org> on 2022/11/01 14:23:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38429: [SPARK-40800][SQL][FOLLOW-UP] Add a config to control whether to always inline one-row relation subquery - posted by GitBox <gi...@apache.org> on 2022/11/01 14:24:13 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #38461: [SPARK-34079][SQL][FOLLOWUP] Improve the readability and simplify the code for MergeScalarSubqueries - posted by GitBox <gi...@apache.org> on 2022/11/01 15:12:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38467: [SPARK-40987][CORE] Avoid creating a directory when deleting a block, causing DAGScheduler to not work - posted by GitBox <gi...@apache.org> on 2022/11/01 15:19:03 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38462: [SPARK-40533] [CONNECT] [PYTHON] Support most built-in literal types for Python in Spark Connect - posted by GitBox <gi...@apache.org> on 2022/11/01 15:22:56 UTC, 1 replies.
- [GitHub] [spark] thejdeep commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/11/01 15:27:57 UTC, 4 replies.
- [GitHub] [spark] LuciferYang closed pull request #37646: [DON'T MERGE] investigate flaky test in ImageFileFormatSuite - posted by GitBox <gi...@apache.org> on 2022/11/01 15:29:24 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #37646: [DON'T MERGE] investigate flaky test in ImageFileFormatSuite - posted by GitBox <gi...@apache.org> on 2022/11/01 15:29:27 UTC, 0 replies.
- [GitHub] [spark] thejdeep commented on pull request #35969: [SPARK-38651][SQL] Add configuration to support writing out empty schemas in supported filebased datasources - posted by GitBox <gi...@apache.org> on 2022/11/01 15:46:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38469: [MINOR][BUILD] Correct the `files` contend in `checkstyle-suppressions.xml` - posted by GitBox <gi...@apache.org> on 2022/11/01 16:01:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38461: [SPARK-34079][SQL][FOLLOWUP] Improve the readability and simplify the code for MergeScalarSubqueries - posted by GitBox <gi...@apache.org> on 2022/11/01 16:02:21 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38469: [MINOR][BUILD] Correct the `files` contend in `checkstyle-suppressions.xml` - posted by GitBox <gi...@apache.org> on 2022/11/01 16:02:48 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #38469: [MINOR][BUILD] Correct the `files` contend in `checkstyle-suppressions.xml` - posted by GitBox <gi...@apache.org> on 2022/11/01 16:03:37 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38470: [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/01 16:07:22 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #32456: [SPARK-35328][Core] Use 'SPARK_DRIVER_LOG_URL_' as env prefix for getting driver log urls by default - posted by GitBox <gi...@apache.org> on 2022/11/01 16:10:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38400: [SPARK-40921][SQL] Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO - posted by GitBox <gi...@apache.org> on 2022/11/01 16:21:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38400: [SPARK-40921][SQL] Add WHEN NOT MATCHED BY SOURCE clause to MERGE INTO - posted by GitBox <gi...@apache.org> on 2022/11/01 16:21:40 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38467: [SPARK-40987][CORE] Avoid creating a directory when deleting a block, causing DAGScheduler to not work - posted by GitBox <gi...@apache.org> on 2022/11/01 16:22:59 UTC, 0 replies.
- [GitHub] [spark] leewyang commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/11/01 16:51:55 UTC, 16 replies.
- [GitHub] [spark] MaxGekk closed pull request #38175: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2251-2275 - posted by GitBox <gi...@apache.org> on 2022/11/01 16:59:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38175: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2251-2275 - posted by GitBox <gi...@apache.org> on 2022/11/01 16:59:32 UTC, 0 replies.
- [GitHub] [spark] jerrypeng commented on a diff in pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/01 17:18:23 UTC, 4 replies.
- [GitHub] [spark] carlfu-db commented on pull request #38404: [SPARK-40956] SQL Equivalent for Dataframe overwrite command - posted by GitBox <gi...@apache.org> on 2022/11/01 18:47:56 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38471: [SC-114545][SPARK-40883][CONNECT] Range.step is required and Python client should have a default value=1 - posted by GitBox <gi...@apache.org> on 2022/11/01 19:34:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38471: [SC-114545][SPARK-40883][CONNECT] Range.step is required and Python client should have a default value=1 - posted by GitBox <gi...@apache.org> on 2022/11/01 19:34:43 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #36698: [SPARK-39316][SQL] Merge PromotePrecision and CheckOverflow into decimal binary arithmetic - posted by GitBox <gi...@apache.org> on 2022/11/01 19:40:05 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38472: [SPARK-40989][CONNECT][PYTHON][TESTS] Improve `session.sql` testing coverage in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 19:47:15 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38472: [SPARK-40989][CONNECT][PYTHON][TESTS] Improve `session.sql` testing coverage in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 19:47:22 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38470: [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/01 19:54:45 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on pull request #38169: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2176-2220 - posted by GitBox <gi...@apache.org> on 2022/11/01 20:04:55 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38470: [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/01 20:15:07 UTC, 3 replies.
- [GitHub] [spark] kristopherkane commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/01 20:45:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38418: [SPARK-40944][SQL] Relax ordering constraint for CREATE TABLE column options - posted by GitBox <gi...@apache.org> on 2022/11/01 20:54:38 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #38418: [SPARK-40944][SQL] Relax ordering constraint for CREATE TABLE column options - posted by GitBox <gi...@apache.org> on 2022/11/01 21:01:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38409: [SPARK-40930][CONNECT] Support Collect() in Python client - posted by GitBox <gi...@apache.org> on 2022/11/01 21:46:09 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/01 22:11:37 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38462: [SPARK-40533] [CONNECT] [PYTHON] Support most built-in literal types for Python in Spark Connect - posted by GitBox <gi...@apache.org> on 2022/11/01 22:11:41 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #38473: [SPARK-40990][PYTHON] DataFrame creation from 2d NumPy array with arbitrary columns - posted by GitBox <gi...@apache.org> on 2022/11/01 22:14:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #38474: [SPARK-XXX][PYTHON] Update cloudpickle to v2.2.0 - posted by GitBox <gi...@apache.org> on 2022/11/01 23:00:14 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38469: [MINOR][BUILD] Correct the `files` contend in `checkstyle-suppressions.xml` - posted by GitBox <gi...@apache.org> on 2022/11/01 23:10:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/01 23:51:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/01 23:58:31 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38453: [SPARK-40977][CONNECT][PYTHON] Complete Support for Union in Python client - posted by GitBox <gi...@apache.org> on 2022/11/02 00:08:13 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38452: [SPARK-40802][SQL] Resolve JDBCRelation's schema with preparing the statement - posted by GitBox <gi...@apache.org> on 2022/11/02 00:08:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37259: spark-submit: throw an error when duplicate argument is provided - posted by GitBox <gi...@apache.org> on 2022/11/02 00:25:21 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db opened a new pull request, #38476: Revert "[SPARK-40976][BUILD] Upgrade sbt to 1.7.3" - posted by GitBox <gi...@apache.org> on 2022/11/02 00:43:57 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on pull request #38476: Revert "[SPARK-40976][BUILD] Upgrade sbt to 1.7.3" - posted by GitBox <gi...@apache.org> on 2022/11/02 00:45:10 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38409: [SPARK-40930][CONNECT] Support Collect() in Python client - posted by GitBox <gi...@apache.org> on 2022/11/02 01:01:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38409: [SPARK-40930][CONNECT] Support Collect() in Python client - posted by GitBox <gi...@apache.org> on 2022/11/02 01:01:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38476: Revert "[SPARK-40976][BUILD] Upgrade sbt to 1.7.3" - posted by GitBox <gi...@apache.org> on 2022/11/02 01:02:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38476: Revert "[SPARK-40976][BUILD] Upgrade sbt to 1.7.3" - posted by GitBox <gi...@apache.org> on 2022/11/02 01:03:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38473: [SPARK-40990][PYTHON] DataFrame creation from 2d NumPy array with arbitrary columns - posted by GitBox <gi...@apache.org> on 2022/11/02 01:06:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38473: [SPARK-40990][PYTHON] DataFrame creation from 2d NumPy array with arbitrary columns - posted by GitBox <gi...@apache.org> on 2022/11/02 01:06:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38470: [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/02 01:11:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38470: [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/02 01:13:45 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38465: [SPARK-40985][BUILD] Upgrade RoaringBitmap to 0.9.35 - posted by GitBox <gi...@apache.org> on 2022/11/02 01:18:09 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38474: [SPARK-40991][PYTHON] Update `cloudpickle` to v2.2.0 - posted by GitBox <gi...@apache.org> on 2022/11/02 01:19:12 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #36698: [SPARK-39316][SQL] Merge PromotePrecision and CheckOverflow into decimal binary arithmetic - posted by GitBox <gi...@apache.org> on 2022/11/02 01:32:31 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #38461: [SPARK-34079][SQL][FOLLOWUP] Improve the readability and simplify the code for MergeScalarSubqueries - posted by GitBox <gi...@apache.org> on 2022/11/02 02:19:55 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38465: [SPARK-40985][BUILD] Upgrade RoaringBitmap to 0.9.35 - posted by GitBox <gi...@apache.org> on 2022/11/02 02:24:31 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38477: [SPARK-40993][CONNECT]PYTHON[DOCS] Migrate markdown style README to PySpark Development Documentation - posted by GitBox <gi...@apache.org> on 2022/11/02 02:29:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38478: [MINOR][SQL] Wrap `given` in backticks to fix compilation warning - posted by GitBox <gi...@apache.org> on 2022/11/02 02:45:02 UTC, 0 replies.
- [GitHub] [spark] lyy-pineapple commented on pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by GitBox <gi...@apache.org> on 2022/11/02 03:11:37 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38474: [SPARK-40991][PYTHON] Update `cloudpickle` to v2.2.0 - posted by GitBox <gi...@apache.org> on 2022/11/02 03:27:02 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/11/02 03:28:23 UTC, 42 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38476: Revert "[SPARK-40976][BUILD] Upgrade sbt to 1.7.3" - posted by GitBox <gi...@apache.org> on 2022/11/02 04:14:08 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38477: [SPARK-40993][CONNECT]PYTHON[DOCS] Migrate markdown style README to PySpark Development Documentation - posted by GitBox <gi...@apache.org> on 2022/11/02 04:26:27 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38404: [SPARK-40956] SQL Equivalent for Dataframe overwrite command - posted by GitBox <gi...@apache.org> on 2022/11/02 04:46:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38438: [SPARK-40748][SQL] Migrate type check failures of conditions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/02 05:11:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38438: [SPARK-40748][SQL] Migrate type check failures of conditions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/02 05:12:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38478: [MINOR][SQL] Wrap `given` in backticks to fix compilation warning - posted by GitBox <gi...@apache.org> on 2022/11/02 05:16:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38478: [MINOR][SQL] Wrap `given` in backticks to fix compilation warning - posted by GitBox <gi...@apache.org> on 2022/11/02 05:19:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38476: Revert "[SPARK-40976][BUILD] Upgrade sbt to 1.7.3" - posted by GitBox <gi...@apache.org> on 2022/11/02 05:21:44 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by GitBox <gi...@apache.org> on 2022/11/02 05:45:45 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/02 05:56:46 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #38371: [SPARK-40968] Fix a few wrong/misleading comments in DAGSchedulerSuite - posted by GitBox <gi...@apache.org> on 2022/11/02 05:57:33 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38371: [SPARK-40968] Fix a few wrong/misleading comments in DAGSchedulerSuite - posted by GitBox <gi...@apache.org> on 2022/11/02 05:58:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38474: [SPARK-40991][PYTHON] Update `cloudpickle` to v2.2.0 - posted by GitBox <gi...@apache.org> on 2022/11/02 06:02:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38474: [SPARK-40991][PYTHON] Update `cloudpickle` to v2.2.0 - posted by GitBox <gi...@apache.org> on 2022/11/02 06:03:02 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38428: [SPARK-40912][CORE][WIP] Overhead of Exceptions in KryoDeserializationStream - posted by GitBox <gi...@apache.org> on 2022/11/02 06:06:03 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38471: [SPARK-40883][CONNECT][FOLLOW-UP] Range.step is required and Python client should have a default value=1 - posted by GitBox <gi...@apache.org> on 2022/11/02 06:10:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38471: [SPARK-40883][CONNECT][FOLLOW-UP] Range.step is required and Python client should have a default value=1 - posted by GitBox <gi...@apache.org> on 2022/11/02 06:11:07 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/02 06:12:09 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #38428: [SPARK-40912][CORE][WIP] Overhead of Exceptions in KryoDeserializationStream - posted by GitBox <gi...@apache.org> on 2022/11/02 06:14:52 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/02 06:16:24 UTC, 7 replies.
- [GitHub] [spark] mridulm commented on pull request #38467: [SPARK-40987][CORE] Avoid creating a directory when deleting a block, causing DAGScheduler to not work - posted by GitBox <gi...@apache.org> on 2022/11/02 06:19:51 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/02 06:20:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/02 06:24:50 UTC, 3 replies.
- [GitHub] [spark] mridulm commented on pull request #38064: [SPARK-40622][SQL][CORE]Result of a single task in collect() must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/02 06:24:51 UTC, 5 replies.
- [GitHub] [spark] mridulm commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/11/02 06:27:22 UTC, 0 replies.
- [GitHub] [spark] jerrypeng commented on pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/02 06:30:12 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by GitBox <gi...@apache.org> on 2022/11/02 06:44:03 UTC, 9 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/02 06:46:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38404: [SPARK-40956] SQL Equivalent for Dataframe overwrite command - posted by GitBox <gi...@apache.org> on 2022/11/02 06:47:54 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38477: [SPARK-40993][CONNECT][PYTHON][DOCS] Migrate markdown style README to PySpark Development Documentation - posted by GitBox <gi...@apache.org> on 2022/11/02 07:44:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip closed pull request #37075: [SparkConnect] Initial Protobuf Definitions - posted by GitBox <gi...@apache.org> on 2022/11/02 07:48:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38479: [SPARK-40697][SQL][FOLLOWUP] Read-side char padding should only be applied if necessary - posted by GitBox <gi...@apache.org> on 2022/11/02 08:27:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38479: [SPARK-40697][SQL][FOLLOWUP] Read-side char padding should only be applied if necessary - posted by GitBox <gi...@apache.org> on 2022/11/02 08:27:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/02 08:52:09 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38477: [SPARK-40993][CONNECT][PYTHON][DOCS] Migrate markdown style README to PySpark Development Documentation - posted by GitBox <gi...@apache.org> on 2022/11/02 08:59:57 UTC, 1 replies.
- [GitHub] [spark] c21 opened a new pull request, #38480: [SPARK-40994][DOC][SQL] Add code example in JDBC data source with partitionColumn - posted by GitBox <gi...@apache.org> on 2022/11/02 09:19:27 UTC, 0 replies.
- [GitHub] [spark] c21 commented on pull request #38480: [SPARK-40994][DOC][SQL] Add code example in JDBC data source with partitionColumn - posted by GitBox <gi...@apache.org> on 2022/11/02 09:22:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38481: [DONT'MERGE] Upgrade sbt-checkstyle-plugin and sbt - posted by GitBox <gi...@apache.org> on 2022/11/02 09:53:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38481: [DONT'MERGE] Upgrade sbt-checkstyle-plugin and sbt - posted by GitBox <gi...@apache.org> on 2022/11/02 09:53:55 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #37697: [SPARK-40248][SQL] Use larger number of bits to build Bloom filter - posted by GitBox <gi...@apache.org> on 2022/11/02 10:06:06 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #37697: [SPARK-40248][SQL] Use larger number of bits to build Bloom filter - posted by GitBox <gi...@apache.org> on 2022/11/02 10:06:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/02 10:09:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/02 11:51:03 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38463: [SPARK-40374][SQL] Migrate type check failures of type creators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/02 11:52:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38481: [SPARK-40996][BUILD] Upgrade `sbt-checkstyle-plugin` to 4.0.0 to resolve `dev/sbt-checkstyle` run failed with sbt 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/02 12:12:31 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38481: [SPARK-40996][BUILD] Upgrade `sbt-checkstyle-plugin` to 4.0.0 to resolve `dev/sbt-checkstyle` run failed with sbt 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/02 12:14:44 UTC, 2 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38483: [SPARK-40997][K8S] K8s resource name prefix should start w/ alphanumeric - posted by GitBox <gi...@apache.org> on 2022/11/02 13:17:11 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38477: [SPARK-40993][CONNECT][PYTHON][DOCS] Migrate markdown style README to PySpark Development Documentation - posted by GitBox <gi...@apache.org> on 2022/11/02 13:19:32 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/02 13:23:38 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/02 13:24:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/02 13:43:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #36540: [SPARK-38466][CORE] Use error classes in org.apache.spark.mapred - posted by GitBox <gi...@apache.org> on 2022/11/02 13:50:52 UTC, 0 replies.
- [GitHub] [spark] MaksGS09 commented on pull request #37206: [SPARK-39696][CORE] Ensure Concurrent r/w `TaskMetrics` not throw Exception - posted by GitBox <gi...@apache.org> on 2022/11/02 13:51:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38427: [SPARK-40950][CORE] Fix isRemoteAddressMaxedOut performance overhead on scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/02 14:06:27 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38430: [SPARK-40957] Add in memory cache in HDFSMetadataLog - posted by GitBox <gi...@apache.org> on 2022/11/02 14:42:57 UTC, 4 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38484: [WIP][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0040` to `INVALID_IDENTIFIER` - posted by GitBox <gi...@apache.org> on 2022/11/02 14:44:44 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38427: [SPARK-40950][CORE] Fix isRemoteAddressMaxedOut performance overhead on scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/02 14:49:06 UTC, 3 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #38427: [SPARK-40950][CORE] Fix isRemoteAddressMaxedOut performance overhead on scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/02 15:16:37 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #38465: [SPARK-40985][BUILD] Upgrade RoaringBitmap to 0.9.35 - posted by GitBox <gi...@apache.org> on 2022/11/02 15:47:05 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38465: [SPARK-40985][BUILD] Upgrade RoaringBitmap to 0.9.35 - posted by GitBox <gi...@apache.org> on 2022/11/02 15:47:17 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38483: [SPARK-40997][K8S] K8s resource name prefix should start w/ alphanumeric - posted by GitBox <gi...@apache.org> on 2022/11/02 17:29:51 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38482: [WIP][SPARK-40749][SQL] Migrate type check failures of generators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/02 17:29:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38484: [SPARK-40998][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0040` to `INVALID_IDENTIFIER` - posted by GitBox <gi...@apache.org> on 2022/11/02 18:23:22 UTC, 1 replies.
- [GitHub] [spark] clee704 commented on pull request #28450: [SPARK-31639] Revert SPARK-27528 Use Parquet logical type TIMESTAMP_MICROS by default - posted by GitBox <gi...@apache.org> on 2022/11/02 19:12:24 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/02 19:27:52 UTC, 0 replies.
- [GitHub] [spark] ben-zhang commented on pull request #38433: [SPARK-40943][SQL] Make the MSCK keyword optional in REPAIR TABLE commands - posted by GitBox <gi...@apache.org> on 2022/11/02 19:55:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38433: [SPARK-40943][SQL] Make the MSCK keyword optional in REPAIR TABLE commands - posted by GitBox <gi...@apache.org> on 2022/11/02 19:56:39 UTC, 0 replies.
- [GitHub] [spark] kelvinjian-db opened a new pull request, #38486: [SPARK-41000][SQL] Make CommandResult extend Command trait - posted by GitBox <gi...@apache.org> on 2022/11/02 20:24:40 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/11/02 21:43:23 UTC, 3 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/02 22:05:05 UTC, 3 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/11/02 22:43:39 UTC, 5 replies.
- [GitHub] [spark] zhouyejoe commented on a diff in pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2022/11/02 22:50:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38477: [SPARK-40993][CONNECT][PYTHON][DOCS] Migrate markdown style README to PySpark Development Documentation - posted by GitBox <gi...@apache.org> on 2022/11/02 23:14:49 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/02 23:14:52 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38472: [SPARK-40989][CONNECT][PYTHON][TESTS] Improve `session.sql` testing coverage in Python client - posted by GitBox <gi...@apache.org> on 2022/11/02 23:14:55 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38470: [SPARK-40995] [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/02 23:14:59 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/02 23:46:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37265: [SPARK-39850][YARN]Print applicationId once applied from yarn rm - posted by GitBox <gi...@apache.org> on 2022/11/03 00:24:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37259: spark-submit: throw an error when duplicate argument is provided - posted by GitBox <gi...@apache.org> on 2022/11/03 00:24:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37163: [SPARK-39750][SQL] Enable `spark.sql.cbo.enabled` by default - posted by GitBox <gi...@apache.org> on 2022/11/03 00:24:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37083: [SPARK-39678][SQL] Improve stats estimation for v2 tables - posted by GitBox <gi...@apache.org> on 2022/11/03 00:24:05 UTC, 0 replies.
- [GitHub] [spark] kelvinjian-db commented on pull request #38486: [SPARK-41000][SQL] Make CommandResult extend Command trait - posted by GitBox <gi...@apache.org> on 2022/11/03 00:24:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38470: [SPARK-40995] [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/03 00:36:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38470: [SPARK-40995] [CONNECT] [DOC] Defining Spark Connect Client Connection String - posted by GitBox <gi...@apache.org> on 2022/11/03 00:37:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38477: [SPARK-40993][CONNECT][PYTHON][DOCS] Migrate markdown style README to PySpark Development Documentation - posted by GitBox <gi...@apache.org> on 2022/11/03 00:37:48 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on a diff in pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/03 00:47:16 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38468: [WIP][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/03 01:31:11 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/03 01:59:19 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/03 02:06:03 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #28450: [SPARK-31639] Revert SPARK-27528 Use Parquet logical type TIMESTAMP_MICROS by default - posted by GitBox <gi...@apache.org> on 2022/11/03 02:07:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38472: [SPARK-40989][CONNECT][PYTHON][TESTS] Improve `session.sql` testing coverage in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 02:08:13 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38487: [SPARK-40995][CONNECT][DOC][FOLLOW-UP] Fix the type in the doc name - posted by GitBox <gi...@apache.org> on 2022/11/03 02:08:14 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38487: [SPARK-40995][CONNECT][DOC][FOLLOW-UP] Fix the type in the doc name - posted by GitBox <gi...@apache.org> on 2022/11/03 02:08:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38472: [SPARK-40989][CONNECT][PYTHON][TESTS] Improve `session.sql` testing coverage in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 02:08:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38487: [SPARK-40995][CONNECT][DOC][FOLLOW-UP] Fix the type in the doc name - posted by GitBox <gi...@apache.org> on 2022/11/03 02:09:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38487: [SPARK-40995][CONNECT][DOC][FOLLOW-UP] Fix the type in the doc name - posted by GitBox <gi...@apache.org> on 2022/11/03 02:09:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38468: [WIP][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/03 02:13:20 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/03 02:36:27 UTC, 3 replies.
- [GitHub] [spark] pan3793 commented on pull request #38483: [SPARK-40997][K8S] K8s resource name prefix should start w/ alphanumeric - posted by GitBox <gi...@apache.org> on 2022/11/03 02:38:29 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38465: [SPARK-40985][BUILD] Upgrade RoaringBitmap to 0.9.35 - posted by GitBox <gi...@apache.org> on 2022/11/03 03:01:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/03 03:14:27 UTC, 1 replies.
- [GitHub] [spark] jzhuge commented on pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/03 03:19:30 UTC, 3 replies.
- [GitHub] [spark] gaoyajun02 commented on a diff in pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/03 03:30:48 UTC, 10 replies.
- [GitHub] [spark] beliefer commented on pull request #38466: [WIP][SPARK-40986][SQL] Add aggregate to reduce the data size for bloom filter - posted by GitBox <gi...@apache.org> on 2022/11/03 04:28:06 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #38466: [WIP][SPARK-40986][SQL] Add aggregate to reduce the data size for bloom filter - posted by GitBox <gi...@apache.org> on 2022/11/03 04:28:06 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38488: [SPARK-41002][CONNECT][PYTHON] Compatible `take` and `head` API in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 04:28:53 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38447: [SPARK-40973][SQL] Rename `_LEGACY_ERROR_TEMP_0055` to `UNCLOSED_BRACKETED_COMMENT` - posted by GitBox <gi...@apache.org> on 2022/11/03 04:29:49 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38462: [SPARK-40533] [CONNECT] [PYTHON] Support most built-in literal types for Python in Spark Connect - posted by GitBox <gi...@apache.org> on 2022/11/03 04:42:45 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/03 04:51:31 UTC, 8 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38453: [SPARK-40977][CONNECT][PYTHON] Complete Support for Union in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 04:53:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38453: [SPARK-40977][CONNECT][PYTHON] Complete Support for Union in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 04:53:32 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #38489: [SPARK-41003][SQL] BHJ LeftAnti does not update numOutputRows when codegen is disabled - posted by GitBox <gi...@apache.org> on 2022/11/03 04:54:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38453: [SPARK-40977][CONNECT][PYTHON] Complete Support for Union in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 04:54:27 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #38489: [SPARK-41003][SQL] BHJ LeftAnti does not update numOutputRows when codegen is disabled - posted by GitBox <gi...@apache.org> on 2022/11/03 04:58:17 UTC, 3 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2022/11/03 05:18:26 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on a diff in pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2022/11/03 05:28:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38484: [SPARK-40998][SQL] Rename the error class `_LEGACY_ERROR_TEMP_0040` to `INVALID_IDENTIFIER` - posted by GitBox <gi...@apache.org> on 2022/11/03 05:44:54 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38462: [SPARK-40533] [CONNECT] [PYTHON] Support most built-in literal types for Python in Spark Connect - posted by GitBox <gi...@apache.org> on 2022/11/03 06:02:30 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on pull request #38483: [SPARK-40997][K8S] K8s resource name prefix should start w/ alphanumeric - posted by GitBox <gi...@apache.org> on 2022/11/03 06:06:08 UTC, 0 replies.
- [GitHub] [spark] sybernatus commented on pull request #37259: spark-submit: throw an error when duplicate argument is provided - posted by GitBox <gi...@apache.org> on 2022/11/03 06:28:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38490: [WIP][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1070` to `LOCATION_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/03 06:43:50 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a - - posted by GitBox <gi...@apache.org> on 2022/11/03 06:45:26 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38482: [SPARK-40749][SQL] Migrate type check failures of generators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/03 06:59:25 UTC, 0 replies.
- [GitHub] [spark] dengziming opened a new pull request, #38491: [MINOR][CONNECT]: Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/03 07:00:50 UTC, 0 replies.
- [GitHub] [spark] tobiasstadler commented on pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a - - posted by GitBox <gi...@apache.org> on 2022/11/03 07:33:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38492: [SPARK-38270][BUILD][FOLLOW-UP] Exclude productElementName and productElementNames in Mima for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/03 08:02:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38492: [SPARK-38270][BUILD][FOLLOW-UP] Exclude productElementName and productElementNames in Mima for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/03 08:03:07 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38277: [SPARK-40815][SQL] Add `DelegateSymlinkTextInputFormat` to workaround `SymlinkTextInputSplit` bug - posted by GitBox <gi...@apache.org> on 2022/11/03 08:06:42 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/03 08:10:32 UTC, 12 replies.
- [GitHub] [spark] dengziming opened a new pull request, #38493: [MINOR][CONNECT]: Fix file name in docs - posted by GitBox <gi...@apache.org> on 2022/11/03 08:41:14 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #34815: [SPARK-37555][SQL] spark-sql should pass last unclosed comment to backend - posted by GitBox <gi...@apache.org> on 2022/11/03 08:53:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38493: [MINOR][CONNECT] Fix file name in docs - posted by GitBox <gi...@apache.org> on 2022/11/03 08:55:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38493: [MINOR][CONNECT] Fix file name in docs - posted by GitBox <gi...@apache.org> on 2022/11/03 08:55:29 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38494: [SPARK-41004][CONNECT][TESTS] Check error classes in InterceptorRegistrySuite - posted by GitBox <gi...@apache.org> on 2022/11/03 09:23:53 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38494: [SPARK-41004][CONNECT][TESTS] Check error classes in InterceptorRegistrySuite - posted by GitBox <gi...@apache.org> on 2022/11/03 09:24:16 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #38277: [SPARK-40815][SQL] Add `DelegateSymlinkTextInputFormat` to workaround `SymlinkTextInputSplit` bug - posted by GitBox <gi...@apache.org> on 2022/11/03 09:28:40 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38277: [SPARK-40815][SQL] Add `DelegateSymlinkTextInputFormat` to workaround `SymlinkTextInputSplit` bug - posted by GitBox <gi...@apache.org> on 2022/11/03 09:54:11 UTC, 2 replies.
- [GitHub] [spark] wankunde opened a new pull request, #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/03 09:57:23 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #38492: [SPARK-38270][BUILD][FOLLOW-UP] Exclude productElementName and productElementNames in Mima for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/03 09:59:05 UTC, 1 replies.
- [GitHub] [spark] AngersZhuuuu commented on a diff in pull request #34815: [SPARK-37555][SQL] spark-sql should pass last unclosed comment to backend - posted by GitBox <gi...@apache.org> on 2022/11/03 10:00:18 UTC, 1 replies.
- [GitHub] [spark] wankunde opened a new pull request, #38496: [SPARK-40708] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2022/11/03 10:19:58 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38302: [SPARK-40834][SQL] Use SparkListenerSQLExecutionEnd to track final SQL status in UI - posted by GitBox <gi...@apache.org> on 2022/11/03 10:24:04 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38449: [SPARK-40798][SQL][FOLLOW-UP] Fix ansi mode in v2 ALTER TABLE PARTITION - posted by GitBox <gi...@apache.org> on 2022/11/03 10:24:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38449: [SPARK-40798][SQL][FOLLOW-UP] Fix ansi mode in v2 ALTER TABLE PARTITION - posted by GitBox <gi...@apache.org> on 2022/11/03 10:28:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38449: [SPARK-40798][SQL][FOLLOW-UP] Fix ansi mode in v2 ALTER TABLE PARTITION - posted by GitBox <gi...@apache.org> on 2022/11/03 10:29:29 UTC, 0 replies.
- [GitHub] [spark] fred-db opened a new pull request, #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/03 10:33:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38492: [SPARK-38270][BUILD][FOLLOW-UP] Exclude productElementName and productElementNames in Mima for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/03 10:39:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38498: [SPARK-40769][SQL] Migrate type check failures of aggregate expressions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/03 11:06:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38498: [WIP][SPARK-40769][CORE][SQL] Migrate type check failures of aggregate expressions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/03 11:08:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38461: [SPARK-34079][SQL][FOLLOWUP] Improve the readability and simplify the code for MergeScalarSubqueries - posted by GitBox <gi...@apache.org> on 2022/11/03 12:59:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38461: [SPARK-34079][SQL][FOLLOWUP] Improve the readability and simplify the code for MergeScalarSubqueries - posted by GitBox <gi...@apache.org> on 2022/11/03 12:59:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38302: [SPARK-40834][SQL] Use SparkListenerSQLExecutionEnd to track final SQL status in UI - posted by GitBox <gi...@apache.org> on 2022/11/03 13:02:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38302: [SPARK-40834][SQL] Use SparkListenerSQLExecutionEnd to track final SQL status in UI - posted by GitBox <gi...@apache.org> on 2022/11/03 13:02:56 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38481: [SPARK-40996][BUILD] Upgrade `sbt-checkstyle-plugin` to 4.0.0 to resolve `dev/sbt-checkstyle` run failed with sbt 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/03 13:32:49 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38481: [SPARK-40996][BUILD] Upgrade `sbt-checkstyle-plugin` to 4.0.0 to resolve `dev/sbt-checkstyle` run failed with sbt 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/03 13:32:58 UTC, 0 replies.
- [GitHub] [spark] dwsmith1983 opened a new pull request, #38499: [MINOR][DOC] updated some grammar and a missed period in the tuning doc - posted by GitBox <gi...@apache.org> on 2022/11/03 13:57:11 UTC, 0 replies.
- [GitHub] [spark] dwsmith1983 commented on pull request #38499: [MINOR][DOC] updated some grammar and a missed period in the tuning doc - posted by GitBox <gi...@apache.org> on 2022/11/03 14:57:27 UTC, 1 replies.
- [GitHub] [spark] dfit99 opened a new pull request, #38500: [SPARK-41007][SQL] Add missing serializer for java.math.BigInteger - posted by GitBox <gi...@apache.org> on 2022/11/03 15:31:29 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38467: [SPARK-40987][CORE] Avoid creating a directory when deleting a block, causing DAGScheduler to not work - posted by GitBox <gi...@apache.org> on 2022/11/03 16:49:32 UTC, 1 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/03 17:03:03 UTC, 21 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38500: [SPARK-41007][SQL] Add missing serializer for java.math.BigInteger - posted by GitBox <gi...@apache.org> on 2022/11/03 17:04:08 UTC, 5 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38490: [SPARK-41009][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1070` to `LOCATION_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/03 17:13:59 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38481: [SPARK-40996][BUILD] Upgrade `sbt-checkstyle-plugin` to 4.0.0 to resolve `dev/sbt-checkstyle` run failed with sbt 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:15:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a - - posted by GitBox <gi...@apache.org> on 2022/11/03 17:15:37 UTC, 0 replies.
- [GitHub] [spark] dfit99 commented on a diff in pull request #38500: [SPARK-41007][SQL] Add missing serializer for java.math.BigInteger - posted by GitBox <gi...@apache.org> on 2022/11/03 17:15:55 UTC, 4 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38352: [SPARK-40801][BUILD][3.2] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:16:31 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a - - posted by GitBox <gi...@apache.org> on 2022/11/03 17:17:35 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38501: [SPARK-41001] [CONNECT] [DOC] Note: Connection string parameters are case-sensitive. - posted by GitBox <gi...@apache.org> on 2022/11/03 17:17:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a hyphen - posted by GitBox <gi...@apache.org> on 2022/11/03 17:19:10 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #38352: [SPARK-40801][BUILD][3.2] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:26:02 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38502: [SPARK-40976][BUILD] Re-upgrade sbt to 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:30:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38502: [SPARK-40976][BUILD] Re-upgrade sbt to 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:31:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a hyphen - posted by GitBox <gi...@apache.org> on 2022/11/03 17:33:38 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a hyphen - posted by GitBox <gi...@apache.org> on 2022/11/03 17:34:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38502: [SPARK-40976][BUILD] Upgrade sbt to 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:40:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38483: [SPARK-40997][K8S] K8s resource name prefix should start w/ alphanumeric - posted by GitBox <gi...@apache.org> on 2022/11/03 17:43:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38352: [SPARK-40801][BUILD][3.2] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:46:16 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38490: [SPARK-41009][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1070` to `LOCATION_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/03 17:48:33 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38277: [SPARK-40815][SQL] Add `DelegateSymlinkTextInputFormat` to workaround `SymlinkTextInputSplit` bug - posted by GitBox <gi...@apache.org> on 2022/11/03 17:48:40 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38502: [SPARK-40976][BUILD] Upgrade sbt to 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/03 17:48:53 UTC, 1 replies.
- [GitHub] [spark] otterc commented on pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/03 17:56:04 UTC, 1 replies.
- [GitHub] [spark] rahulsmahadev commented on pull request #38404: [SPARK-40956] SQL Equivalent for Dataframe overwrite command - posted by GitBox <gi...@apache.org> on 2022/11/03 18:06:05 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/03 18:29:00 UTC, 3 replies.
- [GitHub] [spark] WweiL opened a new pull request, #38503: [SPARK-40940] remove multi state checkers - posted by GitBox <gi...@apache.org> on 2022/11/03 18:39:04 UTC, 0 replies.
- [GitHub] [spark] tobiasstadler commented on pull request #38331: [SPARK-40869][K8S] Resource name prefix should not start with a hyphen - posted by GitBox <gi...@apache.org> on 2022/11/03 19:23:31 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/03 20:19:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38490: [SPARK-41009][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1070` to `LOCATION_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/03 20:56:07 UTC, 6 replies.
- [GitHub] [spark] sunchao commented on pull request #38352: [SPARK-40801][BUILD][3.2] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/03 21:26:49 UTC, 2 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #38146: [SPARK-40687][SQL] Support data masking built-in function 'mask' - posted by GitBox <gi...@apache.org> on 2022/11/03 21:27:00 UTC, 5 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38488: [SPARK-41002][CONNECT][PYTHON] Compatible `take`, `head` and `first` API in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 21:57:08 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/03 22:01:52 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #38352: [SPARK-40801][BUILD][3.2] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/03 22:05:55 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #38504: [SPARK-40815][SQL][FOLLOW-UP] Disable DelegateSymlinkTextInputSplit feature and ignore the test - posted by GitBox <gi...@apache.org> on 2022/11/03 22:07:15 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #38504: [SPARK-40815][SQL][FOLLOW-UP] Disable DelegateSymlinkTextInputSplit feature and ignore the test - posted by GitBox <gi...@apache.org> on 2022/11/03 22:10:14 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38501: [SPARK-41001] [CONNECT] [DOC] Note: Connection string parameters are case-sensitive. - posted by GitBox <gi...@apache.org> on 2022/11/03 22:12:00 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38500: [SPARK-41007][SQL] Add missing serializer for java.math.BigInteger - posted by GitBox <gi...@apache.org> on 2022/11/03 22:12:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38499: [MINOR][DOC] updated some grammar and a missed period in the tuning doc - posted by GitBox <gi...@apache.org> on 2022/11/03 22:12:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38504: [SPARK-40815][SQL][FOLLOW-UP] Disable DelegateSymlinkTextInputSplit feature and ignore the test - posted by GitBox <gi...@apache.org> on 2022/11/03 22:14:15 UTC, 2 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #38504: [SPARK-40815][SQL][FOLLOW-UP] Disable DelegateSymlinkTextInputSplit feature and ignore the test - posted by GitBox <gi...@apache.org> on 2022/11/03 22:15:20 UTC, 0 replies.
- [GitHub] [spark] liuzqt opened a new pull request, #38505: [SPARK-40622][WIP]do not merge(try to fix build error) - posted by GitBox <gi...@apache.org> on 2022/11/03 22:50:21 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 23:01:57 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 23:30:27 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38488: [SPARK-41002][CONNECT][PYTHON] Compatible `take`, `head` and `first` API in Python client - posted by GitBox <gi...@apache.org> on 2022/11/03 23:32:18 UTC, 3 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by GitBox <gi...@apache.org> on 2022/11/03 23:46:54 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/04 00:07:54 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38496: [SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2022/11/04 00:07:58 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/04 00:08:01 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38494: [SPARK-41004][CONNECT][TESTS] Check error classes in InterceptorRegistrySuite - posted by GitBox <gi...@apache.org> on 2022/11/04 00:08:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38491: [MINOR][CONNECT] Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/04 00:08:09 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37265: [SPARK-39850][YARN]Print applicationId once applied from yarn rm - posted by GitBox <gi...@apache.org> on 2022/11/04 00:24:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37239: [SPARK-39825][SQL] Fix PushDownLeftSemiAntiJoin push through project - posted by GitBox <gi...@apache.org> on 2022/11/04 00:24:38 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37226: [MINOR][SQL] Simplify the description of built-in function. - posted by GitBox <gi...@apache.org> on 2022/11/04 00:24:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37163: [SPARK-39750][SQL] Enable `spark.sql.cbo.enabled` by default - posted by GitBox <gi...@apache.org> on 2022/11/04 00:24:41 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38488: [SPARK-41002][CONNECT][PYTHON] Compatible `take`, `head` and `first` API in Python client - posted by GitBox <gi...@apache.org> on 2022/11/04 00:29:52 UTC, 1 replies.
- [GitHub] [spark] liuzqt commented on pull request #38064: [SPARK-40622][SQL][CORE]Result of a single task in collect() must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/04 00:55:21 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38502: [SPARK-40976][BUILD] Upgrade sbt to 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/04 01:26:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38502: [SPARK-40976][BUILD] Upgrade sbt to 1.7.3 - posted by GitBox <gi...@apache.org> on 2022/11/04 01:26:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38504: [SPARK-40815][SQL][FOLLOW-UP] Disable DelegateSymlinkTextInputFormat tests for JDK 9+ - posted by GitBox <gi...@apache.org> on 2022/11/04 01:26:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38504: [SPARK-40815][SQL][FOLLOW-UP] Disable DelegateSymlinkTextInputFormat tests for JDK 9+ - posted by GitBox <gi...@apache.org> on 2022/11/04 01:27:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38501: [SPARK-41001] [CONNECT] [DOC] Note: Connection string parameters are case-sensitive. - posted by GitBox <gi...@apache.org> on 2022/11/04 01:27:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38501: [SPARK-41001] [CONNECT] [DOC] Note: Connection string parameters are case-sensitive. - posted by GitBox <gi...@apache.org> on 2022/11/04 01:27:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/04 01:29:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/04 01:33:00 UTC, 15 replies.
- [GitHub] [spark] pan3793 closed pull request #38483: [SPARK-40997][K8S] K8s resource name prefix should start w/ alphanumeric - posted by GitBox <gi...@apache.org> on 2022/11/04 01:59:57 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38489: [SPARK-41003][SQL] BHJ LeftAnti does not update numOutputRows when codegen is disabled - posted by GitBox <gi...@apache.org> on 2022/11/04 02:03:08 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38488: [SPARK-41002][CONNECT][PYTHON] Compatible `take`, `head` and `first` API in Python client - posted by GitBox <gi...@apache.org> on 2022/11/04 02:03:13 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38486: [SPARK-41000][SQL] Make CommandResult extend Command trait - posted by GitBox <gi...@apache.org> on 2022/11/04 02:03:15 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/04 02:03:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/04 02:08:04 UTC, 33 replies.
- [GitHub] [spark] wangyum commented on pull request #38071: [SPARK-36290][SQL] Pull out complex join condition - posted by GitBox <gi...@apache.org> on 2022/11/04 02:56:29 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/04 03:23:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38507: [SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 03:55:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38507: [WIP][SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 03:56:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38507: [WIP][SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 04:04:06 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #38170: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2201-2225 - posted by GitBox <gi...@apache.org> on 2022/11/04 04:15:02 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/04 04:41:44 UTC, 0 replies.
- [GitHub] [spark] wmoustafa commented on a diff in pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/04 04:56:56 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/04 05:07:38 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38508: [SPARK-41012][SQL] Rename _LEGACY_ERROR_TEMP_1022 to ORDER_BY_POS_OUT_OF_RANGE - posted by GitBox <gi...@apache.org> on 2022/11/04 05:11:32 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on a diff in pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/04 05:13:38 UTC, 13 replies.
- [GitHub] [spark] itholic commented on pull request #38508: [SPARK-41012][SQL] Rename `_LEGACY_ERROR_TEMP_1022` to `ORDER_BY_POS_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/04 05:19:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38170: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2201-2225 - posted by GitBox <gi...@apache.org> on 2022/11/04 05:34:06 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38170: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2201-2225 - posted by GitBox <gi...@apache.org> on 2022/11/04 05:34:53 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/04 05:49:47 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/04 05:50:26 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/04 05:53:42 UTC, 5 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/04 05:58:32 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38494: [SPARK-41004][CONNECT][TESTS] Check error classes in InterceptorRegistrySuite - posted by GitBox <gi...@apache.org> on 2022/11/04 06:30:07 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38494: [SPARK-41004][CONNECT][TESTS] Check error classes in InterceptorRegistrySuite - posted by GitBox <gi...@apache.org> on 2022/11/04 06:30:40 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/04 06:46:16 UTC, 17 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38498: [SPARK-40769][CORE][SQL] Migrate type check failures of aggregate expressions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 07:31:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38482: [SPARK-40749][SQL] Migrate type check failures of generators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 09:21:06 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38482: [SPARK-40749][SQL] Migrate type check failures of generators onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 09:21:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37972: [SPARK-40654][SQL] Protobuf support for Spark - from_protobuf AND to_protobuf - posted by GitBox <gi...@apache.org> on 2022/11/04 09:31:30 UTC, 2 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #38509: [PySpark][DOC] Improve documentation and typing of groupby and cogroup applyInPandas - posted by GitBox <gi...@apache.org> on 2022/11/04 10:26:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38496: [WIP][SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2022/11/04 10:54:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38508: [SPARK-41012][SQL] Rename `_LEGACY_ERROR_TEMP_1022` to `ORDER_BY_POS_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/04 11:20:42 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38508: [SPARK-41012][SQL] Rename `_LEGACY_ERROR_TEMP_1022` to `ORDER_BY_POS_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/04 11:21:20 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on a diff in pull request #37972: [SPARK-40654][SQL] Protobuf support for Spark - from_protobuf AND to_protobuf - posted by GitBox <gi...@apache.org> on 2022/11/04 11:30:08 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/04 11:54:24 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38490: [SPARK-41009][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1070` to `LOCATION_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/04 14:20:46 UTC, 4 replies.
- [GitHub] [spark] dwsmith1983 opened a new pull request, #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/04 14:26:04 UTC, 0 replies.
- [GitHub] [spark] dwsmith1983 commented on pull request #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/04 14:27:20 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38485: [SPARK-41001] [CONNECT] [PYTHON] Implementing Connection String for Python Client - posted by GitBox <gi...@apache.org> on 2022/11/04 14:56:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38462: [SPARK-40533] [CONNECT] [PYTHON] Support most built-in literal types for Python in Spark Connect - posted by GitBox <gi...@apache.org> on 2022/11/04 14:56:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38462: [SPARK-40533] [CONNECT] [PYTHON] Support most built-in literal types for Python in Spark Connect - posted by GitBox <gi...@apache.org> on 2022/11/04 14:57:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38511: [WIP][SPARK-41017][SQL] Do not push Filter through reference-only Project - posted by GitBox <gi...@apache.org> on 2022/11/04 14:57:43 UTC, 0 replies.
- [GitHub] [spark] FouadApp commented on pull request #32679: [SPARK-28098][SQL]Support read hive table while LeafDir had multi-level paths - posted by GitBox <gi...@apache.org> on 2022/11/04 15:35:48 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38498: [SPARK-40769][CORE][SQL] Migrate type check failures of aggregate expressions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 15:54:33 UTC, 0 replies.
- [GitHub] [spark] FouadApp opened a new pull request, #38512: WIP: [SPARK-38564] Support read hive table from subdirectory source - posted by GitBox <gi...@apache.org> on 2022/11/04 15:55:09 UTC, 0 replies.
- [GitHub] [spark] FouadApp closed pull request #38512: WIP: [SPARK-38564] Support read hive table from subdirectory source - posted by GitBox <gi...@apache.org> on 2022/11/04 16:11:00 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/04 17:16:13 UTC, 24 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #37887: [SPARK-40360] ALREADY_EXISTS and NOT_FOUND exceptions - posted by GitBox <gi...@apache.org> on 2022/11/04 17:18:13 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37887: [SPARK-40360] ALREADY_EXISTS and NOT_FOUND exceptions - posted by GitBox <gi...@apache.org> on 2022/11/04 17:33:59 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38498: [SPARK-40769][CORE][SQL] Migrate type check failures of aggregate expressions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 17:56:43 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38498: [SPARK-40769][CORE][SQL] Migrate type check failures of aggregate expressions onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/04 17:57:37 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38479: [SPARK-40697][SQL][FOLLOWUP] Read-side char padding should only be applied if necessary - posted by GitBox <gi...@apache.org> on 2022/11/04 18:14:22 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #38479: [SPARK-40697][SQL][FOLLOWUP] Read-side char padding should only be applied if necessary - posted by GitBox <gi...@apache.org> on 2022/11/04 18:15:20 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #38513: [SPARK-40903][SQL][FOLLOWUP] Cast canonicalized Add as its original data type if necessary - posted by GitBox <gi...@apache.org> on 2022/11/04 18:48:56 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38513: [SPARK-40903][SQL][FOLLOWUP] Cast canonicalized Add as its original data type if necessary - posted by GitBox <gi...@apache.org> on 2022/11/04 18:49:19 UTC, 2 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38514: [WIP][SQL] Provide a query context to `failAnalysis()` - posted by GitBox <gi...@apache.org> on 2022/11/04 18:53:45 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/11/04 19:03:33 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/04 19:25:08 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38509: [SPARK-41014][PySpark][DOC] Improve documentation and typing of groupby and cogroup applyInPandas - posted by GitBox <gi...@apache.org> on 2022/11/04 19:25:11 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #38427: [SPARK-40950][CORE] Fix isRemoteAddressMaxedOut performance overhead on scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/04 19:36:35 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38427: [SPARK-40950][CORE] Fix isRemoteAddressMaxedOut performance overhead on scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/04 19:37:33 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/04 19:51:56 UTC, 2 replies.
- [GitHub] [spark] dwsmith1983 commented on a diff in pull request #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/04 19:54:01 UTC, 2 replies.
- [GitHub] [spark] alex-balikov commented on a diff in pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/04 20:02:57 UTC, 3 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38377: [SPARK-40901][CORE] Unable to store Spark Driver logs with Absolute Hadoop based URI FS Path - posted by GitBox <gi...@apache.org> on 2022/11/04 21:13:12 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/04 21:20:21 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38505: [SPARK-40622][WIP]do not merge(try to fix build error) - posted by GitBox <gi...@apache.org> on 2022/11/04 21:20:24 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/04 22:26:45 UTC, 6 replies.
- [GitHub] [spark] swamirishi commented on a diff in pull request #38377: [SPARK-40901][CORE] Unable to store Spark Driver logs with Absolute Hadoop based URI FS Path - posted by GitBox <gi...@apache.org> on 2022/11/04 23:37:42 UTC, 1 replies.
- [GitHub] [spark] SandishKumarHN opened a new pull request, #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/05 00:12:12 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/05 00:12:56 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37315: [SPARK-39892][SQL] Use ArrowType.Decimal(precision, scale, bitWidth) instead of ArrowType.Decimal(precision, scale) - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37309: [SPARK-39871][CORE] Jmx http interface supported for SparkHistoryServer - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:42 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37226: [MINOR][SQL] Simplify the description of built-in function. - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37239: [SPARK-39825][SQL] Fix PushDownLeftSemiAntiJoin push through project - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37104: [SPARK-39698][SQL] Use `TakeOrderedAndProject` if maxRows below the `spark.sql.execution.topKSortMaxRowsThreshold` - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:46 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37009: [SPARK-38292][PYTHON]Support na_filter for pyspark.pandas.read_csv - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:47 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37083: [SPARK-39678][SQL] Improve stats estimation for v2 tables - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #34637: [SPARK-37349][SQL] add SQL Rest API parsing logic - posted by GitBox <gi...@apache.org> on 2022/11/05 00:23:50 UTC, 0 replies.
- [GitHub] [spark] ljfgem commented on a diff in pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/05 00:39:44 UTC, 1 replies.
- [GitHub] [spark] attilapiros opened a new pull request, #38516: Initial version - posted by GitBox <gi...@apache.org> on 2022/11/05 01:53:06 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #38516: [SPARK-32380][SQL] Fixing access of HBase table via Hive - posted by GitBox <gi...@apache.org> on 2022/11/05 01:56:35 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/05 04:22:47 UTC, 2 replies.
- [GitHub] [spark] jerrypeng opened a new pull request, #38517: [WIP][SPARK-39591][SS] Async Progress Tracking - posted by GitBox <gi...@apache.org> on 2022/11/05 04:33:35 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38514: [SPARK-41019][SQL] Provide a query context to `failAnalysis()` - posted by GitBox <gi...@apache.org> on 2022/11/05 07:10:22 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38422: [SPARK-40948][SQL] Introduce new error class: PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/05 07:13:43 UTC, 1 replies.
- [GitHub] [spark] Dam1029 opened a new pull request, #38518: Reset the executor pods watcher when we receive a version changed fro… - posted by GitBox <gi...@apache.org> on 2022/11/05 07:44:45 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #38513: [SPARK-40903][SQL][FOLLOWUP] Cast canonicalized Add as its original data type if necessary - posted by GitBox <gi...@apache.org> on 2022/11/05 10:51:47 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by GitBox <gi...@apache.org> on 2022/11/05 10:57:46 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38517: [WIP][SPARK-39591][SS] Async Progress Tracking - posted by GitBox <gi...@apache.org> on 2022/11/05 10:57:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38516: [SPARK-32380][SQL] Fixing access of HBase table via Hive from Spark - posted by GitBox <gi...@apache.org> on 2022/11/05 12:28:57 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38516: [SPARK-32380][SQL] Fixing access of HBase table via Hive from Spark - posted by GitBox <gi...@apache.org> on 2022/11/05 12:29:12 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/05 12:50:13 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/05 13:57:44 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38514: [SPARK-41019][SQL] Provide a query context to `failAnalysis()` - posted by GitBox <gi...@apache.org> on 2022/11/05 14:19:08 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38514: [SPARK-41019][SQL] Provide a query context to `failAnalysis()` - posted by GitBox <gi...@apache.org> on 2022/11/05 14:33:30 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38344: [SPARK-40777][SQL][PROTOBUF] Protobuf import support and move error-classes. - posted by GitBox <gi...@apache.org> on 2022/11/05 14:44:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38123: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2101-2125 - posted by GitBox <gi...@apache.org> on 2022/11/05 14:54:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38104: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2000-2025 - posted by GitBox <gi...@apache.org> on 2022/11/05 14:55:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38519: [MINOR][SQL] Remove unused an error class and query error methods - posted by GitBox <gi...@apache.org> on 2022/11/05 15:03:03 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38520: [WIP][SQL][TESTS] Check error classes in DataTypeMismatch... - posted by GitBox <gi...@apache.org> on 2022/11/05 15:47:04 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/05 16:38:03 UTC, 6 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38521: [SPARK-41020][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1019` to `STAR_GROUP_BY_POS` - posted by GitBox <gi...@apache.org> on 2022/11/05 17:56:11 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38500: [SPARK-41007][SQL] Add missing serializer for java.math.BigInteger - posted by GitBox <gi...@apache.org> on 2022/11/05 18:10:13 UTC, 1 replies.
- [GitHub] [spark] SandishKumarHN commented on a diff in pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/05 19:02:38 UTC, 5 replies.
- [GitHub] [spark] jzhuge commented on a diff in pull request #35636: [SPARK-31357][SQL][WIP] Catalog API for view metadata - posted by GitBox <gi...@apache.org> on 2022/11/05 19:18:45 UTC, 4 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37317: [SPARK-39894][SQL] Combine the similar binary comparison in boolean expression. - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:11 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37315: [SPARK-39892][SQL] Use ArrowType.Decimal(precision, scale, bitWidth) instead of ArrowType.Decimal(precision, scale) - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37309: [SPARK-39871][CORE] Jmx http interface supported for SparkHistoryServer - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37235: [SPARK-39824][PYTHON][PS] Introduce index where and putmask func in pyspark - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37065: [SPARK-38699][SQL] Use error classes in the execution errors of dictionary encoding - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37104: [SPARK-39698][SQL] Use `TakeOrderedAndProject` if maxRows below the `spark.sql.execution.topKSortMaxRowsThreshold` - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37009: [SPARK-38292][PYTHON]Support na_filter for pyspark.pandas.read_csv - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:18 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34637: [SPARK-37349][SQL] add SQL Rest API parsing logic - posted by GitBox <gi...@apache.org> on 2022/11/06 00:23:19 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38522: [WIP][SQL][TESTS] Test the error class: DEFAULT_DATABASE_NOT_EXISTS, INDEX_ALREADY_EXISTS, INDEX_NOT_FOUND, ROUTINE_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/06 03:19:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38520: [SPARK-41021][SQL][TESTS] Test some subclasses of error class DATATYPE_MISMATCH - posted by GitBox <gi...@apache.org> on 2022/11/06 03:24:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38521: [SPARK-41020][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1019` to `STAR_GROUP_BY_POS` - posted by GitBox <gi...@apache.org> on 2022/11/06 05:40:43 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38522: [SPARK-41022][SQL][TESTS] Test the error class: DEFAULT_DATABASE_NOT_EXISTS, INDEX_ALREADY_EXISTS, INDEX_NOT_FOUND, ROUTINE_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/06 06:05:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38520: [SPARK-41021][SQL][TESTS] Test some subclasses of error class DATATYPE_MISMATCH - posted by GitBox <gi...@apache.org> on 2022/11/06 06:07:24 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38522: [SPARK-41022][SQL][TESTS] Test the error class: DEFAULT_DATABASE_NOT_EXISTS, INDEX_ALREADY_EXISTS, INDEX_NOT_FOUND, ROUTINE_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/06 06:11:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/06 06:13:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38523: [SPARK-41023][BUILD] Upgrade Jackson to 2.14.0 - posted by GitBox <gi...@apache.org> on 2022/11/06 06:38:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38524: [SPARK-41024][BUILD] Upgrade scala-maven-plugin to 4.7.2 - posted by GitBox <gi...@apache.org> on 2022/11/06 06:42:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38507: [SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/06 06:44:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38507: [SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/06 07:55:35 UTC, 6 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38520: [SPARK-41021][SQL][TESTS] Test some subclasses of error class DATATYPE_MISMATCH - posted by GitBox <gi...@apache.org> on 2022/11/06 08:00:23 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38522: [SPARK-41022][SQL][TESTS] Test the error class: DEFAULT_DATABASE_NOT_EXISTS, INDEX_ALREADY_EXISTS, INDEX_NOT_FOUND, ROUTINE_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/06 08:56:20 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38520: [SPARK-41021][SQL][TESTS] Test some subclasses of error class DATATYPE_MISMATCH - posted by GitBox <gi...@apache.org> on 2022/11/06 09:10:01 UTC, 8 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38525: [SPARK-40950][BUILD][FOLLOWUP] Fix Scala 2.13 Mima check - posted by GitBox <gi...@apache.org> on 2022/11/06 09:59:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38525: [SPARK-40950][BUILD][FOLLOWUP] Fix Scala 2.13 Mima check - posted by GitBox <gi...@apache.org> on 2022/11/06 10:00:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38491: [MINOR][CONNECT] Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/06 10:08:05 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38525: [SPARK-40950][BUILD][FOLLOWUP] Fix Scala 2.13 Mima check - posted by GitBox <gi...@apache.org> on 2022/11/06 11:06:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38524: [SPARK-41024][BUILD] Upgrade scala-maven-plugin to 4.7.2 - posted by GitBox <gi...@apache.org> on 2022/11/06 11:57:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38524: [SPARK-41024][BUILD] Upgrade scala-maven-plugin to 4.7.2 - posted by GitBox <gi...@apache.org> on 2022/11/06 11:57:44 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38523: [SPARK-41023][BUILD] Upgrade Jackson to 2.14.0 - posted by GitBox <gi...@apache.org> on 2022/11/06 13:05:47 UTC, 2 replies.
- [GitHub] [spark] pjfanning commented on pull request #38523: [SPARK-41023][BUILD] Upgrade Jackson to 2.14.0 - posted by GitBox <gi...@apache.org> on 2022/11/06 13:12:42 UTC, 3 replies.
- [GitHub] [spark] srowen closed pull request #38525: [SPARK-40950][BUILD][FOLLOWUP] Fix Scala 2.13 Mima check - posted by GitBox <gi...@apache.org> on 2022/11/06 14:15:28 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38525: [SPARK-40950][BUILD][FOLLOWUP] Fix Scala 2.13 Mima check - posted by GitBox <gi...@apache.org> on 2022/11/06 14:15:37 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #38499: [MINOR][DOC] updated some grammar and a missed period in the tuning doc - posted by GitBox <gi...@apache.org> on 2022/11/06 15:05:52 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38499: [MINOR][DOC] updated some grammar and a missed period in the tuning doc - posted by GitBox <gi...@apache.org> on 2022/11/06 15:05:54 UTC, 0 replies.
- [GitHub] [spark] bsikander commented on pull request #38352: [SPARK-40801][BUILD][3.2] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/06 15:22:27 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #38523: [SPARK-41023][BUILD] Upgrade Jackson to 2.14.0 - posted by GitBox <gi...@apache.org> on 2022/11/06 15:32:58 UTC, 2 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #36304: [SPARK-38959][SQL] DS V2: Support runtime group filtering in row-level commands - posted by GitBox <gi...@apache.org> on 2022/11/06 15:40:13 UTC, 7 replies.
- [GitHub] [spark] aokolnychyi opened a new pull request, #38526: [SPARK-38959][SQL][FOLLOW-UP] Address feedback for RowLevelOperationRuntimeGroupFiltering - posted by GitBox <gi...@apache.org> on 2022/11/06 16:04:33 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #38526: [SPARK-38959][SQL][FOLLOW-UP] Address feedback for RowLevelOperationRuntimeGroupFiltering - posted by GitBox <gi...@apache.org> on 2022/11/06 16:07:32 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38521: [SPARK-41020][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1019` to `STAR_GROUP_BY_POS` - posted by GitBox <gi...@apache.org> on 2022/11/06 17:06:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38522: [SPARK-41022][SQL][TESTS] Test the error class: DEFAULT_DATABASE_NOT_EXISTS, INDEX_ALREADY_EXISTS, INDEX_NOT_FOUND, ROUTINE_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/06 18:06:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38522: [SPARK-41022][SQL][TESTS] Test the error class: DEFAULT_DATABASE_NOT_EXISTS, INDEX_ALREADY_EXISTS, INDEX_NOT_FOUND, ROUTINE_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/06 18:07:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38521: [SPARK-41020][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1019` to `STAR_GROUP_BY_POS` - posted by GitBox <gi...@apache.org> on 2022/11/06 18:19:34 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38519: [MINOR][SQL] Remove unused an error class and query error methods - posted by GitBox <gi...@apache.org> on 2022/11/06 18:42:56 UTC, 1 replies.
- [GitHub] [spark] ljfgem commented on a diff in pull request #35636: [SPARK-31357][SQL][WIP] Catalog API for view metadata - posted by GitBox <gi...@apache.org> on 2022/11/06 19:03:04 UTC, 4 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38527: [SPARK-40875][CONNECT] Improve aggregate in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/06 21:04:01 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38527: [SPARK-40875][CONNECT] Improve aggregate in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/06 21:04:14 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38527: [SPARK-40875][CONNECT] Improve aggregate in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/06 21:29:16 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #38528: [SPARK-41025][SS] Introduce ComparableOffset to support offset range validation - posted by GitBox <gi...@apache.org> on 2022/11/06 22:37:47 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38528: [SPARK-41025][SS] Introduce ComparableOffset to support offset range validation - posted by GitBox <gi...@apache.org> on 2022/11/06 22:38:28 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38529: [SPARK-41026][CONNECT] Support Repartition in Connect Proto - posted by GitBox <gi...@apache.org> on 2022/11/06 22:41:24 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38529: [SPARK-41026][CONNECT] Support Repartition in Connect Proto - posted by GitBox <gi...@apache.org> on 2022/11/06 22:41:37 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38523: [SPARK-41023][BUILD] Upgrade Jackson to 2.14.0 - posted by GitBox <gi...@apache.org> on 2022/11/06 23:17:03 UTC, 8 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38507: [SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/06 23:45:41 UTC, 4 replies.
- [GitHub] [spark] srowen closed pull request #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/07 00:04:18 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37317: [SPARK-39894][SQL] Combine the similar binary comparison in boolean expression. - posted by GitBox <gi...@apache.org> on 2022/11/07 00:21:43 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37235: [SPARK-39824][PYTHON][PS] Introduce index where and putmask func in pyspark - posted by GitBox <gi...@apache.org> on 2022/11/07 00:21:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37316: [SPARK-39893][SQL] Push limit 1 to the aggregate's child plan if grouping expressions and aggregate expressions are foldable - posted by GitBox <gi...@apache.org> on 2022/11/07 00:21:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37065: [SPARK-38699][SQL] Use error classes in the execution errors of dictionary encoding - posted by GitBox <gi...@apache.org> on 2022/11/07 00:21:46 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36253: [SPARK-38932][SQL] Datasource v2 support report distinct keys - posted by GitBox <gi...@apache.org> on 2022/11/07 00:21:47 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #37630: [SPARK-40193][SQL] Merge subquery plans with different filters - posted by GitBox <gi...@apache.org> on 2022/11/07 02:01:23 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38395: [SPARK-40917][SQL] Add a dedicated logical plan for `Summary` - posted by GitBox <gi...@apache.org> on 2022/11/07 02:18:19 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38513: [SPARK-40903][SQL][FOLLOWUP] Cast canonicalized Add as its original data type if necessary - posted by GitBox <gi...@apache.org> on 2022/11/07 02:43:52 UTC, 1 replies.
- [GitHub] [spark] jackylee-ch commented on a diff in pull request #38496: [WIP][SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2022/11/07 03:04:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38530: [SPARK-41027][SQL] Use `UNEXPECTED_INPUT_TYPE ` instead of `MAP_FROM_ENTRIES_WRONG_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/07 03:15:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38530: [SPARK-41027][SQL] Use `UNEXPECTED_INPUT_TYPE ` instead of `MAP_FROM_ENTRIES_WRONG_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/07 03:18:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38530: [SPARK-41027][SQL] Use `UNEXPECTED_INPUT_TYPE` instead of `MAP_FROM_ENTRIES_WRONG_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/07 04:27:32 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38513: [SPARK-40903][SQL][FOLLOWUP] Cast canonicalized Add as its original data type if necessary - posted by GitBox <gi...@apache.org> on 2022/11/07 04:42:32 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38530: [SPARK-41027][SQL] Use `UNEXPECTED_INPUT_TYPE` instead of `MAP_FROM_ENTRIES_WRONG_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/07 05:17:17 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38513: [SPARK-40903][SQL][FOLLOWUP] Cast canonicalized Add as its original data type if necessary - posted by GitBox <gi...@apache.org> on 2022/11/07 05:23:52 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38318: [SPARK-40852][CONNECT][PYTHON] Introduce `StatFunction` in proto and implement `DataFrame.summary` - posted by GitBox <gi...@apache.org> on 2022/11/07 06:14:09 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38318: [SPARK-40852][CONNECT][PYTHON] Introduce `StatFunction` in proto and implement `DataFrame.summary` - posted by GitBox <gi...@apache.org> on 2022/11/07 06:26:55 UTC, 7 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/07 06:28:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38519: [MINOR][SQL] Remove unused an error class and query error methods - posted by GitBox <gi...@apache.org> on 2022/11/07 06:28:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38519: [MINOR][SQL] Remove unused an error class and query error methods - posted by GitBox <gi...@apache.org> on 2022/11/07 06:33:49 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by GitBox <gi...@apache.org> on 2022/11/07 06:46:58 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38509: [SPARK-41014][PYTHON][DOC] Improve documentation and typing of groupby and cogroup applyInPandas - posted by GitBox <gi...@apache.org> on 2022/11/07 06:56:20 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #38525: [SPARK-40950][BUILD][FOLLOWUP] Fix Scala 2.13 Mima check - posted by GitBox <gi...@apache.org> on 2022/11/07 08:21:43 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38104: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2000-2025 - posted by GitBox <gi...@apache.org> on 2022/11/07 08:24:12 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38123: [SPARK-40663][SQL] Migrate execution errors onto error classes: _LEGACY_ERROR_TEMP_2101-2125 - posted by GitBox <gi...@apache.org> on 2022/11/07 08:25:22 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38510: [MINOR][DOC] revisions for spark sql performance tuning to improve readability and grammar - posted by GitBox <gi...@apache.org> on 2022/11/07 08:30:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38447: [SPARK-40973][SQL] Rename `_LEGACY_ERROR_TEMP_0055` to `UNCLOSED_BRACKETED_COMMENT` - posted by GitBox <gi...@apache.org> on 2022/11/07 08:36:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38490: [SPARK-41009][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1070` to `LOCATION_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/07 08:47:14 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #38521: [SPARK-41020][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1019` to `STAR_GROUP_BY_POS` - posted by GitBox <gi...@apache.org> on 2022/11/07 08:54:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38520: [SPARK-41021][SQL][TESTS] Test some subclasses of error class DATATYPE_MISMATCH - posted by GitBox <gi...@apache.org> on 2022/11/07 09:18:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38520: [SPARK-41021][SQL][TESTS] Test some subclasses of error class DATATYPE_MISMATCH - posted by GitBox <gi...@apache.org> on 2022/11/07 09:20:02 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37630: [SPARK-40193][SQL] Merge subquery plans with different filters - posted by GitBox <gi...@apache.org> on 2022/11/07 09:38:40 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #38514: [SPARK-41019][SQL] Provide a query context to `failAnalysis()` - posted by GitBox <gi...@apache.org> on 2022/11/07 10:03:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38527: [SPARK-40875][CONNECT] Improve aggregate in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/07 10:05:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38527: [SPARK-40875][CONNECT] Improve aggregate in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/07 10:06:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38529: [SPARK-41026][CONNECT] Support Repartition in Connect Proto - posted by GitBox <gi...@apache.org> on 2022/11/07 10:07:48 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38422: [SPARK-40948][SQL] Introduce new error class: PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/07 11:12:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38422: [SPARK-40948][SQL] Introduce new error class: PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/07 11:13:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38530: [SPARK-41027][SQL] Use `UNEXPECTED_INPUT_TYPE` instead of `MAP_FROM_ENTRIES_WRONG_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/07 11:17:34 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38532: [MINOR] Replace new SparkException(errorClass = "INTERNAL_ERROR", ...) with SparkException.internalError - posted by GitBox <gi...@apache.org> on 2022/11/07 11:51:57 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38532: [MINOR] Replace new SparkException(errorClass = "INTERNAL_ERROR", ...) with SparkException.internalError - posted by GitBox <gi...@apache.org> on 2022/11/07 11:53:56 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/07 11:53:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38533: [SPARK-41029][SQL] Optimize the use of `GenericArrayData` constructor for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/07 12:17:17 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #38496: [WIP][SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2022/11/07 13:18:55 UTC, 2 replies.
- [GitHub] [spark] wangyum opened a new pull request, #38534: [SPARK-38505][SQL] Make partial aggregation adaptive - posted by GitBox <gi...@apache.org> on 2022/11/07 13:26:48 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/11/07 13:36:39 UTC, 3 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38529: [SPARK-41026][CONNECT] Support Repartition in Connect Proto - posted by GitBox <gi...@apache.org> on 2022/11/07 13:50:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38533: [SPARK-41029][SQL] Optimize constructor use of `GenericArrayData` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/07 15:33:18 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38523: [SPARK-41023][BUILD] Upgrade Jackson to 2.14.0 - posted by GitBox <gi...@apache.org> on 2022/11/07 16:17:08 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38535: [SPARK-41001] [CONNECT] Make `user_id` optional in SparkRemoteSession. - posted by GitBox <gi...@apache.org> on 2022/11/07 16:41:53 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38536: [SPARK-40984][CORE][SQL] Use `NON_FOLDABLE_INPUT` instead of `FRAME_LESS_OFFSET_WITHOUT_FOLDABLE` - posted by GitBox <gi...@apache.org> on 2022/11/07 17:11:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38537: [WIP][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2429` to `COLUMNS_NUM_MISMATCH` - posted by GitBox <gi...@apache.org> on 2022/11/07 17:54:13 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38532: [MINOR] Replace `new SparkException(errorClass = "INTERNAL_ERROR", ...)` with `SparkException.internalError` - posted by GitBox <gi...@apache.org> on 2022/11/07 17:57:02 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/07 17:57:02 UTC, 12 replies.
- [GitHub] [spark] MaxGekk closed pull request #38532: [MINOR][SQL] Replace `new SparkException(errorClass = "INTERNAL_ERROR", ...)` by `SparkException.internalError` - posted by GitBox <gi...@apache.org> on 2022/11/07 18:00:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38538: [SPARK-41031][BUILD] Upgrade `xz` to 1.9 for `avro` 1.11.1 - posted by GitBox <gi...@apache.org> on 2022/11/07 18:08:17 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/07 18:17:23 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2022/11/07 18:29:34 UTC, 3 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #38263: [SPARK-40692][SQL] Support data masking built-in function 'mask_hash' - posted by GitBox <gi...@apache.org> on 2022/11/07 18:30:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38318: [SPARK-40852][CONNECT][PYTHON] Introduce `StatFunction` in proto and implement `DataFrame.summary` - posted by GitBox <gi...@apache.org> on 2022/11/07 18:31:05 UTC, 4 replies.
- [GitHub] [spark] vinodkc commented on pull request #38146: [SPARK-40687][SQL] Support data masking built-in function 'mask' - posted by GitBox <gi...@apache.org> on 2022/11/07 18:31:26 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/07 19:05:13 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38533: [SPARK-41029][SQL] Optimize constructor use of `GenericArrayData` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/07 19:17:14 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #38539: [SPARK-41030][BUILD] Upgrade `Apache ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/07 19:29:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/07 19:30:20 UTC, 7 replies.
- [GitHub] [spark] sunchao commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/07 20:06:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38540: [SPARK-41033][CONNECT][Python] RemoteSparkSession should only accept one `user_id`. - posted by GitBox <gi...@apache.org> on 2022/11/07 20:10:54 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38540: [SPARK-41033][CONNECT][Python] RemoteSparkSession should only accept one `user_id`. - posted by GitBox <gi...@apache.org> on 2022/11/07 20:11:06 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38540: [SPARK-41033][CONNECT][PYTHON] RemoteSparkSession should only accept one `user_id` - posted by GitBox <gi...@apache.org> on 2022/11/07 20:19:52 UTC, 0 replies.
- [GitHub] [spark] aray commented on pull request #37303: [SPARK-39883][SQL][TESTS] Add DataFrame function parity check - posted by GitBox <gi...@apache.org> on 2022/11/07 20:22:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/07 20:28:09 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38540: [SPARK-41033][CONNECT][PYTHON] RemoteSparkSession should only accept one `user_id` - posted by GitBox <gi...@apache.org> on 2022/11/07 20:32:16 UTC, 0 replies.
- [GitHub] [spark] amaliujia closed pull request #38540: [SPARK-41033][CONNECT][PYTHON] RemoteSparkSession should only accept one `user_id` - posted by GitBox <gi...@apache.org> on 2022/11/07 20:32:17 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38535: [SPARK-41001] [CONNECT] Make `user_id` optional in SparkRemoteSession. - posted by GitBox <gi...@apache.org> on 2022/11/07 20:39:05 UTC, 4 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38535: [SPARK-41001] [CONNECT] Make `user_id` optional in SparkRemoteSession. - posted by GitBox <gi...@apache.org> on 2022/11/07 20:46:55 UTC, 5 replies.
- [GitHub] [spark] amaliujia commented on pull request #38535: [SPARK-41001] [CONNECT] Make `user_id` optional in SparkRemoteSession. - posted by GitBox <gi...@apache.org> on 2022/11/07 20:53:28 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38541: [SPARK-41034][CONNECT] Connect DataFrame should require RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/07 21:54:56 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38541: [SPARK-41034][CONNECT] Connect DataFrame should require RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/07 21:56:20 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #38542: Revert "[SPARK-38550][SQL][CORE] Use a disk-based store to save more debug information for live UI" - posted by GitBox <gi...@apache.org> on 2022/11/07 22:20:13 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38542: Revert "[SPARK-38550][SQL][CORE] Use a disk-based store to save more debug information for live UI" - posted by GitBox <gi...@apache.org> on 2022/11/07 22:20:29 UTC, 5 replies.
- [GitHub] [spark] wangyum commented on pull request #38534: [SPARK-38505][SQL] Make partial aggregation adaptive - posted by GitBox <gi...@apache.org> on 2022/11/07 23:29:37 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #38146: [SPARK-40687][SQL] Support data masking built-in function 'mask' - posted by GitBox <gi...@apache.org> on 2022/11/07 23:38:17 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] opened a new pull request, #38543: Bump ivy from 2.5.0 to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/07 23:47:29 UTC, 0 replies.
- [GitHub] [spark] leewyang commented on pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2022/11/08 00:12:38 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38539: [SPARK-41030][BUILD] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 00:16:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38539: [SPARK-41030][BUILD] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 00:16:47 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38543: Bump ivy from 2.5.0 to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 00:16:58 UTC, 0 replies.
- [GitHub] [spark] dependabot[bot] commented on pull request #38543: Bump ivy from 2.5.0 to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 00:17:00 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38538: [SPARK-41031][BUILD] Upgrade `xz` to 1.9 for `avro` 1.11.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 00:17:06 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38543: Bump ivy from 2.5.0 to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 00:17:18 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37316: [SPARK-39893][SQL] Push limit 1 to the aggregate's child plan if grouping expressions and aggregate expressions are foldable - posted by GitBox <gi...@apache.org> on 2022/11/08 00:22:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36253: [SPARK-38932][SQL] Datasource v2 support report distinct keys - posted by GitBox <gi...@apache.org> on 2022/11/08 00:22:38 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38542: Revert "[SPARK-38550][SQL][CORE] Use a disk-based store to save more debug information for live UI" - posted by GitBox <gi...@apache.org> on 2022/11/08 00:23:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38488: [SPARK-41002][CONNECT][PYTHON] Compatible `take`, `head` and `first` API in Python client - posted by GitBox <gi...@apache.org> on 2022/11/08 00:31:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38488: [SPARK-41002][CONNECT][PYTHON] Compatible `take`, `head` and `first` API in Python client - posted by GitBox <gi...@apache.org> on 2022/11/08 00:31:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38529: [SPARK-41026][CONNECT] Support Repartition in Connect Proto - posted by GitBox <gi...@apache.org> on 2022/11/08 01:00:36 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/08 01:23:24 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38500: [SPARK-41007][SQL] Add missing serializer for java.math.BigInteger - posted by GitBox <gi...@apache.org> on 2022/11/08 01:33:29 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38500: [SPARK-41007][SQL] Add missing serializer for java.math.BigInteger - posted by GitBox <gi...@apache.org> on 2022/11/08 01:34:28 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38538: [SPARK-41031][BUILD] Upgrade `xz` to 1.9 for `avro` 1.11.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 01:36:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38318: [SPARK-40852][CONNECT][PYTHON] Introduce `StatFunction` in proto and implement `DataFrame.summary` - posted by GitBox <gi...@apache.org> on 2022/11/08 01:41:31 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38542: Revert "[SPARK-38550][SQL][CORE] Use a disk-based store to save more debug information for live UI" - posted by GitBox <gi...@apache.org> on 2022/11/08 01:47:56 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #38544: [SPARK-40815][SQL][FOLLOW-UP] Fix record reader in DelegateSymlinkTextInputFormat to avoid Hive ExecMapper.getDone() check - posted by GitBox <gi...@apache.org> on 2022/11/08 01:59:36 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #38544: [SPARK-40815][SQL][FOLLOW-UP] Fix record reader in DelegateSymlinkTextInputFormat to avoid Hive ExecMapper.getDone() check - posted by GitBox <gi...@apache.org> on 2022/11/08 02:00:31 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38545: [MINOR][DOCS] Fix links in the sql-pyspark-pandas-with-arrow - posted by GitBox <gi...@apache.org> on 2022/11/08 02:09:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38538: [SPARK-41031][BUILD] Upgrade `xz` to 1.9 for `avro` 1.11.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 02:37:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38533: [SPARK-41029][SQL] Optimize constructor use of `GenericArrayData` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/08 02:38:42 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/08 02:45:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/08 02:46:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38536: [SPARK-40984][CORE][SQL] Use `NON_FOLDABLE_INPUT` instead of `FRAME_LESS_OFFSET_WITHOUT_FOLDABLE` - posted by GitBox <gi...@apache.org> on 2022/11/08 02:49:30 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38547: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Disable ANSI at the test case for DSv2 - posted by GitBox <gi...@apache.org> on 2022/11/08 03:01:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38547: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Disable ANSI at the test case for DSv2 - posted by GitBox <gi...@apache.org> on 2022/11/08 03:01:31 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38547: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Disable ANSI at the test case for DSv2 - posted by GitBox <gi...@apache.org> on 2022/11/08 03:08:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38422: [SPARK-40948][SQL] Introduce new error class: PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/08 03:15:04 UTC, 5 replies.
- [GitHub] [spark] itholic opened a new pull request, #38548: [SPARK-40663][SQL][FOLLOWUP] `SparkIllegalArgumentException` should accept `cause` - posted by GitBox <gi...@apache.org> on 2022/11/08 03:16:27 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38548: [SPARK-40663][SQL][FOLLOWUP] `SparkIllegalArgumentException` should accept `cause` - posted by GitBox <gi...@apache.org> on 2022/11/08 03:17:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38547: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Disable ANSI at the test case for DSv2 - posted by GitBox <gi...@apache.org> on 2022/11/08 03:23:54 UTC, 2 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38547: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Disable ANSI at the test case for DSv2 - posted by GitBox <gi...@apache.org> on 2022/11/08 03:28:29 UTC, 3 replies.
- [GitHub] [spark] boneanxs commented on pull request #37362: [SPARK-39950][SQL] It's unnecessary to materialize BroadcastQueryStage firstly, because the BroadcastQueryStage does not timeout in AQE. - posted by GitBox <gi...@apache.org> on 2022/11/08 03:31:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38422: [SPARK-40948][SQL] Introduce new error class: PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/08 03:35:55 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #37303: [SPARK-39883][SQL][TESTS] Add DataFrame function parity check - posted by GitBox <gi...@apache.org> on 2022/11/08 03:38:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #37303: [SPARK-39883][SQL][TESTS] Add DataFrame function parity check - posted by GitBox <gi...@apache.org> on 2022/11/08 03:40:06 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38422: [SPARK-40948][SQL] Introduce new error class: PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/08 03:41:05 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38549: [SPARK-41026][CONNECT][PYTHON][FOLLOW-UP] Add Coalesce and Repartition API to Python client. - posted by GitBox <gi...@apache.org> on 2022/11/08 04:49:28 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38549: [SPARK-41026][CONNECT][PYTHON][FOLLOW-UP] Add Coalesce and Repartition API to Python client. - posted by GitBox <gi...@apache.org> on 2022/11/08 04:49:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38542: Revert "[SPARK-38550][SQL][CORE] Use a disk-based store to save more debug information for live UI" - posted by GitBox <gi...@apache.org> on 2022/11/08 05:00:21 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38550: [SPARK-41039][BUILD] Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/08 05:29:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38550: [SPARK-41039][BUILD] Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/08 05:30:34 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #38551: [SPARK-41038][SQL] Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS` - posted by GitBox <gi...@apache.org> on 2022/11/08 05:35:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38551: [SPARK-41038][SQL] Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS` - posted by GitBox <gi...@apache.org> on 2022/11/08 05:36:50 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38552: [SPARK-41041][SQL] Integrate `_LEGACY_ERROR_TEMP_1279` into `TABLE_OR_VIEW_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/08 05:52:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38552: [SPARK-41041][SQL] Integrate `_LEGACY_ERROR_TEMP_1279` into `TABLE_OR_VIEW_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/08 05:53:27 UTC, 0 replies.
- [GitHub] [spark] zsxwing opened a new pull request, #38553: [SPARK-41040][SS] - posted by GitBox <gi...@apache.org> on 2022/11/08 06:14:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38263: [SPARK-40692][SQL] Support data masking built-in function 'mask_hash' - posted by GitBox <gi...@apache.org> on 2022/11/08 06:16:30 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #38554: [SPARK-41042][SQL] Rename `PARSE_CHAR_MISSING_LENGTH` to `DATA_TYPE_MISSING_SIZE` - posted by GitBox <gi...@apache.org> on 2022/11/08 06:16:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38551: [SPARK-41038][SQL] Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS` - posted by GitBox <gi...@apache.org> on 2022/11/08 06:23:31 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38555: [SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/08 06:26:26 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38551: [SPARK-41038][SQL] Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS` - posted by GitBox <gi...@apache.org> on 2022/11/08 06:38:21 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on pull request #38434: [SPARK-40946][SQL] Add a new DataSource V2 interface SupportsPushDownClusterKeys - posted by GitBox <gi...@apache.org> on 2022/11/08 06:39:44 UTC, 0 replies.
- [GitHub] [spark] zsxwing opened a new pull request, #38556: [SPARK-41045][SQL]Pre-compute to eliminate ScalaReflection calls after deserializer is created - posted by GitBox <gi...@apache.org> on 2022/11/08 06:42:05 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #23: [SPARK-40519] Add "Publish" workflow to help release apache/spark image - posted by GitBox <gi...@apache.org> on 2022/11/08 07:04:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38556: [SPARK-41045][SQL]Pre-compute to eliminate ScalaReflection calls after deserializer is created - posted by GitBox <gi...@apache.org> on 2022/11/08 07:15:35 UTC, 0 replies.
- [GitHub] [spark-docker] martin-g commented on a diff in pull request #23: [SPARK-40519] Add "Publish" workflow to help release apache/spark image - posted by GitBox <gi...@apache.org> on 2022/11/08 07:21:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37362: [SPARK-39950][SQL] It's unnecessary to materialize BroadcastQueryStage firstly, because the BroadcastQueryStage does not timeout in AQE. - posted by GitBox <gi...@apache.org> on 2022/11/08 07:23:11 UTC, 0 replies.
- [GitHub] [spark] liuzqt commented on a diff in pull request #38064: [SPARK-40622][SQL][CORE]Result of a single task in collect() must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/08 07:37:17 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2022/11/08 07:47:49 UTC, 2 replies.
- [GitHub] [spark-docker] Yikun commented on a diff in pull request #23: [SPARK-40519] Add "Publish" workflow to help release apache/spark image - posted by GitBox <gi...@apache.org> on 2022/11/08 07:53:03 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38557: [SPARK-38959][SQL][FOLLOWUP] Optimizer batch `PartitionPruning` should optimize subqueries - posted by GitBox <gi...@apache.org> on 2022/11/08 08:24:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38526: [SPARK-38959][SQL][FOLLOW-UP] Address feedback for RowLevelOperationRuntimeGroupFiltering - posted by GitBox <gi...@apache.org> on 2022/11/08 08:24:56 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38526: [SPARK-38959][SQL][FOLLOW-UP] Address feedback for RowLevelOperationRuntimeGroupFiltering - posted by GitBox <gi...@apache.org> on 2022/11/08 08:25:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38557: [SPARK-38959][SQL][FOLLOWUP] Optimizer batch `PartitionPruning` should optimize subqueries - posted by GitBox <gi...@apache.org> on 2022/11/08 08:26:24 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38555: [WIP][SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to DATATYPE_MISMATCH.INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/08 09:24:30 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38537: [SPARK-41043][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2429` to `COLUMNS_NUM_MISMATCH` - posted by GitBox <gi...@apache.org> on 2022/11/08 09:46:19 UTC, 0 replies.
- [GitHub] [spark] c21 commented on pull request #38480: [SPARK-40994][DOCS][SQL] Add code example in JDBC data source with partitionColumn - posted by GitBox <gi...@apache.org> on 2022/11/08 09:51:15 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #38548: [SPARK-40663][SQL][FOLLOWUP] `SparkIllegalArgumentException` should accept `cause` - posted by GitBox <gi...@apache.org> on 2022/11/08 10:03:58 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38555: [WIP][SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to DATATYPE_MISMATCH.INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/08 10:05:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38547: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Disable ANSI at the test case for DSv2 - posted by GitBox <gi...@apache.org> on 2022/11/08 10:10:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38536: [SPARK-40984][CORE][SQL] Use `NON_FOLDABLE_INPUT` instead of `FRAME_LESS_OFFSET_WITHOUT_FOLDABLE` - posted by GitBox <gi...@apache.org> on 2022/11/08 10:12:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38536: [SPARK-40984][CORE][SQL] Use `NON_FOLDABLE_INPUT` instead of `FRAME_LESS_OFFSET_WITHOUT_FOLDABLE` - posted by GitBox <gi...@apache.org> on 2022/11/08 10:13:19 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38515: [SPARK-41015][SQL][PROTOBUF] UnitTest null check for data generator - posted by GitBox <gi...@apache.org> on 2022/11/08 10:19:06 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38530: [SPARK-41027][SQL] Use `UNEXPECTED_INPUT_TYPE` instead of `MAP_FROM_ENTRIES_WRONG_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/08 10:32:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38447: [SPARK-40973][SQL] Rename `_LEGACY_ERROR_TEMP_0055` to `UNCLOSED_BRACKETED_COMMENT` - posted by GitBox <gi...@apache.org> on 2022/11/08 10:34:06 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38447: [SPARK-40973][SQL] Rename `_LEGACY_ERROR_TEMP_0055` to `UNCLOSED_BRACKETED_COMMENT` - posted by GitBox <gi...@apache.org> on 2022/11/08 10:34:45 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/08 10:54:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38547: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Disable ANSI at the test case for DSv2 - posted by GitBox <gi...@apache.org> on 2022/11/08 11:13:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38480: [SPARK-40994][DOCS][SQL] Add code example in JDBC data source with partitionColumn - posted by GitBox <gi...@apache.org> on 2022/11/08 11:15:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38480: [SPARK-40994][DOCS][SQL] Add code example in JDBC data source with partitionColumn - posted by GitBox <gi...@apache.org> on 2022/11/08 11:15:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38509: [SPARK-41014][PYTHON][DOC] Improve documentation and typing of groupby and cogroup applyInPandas - posted by GitBox <gi...@apache.org> on 2022/11/08 11:17:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38509: [SPARK-41014][PYTHON][DOC] Improve documentation and typing of groupby and cogroup applyInPandas - posted by GitBox <gi...@apache.org> on 2022/11/08 11:17:40 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/08 11:20:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37206: [SPARK-39696][CORE] Ensure Concurrent r/w `TaskMetrics` not throw Exception - posted by GitBox <gi...@apache.org> on 2022/11/08 11:21:55 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38559: [SPARK-41050][BUILD] Upgrade scalafmt from 3.5.9 to 3.6.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 11:41:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38496: [WIP][SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2022/11/08 11:50:03 UTC, 2 replies.
- [GitHub] [spark] yabola opened a new pull request, #38560: [WIP][SPARK-38005][core] Support cleaning up merged shuffle files and state from external shuffle service - posted by GitBox <gi...@apache.org> on 2022/11/08 12:12:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38559: [SPARK-41050][BUILD] Upgrade scalafmt from 3.5.9 to 3.6.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 13:09:32 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38559: [SPARK-41050][BUILD] Upgrade scalafmt from 3.5.9 to 3.6.1 - posted by GitBox <gi...@apache.org> on 2022/11/08 13:30:15 UTC, 0 replies.
- [GitHub] [spark] Narcasserun opened a new pull request, #38561: Optimizer val - posted by GitBox <gi...@apache.org> on 2022/11/08 13:39:07 UTC, 0 replies.
- [GitHub] [spark] Narcasserun closed pull request #38561: Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/08 13:45:59 UTC, 0 replies.
- [GitHub] [spark] Narcasserun opened a new pull request, #38562: [SSPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/08 13:47:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/08 13:49:05 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38555: [SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to DATATYPE_MISMATCH.INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/08 13:49:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38480: [SPARK-40994][DOCS][SQL] Add code example in JDBC data source with partitionColumn - posted by GitBox <gi...@apache.org> on 2022/11/08 13:50:37 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #28953: [SPARK-32013][SQL] Support query execution before reading DataFrame and before/after writing DataFrame over JDBC - posted by GitBox <gi...@apache.org> on 2022/11/08 13:53:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/08 13:54:10 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38555: [SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to DATATYPE_MISMATCH.INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/08 14:35:14 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38533: [SPARK-41029][SQL] Optimize constructor use of `GenericArrayData` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/08 14:42:45 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38533: [SPARK-41029][SQL] Optimize constructor use of `GenericArrayData` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/08 14:42:58 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38554: [SPARK-41042][SQL] Rename `PARSE_CHAR_MISSING_LENGTH` to `DATATYPE_MISSING_SIZE` - posted by GitBox <gi...@apache.org> on 2022/11/08 14:59:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38554: [SPARK-41042][SQL] Rename `PARSE_CHAR_MISSING_LENGTH` to `DATATYPE_MISSING_SIZE` - posted by GitBox <gi...@apache.org> on 2022/11/08 15:01:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38537: [SPARK-41043][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2429` to `COLUMNS_NUM_MISMATCH` - posted by GitBox <gi...@apache.org> on 2022/11/08 15:03:17 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38537: [SPARK-41043][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2429` to `COLUMNS_NUM_MISMATCH` - posted by GitBox <gi...@apache.org> on 2022/11/08 15:08:12 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38537: [SPARK-41043][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2429` to `COLUMNS_NUM_MISMATCH` - posted by GitBox <gi...@apache.org> on 2022/11/08 15:08:47 UTC, 4 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38263: [SPARK-40692][SQL] Support data masking built-in function 'mask_hash' - posted by GitBox <gi...@apache.org> on 2022/11/08 15:14:48 UTC, 0 replies.
- [GitHub] [spark] Narcasserun closed pull request #38562: [SSPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/08 15:25:16 UTC, 0 replies.
- [GitHub] [spark] Narcasserun opened a new pull request, #38563: [SSPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/08 15:27:41 UTC, 0 replies.
- [GitHub] [spark] zsxwing commented on pull request #38556: [SPARK-41045][SQL]Pre-compute to eliminate ScalaReflection calls after deserializer is created - posted by GitBox <gi...@apache.org> on 2022/11/08 16:17:57 UTC, 0 replies.
- [GitHub] [spark] zsxwing commented on a diff in pull request #38556: [SPARK-41045][SQL]Pre-compute to eliminate ScalaReflection calls after deserializer is created - posted by GitBox <gi...@apache.org> on 2022/11/08 16:17:58 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #38556: [SPARK-41045][SQL]Pre-compute to eliminate ScalaReflection calls after deserializer is created - posted by GitBox <gi...@apache.org> on 2022/11/08 16:19:38 UTC, 0 replies.
- [GitHub] [spark] WweiL commented on a diff in pull request #34815: [SPARK-37555][SQL] spark-sql should pass last unclosed comment to backend - posted by GitBox <gi...@apache.org> on 2022/11/08 16:20:58 UTC, 0 replies.
- [GitHub] [spark] zsxwing commented on pull request #38553: [SPARK-41040][SS] Fix self-union streaming query failure when using readStream.table - posted by GitBox <gi...@apache.org> on 2022/11/08 16:30:49 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #38553: [SPARK-41040][SS] Fix self-union streaming query failure when using readStream.table - posted by GitBox <gi...@apache.org> on 2022/11/08 16:31:44 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #38564: [SPARK-41052][BUILD] Upgrade `Apache Mesos` to 1.11.0 - posted by GitBox <gi...@apache.org> on 2022/11/08 17:30:46 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38549: [SPARK-41026][CONNECT][PYTHON][FOLLOW-UP] Add Coalesce and Repartition API to Python client. - posted by GitBox <gi...@apache.org> on 2022/11/08 17:50:19 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37533: [SPARK-40096]Fix finalize shuffle stage slow due to connection creation slow - posted by GitBox <gi...@apache.org> on 2022/11/08 17:51:59 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/08 17:59:39 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/08 17:59:42 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38560: [WIP][SPARK-38005][core] Support cleaning up merged shuffle files and state from external shuffle service - posted by GitBox <gi...@apache.org> on 2022/11/08 18:08:17 UTC, 6 replies.
- [GitHub] [spark] mridulm commented on pull request #38091: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/08 18:11:47 UTC, 5 replies.
- [GitHub] [spark] asfgit closed pull request #38091: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/08 18:13:09 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #38563: [SPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/08 18:24:53 UTC, 0 replies.
- [GitHub] [spark] fryz commented on pull request #38352: [SPARK-40801][BUILD][3.2] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/08 18:30:36 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38563: [SPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/08 18:35:35 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38564: [SPARK-41052][BUILD][MESOS] Upgrade `Apache Mesos` to 1.11.0 - posted by GitBox <gi...@apache.org> on 2022/11/08 18:36:59 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #38564: [SPARK-41052][BUILD][MESOS] Upgrade `Apache Mesos` to 1.11.0 - posted by GitBox <gi...@apache.org> on 2022/11/08 18:38:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38564: [SPARK-41052][BUILD][MESOS] Upgrade `Apache Mesos` to 1.11.0 - posted by GitBox <gi...@apache.org> on 2022/11/08 18:42:29 UTC, 1 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #38557: [SPARK-38959][SQL][FOLLOWUP] Optimizer batch `PartitionPruning` should optimize subqueries - posted by GitBox <gi...@apache.org> on 2022/11/08 19:07:48 UTC, 3 replies.
- [GitHub] [spark] bjornjorgensen closed pull request #38564: [SPARK-41052][BUILD][MESOS] Upgrade `Apache Mesos` to 1.11.0 - posted by GitBox <gi...@apache.org> on 2022/11/08 19:10:47 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #38565: [SPARK-41035][SQL] Don't patch foldable children of aggregate functions in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/11/08 20:24:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38537: [SPARK-41043][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2429` to `NUM_COLUMNS_MISMATCH` - posted by GitBox <gi...@apache.org> on 2022/11/08 20:24:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38537: [SPARK-41043][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2429` to `NUM_COLUMNS_MISMATCH` - posted by GitBox <gi...@apache.org> on 2022/11/08 20:26:02 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/08 21:41:05 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38545: [MINOR][DOCS] Fix links in the sql-pyspark-pandas-with-arrow - posted by GitBox <gi...@apache.org> on 2022/11/08 21:41:08 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/08 22:12:18 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/08 22:13:28 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/08 22:26:12 UTC, 0 replies.
- [GitHub] [spark] yabola commented on pull request #38560: [WIP][SPARK-38005][core] Support cleaning up merged shuffle files and state from external shuffle service - posted by GitBox <gi...@apache.org> on 2022/11/08 23:42:20 UTC, 8 replies.
- [GitHub] [spark] srowen closed pull request #38559: [SPARK-41050][BUILD] Upgrade scalafmt from 3.5.9 to 3.6.1 - posted by GitBox <gi...@apache.org> on 2022/11/09 00:22:40 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38559: [SPARK-41050][BUILD] Upgrade scalafmt from 3.5.9 to 3.6.1 - posted by GitBox <gi...@apache.org> on 2022/11/09 00:22:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36775: [SPARK-39389]Filesystem closed should not be considered as corrupt files - posted by GitBox <gi...@apache.org> on 2022/11/09 00:23:14 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/09 01:03:19 UTC, 5 replies.
- [GitHub] [spark] gengliangwang closed pull request #38513: [SPARK-40903][SQL][FOLLOWUP] Cast canonicalized Add as its original data type if necessary - posted by GitBox <gi...@apache.org> on 2022/11/09 01:15:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38318: [SPARK-40852][CONNECT][PYTHON] Introduce `StatFunction` in proto and implement `DataFrame.summary` - posted by GitBox <gi...@apache.org> on 2022/11/09 01:19:18 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #38542: Revert "[SPARK-38550][SQL][CORE] Use a disk-based store to save more debug information for live UI" - posted by GitBox <gi...@apache.org> on 2022/11/09 01:19:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38318: [SPARK-40852][CONNECT][PYTHON] Introduce `StatFunction` in proto and implement `DataFrame.summary` - posted by GitBox <gi...@apache.org> on 2022/11/09 01:20:02 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/09 01:21:56 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38557: [SPARK-38959][SQL][FOLLOWUP] Optimizer batch `PartitionPruning` should optimize subqueries - posted by GitBox <gi...@apache.org> on 2022/11/09 01:24:16 UTC, 3 replies.
- [GitHub] [spark] Narcasserun opened a new pull request, #38568: [SSPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/09 01:28:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38565: [SPARK-41035][SQL] Don't patch foldable children of aggregate functions in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/11/09 01:29:20 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38541: [SPARK-41034][CONNECT][PYTHON] Connect DataFrame should require a RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/09 01:31:43 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #21: [SPARK-40569][TESTS] Add smoke test in standalone cluster for spark-docker - posted by GitBox <gi...@apache.org> on 2022/11/09 01:32:17 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun closed pull request #21: [SPARK-40569][TESTS] Add smoke test in standalone cluster for spark-docker - posted by GitBox <gi...@apache.org> on 2022/11/09 01:34:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38565: [SPARK-41035][SQL] Don't patch foldable children of aggregate functions in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/11/09 01:43:53 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38557: [SPARK-38959][SQL][FOLLOWUP] Optimizer batch `PartitionPruning` should optimize subqueries - posted by GitBox <gi...@apache.org> on 2022/11/09 01:44:13 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #34815: [SPARK-37555][SQL] spark-sql should pass last unclosed comment to backend - posted by GitBox <gi...@apache.org> on 2022/11/09 01:45:38 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #38565: [SPARK-41035][SQL] Don't patch foldable children of aggregate functions in `RewriteDistinctAggregates` - posted by GitBox <gi...@apache.org> on 2022/11/09 01:46:06 UTC, 0 replies.
- [GitHub] [spark] maryannxue commented on a diff in pull request #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/09 02:06:55 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/09 02:21:35 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/09 02:22:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38491: [MINOR][CONNECT] Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/09 02:29:27 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38491: [MINOR][CONNECT] Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/09 02:37:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/09 02:39:09 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/09 02:47:39 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38570: [SPARK-41056][R] Fix new R_LIBS_SITE behavior introduced in R 4.2 - posted by GitBox <gi...@apache.org> on 2022/11/09 02:49:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38491: [SPARK-41058][CONNECT] Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/09 02:56:25 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu opened a new pull request, #38571: [SPARK-37555][TEST][FOLLOWUP] Increase timeout of CLI test `spark-sql should pass last unclosed comment to backend` - posted by GitBox <gi...@apache.org> on 2022/11/09 03:07:29 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38550: [SPARK-41039][BUILD] Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/09 03:07:30 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38550: [SPARK-41039][BUILD] Upgrade `scala-parallel-collections` to 1.0.4 for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/09 03:07:31 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/09 03:13:37 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38572: [SPARK-41059][SQL] Rename `_LEGACY_ERROR_TEMP_2420` to `NESTED_AGGREGATE_FUNCTION` - posted by GitBox <gi...@apache.org> on 2022/11/09 03:15:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38573: [SPARK-41061][CONNECT] Support SelectExpr which applies Projection by expressions in Strings in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 03:17:36 UTC, 0 replies.
- [GitHub] [spark] 19Serhii99 opened a new pull request, #38574: [SPARK-41060] [K8S] Made the spark submitter generate new names for driver and executor config maps - posted by GitBox <gi...@apache.org> on 2022/11/09 03:46:51 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/09 05:13:18 UTC, 5 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38535: [SPARK-41001] [CONNECT] Make `user_id` optional in SparkRemoteSession. - posted by GitBox <gi...@apache.org> on 2022/11/09 05:23:54 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38575: [WIP][SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/09 05:35:32 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38575: [WIP][SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/09 05:42:04 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #38576: [SPARK-41062][SQL] Rename `UNSUPPORTED_CORRELATED_REFERENCE` to `CORRELATED_REFERENCE` - posted by GitBox <gi...@apache.org> on 2022/11/09 05:48:25 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38545: [MINOR][DOCS] Fix links in the sql-pyspark-pandas-with-arrow - posted by GitBox <gi...@apache.org> on 2022/11/09 05:52:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 06:20:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 06:22:00 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/09 06:31:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38091: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/09 06:36:50 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/09 06:38:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/09 06:40:10 UTC, 6 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38575: [WIP][SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/09 06:40:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38491: [SPARK-41058][CONNECT] Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/09 06:42:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38491: [SPARK-41058][CONNECT] Remove unused import in commands.proto - posted by GitBox <gi...@apache.org> on 2022/11/09 06:42:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 06:48:31 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/09 06:51:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 06:52:05 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 06:52:56 UTC, 3 replies.
- [GitHub] [spark] pan3793 commented on pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 06:57:52 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 06:58:07 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 07:02:07 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on pull request #38573: [SPARK-41061][CONNECT] Support SelectExpr which applies Projection by expressions in Strings in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 07:15:21 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38511: [SPARK-41017][SQL] Support column pruning with multiple nondeterministic Filters - posted by GitBox <gi...@apache.org> on 2022/11/09 07:17:00 UTC, 2 replies.
- [GitHub] [spark] zhengchenyu commented on pull request #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/11/09 07:19:11 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38570: [SPARK-41056][R] Fix new R_LIBS_SITE behavior introduced in R 4.2 - posted by GitBox <gi...@apache.org> on 2022/11/09 07:39:41 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38570: [SPARK-41056][R] Fix new R_LIBS_SITE behavior introduced in R 4.2 - posted by GitBox <gi...@apache.org> on 2022/11/09 07:41:16 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 07:41:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 07:48:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38475: [SPARK-40992][CONNECT] Support toDF(columnNames) in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 07:48:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 07:51:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38573: [SPARK-41061][CONNECT] Support SelectExpr which applies Projection by expressions in Strings in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 07:52:04 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/09 07:59:52 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2022/11/09 08:03:03 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/09 08:10:19 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/09 08:24:31 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/09 08:36:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37211: [SPARK-39644][SQL] Add RangePartitioning reporting for V2 DataSources - posted by GitBox <gi...@apache.org> on 2022/11/09 08:53:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38552: [SPARK-41041][SQL] Integrate `_LEGACY_ERROR_TEMP_1279` into `TABLE_OR_VIEW_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/09 08:53:53 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38552: [SPARK-41041][SQL] Integrate `_LEGACY_ERROR_TEMP_1279` into `TABLE_OR_VIEW_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/09 08:54:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38579: [SPARK-40877][DOC][FOLLOW-UP] Update the doc of `DataFrame.stat.crosstab ` - posted by GitBox <gi...@apache.org> on 2022/11/09 09:00:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38573: [SPARK-41061][CONNECT] Support SelectExpr which applies Projection by expressions in Strings in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/09 09:03:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38579: [SPARK-40877][DOC][FOLLOW-UP] Update the doc of `DataFrame.stat.crosstab ` - posted by GitBox <gi...@apache.org> on 2022/11/09 09:04:50 UTC, 1 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38580: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Improve test coverage - posted by GitBox <gi...@apache.org> on 2022/11/09 09:16:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38490: [SPARK-41009][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1070` to `LOCATION_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/09 09:33:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38549: [SPARK-41026][CONNECT][PYTHON][FOLLOW-UP] Add Coalesce and Repartition API to Python client. - posted by GitBox <gi...@apache.org> on 2022/11/09 09:35:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38549: [SPARK-41026][CONNECT][PYTHON][FOLLOW-UP] Add Coalesce and Repartition API to Python client. - posted by GitBox <gi...@apache.org> on 2022/11/09 09:36:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38549: [SPARK-41026][CONNECT][PYTHON][FOLLOW-UP] Add Coalesce and Repartition API to Python client. - posted by GitBox <gi...@apache.org> on 2022/11/09 09:36:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38571: [SPARK-37555][TEST][FOLLOWUP] Increase timeout of CLI test `spark-sql should pass last unclosed comment to backend` - posted by GitBox <gi...@apache.org> on 2022/11/09 09:36:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38581: [SPARK-41074][DOC] Add option `--upgrade` in dependency installation command - posted by GitBox <gi...@apache.org> on 2022/11/09 09:43:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38581: [SPARK-41074][DOC] Add option `--upgrade` in dependency installation command - posted by GitBox <gi...@apache.org> on 2022/11/09 09:44:12 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by GitBox <gi...@apache.org> on 2022/11/09 10:13:40 UTC, 3 replies.
- [GitHub] [spark] fred-db commented on a diff in pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/09 10:25:31 UTC, 12 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38581: [SPARK-41074][DOC] Add option `--upgrade` in dependency installation command - posted by GitBox <gi...@apache.org> on 2022/11/09 11:05:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38581: [SPARK-41074][DOC] Add option `--upgrade` in dependency installation command - posted by GitBox <gi...@apache.org> on 2022/11/09 11:06:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/09 12:14:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/09 12:30:49 UTC, 1 replies.
- [GitHub] [spark] wangyum closed pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 12:46:09 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38577: [SPARK-41071][BUILD] Remove `MaxMetaspaceSize` option from `make-distribution.sh` to make it run successfully - posted by GitBox <gi...@apache.org> on 2022/11/09 12:46:33 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/11/09 12:58:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38434: [SPARK-40946][SQL] Add a new DataSource V2 interface SupportsPushDownClusterKeys - posted by GitBox <gi...@apache.org> on 2022/11/09 13:01:06 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38545: [MINOR][DOCS] Fix links in the sql-pyspark-pandas-with-arrow - posted by GitBox <gi...@apache.org> on 2022/11/09 13:07:40 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38545: [MINOR][DOCS] Fix links in the sql-pyspark-pandas-with-arrow - posted by GitBox <gi...@apache.org> on 2022/11/09 13:08:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38580: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Improve test coverage - posted by GitBox <gi...@apache.org> on 2022/11/09 15:06:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38580: [SPARK-40798][SQL][TESTS][FOLLOW-UP] Improve test coverage - posted by GitBox <gi...@apache.org> on 2022/11/09 15:07:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38582: [WIP][SQL] Convert unresolved operators to internal errors - posted by GitBox <gi...@apache.org> on 2022/11/09 15:58:04 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38551: [SPARK-41038][SQL] Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS` - posted by GitBox <gi...@apache.org> on 2022/11/09 16:13:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38551: [SPARK-41038][SQL] Rename `MULTI_VALUE_SUBQUERY_ERROR` to `SCALAR_SUBQUERY_TOO_MANY_ROWS` - posted by GitBox <gi...@apache.org> on 2022/11/09 16:14:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38583: [WIP] Do not use identifier to match interval units - posted by GitBox <gi...@apache.org> on 2022/11/09 17:47:59 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #38584: [SPARK-40281] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/09 18:30:06 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38584: [SPARK-40281] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/09 20:02:16 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/11/09 20:05:51 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #38585: [SPARK-41076][] Upgrade `protobuf` to 3.21.9 - posted by GitBox <gi...@apache.org> on 2022/11/09 20:29:37 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/09 20:38:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/09 20:38:18 UTC, 1 replies.
- [GitHub] [spark] WweiL commented on pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/09 21:13:27 UTC, 1 replies.
- [GitHub] [spark] xkrogen commented on pull request #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/11/09 21:40:32 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38587: [SPARK-41078][CONNECT] DataFrame `withColumnsRenamed` can be implemented through `RenameColumns` proto - posted by GitBox <gi...@apache.org> on 2022/11/09 23:46:22 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38587: [SPARK-41078][CONNECT] DataFrame `withColumnsRenamed` can be implemented through `RenameColumns` proto - posted by GitBox <gi...@apache.org> on 2022/11/09 23:47:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37234: [SPARK-39822][PYTHON][PS] Provide a good feedback to users - posted by GitBox <gi...@apache.org> on 2022/11/10 00:23:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36775: [SPARK-39389]Filesystem closed should not be considered as corrupt files - posted by GitBox <gi...@apache.org> on 2022/11/10 00:23:09 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38587: [SPARK-41078][CONNECT] DataFrame `withColumnsRenamed` can be implemented through `RenameColumns` proto - posted by GitBox <gi...@apache.org> on 2022/11/10 00:48:52 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/10 00:48:55 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38585: [SPARK-41076][BUILD][CONNECT][PROTOBUF] Upgrade `protobuf` to 3.21.9 - posted by GitBox <gi...@apache.org> on 2022/11/10 00:48:58 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38584: [SPARK-40281][PYTHON] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/10 00:49:02 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2022/11/10 01:07:24 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2022/11/10 01:07:58 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #38576: [SPARK-41062][SQL] Rename `UNSUPPORTED_CORRELATED_REFERENCE` to `CORRELATED_REFERENCE` - posted by GitBox <gi...@apache.org> on 2022/11/10 01:19:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38587: [SPARK-41078][CONNECT] DataFrame `withColumnsRenamed` can be implemented through `RenameColumns` proto - posted by GitBox <gi...@apache.org> on 2022/11/10 01:23:10 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38587: [SPARK-41078][CONNECT] DataFrame `withColumnsRenamed` can be implemented through `RenameColumns` proto - posted by GitBox <gi...@apache.org> on 2022/11/10 01:56:58 UTC, 5 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38589: [SPARK-41087][BUILD] Make `build/mvn` use the same JAVA_OPTS as `dev/make-distribution.sh` - posted by GitBox <gi...@apache.org> on 2022/11/10 02:17:23 UTC, 0 replies.
- [GitHub] [spark] Narcasserun closed pull request #38568: [SSPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/10 02:27:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38590: [SPARK-40767][BUILD][3.3] Fix JAVA_OPTS to to improve maven compilation speed - posted by GitBox <gi...@apache.org> on 2022/11/10 02:28:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38585: [SPARK-41076][BUILD][CONNECT][PROTOBUF] Upgrade `protobuf` to 3.21.9 - posted by GitBox <gi...@apache.org> on 2022/11/10 02:46:20 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38584: [SPARK-40281][PYTHON] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/10 02:49:25 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38585: [SPARK-41076][BUILD][CONNECT][PROTOBUF] Upgrade `protobuf` to 3.21.9 - posted by GitBox <gi...@apache.org> on 2022/11/10 02:50:56 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38591: [DONT'MERGE] Unified scala maven plugin - posted by GitBox <gi...@apache.org> on 2022/11/10 03:31:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38591: [DONT'MERGE] Unified scala maven plugin - posted by GitBox <gi...@apache.org> on 2022/11/10 03:32:26 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #38591: [DONT'MERGE] Unify the version of `scala-maven-plugin` used by `hadoop-2` and `hadoop-3` profile - posted by GitBox <gi...@apache.org> on 2022/11/10 03:35:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38591: [DONT'MERGE] Unify the version of `scala-maven-plugin` used by `hadoop-2` and `hadoop-3` profile - posted by GitBox <gi...@apache.org> on 2022/11/10 03:35:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38591: [DONT'MERGE] Unify the version of `scala-maven-plugin` used by `hadoop-2` and `hadoop-3` profile - posted by GitBox <gi...@apache.org> on 2022/11/10 03:35:18 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #38592: [SPARK-41088][SQL] Add PartialAggregate and FinalAggregate logic operators - posted by GitBox <gi...@apache.org> on 2022/11/10 04:06:51 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38593: [SPARK-41089][YARN][SHUFFLE] Relocate Netty native arm64 libs - posted by GitBox <gi...@apache.org> on 2022/11/10 04:07:28 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #38593: [SPARK-41089][YARN][SHUFFLE] Relocate Netty native arm64 libs - posted by GitBox <gi...@apache.org> on 2022/11/10 04:07:55 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/10 04:11:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/10 04:12:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38594: [SPARK-40852][CONNECT][PYTHON][FOLLOWUP] Make `Summary` a separate proto plan - posted by GitBox <gi...@apache.org> on 2022/11/10 04:16:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38595: [SPARK-41090][SQL] Enhance Dataset.createTempView testing coverage for `db_name.view_name` - posted by GitBox <gi...@apache.org> on 2022/11/10 04:20:15 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38595: [SPARK-41090][SQL] Enhance Dataset.createTempView testing coverage for `db_name.view_name` - posted by GitBox <gi...@apache.org> on 2022/11/10 04:20:52 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38593: [SPARK-41089][YARN][SHUFFLE] Relocate Netty native arm64 libs - posted by GitBox <gi...@apache.org> on 2022/11/10 04:28:41 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38595: [SPARK-41090][SQL] Enhance Dataset.createTempView testing coverage for `db_name.view_name` - posted by GitBox <gi...@apache.org> on 2022/11/10 04:29:35 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/10 05:02:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/10 05:02:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38587: [SPARK-41078][CONNECT] Implement DataFrame `withColumnsRenamed` in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/10 05:05:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38587: [SPARK-41078][CONNECT] Implement DataFrame `withColumnsRenamed` in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/10 05:05:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38583: [SPARK-41092][SQL] Do not use identifier to match interval units - posted by GitBox <gi...@apache.org> on 2022/11/10 05:31:24 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/10 05:32:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/10 05:34:41 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/10 05:37:46 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38593: [SPARK-41089][YARN][SHUFFLE] Relocate Netty native arm64 libs - posted by GitBox <gi...@apache.org> on 2022/11/10 05:40:20 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38583: [SPARK-41092][SQL] Do not use identifier to match interval units - posted by GitBox <gi...@apache.org> on 2022/11/10 05:44:12 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38557: [SPARK-38959][SQL][FOLLOWUP] Optimizer batch `PartitionPruning` should optimize subqueries - posted by GitBox <gi...@apache.org> on 2022/11/10 05:46:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #38590: [SPARK-40767][BUILD][3.3] Fix Java opts to to improve maven compilation speed - posted by GitBox <gi...@apache.org> on 2022/11/10 05:49:59 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38596: [SPARK-41093][DEPS] Remove netty-tcnative-classes from Spark dependencyList - posted by GitBox <gi...@apache.org> on 2022/11/10 05:55:27 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #38571: [SPARK-37555][TEST][FOLLOWUP] Increase timeout of CLI test `spark-sql should pass last unclosed comment to backend` - posted by GitBox <gi...@apache.org> on 2022/11/10 06:03:26 UTC, 2 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #38596: [SPARK-41093][DEPS] Remove netty-tcnative-classes from Spark dependencyList - posted by GitBox <gi...@apache.org> on 2022/11/10 06:04:44 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/10 06:06:24 UTC, 9 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38589: [SPARK-41087][BUILD] Make `build/mvn` use the same JAVA_OPTS as `dev/make-distribution.sh` - posted by GitBox <gi...@apache.org> on 2022/11/10 06:10:41 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38589: [SPARK-41087][BUILD] Make `build/mvn` use the same JAVA_OPTS as `dev/make-distribution.sh` - posted by GitBox <gi...@apache.org> on 2022/11/10 06:12:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38589: [SPARK-41087][BUILD] Make `build/mvn` use the same JAVA_OPTS as `dev/make-distribution.sh` - posted by GitBox <gi...@apache.org> on 2022/11/10 06:15:25 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/10 06:15:56 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #23: [SPARK-40519] Add "Publish" workflow to help release apache/spark image - posted by GitBox <gi...@apache.org> on 2022/11/10 06:26:32 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38579: [SPARK-40877][DOC][FOLLOW-UP] Update the doc of `DataFrame.stat.crosstab ` - posted by GitBox <gi...@apache.org> on 2022/11/10 06:43:53 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38585: [SPARK-41076][BUILD][CONNECT] Upgrade `protobuf` to 3.21.9 - posted by GitBox <gi...@apache.org> on 2022/11/10 06:46:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/10 06:47:04 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38541: [SPARK-41034][CONNECT][PYTHON] Connect DataFrame should require a RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/10 06:48:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38541: [SPARK-41034][CONNECT][PYTHON] Connect DataFrame should require a RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/10 06:48:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38589: [SPARK-41087][BUILD] Make `build/mvn` use the same JAVA_OPTS as `dev/make-distribution.sh` - posted by GitBox <gi...@apache.org> on 2022/11/10 06:48:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38589: [SPARK-41087][BUILD] Remove duplicate `-Xmx4g` from `dev/make-distribution.sh` and make `build/mvn` use the same JAVA_OPTS - posted by GitBox <gi...@apache.org> on 2022/11/10 06:53:28 UTC, 4 replies.
- [GitHub] [spark] amaliujia commented on pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/10 06:56:21 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38506: [SPARK-41010][CONNECT][PYTHON] Complete Support for Except and Intersect in Python client - posted by GitBox <gi...@apache.org> on 2022/11/10 07:01:00 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2022/11/10 07:01:55 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38576: [SPARK-41062][SQL] Rename `UNSUPPORTED_CORRELATED_REFERENCE` to `CORRELATED_REFERENCE` - posted by GitBox <gi...@apache.org> on 2022/11/10 07:03:45 UTC, 7 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2022/11/10 07:03:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38575: [WIP][SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/10 07:08:21 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38572: [SPARK-41059][SQL] Rename `_LEGACY_ERROR_TEMP_2420` to `NESTED_AGGREGATE_FUNCTION` - posted by GitBox <gi...@apache.org> on 2022/11/10 07:09:51 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/10 07:10:16 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38544: [SPARK-40815][SQL][FOLLOW-UP] Fix record reader in DelegateSymlinkTextInputFormat to avoid Hive ExecMapper.getDone() check - posted by GitBox <gi...@apache.org> on 2022/11/10 07:27:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38544: [SPARK-40815][SQL][FOLLOW-UP] Fix record reader in DelegateSymlinkTextInputFormat to avoid Hive ExecMapper.getDone() check - posted by GitBox <gi...@apache.org> on 2022/11/10 07:27:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38595: [SPARK-41090][SQL] Enhance Dataset.createTempView testing coverage for `db_name.view_name` - posted by GitBox <gi...@apache.org> on 2022/11/10 07:28:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/10 07:35:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38594: [SPARK-40852][CONNECT][PYTHON][FOLLOWUP] Make `Summary` a separate proto plan - posted by GitBox <gi...@apache.org> on 2022/11/10 07:38:23 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38579: [SPARK-40877][DOC][FOLLOW-UP] Update the doc of `DataFrame.stat.crosstab ` - posted by GitBox <gi...@apache.org> on 2022/11/10 07:40:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38579: [SPARK-40877][DOC][FOLLOW-UP] Update the doc of `DataFrame.stat.crosstab ` - posted by GitBox <gi...@apache.org> on 2022/11/10 07:42:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38583: [SPARK-41092][SQL] Do not use identifier to match interval units - posted by GitBox <gi...@apache.org> on 2022/11/10 07:43:24 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38594: [SPARK-40852][CONNECT][PYTHON][FOLLOWUP] Make `Summary` a separate proto plan - posted by GitBox <gi...@apache.org> on 2022/11/10 07:44:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38594: [SPARK-40852][CONNECT][PYTHON][FOLLOWUP] Make `Summary` a separate proto plan - posted by GitBox <gi...@apache.org> on 2022/11/10 07:44:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/10 07:55:05 UTC, 9 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38597: [SPARK-41034][CONNECT][PYTHON][FOLLOW-UP] Fix python lint - posted by GitBox <gi...@apache.org> on 2022/11/10 08:22:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38597: [SPARK-41034][CONNECT][PYTHON][FOLLOW-UP] Fix python lint - posted by GitBox <gi...@apache.org> on 2022/11/10 08:22:54 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38582: [SPARK-41095][SQL] Convert unresolved operators to internal errors - posted by GitBox <gi...@apache.org> on 2022/11/10 08:31:28 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38583: [SPARK-41092][SQL] Do not use identifier to match interval units - posted by GitBox <gi...@apache.org> on 2022/11/10 08:33:50 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38583: [SPARK-41092][SQL] Do not use identifier to match interval units - posted by GitBox <gi...@apache.org> on 2022/11/10 08:36:32 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38572: [SPARK-41059][SQL] Rename `_LEGACY_ERROR_TEMP_2420` to `NESTED_AGGREGATE_FUNCTION` - posted by GitBox <gi...@apache.org> on 2022/11/10 08:42:05 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38597: [SPARK-41034][CONNECT][PYTHON][FOLLOW-UP] Fix mypy annotations test - posted by GitBox <gi...@apache.org> on 2022/11/10 09:04:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38598: [SPARK-41097][CORE][SQL] Remove redundant collection conversion base on Scala 2.13 code - posted by GitBox <gi...@apache.org> on 2022/11/10 09:07:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38598: [SPARK-41097][CORE][SQL] Remove redundant collection conversion base on Scala 2.13 code - posted by GitBox <gi...@apache.org> on 2022/11/10 09:08:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/10 09:10:47 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38166: [SPARK-40713][CONNECT] Improve SET operation support in the proto and the server - posted by GitBox <gi...@apache.org> on 2022/11/10 09:18:57 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38597: [SPARK-41034][CONNECT][PYTHON][FOLLOW-UP] Fix mypy annotations test - posted by GitBox <gi...@apache.org> on 2022/11/10 09:24:08 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38597: [SPARK-41034][CONNECT][PYTHON][FOLLOW-UP] Fix mypy annotations test - posted by GitBox <gi...@apache.org> on 2022/11/10 09:25:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38597: [SPARK-41034][CONNECT][PYTHON][FOLLOW-UP] Fix mypy annotations test - posted by GitBox <gi...@apache.org> on 2022/11/10 09:25:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38535: [SPARK-41001] [CONNECT] Make `user_id` optional in SparkRemoteSession. - posted by GitBox <gi...@apache.org> on 2022/11/10 09:28:48 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38599: [SPARK-41063][BUILD] Clean all except files in Git repository before running Mima - posted by GitBox <gi...@apache.org> on 2022/11/10 09:39:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/10 10:16:17 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38589: [SPARK-41087][BUILD] Remove duplicate `-Xmx4g` from `dev/make-distribution.sh` and make `build/mvn` use the same JAVA_OPTS - posted by GitBox <gi...@apache.org> on 2022/11/10 11:03:55 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38574: [SPARK-41060][K8S] Fix generating driver and executor Config Maps - posted by GitBox <gi...@apache.org> on 2022/11/10 11:29:40 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38600: [SPARK-41098][SQL] Rename `GROUP_BY_POS_REFERS_AGG_EXPR` to `GROUP_BY_POS_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/10 12:46:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38166: [SPARK-40713][CONNECT] Improve SET operation support in the proto and the server - posted by GitBox <gi...@apache.org> on 2022/11/10 12:49:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38582: [SPARK-41095][SQL] Convert unresolved operators to internal errors - posted by GitBox <gi...@apache.org> on 2022/11/10 13:04:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/10 13:05:39 UTC, 9 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #38601: [WIP] Upgrade Ubuntu latest - posted by GitBox <gi...@apache.org> on 2022/11/10 13:51:17 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 opened a new pull request, #38602: [SPARK-41099][CORE] Do not wrap exceptions thrown in SparkHadoopWriter.write - posted by GitBox <gi...@apache.org> on 2022/11/10 13:53:41 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38601: [SPARK-41100][INFRA] Upgrade Ubuntu to latest - posted by GitBox <gi...@apache.org> on 2022/11/10 14:02:11 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on pull request #38601: [SPARK-41100][INFRA] Upgrade Ubuntu to latest - posted by GitBox <gi...@apache.org> on 2022/11/10 14:17:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38599: [SPARK-41063][BUILD] Clean all except files in Git repository before running Mima - posted by GitBox <gi...@apache.org> on 2022/11/10 14:33:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38602: [SPARK-41099][CORE] Do not wrap exceptions thrown in SparkHadoopWriter.write - posted by GitBox <gi...@apache.org> on 2022/11/10 14:42:01 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38575: [WIP][SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/10 14:46:48 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #38593: [SPARK-41089][YARN][SHUFFLE] Relocate Netty native arm64 libs - posted by GitBox <gi...@apache.org> on 2022/11/10 14:58:06 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38576: [SPARK-41062][SQL] Rename `UNSUPPORTED_CORRELATED_REFERENCE` to `CORRELATED_REFERENCE` - posted by GitBox <gi...@apache.org> on 2022/11/10 15:13:45 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/10 16:29:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38569: [SPARK-41055][SQL] Rename `_LEGACY_ERROR_TEMP_2424` to `GROUP_BY_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/10 16:30:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38600: [SPARK-41098][SQL] Rename `GROUP_BY_POS_REFERS_AGG_EXPR` to `GROUP_BY_POS_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/10 16:36:40 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on pull request #38596: [SPARK-41093][BUILD] Remove netty-tcnative-classes from Spark dependencyList - posted by GitBox <gi...@apache.org> on 2022/11/10 16:38:01 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38582: [SPARK-41095][SQL] Convert unresolved operators to internal errors - posted by GitBox <gi...@apache.org> on 2022/11/10 16:48:27 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #38596: [SPARK-41093][BUILD] Remove netty-tcnative-classes from Spark dependencyList - posted by GitBox <gi...@apache.org> on 2022/11/10 17:10:33 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38568: [SSPARK-41051][CORE] Optimize ProcfsMetrics file acquisition - posted by GitBox <gi...@apache.org> on 2022/11/10 17:18:41 UTC, 0 replies.
- [GitHub] [spark] Tagar commented on pull request #38584: [SPARK-40281][PYTHON] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/10 17:48:12 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38594: [SPARK-40852][CONNECT][PYTHON][FOLLOWUP] Make `Summary` a separate proto plan - posted by GitBox <gi...@apache.org> on 2022/11/10 18:10:45 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38578: [SPARK-41064][CONNECT][PYTHON] Implement `DataFrame.crosstab` and `DataFrame.stat.crosstab` - posted by GitBox <gi...@apache.org> on 2022/11/10 18:13:00 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38597: [SPARK-41034][CONNECT][PYTHON][FOLLOW-UP] Fix mypy annotations test - posted by GitBox <gi...@apache.org> on 2022/11/10 18:21:34 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38602: [SPARK-41099][CORE] Do not wrap exceptions thrown in SparkHadoopWriter.write - posted by GitBox <gi...@apache.org> on 2022/11/10 18:53:05 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38595: [SPARK-41090][SQL] Fix view not found issue for `db_name.view_name` - posted by GitBox <gi...@apache.org> on 2022/11/10 18:58:49 UTC, 1 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #38584: [SPARK-40281][PYTHON] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/10 19:07:19 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38566: [SPARK-41046][CONNECT] Support CreateView in Connect DSL - posted by GitBox <gi...@apache.org> on 2022/11/10 19:16:23 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38560: [WIP][SPARK-38005][core] Support cleaning up merged shuffle files and state from external shuffle service - posted by GitBox <gi...@apache.org> on 2022/11/10 19:16:26 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #38584: [SPARK-40281][PYTHON] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/10 19:17:50 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN opened a new pull request, #38603: [SPARK-41101][PYTHON][PROTOBUF] Message classname support for PYSPARK-PROTOBUF - posted by GitBox <gi...@apache.org> on 2022/11/10 19:56:50 UTC, 0 replies.
- [GitHub] [spark] ljfgem commented on pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/10 20:23:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38604: [SPARK-41102][CONNECT][REFACTORING] Merge SparkConnectPlanner and SparkConnectCommandPlanner. - posted by GitBox <gi...@apache.org> on 2022/11/10 21:05:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38604: [SPARK-41102][CONNECT][REFACTORING] Merge SparkConnectPlanner and SparkConnectCommandPlanner. - posted by GitBox <gi...@apache.org> on 2022/11/10 21:05:31 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/10 21:45:30 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/10 21:45:58 UTC, 4 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/10 21:55:43 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/10 22:02:59 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/10 22:03:54 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38535: [SPARK-41001] [CONNECT] Make `user_id` optional in SparkRemoteSession. - posted by GitBox <gi...@apache.org> on 2022/11/10 22:04:31 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38544: [SPARK-40815][SQL][FOLLOW-UP] Fix record reader in DelegateSymlinkTextInputFormat to avoid Hive ExecMapper.getDone() check - posted by GitBox <gi...@apache.org> on 2022/11/10 23:13:28 UTC, 0 replies.
- [GitHub] [spark] amaliujia closed pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/10 23:15:45 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38606: [SPARK-41105][CONNECT] Adopt `optional` keyword from proto3 which offers `hasXXX` to differentiate if a field is set or unset - posted by GitBox <gi...@apache.org> on 2022/11/10 23:37:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38606: [SPARK-41105][CONNECT] Adopt `optional` keyword from proto3 which offers `hasXXX` to differentiate if a field is set or unset - posted by GitBox <gi...@apache.org> on 2022/11/10 23:38:24 UTC, 2 replies.
- [GitHub] [spark] asfgit closed pull request #38377: [SPARK-40901][CORE] Unable to store Spark Driver logs with Absolute Hadoop based URI FS Path - posted by GitBox <gi...@apache.org> on 2022/11/11 00:16:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37365: [SPARK-39938][PYTHON][PS] Accept all inputs of prefix/suffix which implement __str__ in add_predix/add_suffix - posted by GitBox <gi...@apache.org> on 2022/11/11 00:25:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37355: [SPARK-39930][SQL] Introduce Cache Hints - posted by GitBox <gi...@apache.org> on 2022/11/11 00:25:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37234: [SPARK-39822][PYTHON][PS] Provide a good feedback to users - posted by GitBox <gi...@apache.org> on 2022/11/11 00:25:15 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on pull request #38511: [SPARK-41017][SQL] Support column pruning with multiple nondeterministic Filters - posted by GitBox <gi...@apache.org> on 2022/11/11 00:35:17 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38607: [SPARK-40938][CONNECT][PYTHON][FOLLOW-UP] Fix SubqueryAlias without the child plan when constructing Connect proto in the Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 00:39:30 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #38608: [SPARK-41080][SQL] Support Bit manipulation function SETBIT - posted by GitBox <gi...@apache.org> on 2022/11/11 00:42:51 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/11 00:45:15 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 01:21:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38468: [SPARK-41005][CONNECT][PYTHON] Arrow-based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 01:21:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38606: [SPARK-41105][CONNECT] Adopt `optional` keyword from proto3 which offers `hasXXX` to differentiate if a field is set or unset - posted by GitBox <gi...@apache.org> on 2022/11/11 01:27:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38599: [SPARK-41063][BUILD] Clean all except files in Git repository before running Mima - posted by GitBox <gi...@apache.org> on 2022/11/11 01:40:23 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38599: [SPARK-41063][BUILD] Clean all except files in Git repository before running Mima - posted by GitBox <gi...@apache.org> on 2022/11/11 01:40:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38091: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/11 01:42:46 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38586: [SPARK-41077][CONNECT][PYTHON][REFACTORING] Rename `ColumnRef` to `Column` in Python client implementation - posted by GitBox <gi...@apache.org> on 2022/11/11 02:03:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38609: [SPARK-40593][CONNECT] Add profile to make user can specify custom `protocExecutable` and `pluginExecutable` when building connect module - posted by GitBox <gi...@apache.org> on 2022/11/11 02:24:58 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #33636: [SPARK-36414][SQL] Disable timeout for BroadcastQueryStageExec in AQE - posted by GitBox <gi...@apache.org> on 2022/11/11 02:31:36 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #33636: [SPARK-36414][SQL] Disable timeout for BroadcastQueryStageExec in AQE - posted by GitBox <gi...@apache.org> on 2022/11/11 02:33:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38609: [WIP][SPARK-40593][CONNECT] Add profile to make user can specify custom `protocExecutable` and `pluginExecutable` when building connect module - posted by GitBox <gi...@apache.org> on 2022/11/11 02:37:20 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38404: [SPARK-40956] SQL Equivalent for Dataframe overwrite command - posted by GitBox <gi...@apache.org> on 2022/11/11 02:38:26 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38595: [SPARK-41090][SQL] Fix view not found issue for `db_name.view_name` - posted by GitBox <gi...@apache.org> on 2022/11/11 02:53:27 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38610: [SPARK-41106][SQL] Reduce collection conversion when create AttributeMap - posted by GitBox <gi...@apache.org> on 2022/11/11 02:57:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38584: [SPARK-40281][PYTHON] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/11 02:58:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38584: [SPARK-40281][PYTHON] Memory Profiler on Executors - posted by GitBox <gi...@apache.org> on 2022/11/11 02:59:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38535: [SPARK-41001][CONNECT] Make `userId` optional in SparkRemoteSession - posted by GitBox <gi...@apache.org> on 2022/11/11 03:00:45 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38535: [SPARK-41001][CONNECT] Make `userId` optional in SparkRemoteSession - posted by GitBox <gi...@apache.org> on 2022/11/11 03:01:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38610: [SPARK-41106][SQL] Reduce collection conversion when create AttributeMap - posted by GitBox <gi...@apache.org> on 2022/11/11 03:03:13 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38606: [SPARK-41105][CONNECT] Adopt `optional` keyword from proto3 which offers `hasXXX` to differentiate if a field is set or unset - posted by GitBox <gi...@apache.org> on 2022/11/11 03:04:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38606: [SPARK-41105][CONNECT] Adopt `optional` keyword from proto3 which offers `hasXXX` to differentiate if a field is set or unset - posted by GitBox <gi...@apache.org> on 2022/11/11 03:04:43 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 commented on pull request #38602: [SPARK-41099][CORE] Do not wrap exceptions thrown in SparkHadoopWriter.write - posted by GitBox <gi...@apache.org> on 2022/11/11 03:08:46 UTC, 0 replies.
- [GitHub] [spark] bozhang2820 closed pull request #38602: [SPARK-41099][CORE] Do not wrap exceptions thrown in SparkHadoopWriter.write - posted by GitBox <gi...@apache.org> on 2022/11/11 03:08:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38603: [SPARK-41101][PYTHON][PROTOBUF] Message classname support for PYSPARK-PROTOBUF - posted by GitBox <gi...@apache.org> on 2022/11/11 03:09:29 UTC, 5 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38606: [SPARK-41105][CONNECT] Adopt `optional` keyword from proto3 which offers `hasXXX` to differentiate if a field is set or unset - posted by GitBox <gi...@apache.org> on 2022/11/11 03:10:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38604: [SPARK-41102][CONNECT][REFACTORING] Merge SparkConnectPlanner and SparkConnectCommandPlanner - posted by GitBox <gi...@apache.org> on 2022/11/11 03:13:59 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38607: [SPARK-40938][CONNECT][PYTHON][FOLLOW-UP] Fix SubqueryAlias without the child plan when constructing Connect proto in the Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 03:14:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38609: [WIP][SPARK-40593][CONNECT] Add profile to make user can specify custom `protocExecutable` and `pluginExecutable` when building connect module - posted by GitBox <gi...@apache.org> on 2022/11/11 03:20:08 UTC, 1 replies.
- [GitHub] [spark] SandishKumarHN commented on a diff in pull request #38603: [SPARK-41101][PYTHON][PROTOBUF] Message classname support for PYSPARK-PROTOBUF - posted by GitBox <gi...@apache.org> on 2022/11/11 03:21:10 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/11 03:22:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38546: [SPARK-41036][CONNECT][PYTHON] `columns` API should use `schema` API to avoid data fetching - posted by GitBox <gi...@apache.org> on 2022/11/11 03:22:35 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #38608: [SPARK-41080][SQL] Support Bit manipulation function SETBIT - posted by GitBox <gi...@apache.org> on 2022/11/11 03:25:31 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #38611: [SPARK-41107] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/11 03:32:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38612: [SPARK-41108][CONNECT] Control the max size of arrow batch - posted by GitBox <gi...@apache.org> on 2022/11/11 03:44:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38612: [SPARK-41108][CONNECT] Control the max size of arrow batch - posted by GitBox <gi...@apache.org> on 2022/11/11 03:50:43 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38595: [SPARK-41090][SQL] Fix view not found issue for `db_name.view_name` - posted by GitBox <gi...@apache.org> on 2022/11/11 04:11:45 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 04:22:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38604: [SPARK-41102][CONNECT] Merge SparkConnectPlanner and SparkConnectCommandPlanner - posted by GitBox <gi...@apache.org> on 2022/11/11 04:25:36 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 04:37:07 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38614: [SPARK-41005][CONNECT][FOLLOWUP] Collect should use `submitJob` instead of `runJob` - posted by GitBox <gi...@apache.org> on 2022/11/11 04:38:20 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38614: [SPARK-41005][CONNECT][FOLLOWUP] Collect should use `submitJob` instead of `runJob` - posted by GitBox <gi...@apache.org> on 2022/11/11 04:38:51 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 04:40:37 UTC, 7 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38607: [SPARK-40938][CONNECT][PYTHON][FOLLOW-UP] Fix SubqueryAlias without the child plan when constructing Connect proto in the Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 04:48:48 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 04:49:38 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 05:00:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 05:01:07 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38604: [SPARK-41102][CONNECT] Merge SparkConnectPlanner and SparkConnectCommandPlanner - posted by GitBox <gi...@apache.org> on 2022/11/11 05:10:36 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38612: [SPARK-41108][CONNECT] Control the max size of arrow batch - posted by GitBox <gi...@apache.org> on 2022/11/11 05:11:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38614: [SPARK-41005][CONNECT][FOLLOWUP] Collect should use `submitJob` instead of `runJob` - posted by GitBox <gi...@apache.org> on 2022/11/11 05:12:41 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38614: [SPARK-41005][CONNECT][FOLLOWUP] Collect should use `submitJob` instead of `runJob` - posted by GitBox <gi...@apache.org> on 2022/11/11 05:13:15 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/11 05:28:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38612: [SPARK-41108][CONNECT] Control the max size of arrow batch - posted by GitBox <gi...@apache.org> on 2022/11/11 05:35:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38612: [SPARK-41108][CONNECT] Control the max size of arrow batch - posted by GitBox <gi...@apache.org> on 2022/11/11 05:35:42 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38616: [SPARK-41110][CONNECT][PYTHON] Implement `DataFrame.sparkSession` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 05:41:32 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #38603: [SPARK-41101][PYTHON][PROTOBUF] Message classname support for PYSPARK-PROTOBUF - posted by GitBox <gi...@apache.org> on 2022/11/11 05:42:44 UTC, 2 replies.
- [GitHub] [spark] mridulm opened a new pull request, #38617: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/11 06:01:12 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38617: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/11 06:02:11 UTC, 4 replies.
- [GitHub] [spark] mridulm commented on pull request #38617: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/11 06:06:35 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38609: [WIP][SPARK-40593][CONNECT] Add profile to make user can specify custom `protocExecutable` and `pluginExecutable` when building connect module - posted by GitBox <gi...@apache.org> on 2022/11/11 06:28:04 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38555: [SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/11 06:32:54 UTC, 3 replies.
- [GitHub] [spark] MaxGekk closed pull request #38572: [SPARK-41059][SQL] Rename `_LEGACY_ERROR_TEMP_2420` to `NESTED_AGGREGATE_FUNCTION` - posted by GitBox <gi...@apache.org> on 2022/11/11 06:34:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38582: [SPARK-41095][SQL] Convert unresolved operators to internal errors - posted by GitBox <gi...@apache.org> on 2022/11/11 06:37:20 UTC, 0 replies.
- [GitHub] [spark] beatbull commented on pull request #33828: [SPARK-36579][CORE][SQL] Make spark source stagingDir can be customized - posted by GitBox <gi...@apache.org> on 2022/11/11 06:54:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38618: [SPARK-41108][SPARK-41005][CONNECT][FOLLOW-UP] Deduplicate ArrowConverters codes - posted by GitBox <gi...@apache.org> on 2022/11/11 07:06:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38616: [SPARK-41110][CONNECT][PYTHON] Implement `DataFrame.sparkSession` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 07:11:13 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #38064: [SPARK-40622][SQL][CORE]Result of a single task in collect() must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/11 07:16:21 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38619: [SPARK-41112][SQL] RuntimeFilter should apply ColumnPruning eagerly with in-subquery filter - posted by GitBox <gi...@apache.org> on 2022/11/11 07:26:00 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38619: [SPARK-41112][SQL] RuntimeFilter should apply ColumnPruning eagerly with in-subquery filter - posted by GitBox <gi...@apache.org> on 2022/11/11 07:29:50 UTC, 2 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38619: [SPARK-41112][SQL] RuntimeFilter should apply ColumnPruning eagerly with in-subquery filter - posted by GitBox <gi...@apache.org> on 2022/11/11 07:30:45 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38620: [SPARK-41113][BUILD] Upgrade sbt to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/11 07:32:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38620: [SPARK-41113][BUILD] Upgrade sbt to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/11 07:32:49 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38604: [SPARK-41102][CONNECT] Merge SparkConnectPlanner and SparkConnectCommandPlanner - posted by GitBox <gi...@apache.org> on 2022/11/11 07:48:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38604: [SPARK-41102][CONNECT] Merge SparkConnectPlanner and SparkConnectCommandPlanner - posted by GitBox <gi...@apache.org> on 2022/11/11 07:48:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/11 08:04:07 UTC, 0 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/11 08:30:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/11 08:58:43 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38619: [SPARK-41112][SQL] RuntimeFilter should apply ColumnPruning eagerly with in-subquery filter - posted by GitBox <gi...@apache.org> on 2022/11/11 09:13:50 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/11 09:15:26 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38535: [SPARK-41001][CONNECT] Make `userId` optional in SparkRemoteSession - posted by GitBox <gi...@apache.org> on 2022/11/11 09:24:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38555: [SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/11 09:26:36 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38555: [SPARK-41044][SQL] Convert DATATYPE_MISMATCH.UNSPECIFIED_FRAME to INTERNAL_ERROR - posted by GitBox <gi...@apache.org> on 2022/11/11 09:27:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38616: [SPARK-41110][CONNECT][PYTHON] Implement `DataFrame.sparkSession` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 09:30:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38616: [SPARK-41110][CONNECT][PYTHON] Implement `DataFrame.sparkSession` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 09:30:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38618: [SPARK-41108][SPARK-41005][CONNECT][FOLLOW-UP] Deduplicate ArrowConverters codes - posted by GitBox <gi...@apache.org> on 2022/11/11 10:12:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38618: [SPARK-41108][SPARK-41005][CONNECT][FOLLOW-UP] Deduplicate ArrowConverters codes - posted by GitBox <gi...@apache.org> on 2022/11/11 10:12:17 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/11 10:53:39 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38617: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/11 10:58:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38617: [SPARK-40096][CORE][TESTS][FOLLOW-UP] Fix flaky test case - posted by GitBox <gi...@apache.org> on 2022/11/11 10:58:35 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38623: [WIP][SPARK-41072][SQL] Add the error class `STREAM_FAILED` - posted by GitBox <gi...@apache.org> on 2022/11/11 11:11:34 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 11:14:07 UTC, 5 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/11 11:56:56 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38613: [SPARK-41005][CONNECT][PYTHON][FOLLOW-UP] Fetch/send partitions in parallel for Arrow based collect - posted by GitBox <gi...@apache.org> on 2022/11/11 12:11:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38614: [SPARK-41005][CONNECT][FOLLOWUP] Collect should use `submitJob` instead of `runJob` - posted by GitBox <gi...@apache.org> on 2022/11/11 12:18:17 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by GitBox <gi...@apache.org> on 2022/11/11 12:22:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38507: [SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/11 12:47:03 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/11 12:56:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38507: [SPARK-40372][SQL] Migrate failures of array type checks onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/11 13:12:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/11 13:13:13 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/11 13:26:34 UTC, 6 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/11 13:28:00 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/11 13:40:16 UTC, 7 replies.
- [GitHub] [spark] srowen closed pull request #38596: [SPARK-41093][BUILD] Remove netty-tcnative-classes from Spark dependencyList - posted by GitBox <gi...@apache.org> on 2022/11/11 14:15:47 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38624: [SPARK-40559][PYTHON] Add applyInArrow to groupBy and cogroup - posted by GitBox <gi...@apache.org> on 2022/11/11 14:50:48 UTC, 0 replies.
- [GitHub] [spark] deepyaman opened a new pull request, #38625: Update generic.py - posted by GitBox <gi...@apache.org> on 2022/11/11 15:30:30 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/11 15:33:02 UTC, 3 replies.
- [GitHub] [spark] deepyaman commented on pull request #38625: [PYTHON][PS] Fix the `.groupby()` method docstring - posted by GitBox <gi...@apache.org> on 2022/11/11 15:47:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38626: [SPARK-38959][SQL][FOLLOWUP] Do not optimize subqueries twice - posted by GitBox <gi...@apache.org> on 2022/11/11 16:17:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38626: [SPARK-38959][SQL][FOLLOWUP] Do not optimize subqueries twice - posted by GitBox <gi...@apache.org> on 2022/11/11 16:17:39 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/11 16:43:32 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/11 17:06:05 UTC, 0 replies.
- [GitHub] [spark] 19Serhii99 commented on pull request #38574: [SPARK-41060][K8S] Fix generating driver and executor Config Maps - posted by GitBox <gi...@apache.org> on 2022/11/11 17:08:15 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/11 17:22:56 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/11 17:23:18 UTC, 0 replies.
- [GitHub] [spark] carlfu-db commented on a diff in pull request #38404: [SPARK-40956] SQL Equivalent for Dataframe overwrite command - posted by GitBox <gi...@apache.org> on 2022/11/11 18:37:50 UTC, 2 replies.
- [GitHub] [spark] kazuyukitanimura opened a new pull request, #38628: [SPARK-41096][SQL] Support reading parquet FIXED_LEN_BYTE_ARRAY type - posted by GitBox <gi...@apache.org> on 2022/11/11 18:49:08 UTC, 0 replies.
- [GitHub] [spark] kazuyukitanimura commented on pull request #38628: [SPARK-41096][SQL] Support reading parquet FIXED_LEN_BYTE_ARRAY type - posted by GitBox <gi...@apache.org> on 2022/11/11 18:51:02 UTC, 2 replies.
- [GitHub] [spark] ueshin commented on a diff in pull request #38611: [SPARK-41107][PYTHON][INFRA][TEST] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/11 18:51:16 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/11 18:52:31 UTC, 6 replies.
- [GitHub] [spark] zsxwing commented on a diff in pull request #38623: [WIP][SPARK-41072][SQL][SS] Add the error class `STREAM_FAILED` - posted by GitBox <gi...@apache.org> on 2022/11/11 19:04:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38629: [WIP][SS] Add the error class `STREAM_FAILED` to `StreamingQueryException` - posted by GitBox <gi...@apache.org> on 2022/11/11 20:33:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38623: [WIP][SPARK-41072][SQL][SS] Add the error class `STREAM_FAILED` - posted by GitBox <gi...@apache.org> on 2022/11/11 20:38:22 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38630: [SPARK-41115][CONNECT] Add ClientType Enum to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/11 20:49:34 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38630: [SPARK-41115][CONNECT] Add ClientType Enum to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/11 20:50:17 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38630: [SPARK-41115][CONNECT] Add ClientType Enum to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/11 20:56:49 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38631: [SPARK-40809] [CONNECT] [FOLLOW] - posted by GitBox <gi...@apache.org> on 2022/11/11 21:05:15 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 21:08:13 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38630: [SPARK-41115][CONNECT] Add ClientType Enum to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/11 21:12:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 21:15:11 UTC, 3 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38607: [SPARK-40938][CONNECT][PYTHON][FOLLOW-UP] Fix SubqueryAlias without the child plan when constructing Connect proto in the Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 21:15:56 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/11 21:19:41 UTC, 15 replies.
- [GitHub] [spark] felipepessoto commented on pull request #37616: [SPARK-40178][PYTHON][SQL] Fix partitioning hint parameters in PySpark - posted by GitBox <gi...@apache.org> on 2022/11/11 22:03:20 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/11 22:31:11 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38611: [SPARK-41107][PYTHON][INFRA][TEST] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/11 22:31:15 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #38611: [SPARK-41107][PYTHON][INFRA][TEST] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/11 22:51:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38632: [SPARK-41116][CONNECT] Input relation can be optional for Project in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/11 23:39:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38632: [SPARK-41116][CONNECT] Input relation can be optional for Project in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/11 23:40:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37365: [SPARK-39938][PYTHON][PS] Accept all inputs of prefix/suffix which implement __str__ in add_predix/add_suffix - posted by GitBox <gi...@apache.org> on 2022/11/12 00:24:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37346: [SPARK-37210][CORE][SQL] Allow forced use of staging directory - posted by GitBox <gi...@apache.org> on 2022/11/12 00:24:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37355: [SPARK-39930][SQL] Introduce Cache Hints - posted by GitBox <gi...@apache.org> on 2022/11/12 00:24:17 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38626: [SPARK-38959][SQL][FOLLOWUP] Do not optimize subqueries twice - posted by GitBox <gi...@apache.org> on 2022/11/12 02:14:40 UTC, 3 replies.
- [GitHub] [spark] Dam1029 commented on pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by GitBox <gi...@apache.org> on 2022/11/12 03:54:18 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38607: [SPARK-40938][CONNECT][PYTHON][FOLLOW-UP] Fix SubqueryAlias without the child plan when constructing Connect proto in the Python client - posted by GitBox <gi...@apache.org> on 2022/11/12 04:19:21 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38603: [SPARK-41101][PYTHON][PROTOBUF] Message classname support for PYSPARK-PROTOBUF - posted by GitBox <gi...@apache.org> on 2022/11/12 04:19:24 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38601: [WIP][SPARK-41100][INFRA] Upgrade Ubuntu to latest - posted by GitBox <gi...@apache.org> on 2022/11/12 04:19:27 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38595: [SPARK-41090][SQL] Throw Exception for `db_name.view_name` when creating temp view - posted by GitBox <gi...@apache.org> on 2022/11/12 08:12:31 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38633: [SPARK-41117][BUILD] Upgrade sbt from 1.7.3 to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/12 08:14:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/12 11:59:24 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/12 12:01:13 UTC, 7 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2022/11/12 12:04:46 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38633: [SPARK-41117][BUILD] Upgrade sbt from 1.7.3 to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/12 14:22:20 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38633: [SPARK-41117][BUILD] Upgrade sbt from 1.7.3 to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/12 14:39:33 UTC, 0 replies.
- [GitHub] [spark] panbingkun closed pull request #38633: [SPARK-41117][BUILD] Upgrade sbt from 1.7.3 to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/12 14:39:51 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38634: [WIP] [CONNECT] Support Cast and DataTypes as Literals - posted by GitBox <gi...@apache.org> on 2022/11/12 16:32:46 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38634: [WIP] [CONNECT] Support Cast and DataTypes as Literals - posted by GitBox <gi...@apache.org> on 2022/11/12 16:34:59 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #38635: [SPARK-41118][SQL] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/12 19:33:24 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #38635: [SPARK-41118][SQL] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/12 19:40:24 UTC, 3 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38636: [SPARK-41120][BUILD] Upgrade joda-time from 2.12.0 to 2.12.1 - posted by GitBox <gi...@apache.org> on 2022/11/13 00:18:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37346: [SPARK-37210][CORE][SQL] Allow forced use of staging directory - posted by GitBox <gi...@apache.org> on 2022/11/13 00:21:07 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36741: [SPARK-39357][SQL] Fix pmCache memory leak caused by IsolatedClassLoader - posted by GitBox <gi...@apache.org> on 2022/11/13 00:21:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36552: [SPARK-38506][SQL] Push partial aggregation through join - posted by GitBox <gi...@apache.org> on 2022/11/13 00:21:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38637: [SPARK-41121][BUILD] Upgrade sbt-assembly from 1.2.0 to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/11/13 00:44:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38600: [SPARK-41098][SQL] Rename `GROUP_BY_POS_REFERS_AGG_EXPR` to `GROUP_BY_POS_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/13 01:29:44 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38637: [SPARK-41121][BUILD] Upgrade sbt-assembly from 1.2.0 to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/11/13 01:51:40 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38636: [SPARK-41120][BUILD] Upgrade joda-time from 2.12.0 to 2.12.1 - posted by GitBox <gi...@apache.org> on 2022/11/13 01:51:43 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38634: [WIP] [CONNECT] Support Cast and DataTypes as Literals - posted by GitBox <gi...@apache.org> on 2022/11/13 03:45:52 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38632: [SPARK-41116][CONNECT] Input relation can be optional for Project in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/13 03:45:55 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/13 03:45:57 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38630: [SPARK-41115][CONNECT] Add ClientType Enum to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/13 03:46:00 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/13 04:22:30 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/13 06:07:34 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/13 06:08:01 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/13 07:27:51 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/13 07:27:54 UTC, 2 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/13 07:38:31 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38629: [SPARK-41072][SQL][SS] Add the error class `STREAM_FAILED` to `StreamingQueryException` - posted by GitBox <gi...@apache.org> on 2022/11/13 07:47:16 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38629: [SPARK-41072][SQL][SS] Add the error class `STREAM_FAILED` to `StreamingQueryException` - posted by GitBox <gi...@apache.org> on 2022/11/13 07:49:32 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #38623: [WIP][SPARK-41072][SQL][SS] Add the error class `STREAM_FAILED` - posted by GitBox <gi...@apache.org> on 2022/11/13 07:52:53 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38625: [PYTHON][PS] Fix the `.groupby()` method docstring - posted by GitBox <gi...@apache.org> on 2022/11/13 09:35:18 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/13 09:49:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38639: [SPARK-41123][BUILD] Upgrade mysql-connector-java from 8.0.30 to 8.0.31 - posted by GitBox <gi...@apache.org> on 2022/11/13 12:17:38 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38639: [SPARK-41123][BUILD] Upgrade mysql-connector-java from 8.0.30 to 8.0.31 - posted by GitBox <gi...@apache.org> on 2022/11/13 12:28:16 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #38640: [SPARK-41124][SQL][TEST] Add DSv2 PlanStabilitySuites - posted by GitBox <gi...@apache.org> on 2022/11/13 13:12:49 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #38640: [SPARK-41124][SQL][TEST] Add DSv2 PlanStabilitySuites - posted by GitBox <gi...@apache.org> on 2022/11/13 13:34:00 UTC, 5 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #38639: [SPARK-41123][BUILD] Upgrade mysql-connector-java from 8.0.30 to 8.0.31 - posted by GitBox <gi...@apache.org> on 2022/11/13 14:41:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/13 23:49:34 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/13 23:53:19 UTC, 3 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36741: [SPARK-39357][SQL] Fix pmCache memory leak caused by IsolatedClassLoader - posted by GitBox <gi...@apache.org> on 2022/11/14 00:21:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36552: [SPARK-38506][SQL] Push partial aggregation through join - posted by GitBox <gi...@apache.org> on 2022/11/14 00:21:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/14 01:07:51 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38634: [WIP] [CONNECT] Support Cast and DataTypes as Literals - posted by GitBox <gi...@apache.org> on 2022/11/14 01:20:07 UTC, 0 replies.
- [GitHub] [spark] tianshuang commented on pull request #36741: [SPARK-39357][SQL] Fix pmCache memory leak caused by IsolatedClassLoader - posted by GitBox <gi...@apache.org> on 2022/11/14 01:28:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38625: [PYTHON][PS] Fix the `.groupby()` method docstring - posted by GitBox <gi...@apache.org> on 2022/11/14 01:28:52 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/14 01:36:34 UTC, 6 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38607: [SPARK-40938][CONNECT][PYTHON][FOLLOW-UP] Fix SubqueryAlias without the child plan when constructing Connect proto in the Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 01:36:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38607: [SPARK-40938][CONNECT][PYTHON][FOLLOW-UP] Fix SubqueryAlias without the child plan when constructing Connect proto in the Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 01:37:03 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/14 02:09:17 UTC, 2 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38611: [SPARK-41107][PYTHON][INFRA][TEST] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/14 02:33:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38571: [SPARK-37555][TEST][FOLLOWUP] Increase timeout of CLI test `spark-sql should pass last unclosed comment to backend` - posted by GitBox <gi...@apache.org> on 2022/11/14 03:07:12 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37355: [SPARK-39930][SQL] Introduce Cache Hints - posted by GitBox <gi...@apache.org> on 2022/11/14 03:10:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38626: [SPARK-38959][SQL][FOLLOWUP] Do not optimize subqueries twice - posted by GitBox <gi...@apache.org> on 2022/11/14 03:16:48 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38595: [SPARK-41090][SQL] Throw Exception for `db_name.view_name` when creating temp view by Dataset API - posted by GitBox <gi...@apache.org> on 2022/11/14 03:24:43 UTC, 10 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38576: [SPARK-41062][SQL] Rename `UNSUPPORTED_CORRELATED_REFERENCE` to `CORRELATED_REFERENCE` - posted by GitBox <gi...@apache.org> on 2022/11/14 03:26:39 UTC, 6 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38595: [SPARK-41090][SQL] Throw Exception for `db_name.view_name` when creating temp view by Dataset API - posted by GitBox <gi...@apache.org> on 2022/11/14 03:45:16 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #38641: [SPARK-41126][K8S] `entrypoint.sh` should use its WORKDIR instead of `/tmp` directory - posted by GitBox <gi...@apache.org> on 2022/11/14 03:46:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 04:13:49 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38632: [SPARK-41116][CONNECT] Input relation can be optional for Project in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/14 04:15:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38632: [SPARK-41116][CONNECT] Input relation can be optional for Project in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/14 04:15:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38589: [SPARK-41087][BUILD] Remove duplicate `-Xmx4g` from `dev/make-distribution.sh` and make `build/mvn` use the same JAVA_OPTS - posted by GitBox <gi...@apache.org> on 2022/11/14 04:22:45 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38642: [SPARK-41127][CONNECT][PYTHON] Implement DataFrame.CreateGlobalView in Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 04:23:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38571: [SPARK-37555][TEST][FOLLOWUP] Increase timeout of CLI test `spark-sql should pass last unclosed comment to backend` - posted by GitBox <gi...@apache.org> on 2022/11/14 04:24:08 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38642: [SPARK-41127][CONNECT][PYTHON] Implement DataFrame.CreateGlobalView in Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 04:24:39 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/14 04:33:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/14 04:35:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38621: [SPARK-41111][CONNECT][PYTHON] Implement `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2022/11/14 04:36:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/14 04:38:45 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/14 04:41:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/14 04:43:55 UTC, 11 replies.
- [GitHub] [spark] sunchao opened a new pull request, #38643: [SPARK-41091][BUILD][3.2] Fix Docker release tool for branch-3.2 - posted by GitBox <gi...@apache.org> on 2022/11/14 05:15:52 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #38643: [SPARK-41091][BUILD][3.2] Fix Docker release tool for branch-3.2 - posted by GitBox <gi...@apache.org> on 2022/11/14 05:17:20 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38641: [SPARK-41126][K8S] `entrypoint.sh` should use its WORKDIR instead of `/tmp` directory - posted by GitBox <gi...@apache.org> on 2022/11/14 05:51:11 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2022/11/14 05:51:45 UTC, 3 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38641: [SPARK-41126][K8S] `entrypoint.sh` should use its WORKDIR instead of `/tmp` directory - posted by GitBox <gi...@apache.org> on 2022/11/14 06:00:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38641: [SPARK-41126][K8S] `entrypoint.sh` should use its WORKDIR instead of `/tmp` directory - posted by GitBox <gi...@apache.org> on 2022/11/14 06:01:20 UTC, 2 replies.
- [GitHub] [spark] viirya commented on pull request #38641: [SPARK-41126][K8S] `entrypoint.sh` should use its WORKDIR instead of `/tmp` directory - posted by GitBox <gi...@apache.org> on 2022/11/14 06:02:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38641: [SPARK-41126][K8S] `entrypoint.sh` should use its WORKDIR instead of `/tmp` directory - posted by GitBox <gi...@apache.org> on 2022/11/14 06:04:47 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38644: [SPARK-41130][SQL] Rename `OUT_OF_DECIMAL_TYPE_RANGE` to `NUMERIC_OUT_OF_SUPPORTED_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/14 06:15:00 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38644: [SPARK-41130][SQL] Rename `OUT_OF_DECIMAL_TYPE_RANGE` to `NUMERIC_OUT_OF_SUPPORTED_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/14 06:15:41 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/14 06:22:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/14 06:24:22 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #38384: [SPARK-40657][PROTOBUF] Require shading for Java class jar, improve error handling - posted by GitBox <gi...@apache.org> on 2022/11/14 06:26:23 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38643: [SPARK-41091][BUILD][3.2] Fix Docker release tool for branch-3.2 - posted by GitBox <gi...@apache.org> on 2022/11/14 06:28:09 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38643: [SPARK-41091][BUILD][3.2] Fix Docker release tool for branch-3.2 - posted by GitBox <gi...@apache.org> on 2022/11/14 06:28:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38600: [SPARK-41098][SQL] Rename `GROUP_BY_POS_REFERS_AGG_EXPR` to `GROUP_BY_POS_AGGREGATE` - posted by GitBox <gi...@apache.org> on 2022/11/14 06:30:02 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/14 06:30:48 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38589: [SPARK-41087][BUILD] Remove duplicated `-Xmx4g` from `dev/make-distribution.sh` and make `build/mvn` use the same JAVA_OPTS - posted by GitBox <gi...@apache.org> on 2022/11/14 06:42:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38589: [SPARK-41087][BUILD] Remove duplicated `-Xmx4g` from `dev/make-distribution.sh` and make `build/mvn` use the same JAVA_OPTS - posted by GitBox <gi...@apache.org> on 2022/11/14 06:45:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/14 06:45:22 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on pull request #38628: [SPARK-41096][SQL] Support reading parquet FIXED_LEN_BYTE_ARRAY type - posted by GitBox <gi...@apache.org> on 2022/11/14 06:45:52 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2022/11/14 06:49:40 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #38645: [SPARK-41109][SQL][FOLLOWUP] Fix Scalastyle - posted by GitBox <gi...@apache.org> on 2022/11/14 06:50:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38645: [SPARK-41109][SQL][FOLLOWUP] Fix Scalastyle - posted by GitBox <gi...@apache.org> on 2022/11/14 06:53:20 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/14 06:55:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38645: [SPARK-41109][SQL][FOLLOWUP] Fix Scalastyle - posted by GitBox <gi...@apache.org> on 2022/11/14 06:57:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38645: [SPARK-41109][SQL][FOLLOWUP] Fix Scalastyle - posted by GitBox <gi...@apache.org> on 2022/11/14 07:00:11 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38646: [SPARK-41131][SQL] Improve error message for `UNRESOLVED_MAP_KEY.WITHOUT_SUGGESTION` - posted by GitBox <gi...@apache.org> on 2022/11/14 07:03:24 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38646: [SPARK-41131][SQL] Improve error message for `UNRESOLVED_MAP_KEY.WITHOUT_SUGGESTION` - posted by GitBox <gi...@apache.org> on 2022/11/14 07:04:00 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38064: [SPARK-40622][SQL][CORE]Remove the limitation that single task result must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/14 07:21:52 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/14 07:24:14 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38384: [SPARK-40657][PROTOBUF] Require shading for Java class jar, improve error handling - posted by GitBox <gi...@apache.org> on 2022/11/14 07:25:50 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #38647: [SPARK-41133][SQL] Integrate `UNSCALED_VALUE_TOO_LARGE_FOR_PRECISION` into `NUMERIC_VALUE_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/14 07:27:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38648: [SPARK-41134][SQL] Improve error message of internal errors - posted by GitBox <gi...@apache.org> on 2022/11/14 07:33:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38648: [SPARK-41134][SQL] Improve error message of internal errors - posted by GitBox <gi...@apache.org> on 2022/11/14 07:34:09 UTC, 1 replies.
- [GitHub] [spark] wankunde opened a new pull request, #38649: [SPARK-41132][SQL] Convert LikeAny and NotLikeAny to InSet if no pattern contains wildcards - posted by GitBox <gi...@apache.org> on 2022/11/14 07:48:20 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #37355: [SPARK-39930][SQL] Introduce Cache Hints - posted by GitBox <gi...@apache.org> on 2022/11/14 07:51:21 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38650: [SPARK-41135][SQL] Rename `UNSUPPORTED_EMPTY_LOCATION` to `INVALID_EMPTY_LOCATION` - posted by GitBox <gi...@apache.org> on 2022/11/14 07:58:36 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38650: [SPARK-41135][SQL] Rename `UNSUPPORTED_EMPTY_LOCATION` to `INVALID_EMPTY_LOCATION` - posted by GitBox <gi...@apache.org> on 2022/11/14 07:58:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38626: [SPARK-38959][SQL][FOLLOWUP] Do not optimize subqueries twice - posted by GitBox <gi...@apache.org> on 2022/11/14 08:06:39 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38646: [SPARK-41131][SQL] Improve error message for `UNRESOLVED_MAP_KEY.WITHOUT_SUGGESTION` - posted by GitBox <gi...@apache.org> on 2022/11/14 08:24:39 UTC, 1 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38651: [SPARK-41136][K8S] Shorten graceful shutdown time of ExecutorPodsSnapshotsStoreImpl to prevent blocking shutdown process - posted by GitBox <gi...@apache.org> on 2022/11/14 08:29:26 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38652: [SPARK-41137][SQL] Rename `LATERAL_JOIN_OF_TYPE` to `INVALID_LATERAL_JOIN_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/14 08:31:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38648: [SPARK-41134][SQL] Improve error message of internal errors - posted by GitBox <gi...@apache.org> on 2022/11/14 08:31:24 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38650: [SPARK-41135][SQL] Rename `UNSUPPORTED_EMPTY_LOCATION` to `INVALID_EMPTY_LOCATION` - posted by GitBox <gi...@apache.org> on 2022/11/14 08:41:15 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #38652: [SPARK-41137][SQL] Rename `LATERAL_JOIN_OF_TYPE` to `INVALID_LATERAL_JOIN_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/14 08:42:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/14 08:45:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38648: [SPARK-41134][SQL] Improve error message of internal errors - posted by GitBox <gi...@apache.org> on 2022/11/14 08:53:56 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38651: [SPARK-41136][K8S] Shorten graceful shutdown time of ExecutorPodsSnapshotsStoreImpl to prevent blocking shutdown process - posted by GitBox <gi...@apache.org> on 2022/11/14 08:57:46 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38654: [SPARK-41005][CONNECT][DOC][FOLLOW-UP] Document the reason of sending batch in main thread - posted by GitBox <gi...@apache.org> on 2022/11/14 09:25:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38654: [SPARK-41005][CONNECT][DOC][FOLLOW-UP] Document the reason of sending batch in main thread - posted by GitBox <gi...@apache.org> on 2022/11/14 09:26:23 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38655: [SPARK-41138][PYTHON] `DataFrame.na.fill` should have the same augment types as `DataFrame.fillna` - posted by GitBox <gi...@apache.org> on 2022/11/14 10:04:37 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2022/11/14 10:35:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38656: [WIP][SPARK-41140][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2440` to `INVALID_WHERE_CONDITION` - posted by GitBox <gi...@apache.org> on 2022/11/14 10:57:21 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38657: [SPARK-41139][SQL] Improve error class: `PYTHON_UDF_IN_ON_CLAUSE` - posted by GitBox <gi...@apache.org> on 2022/11/14 11:01:14 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38657: [SPARK-41139][SQL] Improve error class: `PYTHON_UDF_IN_ON_CLAUSE` - posted by GitBox <gi...@apache.org> on 2022/11/14 11:01:35 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38064: [SPARK-40622][SQL][CORE]Remove the limitation that single task result must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/14 11:23:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38658: [SPARK-41109][CORE][FOLLOWUP] Fix error class order - posted by GitBox <gi...@apache.org> on 2022/11/14 11:25:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38615: [SPARK-41109][SQL] Rename the error class _LEGACY_ERROR_TEMP_1216 to INVALID_LIKE_PATTERN - posted by GitBox <gi...@apache.org> on 2022/11/14 11:28:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38658: [SPARK-41109][CORE][FOLLOWUP] Fix error class order - posted by GitBox <gi...@apache.org> on 2022/11/14 11:29:53 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38658: [SPARK-41109][CORE][FOLLOWUP] Re-order error class to fix `SparkThrowableSuite` - posted by GitBox <gi...@apache.org> on 2022/11/14 11:37:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38637: [SPARK-41121][BUILD] Upgrade sbt-assembly from 1.2.0 to 2.0.0 - posted by GitBox <gi...@apache.org> on 2022/11/14 11:38:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38658: [SPARK-41109][CORE][FOLLOWUP] Re-order error class to fix `SparkThrowableSuite` - posted by GitBox <gi...@apache.org> on 2022/11/14 11:42:45 UTC, 4 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38646: [SPARK-41131][SQL] Improve error message for `UNRESOLVED_MAP_KEY.WITHOUT_SUGGESTION` - posted by GitBox <gi...@apache.org> on 2022/11/14 11:50:59 UTC, 3 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38644: [SPARK-41130][SQL] Rename `OUT_OF_DECIMAL_TYPE_RANGE` to `NUMERIC_OUT_OF_SUPPORTED_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/14 11:59:04 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38640: [SPARK-41124][SQL][TEST] Add DSv2 PlanStabilitySuites - posted by GitBox <gi...@apache.org> on 2022/11/14 12:10:23 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 12:13:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38654: [SPARK-41005][CONNECT][DOC][FOLLOW-UP] Document the reason of sending batch in main thread - posted by GitBox <gi...@apache.org> on 2022/11/14 12:14:10 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on pull request #38643: [SPARK-41091][BUILD][3.2] Fix Docker release tool for branch-3.2 - posted by GitBox <gi...@apache.org> on 2022/11/14 12:21:35 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38651: [SPARK-41136][K8S] Shorten graceful shutdown time of ExecutorPodsSnapshotsStoreImpl to prevent blocking shutdown process - posted by GitBox <gi...@apache.org> on 2022/11/14 12:55:09 UTC, 0 replies.
- [GitHub] [spark] dengziming opened a new pull request, #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/14 13:18:07 UTC, 0 replies.
- [GitHub] [spark] dengziming commented on a diff in pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/14 13:21:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38635: [SPARK-41118][SQL] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/14 13:30:01 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38649: [SPARK-41132][SQL] Convert LikeAny and NotLikeAny to InSet if no pattern contains wildcards - posted by GitBox <gi...@apache.org> on 2022/11/14 14:52:29 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/14 15:19:16 UTC, 1 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/14 15:37:43 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38658: [SPARK-41109][CORE][FOLLOWUP] Re-order error class to fix `SparkThrowableSuite` - posted by GitBox <gi...@apache.org> on 2022/11/14 16:20:19 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38658: [SPARK-41109][CORE][FOLLOWUP] Re-order error class to fix `SparkThrowableSuite` - posted by GitBox <gi...@apache.org> on 2022/11/14 16:22:07 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/14 16:56:57 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #38651: [SPARK-41136][K8S] Shorten graceful shutdown time of ExecutorPodsSnapshotsStoreImpl to prevent blocking shutdown process - posted by GitBox <gi...@apache.org> on 2022/11/14 17:03:22 UTC, 7 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38656: [SPARK-41140][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2440` to `INVALID_WHERE_CONDITION` - posted by GitBox <gi...@apache.org> on 2022/11/14 18:52:24 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/14 18:52:41 UTC, 3 replies.
- [GitHub] [spark] sunchao closed pull request #38628: [SPARK-41096][SQL] Support reading parquet FIXED_LEN_BYTE_ARRAY type - posted by GitBox <gi...@apache.org> on 2022/11/14 18:53:23 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #38628: [SPARK-41096][SQL] Support reading parquet FIXED_LEN_BYTE_ARRAY type - posted by GitBox <gi...@apache.org> on 2022/11/14 18:53:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/14 19:16:39 UTC, 2 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #38648: [SPARK-41134][SQL] Improve error message of internal errors - posted by GitBox <gi...@apache.org> on 2022/11/14 19:30:53 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #38628: [SPARK-41096][SQL] Support reading parquet FIXED_LEN_BYTE_ARRAY type - posted by GitBox <gi...@apache.org> on 2022/11/14 19:31:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38609: [SPARK-40593][BUILD][CONNECT] Make user can build and test `connect` module by specifying the user-defined `protoc` and `protoc-gen-grpc-java` - posted by GitBox <gi...@apache.org> on 2022/11/14 19:54:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Make user can build and test `connect` module by specifying the user-defined `protoc` and `protoc-gen-grpc-java` - posted by GitBox <gi...@apache.org> on 2022/11/14 19:54:57 UTC, 2 replies.
- [GitHub] [spark] SandishKumarHN commented on a diff in pull request #38384: [SPARK-40657][PROTOBUF] Require shading for Java class jar, improve error handling - posted by GitBox <gi...@apache.org> on 2022/11/14 20:11:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38642: [SPARK-41127][CONNECT][PYTHON] Implement DataFrame.CreateGlobalView in Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 20:23:53 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/14 20:29:59 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2022/11/14 21:21:58 UTC, 5 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2022/11/14 21:41:17 UTC, 2 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/11/14 21:43:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38642: [SPARK-41127][CONNECT][PYTHON] Implement DataFrame.CreateGlobalView in Python client - posted by GitBox <gi...@apache.org> on 2022/11/14 22:41:14 UTC, 0 replies.
- [GitHub] [spark] xkrogen opened a new pull request, #38660: [SPARK-40199][SQL][WIP] Provide useful error when encountering null values in non-null fields - posted by GitBox <gi...@apache.org> on 2022/11/14 23:50:08 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #37634: [SPARK-40199][SQL] Provide useful error when projecting a non-null column encounters null value - posted by GitBox <gi...@apache.org> on 2022/11/14 23:50:27 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by GitBox <gi...@apache.org> on 2022/11/14 23:55:26 UTC, 1 replies.
- [GitHub] [spark] xkrogen commented on pull request #38660: [SPARK-40199][SQL][WIP] Provide useful error when encountering null values in non-null fields - posted by GitBox <gi...@apache.org> on 2022/11/15 00:00:22 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37409: [SPARK-39970][CORE] Introduce ThrottledLogger to prevent log message flooding caused by network issues - posted by GitBox <gi...@apache.org> on 2022/11/15 00:22:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38639: [SPARK-41123][BUILD] Upgrade mysql-connector-java from 8.0.30 to 8.0.31 - posted by GitBox <gi...@apache.org> on 2022/11/15 00:39:47 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/15 00:39:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38609: [SPARK-40593][BUILD][CONNECT] Make user can build and test `connect` module by specifying the user-defined `protoc` and `protoc-gen-grpc-java` - posted by GitBox <gi...@apache.org> on 2022/11/15 01:03:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38654: [SPARK-41005][CONNECT][DOC][FOLLOW-UP] Document the reason of sending batch in main thread - posted by GitBox <gi...@apache.org> on 2022/11/15 01:08:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38654: [SPARK-41005][CONNECT][DOC][FOLLOW-UP] Document the reason of sending batch in main thread - posted by GitBox <gi...@apache.org> on 2022/11/15 01:08:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38611: [SPARK-41107][PYTHON][INFRA][TEST] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/15 01:22:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/15 01:25:01 UTC, 1 replies.
- [GitHub] [spark] liuzqt commented on pull request #38064: [SPARK-40622][SQL][CORE]Remove the limitation that single task result must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/15 01:44:26 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #37355: [SPARK-39930][SQL] Introduce Cache Hints - posted by GitBox <gi...@apache.org> on 2022/11/15 01:49:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/15 02:11:09 UTC, 1 replies.
- [GitHub] [spark] dengziming commented on pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/15 02:13:50 UTC, 5 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/15 02:21:38 UTC, 2 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #38611: [SPARK-41107][PYTHON][INFRA][TEST] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/15 02:29:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38609: [SPARK-40593][BUILD][CONNECT] Make user can build and test `connect` module by specifying the user-defined `protoc` and `protoc-gen-grpc-java` - posted by GitBox <gi...@apache.org> on 2022/11/15 02:34:34 UTC, 4 replies.
- [GitHub] [spark] itholic commented on pull request #38647: [SPARK-41133][SQL] Integrate `UNSCALED_VALUE_TOO_LARGE_FOR_PRECISION` into `NUMERIC_VALUE_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/15 02:42:32 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #38661: [SPARK-41085][SQL] Support Bit manipulation function COUNTSET - posted by GitBox <gi...@apache.org> on 2022/11/15 02:55:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38635: [SPARK-41118][SQL] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/15 02:56:11 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/15 03:07:57 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38636: [SPARK-41120][BUILD] Upgrade joda-time from 2.12.0 to 2.12.1 - posted by GitBox <gi...@apache.org> on 2022/11/15 03:24:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38636: [SPARK-41120][BUILD] Upgrade joda-time from 2.12.0 to 2.12.1 - posted by GitBox <gi...@apache.org> on 2022/11/15 03:24:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38639: [SPARK-41123][BUILD] Upgrade mysql-connector-java from 8.0.30 to 8.0.31 - posted by GitBox <gi...@apache.org> on 2022/11/15 03:25:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38639: [SPARK-41123][BUILD] Upgrade mysql-connector-java from 8.0.30 to 8.0.31 - posted by GitBox <gi...@apache.org> on 2022/11/15 03:26:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38064: [SPARK-40622][SQL][CORE]Remove the limitation that single task result must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/15 03:33:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/15 03:43:45 UTC, 1 replies.
- [GitHub] [spark] zhengchenyu closed pull request #37949: [SPARK-40504][YARN] Make yarn appmaster load config from client - posted by GitBox <gi...@apache.org> on 2022/11/15 03:46:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/15 04:04:47 UTC, 2 replies.
- [GitHub] [spark] zhengchenyu commented on pull request #33674: [Spark-36328][CORE][SQL] Reuse the FileSystem delegation token while querying partitioned hive table. - posted by GitBox <gi...@apache.org> on 2022/11/15 04:06:46 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/15 04:10:07 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38655: [SPARK-41138][PYTHON] `DataFrame.na.fill` should have the same augment types as `DataFrame.fillna` - posted by GitBox <gi...@apache.org> on 2022/11/15 04:18:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38603: [SPARK-41101][PYTHON][PROTOBUF] Message classname support for PYSPARK-PROTOBUF - posted by GitBox <gi...@apache.org> on 2022/11/15 04:19:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38603: [SPARK-41101][PYTHON][PROTOBUF] Message classname support for PYSPARK-PROTOBUF - posted by GitBox <gi...@apache.org> on 2022/11/15 04:20:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #37616: [SPARK-40178][PYTHON][SQL] Fix partitioning hint parameters in PySpark - posted by GitBox <gi...@apache.org> on 2022/11/15 04:28:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38642: [SPARK-41127][CONNECT][PYTHON] Implement DataFrame.CreateGlobalView in Python client - posted by GitBox <gi...@apache.org> on 2022/11/15 04:31:55 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38651: [SPARK-41136][K8S] Shorten graceful shutdown time of ExecutorPodsSnapshotsStoreImpl to prevent blocking shutdown process - posted by GitBox <gi...@apache.org> on 2022/11/15 04:32:15 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38620: [SPARK-41113][BUILD] Upgrade sbt to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/15 04:35:32 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38620: [SPARK-41113][BUILD] Upgrade sbt to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/15 04:42:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38648: [SPARK-41134][SQL] Improve error message of internal errors - posted by GitBox <gi...@apache.org> on 2022/11/15 04:49:27 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38662: [SPARK-41144][SQL] Unresolved hint should not cause query failure - posted by GitBox <gi...@apache.org> on 2022/11/15 04:50:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38640: [SPARK-41124][SQL][TEST] Add DSv2 PlanStabilitySuites - posted by GitBox <gi...@apache.org> on 2022/11/15 04:53:02 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38662: [SPARK-41144][SQL] Unresolved hint should not cause query failure - posted by GitBox <gi...@apache.org> on 2022/11/15 04:53:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38619: [SPARK-41112][SQL] RuntimeFilter should apply ColumnPruning eagerly with in-subquery filter - posted by GitBox <gi...@apache.org> on 2022/11/15 04:54:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/15 04:54:57 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/15 04:55:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/15 04:59:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2022/11/15 05:14:58 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 opened a new pull request, #38663: [SPARK-41143][SQL] Add named argument function syntax support - posted by GitBox <gi...@apache.org> on 2022/11/15 05:39:58 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38629: [SPARK-41072][SQL][SS] Add the error class `STREAM_FAILED` to `StreamingQueryException` - posted by GitBox <gi...@apache.org> on 2022/11/15 05:45:44 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38656: [SPARK-41140][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2440` to `INVALID_WHERE_CONDITION` - posted by GitBox <gi...@apache.org> on 2022/11/15 05:54:51 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38528: [SPARK-41025][SS] Introduce ValidateOffsetRange/ComparableOffset to support offset range validation - posted by GitBox <gi...@apache.org> on 2022/11/15 06:26:22 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38528: [SPARK-41025][SS] Introduce ValidateOffsetRange/ComparableOffset to support offset range validation - posted by GitBox <gi...@apache.org> on 2022/11/15 06:26:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38652: [SPARK-41137][SQL] Rename `LATERAL_JOIN_OF_TYPE` to `INVALID_LATERAL_JOIN_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/15 06:36:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38652: [SPARK-41137][SQL] Rename `LATERAL_JOIN_OF_TYPE` to `INVALID_LATERAL_JOIN_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/15 06:38:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38630: [SPARK-41115][CONNECT] Add ClientType to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/15 07:13:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38642: [SPARK-41127][CONNECT][PYTHON] Implement DataFrame.CreateGlobalView in Python client - posted by GitBox <gi...@apache.org> on 2022/11/15 07:28:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38642: [SPARK-41127][CONNECT][PYTHON] Implement DataFrame.CreateGlobalView in Python client - posted by GitBox <gi...@apache.org> on 2022/11/15 07:29:27 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38664: [SPARK-41147][SQL] Assign a name to the legacy error class `_LEGACY_ERROR_TEMP_1042` - posted by GitBox <gi...@apache.org> on 2022/11/15 07:40:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38619: [SPARK-41112][SQL] RuntimeFilter should apply ColumnPruning eagerly with in-subquery filter - posted by GitBox <gi...@apache.org> on 2022/11/15 08:47:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38662: [SPARK-41144][SQL] Unresolved hint should not cause query failure - posted by GitBox <gi...@apache.org> on 2022/11/15 08:49:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38662: [SPARK-41144][SQL] Unresolved hint should not cause query failure - posted by GitBox <gi...@apache.org> on 2022/11/15 08:49:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38404: [SPARK-40956] SQL Equivalent for Dataframe overwrite command - posted by GitBox <gi...@apache.org> on 2022/11/15 08:57:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/15 09:01:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/15 09:28:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38653: [SPARK-41128][CONNECT][PYTHON] Implement `DataFrame.fillna ` and `DataFrame.na.fill ` - posted by GitBox <gi...@apache.org> on 2022/11/15 10:24:43 UTC, 0 replies.
- [GitHub] [spark] fred-db commented on pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/15 10:53:10 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38664: [SPARK-41147][SQL] Assign a name to the legacy error class `_LEGACY_ERROR_TEMP_1042` - posted by GitBox <gi...@apache.org> on 2022/11/15 10:57:24 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #38656: [SPARK-41140][SQL] Rename the error class `_LEGACY_ERROR_TEMP_2440` to `INVALID_WHERE_CONDITION` - posted by GitBox <gi...@apache.org> on 2022/11/15 11:58:53 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #38640: [SPARK-41124][SQL][TEST] Add DSv2 PlanStabilitySuites - posted by GitBox <gi...@apache.org> on 2022/11/15 11:59:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38663: [SPARK-41143][SQL] Add named argument function syntax support - posted by GitBox <gi...@apache.org> on 2022/11/15 12:21:38 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/15 12:25:30 UTC, 2 replies.
- [GitHub] [spark] Ngone51 commented on pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/15 12:27:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/15 12:54:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38627: [SPARK-40875] [CONNECT] [FOLLOW] Retain Group expressions in aggregate. - posted by GitBox <gi...@apache.org> on 2022/11/15 12:55:20 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #38467: [SPARK-40987][CORE] Avoid creating a directory when deleting a block, causing DAGScheduler to not work - posted by GitBox <gi...@apache.org> on 2022/11/15 13:00:16 UTC, 2 replies.
- [GitHub] [spark-docker] Yikun closed pull request #23: [SPARK-40519] Add "Publish" workflow to help release apache/spark image - posted by GitBox <gi...@apache.org> on 2022/11/15 13:32:41 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/15 16:19:12 UTC, 0 replies.
- [GitHub] [spark] awdavidson commented on pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by GitBox <gi...@apache.org> on 2022/11/15 16:25:58 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #38663: [SPARK-41143][SQL] Add named argument function syntax support - posted by GitBox <gi...@apache.org> on 2022/11/15 17:38:44 UTC, 1 replies.
- [GitHub] [spark] WweiL closed pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/15 17:55:32 UTC, 0 replies.
- [GitHub] [spark] WweiL opened a new pull request, #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/15 17:55:36 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on a diff in pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/15 18:36:45 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #38661: [SPARK-41085][SQL] Support Bit manipulation function COUNTSET - posted by GitBox <gi...@apache.org> on 2022/11/15 18:38:07 UTC, 0 replies.
- [GitHub] [spark] alex-balikov commented on pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/15 18:48:11 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #38611: [SPARK-41107][PYTHON][INFRA][TESTS] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/15 19:05:22 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #38257: [SPARK-40798][SQL] Alter partition should verify value follow storeAssignmentPolicy - posted by GitBox <gi...@apache.org> on 2022/11/15 19:44:21 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38511: [SPARK-41017][SQL] Support column pruning with multiple nondeterministic Filters - posted by GitBox <gi...@apache.org> on 2022/11/15 19:56:07 UTC, 2 replies.
- [GitHub] [spark] kyle-ai2 commented on pull request #38539: [SPARK-41030][BUILD] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/15 20:14:14 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #38531: [SPARK-40755][SQL] Migrate type check failures of number formatting onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/15 20:38:48 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38647: [SPARK-41133][SQL] Integrate `UNSCALED_VALUE_TOO_LARGE_FOR_PRECISION` into `NUMERIC_VALUE_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/15 20:42:31 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38609: [SPARK-40593][BUILD][CONNECT] Make user can build and test `connect` module by specifying the user-defined `protoc` and `protoc-gen-grpc-java` - posted by GitBox <gi...@apache.org> on 2022/11/15 21:02:08 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38630: [SPARK-41115][CONNECT] Add ClientType to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/15 21:03:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38620: [SPARK-41113][BUILD] Upgrade sbt to 1.8.0 - posted by GitBox <gi...@apache.org> on 2022/11/15 21:28:52 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38665: [WIP][SQL] Remove the class `TypeCheckFailure` - posted by GitBox <gi...@apache.org> on 2022/11/15 21:32:38 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #38384: [SPARK-40657][PROTOBUF] Require shading for Java class jar, improve error handling - posted by GitBox <gi...@apache.org> on 2022/11/15 23:58:16 UTC, 4 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37409: [SPARK-39970][CORE] Introduce ThrottledLogger to prevent log message flooding caused by network issues - posted by GitBox <gi...@apache.org> on 2022/11/16 00:23:05 UTC, 0 replies.
- [GitHub] [spark] Stycos commented on pull request #38262: [SPARK-40801][BUILD] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/16 00:39:32 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38666: [CONENCT][PYTHON][DOC] Document how to run the module of tests for Spark Connect Python tests - posted by GitBox <gi...@apache.org> on 2022/11/16 01:15:15 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38666: [CONENCT][PYTHON][DOC] Document how to run the module of tests for Spark Connect Python tests - posted by GitBox <gi...@apache.org> on 2022/11/16 01:15:24 UTC, 1 replies.
- [GitHub] [spark] Yaohua628 commented on a diff in pull request #38663: [SPARK-41143][SQL] Add named argument function syntax support - posted by GitBox <gi...@apache.org> on 2022/11/16 02:00:53 UTC, 2 replies.
- [GitHub] [spark] asfgit closed pull request #38064: [SPARK-40622][SQL][CORE]Remove the limitation that single task result must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/16 02:54:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38257: [SPARK-40798][SQL] Alter partition should verify value follow storeAssignmentPolicy - posted by GitBox <gi...@apache.org> on 2022/11/16 02:55:07 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38667: [SPARK-40798][DOCS] Fix a typo in the configuration name at migration guide - posted by GitBox <gi...@apache.org> on 2022/11/16 02:57:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #34367: [SPARK-37099][SQL] Introduce a rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/16 03:01:12 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38667: [SPARK-40798][DOCS][FOLLOW-UP] Fix a typo in the configuration name at migration guide - posted by GitBox <gi...@apache.org> on 2022/11/16 03:48:04 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #38651: [SPARK-41136][K8S] Shorten graceful shutdown time of ExecutorPodsSnapshotsStoreImpl to prevent blocking shutdown process - posted by GitBox <gi...@apache.org> on 2022/11/16 03:51:28 UTC, 1 replies.
- [GitHub] [spark] warrenzhu25 opened a new pull request, #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/16 04:11:29 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38630: [SPARK-41115][CONNECT] Add ClientType to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/16 04:47:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38667: [SPARK-40798][DOCS][FOLLOW-UP] Fix a typo in the configuration name at migration guide - posted by GitBox <gi...@apache.org> on 2022/11/16 04:50:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38667: [SPARK-40798][DOCS][FOLLOW-UP] Fix a typo in the configuration name at migration guide - posted by GitBox <gi...@apache.org> on 2022/11/16 04:50:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38666: [CONENCT][PYTHON][DOC] Document how to run the module of tests for Spark Connect Python tests - posted by GitBox <gi...@apache.org> on 2022/11/16 04:57:04 UTC, 1 replies.
- [GitHub] [spark] viirya opened a new pull request, #38669: [SPARK-41155][SQL] Add error message to SchemaColumnConvertNotSupportedException - posted by GitBox <gi...@apache.org> on 2022/11/16 04:57:15 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #38669: [SPARK-41155][SQL] Add error message to SchemaColumnConvertNotSupportedException - posted by GitBox <gi...@apache.org> on 2022/11/16 04:57:44 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #34367: [SPARK-37099][SQL] Introduce a rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/16 05:04:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38665: [SPARK-41156][SQL] Remove the class `TypeCheckFailure` - posted by GitBox <gi...@apache.org> on 2022/11/16 06:14:49 UTC, 3 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #38649: [SPARK-41132][SQL] Convert LikeAny and NotLikeAny to InSet if no pattern contains wildcards - posted by GitBox <gi...@apache.org> on 2022/11/16 06:18:38 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #34367: [SPARK-37099][SQL] Introduce a rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/16 06:53:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38670: [SPARK-41157][CONNECT][PYTHON][TEST] Show detailed differences in test - posted by GitBox <gi...@apache.org> on 2022/11/16 06:59:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38262: [SPARK-40801][BUILD] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/16 07:01:29 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38665: [SPARK-41156][SQL] Remove the class `TypeCheckFailure` - posted by GitBox <gi...@apache.org> on 2022/11/16 07:23:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/16 07:26:39 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38669: [SPARK-41155][SQL] Add error message to SchemaColumnConvertNotSupportedException - posted by GitBox <gi...@apache.org> on 2022/11/16 07:48:30 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38669: [SPARK-41155][SQL] Add error message to SchemaColumnConvertNotSupportedException - posted by GitBox <gi...@apache.org> on 2022/11/16 08:02:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38464: [SPARK-32628][SQL] Use bloom filter to improve dynamic partition pruning - posted by GitBox <gi...@apache.org> on 2022/11/16 08:36:49 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38464: [SPARK-32628][SQL] Use bloom filter to improve dynamic partition pruning - posted by GitBox <gi...@apache.org> on 2022/11/16 08:45:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/16 08:48:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/16 08:49:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38671: [SPARK-41158][SQL][TESTS] Use `checkError()` to check `DATATYPE_MISMATCH` in `DataFrameFunctionsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/16 08:50:41 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38605: [SPARK-41103][CONNECT][DOC] Document how to add a new proto field of messages - posted by GitBox <gi...@apache.org> on 2022/11/16 08:52:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38670: [SPARK-41157][CONNECT][PYTHON][TEST] Show detailed differences in dataframe comparison - posted by GitBox <gi...@apache.org> on 2022/11/16 08:53:46 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38670: [SPARK-41157][CONNECT][PYTHON][TEST] Show detailed differences in dataframe comparison - posted by GitBox <gi...@apache.org> on 2022/11/16 09:00:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38670: [SPARK-41157][CONNECT][PYTHON][TEST] Show detailed differences in dataframe comparison - posted by GitBox <gi...@apache.org> on 2022/11/16 09:01:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38665: [SPARK-41156][SQL] Remove the class `TypeCheckFailure` - posted by GitBox <gi...@apache.org> on 2022/11/16 09:03:29 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38511: [SPARK-41017][SQL] Support column pruning with multiple nondeterministic Filters - posted by GitBox <gi...@apache.org> on 2022/11/16 09:08:07 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38671: [SPARK-41158][SQL][TESTS] Use `checkError()` to check `DATATYPE_MISMATCH` in `DataFrameFunctionsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/16 09:16:52 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/16 09:19:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/16 09:20:25 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #38672: [WIP][SPARK-41159][SQL] Optimize like any and like all expressions - posted by GitBox <gi...@apache.org> on 2022/11/16 09:46:38 UTC, 0 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #38560: [WIP][SPARK-38005][core] Support cleaning up merged shuffle files and state from external shuffle service - posted by GitBox <gi...@apache.org> on 2022/11/16 10:42:52 UTC, 15 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/11/16 11:18:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Make user can build and test `connect` module by specifying the user-defined `protoc` and `protoc-gen-grpc-java` - posted by GitBox <gi...@apache.org> on 2022/11/16 11:41:39 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #38673: [SPARK-41149][PYTHON] Fix `SparkSession.builder.config` to support bool - posted by GitBox <gi...@apache.org> on 2022/11/16 11:45:02 UTC, 0 replies.
- [GitHub] [spark] xiuzhu9527 opened a new pull request, #38674: [SPARK-41160][YARN] Fix error when submitting a task to the yarn that enabled the timeline service - posted by GitBox <gi...@apache.org> on 2022/11/16 12:08:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38511: [SPARK-41017][SQL] Support column pruning with multiple nondeterministic Filters - posted by GitBox <gi...@apache.org> on 2022/11/16 12:24:38 UTC, 1 replies.
- [GitHub] [spark] xiuzhu9527 commented on pull request #38674: [SPARK-41160][YARN] Fix error when submitting a task to the yarn that enabled the timeline service - posted by GitBox <gi...@apache.org> on 2022/11/16 12:32:31 UTC, 4 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38674: [SPARK-41160][YARN] Fix error when submitting a task to the yarn that enabled the timeline service - posted by GitBox <gi...@apache.org> on 2022/11/16 14:47:49 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38598: [SPARK-41097][CORE][SQL][SS][PROTOBUF] Remove redundant collection conversion base on Scala 2.13 code - posted by GitBox <gi...@apache.org> on 2022/11/16 14:49:43 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38598: [SPARK-41097][CORE][SQL][SS][PROTOBUF] Remove redundant collection conversion base on Scala 2.13 code - posted by GitBox <gi...@apache.org> on 2022/11/16 14:56:14 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38610: [SPARK-41106][SQL] Reduce collection conversion when create AttributeMap - posted by GitBox <gi...@apache.org> on 2022/11/16 14:58:34 UTC, 3 replies.
- [GitHub] [spark] LuciferYang closed pull request #37206: [SPARK-39696][CORE] Ensure Concurrent r/w `TaskMetrics` not throw Exception - posted by GitBox <gi...@apache.org> on 2022/11/16 15:00:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37646: [DON'T MERGE] investigate flaky test in ImageFileFormatSuite - posted by GitBox <gi...@apache.org> on 2022/11/16 15:01:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38675: [SPARK-41161][BUILD] Upgrade scala-parser-combinators to 2.1.1 - posted by GitBox <gi...@apache.org> on 2022/11/16 15:13:47 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38610: [SPARK-41106][SQL] Reduce collection conversion when create AttributeMap - posted by GitBox <gi...@apache.org> on 2022/11/16 15:15:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38675: [SPARK-41161][BUILD] Upgrade scala-parser-combinators to 2.1.1 - posted by GitBox <gi...@apache.org> on 2022/11/16 15:16:53 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/16 15:23:14 UTC, 2 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #38676: [SPARK-41162][SQL] Do not push down join predicate that are ambiguous to both sides - posted by GitBox <gi...@apache.org> on 2022/11/16 15:35:06 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38676: [SPARK-41162][SQL] Do not push down join predicate that are ambiguous to both sides - posted by GitBox <gi...@apache.org> on 2022/11/16 15:41:46 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38671: [SPARK-41158][SQL][TESTS] Use `checkError()` to check `DATATYPE_MISMATCH` in `DataFrameFunctionsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/16 16:08:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38671: [SPARK-41158][SQL][TESTS] Use `checkError()` to check `DATATYPE_MISMATCH` in `DataFrameFunctionsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/16 16:09:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38657: [SPARK-41139][SQL] Improve error class: `PYTHON_UDF_IN_ON_CLAUSE` - posted by GitBox <gi...@apache.org> on 2022/11/16 16:13:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38657: [SPARK-41139][SQL] Improve error class: `PYTHON_UDF_IN_ON_CLAUSE` - posted by GitBox <gi...@apache.org> on 2022/11/16 16:14:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/16 16:33:00 UTC, 6 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38672: [WIP][SPARK-41159][SQL] Optimize like any and like all expressions - posted by GitBox <gi...@apache.org> on 2022/11/16 16:43:32 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/16 17:05:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2022/11/16 17:34:06 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #38677: [SPARK-41150][PYTHON][DOC] Memory Profile for UDFs - posted by GitBox <gi...@apache.org> on 2022/11/16 19:20:11 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38670: [SPARK-41157][CONNECT][PYTHON][TEST] Show detailed differences in dataframe comparison - posted by GitBox <gi...@apache.org> on 2022/11/16 19:45:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38669: [SPARK-41155][SQL] Add error message to SchemaColumnConvertNotSupportedException - posted by GitBox <gi...@apache.org> on 2022/11/16 19:54:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38618: [SPARK-41108][SPARK-41005][CONNECT][FOLLOW-UP] Deduplicate ArrowConverters codes - posted by GitBox <gi...@apache.org> on 2022/11/16 20:02:10 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38678: [SPARK-41164][CONNECT] Update relations.proto to follow Connect Proto development guide - posted by GitBox <gi...@apache.org> on 2022/11/16 20:18:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38678: [SPARK-41164][CONNECT] Update relations.proto to follow Connect proto development guide - posted by GitBox <gi...@apache.org> on 2022/11/16 20:20:11 UTC, 0 replies.
- [GitHub] [spark] viirya closed pull request #38669: [SPARK-41155][SQL] Add error message to SchemaColumnConvertNotSupportedException - posted by GitBox <gi...@apache.org> on 2022/11/16 20:29:33 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/16 20:41:02 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38666: [CONENCT][PYTHON][DOC] Document how to run the module of tests for Spark Connect Python tests - posted by GitBox <gi...@apache.org> on 2022/11/16 20:41:06 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/16 20:42:50 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38503: [SPARK-40940] Remove Multi-stateful operator checkers for streaming queries. - posted by GitBox <gi...@apache.org> on 2022/11/16 20:43:55 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38384: [SPARK-40657][PROTOBUF] Require shading for Java class jar, improve error handling - posted by GitBox <gi...@apache.org> on 2022/11/16 20:59:18 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38384: [SPARK-40657][PROTOBUF] Require shading for Java class jar, improve error handling - posted by GitBox <gi...@apache.org> on 2022/11/16 21:06:45 UTC, 4 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38677: [SPARK-41150][PYTHON][DOCS] Document debugging with PySpark memory profiler - posted by GitBox <gi...@apache.org> on 2022/11/16 21:57:07 UTC, 0 replies.
- [GitHub] [spark] huskysun opened a new pull request, #38679: [SPARK-40671][Kubernetes] Add configurability to customize labels of driver service object - posted by GitBox <gi...@apache.org> on 2022/11/16 22:33:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38618: [SPARK-41108][SPARK-41005][CONNECT][FOLLOW-UP] Deduplicate ArrowConverters codes - posted by GitBox <gi...@apache.org> on 2022/11/16 23:20:47 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38630: [SPARK-41115][CONNECT] Add ClientType to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/17 00:01:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38630: [SPARK-41115][CONNECT] Add ClientType to proto to indicate which client sends a request - posted by GitBox <gi...@apache.org> on 2022/11/17 00:02:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #34367: [SPARK-37099][SQL] Introduce a rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/17 00:22:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38666: [CONENCT][PYTHON][DOC] Document how to run the module of tests for Spark Connect Python tests - posted by GitBox <gi...@apache.org> on 2022/11/17 00:50:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38666: [CONENCT][PYTHON][DOC] Document how to run the module of tests for Spark Connect Python tests - posted by GitBox <gi...@apache.org> on 2022/11/17 00:50:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38635: [SPARK-41118][SQL] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/17 00:54:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38635: [SPARK-41118][SQL] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/17 00:55:22 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2022/11/17 01:37:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38595: [SPARK-41090][SQL] Throw Exception for `db_name.view_name` when creating temp view by Dataset API - posted by GitBox <gi...@apache.org> on 2022/11/17 01:37:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38595: [SPARK-41090][SQL] Throw Exception for `db_name.view_name` when creating temp view by Dataset API - posted by GitBox <gi...@apache.org> on 2022/11/17 01:38:05 UTC, 0 replies.
- [GitHub] [spark] huskysun commented on pull request #38679: [SPARK-40671][Kubernetes] Add configurability to customize labels of driver service object - posted by GitBox <gi...@apache.org> on 2022/11/17 01:45:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38677: [SPARK-41150][PYTHON][DOCS] Document debugging with PySpark memory profiler - posted by GitBox <gi...@apache.org> on 2022/11/17 01:46:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38677: [SPARK-41150][PYTHON][DOCS] Document debugging with PySpark memory profiler - posted by GitBox <gi...@apache.org> on 2022/11/17 01:46:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38679: [SPARK-40671][Kubernetes] Add configurability to customize labels of driver service object - posted by GitBox <gi...@apache.org> on 2022/11/17 01:48:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38679: [SPARK-40671][K8S] Support driver service labels - posted by GitBox <gi...@apache.org> on 2022/11/17 01:49:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38679: [SPARK-40671][K8S] Support driver service labels - posted by GitBox <gi...@apache.org> on 2022/11/17 01:55:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38679: [SPARK-40671][K8S] Support driver service labels - posted by GitBox <gi...@apache.org> on 2022/11/17 01:55:35 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38673: [SPARK-41149][PYTHON] Fix `SparkSession.builder.config` to support bool - posted by GitBox <gi...@apache.org> on 2022/11/17 01:56:32 UTC, 0 replies.
- [GitHub] [spark] huskysun commented on pull request #38679: [SPARK-40671][K8S] Support driver service labels - posted by GitBox <gi...@apache.org> on 2022/11/17 01:58:53 UTC, 1 replies.
- [GitHub] [spark] rangadi opened a new pull request, #38680: [SPARK-40657][FOLLOWUP]Minor: Add clarifying comment in ProtobufUtils - posted by GitBox <gi...@apache.org> on 2022/11/17 02:24:05 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #38680: [SPARK-40657][FOLLOWUP]Minor: Add clarifying comment in ProtobufUtils - posted by GitBox <gi...@apache.org> on 2022/11/17 02:24:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/17 03:32:35 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/17 03:34:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38558: [SPARK-41048][SQL] Improve output partitioning and ordering with AQE cache - posted by GitBox <gi...@apache.org> on 2022/11/17 03:34:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/17 03:38:11 UTC, 2 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #38681: [SPARK-41165][CONNECT] Avoid hangs in the arrow collect code path - posted by GitBox <gi...@apache.org> on 2022/11/17 04:12:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38681: [SPARK-41165][CONNECT] Avoid hangs in the arrow collect code path - posted by GitBox <gi...@apache.org> on 2022/11/17 04:26:58 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #38682: [SPARK-41167][SQL] Optimize LikeSimplification rule to improve multi like performance - posted by GitBox <gi...@apache.org> on 2022/11/17 04:27:04 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/17 04:47:16 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38673: [SPARK-41149][PYTHON] Fix `SparkSession.builder.config` to support bool - posted by GitBox <gi...@apache.org> on 2022/11/17 06:01:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38673: [SPARK-41149][PYTHON] Fix `SparkSession.builder.config` to support bool - posted by GitBox <gi...@apache.org> on 2022/11/17 06:01:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38682: [SPARK-41167][SQL] Optimize LikeSimplification rule to improve multi like performance - posted by GitBox <gi...@apache.org> on 2022/11/17 06:07:26 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 opened a new pull request, #38683: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/17 06:08:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38681: [SPARK-41165][CONNECT] Avoid hangs in the arrow collect code path - posted by GitBox <gi...@apache.org> on 2022/11/17 06:09:02 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on pull request #38683: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/17 06:09:18 UTC, 3 replies.
- [GitHub] [spark] Yaohua628 closed pull request #38663: [SPARK-41143][SQL] Add named argument syntax support for table-valued function - posted by GitBox <gi...@apache.org> on 2022/11/17 06:11:36 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #38682: [SPARK-41167][SQL] Optimize LikeSimplification rule to improve multi like performance - posted by GitBox <gi...@apache.org> on 2022/11/17 06:13:22 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #38511: [SPARK-41017][SQL] Support column pruning with multiple nondeterministic Filters - posted by GitBox <gi...@apache.org> on 2022/11/17 06:42:58 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38511: [SPARK-41017][SQL] Support column pruning with multiple nondeterministic Filters - posted by GitBox <gi...@apache.org> on 2022/11/17 06:43:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38664: [SPARK-41147][SQL] Assign a name to the legacy error class `_LEGACY_ERROR_TEMP_1042` - posted by GitBox <gi...@apache.org> on 2022/11/17 06:57:46 UTC, 4 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38684: [SPARK-41017][SQL][FOLLOWUP] Respect the original Filter operator order - posted by GitBox <gi...@apache.org> on 2022/11/17 07:07:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38684: [SPARK-41017][SQL][FOLLOWUP] Respect the original Filter operator order - posted by GitBox <gi...@apache.org> on 2022/11/17 07:07:26 UTC, 1 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38685: [WIP][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1233` to `COLUMN_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/17 07:22:56 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #38441: [SPARK-40979][CORE] Keep removed executor info due to decommission - posted by GitBox <gi...@apache.org> on 2022/11/17 07:47:00 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/17 07:49:48 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38674: [SPARK-41160][YARN] Fix error when submitting a task to the yarn that enabled the timeline service - posted by GitBox <gi...@apache.org> on 2022/11/17 07:50:17 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/11/17 08:04:30 UTC, 0 replies.
- [GitHub] [spark] dengziming commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/17 08:05:43 UTC, 2 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38676: [SPARK-41162][SQL] Do not push down anti-join predicates that become ambiguous - posted by GitBox <gi...@apache.org> on 2022/11/17 08:18:10 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38684: [SPARK-41017][SQL][FOLLOWUP] Respect the original Filter operator order - posted by GitBox <gi...@apache.org> on 2022/11/17 08:29:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/17 09:43:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38678: [SPARK-41164][CONNECT] Update relations.proto to follow Connect proto development guide - posted by GitBox <gi...@apache.org> on 2022/11/17 11:00:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38678: [SPARK-41164][CONNECT] Update relations.proto to follow Connect proto development guide - posted by GitBox <gi...@apache.org> on 2022/11/17 11:01:02 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38687: [SPARK-41154][SQL] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/17 11:17:50 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38687: [SPARK-41154][SQL] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/17 11:18:20 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Make user can build and test `connect` module by specifying the user-defined `protoc` and `protoc-gen-grpc-java` - posted by GitBox <gi...@apache.org> on 2022/11/17 11:20:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Support user configurable `protoc` and `protoc-gen-grpc-java` executables when building Spark Connect. - posted by GitBox <gi...@apache.org> on 2022/11/17 11:30:25 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Support user configurable `protoc` and `protoc-gen-grpc-java` executables when building Spark Connect. - posted by GitBox <gi...@apache.org> on 2022/11/17 11:31:53 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38688: [WIP][SPARK-41166][TESTS] Check errorSubClass of DataTypeMismatch in *ExpressionSuites - posted by GitBox <gi...@apache.org> on 2022/11/17 12:06:30 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38689: [WIP][SPARK-41171][SQL] Push down filter through window when partitionSpec is empty - posted by GitBox <gi...@apache.org> on 2022/11/17 12:14:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38678: [SPARK-41164][CONNECT] Update relations.proto to follow Connect proto development guide - posted by GitBox <gi...@apache.org> on 2022/11/17 13:35:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38690: [SPARK-41177][PROTOBUF][TESTS] Fix maven test failed of `protobuf` module - posted by GitBox <gi...@apache.org> on 2022/11/17 14:43:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38691: [SPARK-41178][SQL] Fix parser rule precedence between JOIN and comma - posted by GitBox <gi...@apache.org> on 2022/11/17 15:31:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38691: [SPARK-41178][SQL] Fix parser rule precedence between JOIN and comma - posted by GitBox <gi...@apache.org> on 2022/11/17 15:32:32 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by GitBox <gi...@apache.org> on 2022/11/17 15:35:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38684: [SPARK-41017][SQL][FOLLOWUP] Respect the original Filter operator order - posted by GitBox <gi...@apache.org> on 2022/11/17 15:49:47 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38610: [SPARK-41106][SQL] Reduce collection conversion when create AttributeMap - posted by GitBox <gi...@apache.org> on 2022/11/17 16:04:54 UTC, 1 replies.
- [GitHub] [spark] awdavidson commented on a diff in pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by GitBox <gi...@apache.org> on 2022/11/17 16:22:26 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38690: [SPARK-41177][PROTOBUF][TESTS] Fix maven test failed of `protobuf` module - posted by GitBox <gi...@apache.org> on 2022/11/17 16:41:37 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38610: [SPARK-41106][SQL] Reduce collection conversion when create AttributeMap - posted by GitBox <gi...@apache.org> on 2022/11/17 16:53:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #37725: [DO-NOT-MERGE] Exceptions without error classes in SQL golden files - posted by GitBox <gi...@apache.org> on 2022/11/17 17:03:14 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38692: [SPARK-41183][SQL] Add an extension API to do plan normalization for caching - posted by GitBox <gi...@apache.org> on 2022/11/17 17:06:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38692: [SPARK-41183][SQL] Add an extension API to do plan normalization for caching - posted by GitBox <gi...@apache.org> on 2022/11/17 17:07:03 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #37725: [DO-NOT-MERGE] Exceptions without error classes in SQL golden files - posted by GitBox <gi...@apache.org> on 2022/11/17 17:09:38 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38693: Homogenize the python proto version - posted by GitBox <gi...@apache.org> on 2022/11/17 17:16:57 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38681: [SPARK-41165][CONNECT] Avoid hangs in the arrow collect code path - posted by GitBox <gi...@apache.org> on 2022/11/17 18:19:34 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38678: [SPARK-41164][CONNECT] Update relations.proto to follow Connect proto development guide - posted by GitBox <gi...@apache.org> on 2022/11/17 18:25:50 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38647: [SPARK-41133][SQL] Integrate `UNSCALED_VALUE_TOO_LARGE_FOR_PRECISION` into `NUMERIC_VALUE_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/17 18:27:48 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38647: [SPARK-41133][SQL] Integrate `UNSCALED_VALUE_TOO_LARGE_FOR_PRECISION` into `NUMERIC_VALUE_OUT_OF_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/17 18:28:30 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/17 18:58:28 UTC, 7 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #38694: [SPARK-41184][CONNECT] Disable flakey Fill.NA tests - posted by GitBox <gi...@apache.org> on 2022/11/17 19:22:00 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38694: [SPARK-41184][CONNECT] Disable flakey Fill.NA tests - posted by GitBox <gi...@apache.org> on 2022/11/17 20:02:35 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38693: Homogenize the python proto version - posted by GitBox <gi...@apache.org> on 2022/11/17 20:14:10 UTC, 1 replies.
- [GitHub] [spark] vitas commented on pull request #38262: [SPARK-40801][BUILD] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/17 20:33:34 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38262: [SPARK-40801][BUILD] Upgrade `Apache commons-text` to 1.10 - posted by GitBox <gi...@apache.org> on 2022/11/17 20:47:23 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37359: [SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files - posted by GitBox <gi...@apache.org> on 2022/11/18 00:24:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37129: [SPARK-39710][SQL] Support push local topK through outer join - posted by GitBox <gi...@apache.org> on 2022/11/18 00:24:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36695: [SPARK-38474][CORE] Use error class in org.apache.spark.security - posted by GitBox <gi...@apache.org> on 2022/11/18 00:24:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38695: [TEST ONLY][DO NOT MERGE]. Test the schema of `collect` - posted by GitBox <gi...@apache.org> on 2022/11/18 00:27:47 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38680: [SPARK-40657][PROTOBUF][FOLLOWUP][MINOR] Add clarifying comment in ProtobufUtils - posted by GitBox <gi...@apache.org> on 2022/11/18 00:40:41 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38680: [SPARK-40657][PROTOBUF][FOLLOWUP][MINOR] Add clarifying comment in ProtobufUtils - posted by GitBox <gi...@apache.org> on 2022/11/18 00:41:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38694: [SPARK-41184][CONNECT] Disable flakey Fill.NA tests - posted by GitBox <gi...@apache.org> on 2022/11/18 00:43:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38681: [SPARK-41165][CONNECT] Avoid hangs in the arrow collect code path - posted by GitBox <gi...@apache.org> on 2022/11/18 00:43:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38694: [SPARK-41184][CONNECT] Disable flakey Fill.NA tests - posted by GitBox <gi...@apache.org> on 2022/11/18 00:43:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38681: [SPARK-41165][CONNECT] Avoid hangs in the arrow collect code path - posted by GitBox <gi...@apache.org> on 2022/11/18 00:43:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38609: [SPARK-40593][BUILD][CONNECT] Support user configurable `protoc` and `protoc-gen-grpc-java` executables when building Spark Connect. - posted by GitBox <gi...@apache.org> on 2022/11/18 01:09:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38609: [SPARK-40593][BUILD][CONNECT] Support user configurable `protoc` and `protoc-gen-grpc-java` executables when building Spark Connect. - posted by GitBox <gi...@apache.org> on 2022/11/18 01:09:54 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38611: [SPARK-41107][PYTHON][INFRA][TESTS] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/18 01:11:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38611: [SPARK-41107][PYTHON][INFRA][TESTS] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/18 01:24:18 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38696: [SPARK-41175][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1078 - posted by GitBox <gi...@apache.org> on 2022/11/18 01:55:15 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38611: [SPARK-41107][PYTHON][INFRA][TESTS] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/18 02:12:11 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #38697: [SPARK-41118][SQL][3.3] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/18 02:27:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38690: [SPARK-41177][PROTOBUF][TESTS] Fix maven test failed of `protobuf` module - posted by GitBox <gi...@apache.org> on 2022/11/18 02:31:04 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38690: [SPARK-41177][PROTOBUF][TESTS] Fix maven test failed of `protobuf` module - posted by GitBox <gi...@apache.org> on 2022/11/18 02:37:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38690: [SPARK-41177][PROTOBUF][TESTS] Fix maven test failed of `protobuf` module - posted by GitBox <gi...@apache.org> on 2022/11/18 02:37:55 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38683: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/18 02:43:18 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #37725: [DO-NOT-MERGE] Exceptions without error classes in SQL golden files - posted by GitBox <gi...@apache.org> on 2022/11/18 02:48:13 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #38698: [SPARK-41186][PS][TESTS] Replace `list_run_infos` with `search_runs` in mlflow doctest - posted by GitBox <gi...@apache.org> on 2022/11/18 03:01:12 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38698: [SPARK-41186][PS][TESTS] Replace `list_run_infos` with `search_runs` in mlflow doctest - posted by GitBox <gi...@apache.org> on 2022/11/18 03:02:24 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38698: [SPARK-41186][PS][TESTS] Replace `list_run_infos` with `search_runs` in mlflow doctest - posted by GitBox <gi...@apache.org> on 2022/11/18 03:08:06 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #38699: [SPARK-41188][CORE][ML] Set executorEnv OMP_NUM_THREADS to be spark.task.cpus by default for spark executor JVM processes - posted by GitBox <gi...@apache.org> on 2022/11/18 03:22:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38691: [SPARK-41178][SQL] Fix parser rule precedence between JOIN and comma - posted by GitBox <gi...@apache.org> on 2022/11/18 03:26:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38700: [SPARK-41189][PYTHON] Add an environment to switch on and off namedtuple hack - posted by GitBox <gi...@apache.org> on 2022/11/18 03:33:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38064: [SPARK-40622][SQL][CORE]Remove the limitation that single task result must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/18 03:35:42 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38701: [TEST ONLY][DO NOT MERGE] Test collect after avoiding hang with arrow-collect - posted by GitBox <gi...@apache.org> on 2022/11/18 03:36:06 UTC, 0 replies.
- [GitHub] [spark] wineternity opened a new pull request, #38702: SPARK-41187 [Core] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/18 03:37:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38064: [SPARK-40622][SQL][CORE]Remove the limitation that single task result must fit in 2GB - posted by GitBox <gi...@apache.org> on 2022/11/18 03:47:13 UTC, 1 replies.
- [GitHub] [spark] mcdull-zhang opened a new pull request, #38703: [SPARK-41191] [SQL] Cache Table is not working while nested caches exist - posted by GitBox <gi...@apache.org> on 2022/11/18 04:19:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38704: [SPARK-41193][SQL][TESTS] Ignore `collect data with single partition larger than 2GB bytes array limit` in `DatasetLargeResultCollectingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/18 04:43:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38704: [SPARK-41193][SQL][TESTS] Ignore `collect data with single partition larger than 2GB bytes array limit` in `DatasetLargeResultCollectingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/18 04:43:28 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/18 04:51:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38638: [SPARK-41122][CONNECT] Explain API can support different modes - posted by GitBox <gi...@apache.org> on 2022/11/18 04:51:56 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #38700: [SPARK-41189][PYTHON] Add an environment to switch on and off namedtuple hack - posted by GitBox <gi...@apache.org> on 2022/11/18 05:00:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38700: [SPARK-41189][PYTHON] Add an environment to switch on and off namedtuple hack - posted by GitBox <gi...@apache.org> on 2022/11/18 05:18:48 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #38702: SPARK-41187 [Core] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/18 05:26:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38683: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/18 05:38:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38705: [SPARK-41173][SQL] Move `require()` out from the constructors of string expressions - posted by GitBox <gi...@apache.org> on 2022/11/18 05:50:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38706: [TEST ONLY] Come back to collect.foreach(send) - posted by GitBox <gi...@apache.org> on 2022/11/18 05:57:14 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38707: [SPARK-41176][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1042 - posted by GitBox <gi...@apache.org> on 2022/11/18 06:07:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #37725: [DO-NOT-MERGE] Exceptions without error classes in SQL golden files - posted by GitBox <gi...@apache.org> on 2022/11/18 06:51:46 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38708: [SPARK-41194][PROTOBUF][TESTS] Add `log4j2.properties` for testing to `protobuf` module - posted by GitBox <gi...@apache.org> on 2022/11/18 07:06:32 UTC, 0 replies.
- [GitHub] [spark] toujours33 opened a new pull request, #38709: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/18 07:35:41 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38710: [SPARK-41179][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1092 - posted by GitBox <gi...@apache.org> on 2022/11/18 07:43:28 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38676: [SPARK-41162][SQL] Do not push down anti-join predicates that become ambiguous - posted by GitBox <gi...@apache.org> on 2022/11/18 07:49:16 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38692: [SPARK-41183][SQL] Add an extension API to do plan normalization for caching - posted by GitBox <gi...@apache.org> on 2022/11/18 08:57:30 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38700: [SPARK-41189][PYTHON] Add an environment to switch on and off namedtuple hack - posted by GitBox <gi...@apache.org> on 2022/11/18 08:59:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38700: [SPARK-41189][PYTHON] Add an environment to switch on and off namedtuple hack - posted by GitBox <gi...@apache.org> on 2022/11/18 08:59:50 UTC, 0 replies.
- [GitHub] [spark] Yikun closed pull request #38611: [SPARK-41107][PYTHON][INFRA][TESTS] Install memory-profiler in the CI - posted by GitBox <gi...@apache.org> on 2022/11/18 09:12:53 UTC, 0 replies.
- [GitHub] [spark] toujours33 closed pull request #38709: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/18 09:34:28 UTC, 0 replies.
- [GitHub] [spark] toujours33 opened a new pull request, #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/18 09:41:22 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38712: [WIP][SQL] Parameterized SQL queries - posted by GitBox <gi...@apache.org> on 2022/11/18 10:00:10 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38665: [SPARK-41156][SQL] Remove the class `TypeCheckFailure` - posted by GitBox <gi...@apache.org> on 2022/11/18 10:01:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38644: [SPARK-41130][SQL] Rename `OUT_OF_DECIMAL_TYPE_RANGE` to `NUMERIC_OUT_OF_SUPPORTED_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/18 10:11:28 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38644: [SPARK-41130][SQL] Rename `OUT_OF_DECIMAL_TYPE_RANGE` to `NUMERIC_OUT_OF_SUPPORTED_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/18 10:12:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38644: [SPARK-41130][SQL] Rename `OUT_OF_DECIMAL_TYPE_RANGE` to `NUMERIC_OUT_OF_SUPPORTED_RANGE` - posted by GitBox <gi...@apache.org> on 2022/11/18 10:13:10 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #38682: [SPARK-41167][SQL] Optimize LikeSimplification rule to improve multi like performance - posted by GitBox <gi...@apache.org> on 2022/11/18 10:26:04 UTC, 7 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38688: [SPARK-41166][TESTS] Check errorSubClass of DataTypeMismatch in *ExpressionSuites - posted by GitBox <gi...@apache.org> on 2022/11/18 10:29:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38688: [SPARK-41166][SQL][TESTS] Check errorSubClass of DataTypeMismatch in *ExpressionSuites - posted by GitBox <gi...@apache.org> on 2022/11/18 10:31:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38705: [SPARK-41173][SQL] Move `require()` out from the constructors of string expressions - posted by GitBox <gi...@apache.org> on 2022/11/18 11:03:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38675: [SPARK-41161][BUILD] Upgrade scala-parser-combinators to 2.1.1 - posted by GitBox <gi...@apache.org> on 2022/11/18 11:44:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38675: [SPARK-41161][BUILD] Upgrade scala-parser-combinators to 2.1.1 - posted by GitBox <gi...@apache.org> on 2022/11/18 11:44:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/18 11:52:32 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38706: [TEST ONLY] Come back to collect.foreach(send) - posted by GitBox <gi...@apache.org> on 2022/11/18 12:18:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38687: [SPARK-41154][SQL] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/18 13:42:53 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/18 13:48:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38497: [SPARK-40999] Hint propagation to subqueries - posted by GitBox <gi...@apache.org> on 2022/11/18 13:50:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38692: [SPARK-41183][SQL] Add an extension API to do plan normalization for caching - posted by GitBox <gi...@apache.org> on 2022/11/18 13:50:39 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/18 14:00:51 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38705: [SPARK-41173][SQL] Move `require()` out from the constructors of string expressions - posted by GitBox <gi...@apache.org> on 2022/11/18 14:07:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38075: [WIP][SPARK-40633][BUILD] Upgrade janino to 3.1.8 - posted by GitBox <gi...@apache.org> on 2022/11/18 14:27:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38713: [SPARK-41195][SQL] Support PIVOT/UNPIVOT with join children - posted by GitBox <gi...@apache.org> on 2022/11/18 15:13:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38713: [SPARK-41195][SQL] Support PIVOT/UNPIVOT with join children - posted by GitBox <gi...@apache.org> on 2022/11/18 15:15:12 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38713: [SPARK-41195][SQL] Support PIVOT/UNPIVOT with join children - posted by GitBox <gi...@apache.org> on 2022/11/18 15:15:18 UTC, 0 replies.
- [GitHub] [spark] ryan-johnson-databricks commented on a diff in pull request #38692: [SPARK-41183][SQL] Add an extension API to do plan normalization for caching - posted by GitBox <gi...@apache.org> on 2022/11/18 15:24:03 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/18 15:57:11 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38710: [SPARK-41179][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1092 - posted by GitBox <gi...@apache.org> on 2022/11/18 15:57:14 UTC, 0 replies.
- [GitHub] [spark] antonipp commented on a diff in pull request #38376: [SPARK-40817] [Kubernetes] Do not discard remote user-specified files when launching Spark jobs on Kubernetes - posted by GitBox <gi...@apache.org> on 2022/11/18 16:31:20 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38713: [SPARK-41195][SQL] Support PIVOT/UNPIVOT with join children - posted by GitBox <gi...@apache.org> on 2022/11/18 17:05:22 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #38699: [SPARK-41188][CORE][ML] Set executorEnv OMP_NUM_THREADS to be spark.task.cpus by default for spark executor JVM processes - posted by GitBox <gi...@apache.org> on 2022/11/18 17:06:03 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38704: [SPARK-41193][SQL][TESTS] Ignore `collect data with single partition larger than 2GB bytes array limit` in `DatasetLargeResultCollectingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/18 17:10:33 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #38676: [SPARK-41162][SQL] Do not push down anti-join predicates that become ambiguous - posted by GitBox <gi...@apache.org> on 2022/11/18 17:27:34 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38707: [SPARK-41176][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1042 - posted by GitBox <gi...@apache.org> on 2022/11/18 17:53:45 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38693: [SPARK-41196] [CONNECT] Homogenize the protobuf version across the Spark connect server to use the same major version. - posted by GitBox <gi...@apache.org> on 2022/11/18 18:34:55 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38693: [SPARK-41196] [CONNECT] Homogenize the protobuf version across the Spark connect server to use the same major version. - posted by GitBox <gi...@apache.org> on 2022/11/18 18:35:58 UTC, 0 replies.
- [GitHub] [spark] ahshahid opened a new pull request, #38714: [WIP][SPARK-41141]. avoid introducing a new aggregate expression in the analysis phase when subquery is referencing it - posted by GitBox <gi...@apache.org> on 2022/11/18 18:36:31 UTC, 0 replies.
- [GitHub] [spark] tedyu opened a new pull request, #38715: [SPARK-41197] Upgrade Kafka version to 3.3 release - posted by GitBox <gi...@apache.org> on 2022/11/18 18:46:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38705: [SPARK-41173][SQL] Move `require()` out from the constructors of string expressions - posted by GitBox <gi...@apache.org> on 2022/11/18 19:13:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38705: [SPARK-41173][SQL] Move `require()` out from the constructors of string expressions - posted by GitBox <gi...@apache.org> on 2022/11/18 19:15:17 UTC, 0 replies.
- [GitHub] [spark] geofflangenderfer commented on pull request #4093: [SPARK-5307] SerializationDebugger to help debug NotSerializableException - posted by GitBox <gi...@apache.org> on 2022/11/18 19:47:46 UTC, 0 replies.
- [GitHub] [spark] viirya opened a new pull request, #38716: [SPARK-XXXXX][SS] Use latestCommittedBatchId as currentBatchId when resuming late batch - posted by GitBox <gi...@apache.org> on 2022/11/18 19:52:42 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #38717: [SPARK-41198][SS] Fix metrics in streaming query having CTE and DSv1 streaming source - posted by GitBox <gi...@apache.org> on 2022/11/18 20:26:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38696: [SPARK-41175][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1078 - posted by GitBox <gi...@apache.org> on 2022/11/18 20:30:26 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38718: [SPARK-41196][CONNECT][FOLLOW-UP] Fix out of sync generated files for Python - posted by GitBox <gi...@apache.org> on 2022/11/18 20:51:43 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38718: [SPARK-41196][CONNECT][FOLLOW-UP] Fix out of sync generated files for Python - posted by GitBox <gi...@apache.org> on 2022/11/18 20:51:50 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #38719: [SPARK-41999][SS] Fix metrics issue when DSv1 streaming source and DSv2 streaming source are co-used - posted by GitBox <gi...@apache.org> on 2022/11/18 21:08:55 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #35969: [SPARK-38651][SQL] Add configuration to support writing out empty schemas in supported filebased datasources - posted by GitBox <gi...@apache.org> on 2022/11/18 21:12:46 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38717: [SPARK-41198][SS] Fix metrics in streaming query having CTE and DSv1 streaming source - posted by GitBox <gi...@apache.org> on 2022/11/18 21:17:49 UTC, 2 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38719: [SPARK-41199][SS] Fix metrics issue when DSv1 streaming source and DSv2 streaming source are co-used - posted by GitBox <gi...@apache.org> on 2022/11/18 21:18:04 UTC, 1 replies.
- [GitHub] [spark] tedyu commented on pull request #38715: [SPARK-41197] Upgrade Kafka version to 3.3 release - posted by GitBox <gi...@apache.org> on 2022/11/18 21:53:39 UTC, 5 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38703: [SPARK-41191] [SQL] Cache Table is not working while nested caches exist - posted by GitBox <gi...@apache.org> on 2022/11/18 22:47:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38702: [SPARK-41187][Core] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/18 22:47:07 UTC, 0 replies.
- [GitHub] [spark] liuzqt commented on a diff in pull request #38704: [SPARK-41193][SQL][TESTS] Ignore `collect data with single partition larger than 2GB bytes array limit` in `DatasetLargeResultCollectingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/18 23:15:22 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #38716: [SPARK-XXXXX][SS] Use latestCommittedBatchId as currentBatchId when resuming late batch - posted by GitBox <gi...@apache.org> on 2022/11/18 23:16:12 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37460: [WIP][SPARK-40031][SQL] Remove unnecessary TryEval in TryCast - posted by GitBox <gi...@apache.org> on 2022/11/19 00:21:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37359: [SPARK-25342][CORE][SQL]Support rolling back a result stage and rerunning all result tasks when writing files - posted by GitBox <gi...@apache.org> on 2022/11/19 00:21:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37129: [SPARK-39710][SQL] Support push local topK through outer join - posted by GitBox <gi...@apache.org> on 2022/11/19 00:21:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36767: [SPARK-39363][K8S] Deprecate k8s memory overhead and make it optional - posted by GitBox <gi...@apache.org> on 2022/11/19 00:21:33 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36695: [SPARK-38474][CORE] Use error class in org.apache.spark.security - posted by GitBox <gi...@apache.org> on 2022/11/19 00:21:35 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38719: [SPARK-41199][SS] Fix metrics issue when DSv1 streaming source and DSv2 streaming source are co-used - posted by GitBox <gi...@apache.org> on 2022/11/19 00:31:41 UTC, 1 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #38720: [SPARK-41165][SPARK-41184][CONNECT] Fix arrow collect (again) and reenable tests. - posted by GitBox <gi...@apache.org> on 2022/11/19 00:33:09 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38719: [SPARK-41199][SS] Fix metrics issue when DSv1 streaming source and DSv2 streaming source are co-used - posted by GitBox <gi...@apache.org> on 2022/11/19 00:34:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38718: [SPARK-41196][CONNECT][FOLLOW-UP] Fix out of sync generated files for Python - posted by GitBox <gi...@apache.org> on 2022/11/19 00:43:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38718: [SPARK-41196][CONNECT][FOLLOW-UP] Fix out of sync generated files for Python - posted by GitBox <gi...@apache.org> on 2022/11/19 00:44:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38698: [SPARK-41186][INFRA][PS][TESTS] Upgrade infra and replace `list_run_infos` with `search_runs` in mlflow doctest - posted by GitBox <gi...@apache.org> on 2022/11/19 00:52:23 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38682: [SPARK-41167][SQL] Optimize LikeSimplification rule to improve multi like performance - posted by GitBox <gi...@apache.org> on 2022/11/19 01:45:29 UTC, 0 replies.
- [GitHub] [spark] Yikun closed pull request #38698: [SPARK-41186][INFRA][PS][TESTS] Upgrade infra and replace `list_run_infos` with `search_runs` in mlflow doctest - posted by GitBox <gi...@apache.org> on 2022/11/19 01:51:24 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38698: [SPARK-41186][INFRA][PS][TESTS] Upgrade infra and replace `list_run_infos` with `search_runs` in mlflow doctest - posted by GitBox <gi...@apache.org> on 2022/11/19 01:51:58 UTC, 0 replies.
- [GitHub] [spark] viirya closed pull request #38716: [SPARK-XXXXX][SS] Use latestCommittedBatchId as currentBatchId when resuming late batch - posted by GitBox <gi...@apache.org> on 2022/11/19 03:57:07 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #38699: [SPARK-41188][CORE][ML] Set executorEnv OMP_NUM_THREADS to be spark.task.cpus by default for spark executor JVM processes - posted by GitBox <gi...@apache.org> on 2022/11/19 04:13:53 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38721: [WIP][SPARK-41172][SQL] Migrate the ambiguous ref error to an error class - posted by GitBox <gi...@apache.org> on 2022/11/19 04:34:42 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #38682: [SPARK-41167][SQL] Improve multi like performance by creating a balanced expression tree predicate - posted by GitBox <gi...@apache.org> on 2022/11/19 05:07:47 UTC, 0 replies.
- [GitHub] [spark] WangGuangxin opened a new pull request, #38722: [SPARK-41200][CORE] BytesToBytesMap's longArray size can be up to MAX_CAPACITY - posted by GitBox <gi...@apache.org> on 2022/11/19 05:23:54 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38696: [SPARK-41175][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1078 - posted by GitBox <gi...@apache.org> on 2022/11/19 05:35:53 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/19 05:59:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/19 06:00:07 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38696: [SPARK-41175][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1078 - posted by GitBox <gi...@apache.org> on 2022/11/19 06:02:45 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38693: [SPARK-41196] [CONNECT] Homogenize the protobuf version across the Spark connect server to use the same major version. - posted by GitBox <gi...@apache.org> on 2022/11/19 07:34:25 UTC, 0 replies.
- [GitHub] [spark] williamhyun opened a new pull request, #38724: [SPARK-41202][BUILD] Update ORC to 1.7.7 - posted by GitBox <gi...@apache.org> on 2022/11/19 07:57:22 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #38699: [SPARK-41188][CORE][ML] Set executorEnv OMP_NUM_THREADS to be spark.task.cpus by default for spark executor JVM processes - posted by GitBox <gi...@apache.org> on 2022/11/19 09:23:37 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38725: [SPARK-41182][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1102 - posted by GitBox <gi...@apache.org> on 2022/11/19 10:51:14 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38721: [SPARK-41172][SQL] Migrate the ambiguous ref error to an error class - posted by GitBox <gi...@apache.org> on 2022/11/19 10:52:48 UTC, 0 replies.
- [GitHub] [spark] WangGuangxin commented on pull request #38722: [SPARK-41200][CORE] BytesToBytesMap's longArray size can be up to MAX_CAPACITY - posted by GitBox <gi...@apache.org> on 2022/11/19 11:41:56 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38719: [SPARK-41199][SS] Fix metrics issue when DSv1 streaming source and DSv2 streaming source are co-used - posted by GitBox <gi...@apache.org> on 2022/11/19 13:42:52 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/19 14:03:35 UTC, 3 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/19 14:13:41 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38683: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/19 17:28:07 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38682: [SPARK-41167][SQL] Improve multi like performance by creating a balanced expression tree predicate - posted by GitBox <gi...@apache.org> on 2022/11/19 17:28:10 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38676: [SPARK-41162][SQL] Do not push down anti-join predicates that become ambiguous - posted by GitBox <gi...@apache.org> on 2022/11/19 17:28:13 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38721: [SPARK-41172][SQL] Migrate the ambiguous ref error to an error class - posted by GitBox <gi...@apache.org> on 2022/11/19 17:31:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38721: [SPARK-41172][SQL] Migrate the ambiguous ref error to an error class - posted by GitBox <gi...@apache.org> on 2022/11/19 17:31:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38724: [SPARK-41202][BUILD][3.3] Update ORC to 1.7.7 - posted by GitBox <gi...@apache.org> on 2022/11/19 19:06:33 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38720: [SPARK-41165][SPARK-41184][CONNECT] Fix arrow collect (again) and reenable tests. - posted by GitBox <gi...@apache.org> on 2022/11/19 20:20:28 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38715: [SPARK-41197] Upgrade Kafka version to 3.3 release - posted by GitBox <gi...@apache.org> on 2022/11/19 20:53:34 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38726: [SPARK-41203] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/19 21:04:44 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37460: [WIP][SPARK-40031][SQL] Remove unnecessary TryEval in TryCast - posted by GitBox <gi...@apache.org> on 2022/11/20 00:23:00 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36908: [SPARK-39510][SQL][WIP] Leverage the natural partitioning and ordering of MonotonicallyIncreasingID - posted by GitBox <gi...@apache.org> on 2022/11/20 00:23:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36767: [SPARK-39363][K8S] Deprecate k8s memory overhead and make it optional - posted by GitBox <gi...@apache.org> on 2022/11/20 00:23:03 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36070: [SPARK-31675][CORE] Fix rename and delete files with different filesystem - posted by GitBox <gi...@apache.org> on 2022/11/20 00:23:04 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #38727: [SPARK-41205][SQL] Check that format is foldable in `TryToBinary` - posted by GitBox <gi...@apache.org> on 2022/11/20 02:02:11 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on a diff in pull request #38727: [SPARK-41205][SQL] Check that format is foldable in `TryToBinary` - posted by GitBox <gi...@apache.org> on 2022/11/20 02:04:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38726: [SPARK-41203] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/20 04:00:04 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/20 04:05:49 UTC, 5 replies.
- [GitHub] [spark] aviralKumar13 commented on pull request #37196: [SPARK-39781][SS] Add support for providing max_open_files to rocksdb state store provider - posted by GitBox <gi...@apache.org> on 2022/11/20 04:34:58 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38717: [SPARK-41198][SS] Fix metrics in streaming query having CTE and DSv1 streaming source - posted by GitBox <gi...@apache.org> on 2022/11/20 07:16:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38727: [SPARK-41205][SQL] Check that format is foldable in `TryToBinary` - posted by GitBox <gi...@apache.org> on 2022/11/20 07:31:25 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38726: [SPARK-41203] [CONNECT] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/20 08:05:15 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38717: [SPARK-41198][SS] Fix metrics in streaming query having CTE and DSv1 streaming source - posted by GitBox <gi...@apache.org> on 2022/11/20 09:01:50 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38726: [SPARK-41203] [CONNECT] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/20 09:02:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38708: [SPARK-41194][PROTOBUF][TESTS] Add `log4j2.properties` configuration file for `protobuf` module testing - posted by GitBox <gi...@apache.org> on 2022/11/20 09:04:07 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38708: [SPARK-41194][PROTOBUF][TESTS] Add `log4j2.properties` configuration file for `protobuf` module testing - posted by GitBox <gi...@apache.org> on 2022/11/20 09:04:42 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38726: [SPARK-41203] [CONNECT] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/20 11:54:16 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/20 13:56:58 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/20 15:04:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38685: [SPARK-41206][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1233` to `COLUMN_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/20 16:42:04 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38725: [SPARK-41182][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1102 - posted by GitBox <gi...@apache.org> on 2022/11/20 16:50:37 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38726: [SPARK-41203] [CONNECT] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/20 16:58:27 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38725: [SPARK-41182][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1102 - posted by GitBox <gi...@apache.org> on 2022/11/20 16:58:30 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/20 16:58:33 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38722: [SPARK-41200][CORE] BytesToBytesMap's longArray size can be up to MAX_CAPACITY - posted by GitBox <gi...@apache.org> on 2022/11/20 16:58:36 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #38685: [SPARK-41206][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1233` to `COLUMN_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/20 17:08:33 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38717: [SPARK-41198][SS] Fix metrics in streaming query having CTE and DSv1 streaming source - posted by GitBox <gi...@apache.org> on 2022/11/20 21:45:02 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38714: [WIP][SPARK-41141]. avoid introducing a new aggregate expression in the analysis phase when subquery is referencing it - posted by GitBox <gi...@apache.org> on 2022/11/20 21:50:50 UTC, 0 replies.
- [GitHub] [spark] HuwCampbell commented on a diff in pull request #36441: [SPARK-39091][SQL] Updating specific SQL Expression traits that don't compose when multiple are extended due to nodePatterns being final. - posted by GitBox <gi...@apache.org> on 2022/11/21 00:12:16 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #38687: [SPARK-41154][SQL] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/21 00:13:50 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36908: [SPARK-39510][SQL][WIP] Leverage the natural partitioning and ordering of MonotonicallyIncreasingID - posted by GitBox <gi...@apache.org> on 2022/11/21 00:21:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36443: [POC][WIP][SPARK-39088][CORE] Add a "live" driver link to the UI for history server when serving in-progress applications. - posted by GitBox <gi...@apache.org> on 2022/11/21 00:21:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36070: [SPARK-31675][CORE] Fix rename and delete files with different filesystem - posted by GitBox <gi...@apache.org> on 2022/11/21 00:21:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38726: [SPARK-41203] [CONNECT] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/21 00:37:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38726: [SPARK-41203] [CONNECT] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/21 00:38:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/21 00:38:59 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38725: [SPARK-41182][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1102 - posted by GitBox <gi...@apache.org> on 2022/11/21 00:46:45 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38718: [SPARK-41196][CONNECT][FOLLOW-UP] Fix out of sync generated files for Python - posted by GitBox <gi...@apache.org> on 2022/11/21 01:05:05 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/21 01:33:05 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38704: [SPARK-41193][SQL][TESTS] Ignore `collect data with single partition larger than 2GB bytes array limit` in `DatasetLargeResultCollectingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/21 02:32:50 UTC, 0 replies.
- [GitHub] [spark] mcdull-zhang commented on pull request #38703: [SPARK-41191] [SQL] Cache Table is not working while nested caches exist - posted by GitBox <gi...@apache.org> on 2022/11/21 02:35:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38729: [CONNECT][INFRA] Update protobuf versions in CI - posted by GitBox <gi...@apache.org> on 2022/11/21 02:35:53 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38730: [SPARK-41181][SQL] Migrate the map options errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/21 02:37:08 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #38731: [SPARK-41209] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/21 02:37:59 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #38731: [SPARK-41209] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/21 02:38:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38683: [SPARK-41151][SQL] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/21 02:54:08 UTC, 8 replies.
- [GitHub] [spark] wankunde commented on pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/11/21 03:05:27 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38683: [SPARK-41151][SQL] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/21 03:05:49 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/21 03:06:29 UTC, 5 replies.
- [GitHub] [spark] pan3793 opened a new pull request, #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by GitBox <gi...@apache.org> on 2022/11/21 03:14:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/21 03:21:46 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #38733: [SPARK-41211][Core] Upgrade ZooKeeper from 3.6.2 to 3.6.3 - posted by GitBox <gi...@apache.org> on 2022/11/21 03:25:06 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on a diff in pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by GitBox <gi...@apache.org> on 2022/11/21 03:26:16 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/21 03:44:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38735: [SPARK-41213][CONNECT][PYTHON] Implement `DataFrame.__repr__` and `DataFrame.dtypes` - posted by GitBox <gi...@apache.org> on 2022/11/21 05:07:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38735: [SPARK-41213][CONNECT][PYTHON] Implement `DataFrame.__repr__` and `DataFrame.dtypes` - posted by GitBox <gi...@apache.org> on 2022/11/21 05:08:06 UTC, 1 replies.
- [GitHub] [spark] sadikovi commented on pull request #38731: [SPARK-41209][PYSPARK] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/21 05:22:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38692: [SPARK-41183][SQL] Add an extension API to do plan normalization for caching - posted by GitBox <gi...@apache.org> on 2022/11/21 05:31:36 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/21 06:14:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38333: [SPARK-40872] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/21 06:22:07 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/21 06:36:25 UTC, 0 replies.
- [GitHub] [spark] erenavsarogullari opened a new pull request, #38736: [WIP][SPARK-41214][SQL] - SubPlan metrics under InMemoryRelation are missed when … - posted by GitBox <gi...@apache.org> on 2022/11/21 06:55:36 UTC, 0 replies.
- [GitHub] [spark] erenavsarogullari commented on pull request #38736: [WIP][SPARK-41214][SQL] - SubPlan metrics are missed when AQE is enabled under InMemoryRelation - posted by GitBox <gi...@apache.org> on 2022/11/21 07:02:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38737: [SPARK-41174][SQL] Propagate an error class to users for invalid `format` of `to_binary()` - posted by GitBox <gi...@apache.org> on 2022/11/21 07:03:10 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2022/11/21 07:03:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38703: [SPARK-41191] [SQL] Cache Table is not working while nested caches exist - posted by GitBox <gi...@apache.org> on 2022/11/21 07:21:19 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/21 07:37:12 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38722: [SPARK-41200][CORE] BytesToBytesMap's longArray size can be up to MAX_CAPACITY - posted by GitBox <gi...@apache.org> on 2022/11/21 07:37:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38075: [WIP][SPARK-40633][BUILD] Upgrade janino to 3.1.8 - posted by GitBox <gi...@apache.org> on 2022/11/21 07:55:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38075: [WIP][SPARK-40633][BUILD] Upgrade janino to 3.1.9 - posted by GitBox <gi...@apache.org> on 2022/11/21 08:11:40 UTC, 0 replies.
- [GitHub] [spark] toujours33 commented on pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/21 08:12:30 UTC, 3 replies.
- [GitHub] [spark] yaooqinn commented on pull request #38733: [SPARK-41211][BUILD] Upgrade ZooKeeper from 3.6.2 to 3.6.3 - posted by GitBox <gi...@apache.org> on 2022/11/21 08:15:41 UTC, 0 replies.
- [GitHub] [spark] yaooqinn closed pull request #38733: [SPARK-41211][BUILD] Upgrade ZooKeeper from 3.6.2 to 3.6.3 - posted by GitBox <gi...@apache.org> on 2022/11/21 08:15:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38738: WIP - posted by GitBox <gi...@apache.org> on 2022/11/21 08:55:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38665: [SPARK-41156][SQL] Remove the class `TypeCheckFailure` - posted by GitBox <gi...@apache.org> on 2022/11/21 08:55:58 UTC, 2 replies.
- [GitHub] [spark] zengqinchris commented on pull request #35539: [SPARK-38228][SQL] Legacy store assignment should not fail on error under ANSI mode - posted by GitBox <gi...@apache.org> on 2022/11/21 09:25:09 UTC, 2 replies.
- [GitHub] [spark] wineternity commented on pull request #38702: [SPARK-41187][Core] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/21 09:42:32 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by GitBox <gi...@apache.org> on 2022/11/21 09:51:14 UTC, 0 replies.
- [GitHub] [spark] DmytroMitin opened a new pull request, #38740: [WIP] Add product encoders for local classes - posted by GitBox <gi...@apache.org> on 2022/11/21 10:02:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38687: [SPARK-41154][SQL] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/21 10:19:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38737: [SPARK-41174][CORE][SQL] Propagate an error class to users for invalid `format` of `to_binary()` - posted by GitBox <gi...@apache.org> on 2022/11/21 10:35:24 UTC, 1 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38741: [SPARK-41154][SQL][3.3] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/21 10:43:12 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38665: [SPARK-41156][SQL] Remove the class `TypeCheckFailure` - posted by GitBox <gi...@apache.org> on 2022/11/21 10:45:40 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #35253: [SPARK-37965][SQL] Remove check field name when reading/writing existing data in Orc - posted by GitBox <gi...@apache.org> on 2022/11/21 10:54:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38598: [SPARK-41097][CORE][SQL][SS][PROTOBUF] Remove redundant collection conversion base on Scala 2.13 code - posted by GitBox <gi...@apache.org> on 2022/11/21 11:17:37 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/21 11:23:31 UTC, 0 replies.
- [GitHub] [spark] jbguerraz commented on pull request #32397: [WIP][SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2022/11/21 11:31:32 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by GitBox <gi...@apache.org> on 2022/11/21 11:35:29 UTC, 0 replies.
- [GitHub] [spark] WangGuangxin closed pull request #38722: [SPARK-41200][CORE] BytesToBytesMap's longArray size can be up to MAX_CAPACITY - posted by GitBox <gi...@apache.org> on 2022/11/21 11:35:47 UTC, 0 replies.
- [GitHub] [spark] WangGuangxin commented on a diff in pull request #38722: [SPARK-41200][CORE] BytesToBytesMap's longArray size can be up to MAX_CAPACITY - posted by GitBox <gi...@apache.org> on 2022/11/21 11:35:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/21 11:36:03 UTC, 13 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38729: [SPARK-41196][CONNECT][INFRA][FOLLOW-UP] Change protobuf versions in CI - posted by GitBox <gi...@apache.org> on 2022/11/21 11:38:52 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38075: [SPARK-40633][BUILD] Upgrade janino to 3.1.9 - posted by GitBox <gi...@apache.org> on 2022/11/21 11:39:10 UTC, 6 replies.
- [GitHub] [spark] mcdull-zhang commented on a diff in pull request #38703: [SPARK-41191] [SQL] Cache Table is not working while nested caches exist - posted by GitBox <gi...@apache.org> on 2022/11/21 11:39:23 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by GitBox <gi...@apache.org> on 2022/11/21 12:25:17 UTC, 6 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by GitBox <gi...@apache.org> on 2022/11/21 12:26:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35253: [SPARK-37965][SQL] Remove check field name when reading/writing existing data in Orc - posted by GitBox <gi...@apache.org> on 2022/11/21 12:30:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by GitBox <gi...@apache.org> on 2022/11/21 12:42:53 UTC, 2 replies.
- [GitHub] [spark] 19855134604 opened a new pull request, #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/21 13:07:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38744: [WIP][SPARK-41217][SQL] Add the error class `FAILED_FUNCTION_CALL` - posted by GitBox <gi...@apache.org> on 2022/11/21 13:12:04 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38745: [WIP][SPARK-37099][SQL] Optimize the filter based on rank-like window function by reduce not required rows - posted by GitBox <gi...@apache.org> on 2022/11/21 14:08:47 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by GitBox <gi...@apache.org> on 2022/11/21 14:10:03 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/21 15:18:54 UTC, 2 replies.
- [GitHub] [spark] revans2 commented on a diff in pull request #38739: [SPARK-41207][SQL] Fix BinaryArithmetic with negative scale - posted by GitBox <gi...@apache.org> on 2022/11/21 15:24:29 UTC, 5 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38744: [SPARK-41217][SQL] Add the error class `FAILED_FUNCTION_CALL` - posted by GitBox <gi...@apache.org> on 2022/11/21 15:54:56 UTC, 2 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38746: [SPARK-41017][SQL][FOLLOWUP] Push Filter with both deterministic and nondeterministic predicates - posted by GitBox <gi...@apache.org> on 2022/11/21 15:56:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38746: [SPARK-41017][SQL][FOLLOWUP] Push Filter with both deterministic and nondeterministic predicates - posted by GitBox <gi...@apache.org> on 2022/11/21 15:57:05 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38744: [SPARK-41217][SQL] Add the error class `FAILED_FUNCTION_CALL` - posted by GitBox <gi...@apache.org> on 2022/11/21 15:57:21 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37556: [SPARK-39799][SQL] DataSourceV2: View catalog interface - posted by GitBox <gi...@apache.org> on 2022/11/21 16:04:16 UTC, 1 replies.
- [GitHub] [spark] tgravescs commented on pull request #38674: [SPARK-41160][YARN] Fix error when submitting a task to the yarn that enabled the timeline service - posted by GitBox <gi...@apache.org> on 2022/11/21 16:29:44 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38302: [SPARK-40834][SQL] Use SparkListenerSQLExecutionEnd to track final SQL status in UI - posted by GitBox <gi...@apache.org> on 2022/11/21 16:48:15 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on pull request #38726: [SPARK-41203] [CONNECT] Support Dataframe.tansform in Python client. - posted by GitBox <gi...@apache.org> on 2022/11/21 17:56:10 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38713: [SPARK-41195][SQL] Support PIVOT/UNPIVOT with join children - posted by GitBox <gi...@apache.org> on 2022/11/21 18:10:48 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #38713: [SPARK-41195][SQL] Support PIVOT/UNPIVOT with join children - posted by GitBox <gi...@apache.org> on 2022/11/21 18:11:45 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38746: [SPARK-41017][SQL][FOLLOWUP] Push Filter with both deterministic and nondeterministic predicates - posted by GitBox <gi...@apache.org> on 2022/11/21 18:25:08 UTC, 1 replies.
- [GitHub] [spark] Yaohua628 commented on a diff in pull request #38683: [SPARK-41151][SQL] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/21 18:43:49 UTC, 1 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38747: [SPARK-40834][SQL][FOLLOWUP] Take care of legacy query end events - posted by GitBox <gi...@apache.org> on 2022/11/21 19:28:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38747: [SPARK-40834][SQL][FOLLOWUP] Take care of legacy query end events - posted by GitBox <gi...@apache.org> on 2022/11/21 19:29:12 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38746: [SPARK-41017][SQL][FOLLOWUP] Push Filter with both deterministic and nondeterministic predicates - posted by GitBox <gi...@apache.org> on 2022/11/21 19:32:52 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38683: [SPARK-41151][SQL] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/21 23:54:25 UTC, 2 replies.
- [GitHub] [spark] bersprockets commented on pull request #38727: [SPARK-41205][SQL] Check that format is foldable in `TryToBinary` - posted by GitBox <gi...@apache.org> on 2022/11/22 00:04:14 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36443: [POC][WIP][SPARK-39088][CORE] Add a "live" driver link to the UI for history server when serving in-progress applications. - posted by GitBox <gi...@apache.org> on 2022/11/22 00:22:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/22 00:36:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38706: [SPARK-41005][COLLECT][FOLLOWUP] Remove JSON code path and use `RDD.collect` in Arrow code path - posted by GitBox <gi...@apache.org> on 2022/11/22 00:43:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38706: [SPARK-41005][COLLECT][FOLLOWUP] Remove JSON code path and use `RDD.collect` in Arrow code path - posted by GitBox <gi...@apache.org> on 2022/11/22 00:44:06 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38706: [SPARK-41005][COLLECT][FOLLOWUP] Remove JSON code path and use `RDD.collect` in Arrow code path - posted by GitBox <gi...@apache.org> on 2022/11/22 00:46:13 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38706: [SPARK-41005][COLLECT][FOLLOWUP] Remove JSON code path and use `RDD.collect` in Arrow code path - posted by GitBox <gi...@apache.org> on 2022/11/22 00:46:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38701: [TEST ONLY][DO NOT MERGE] Test arrow-collect after avoiding hang - posted by GitBox <gi...@apache.org> on 2022/11/22 00:47:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38701: [TEST ONLY][DO NOT MERGE] Test arrow-collect after avoiding hang - posted by GitBox <gi...@apache.org> on 2022/11/22 00:47:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38695: [TEST ONLY][DO NOT MERGE]. Test the schema of `collect` - posted by GitBox <gi...@apache.org> on 2022/11/22 00:49:02 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38683: [SPARK-41151][SQL] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/22 00:54:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/22 01:05:50 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38735: [SPARK-41213][CONNECT][PYTHON] Implement `DataFrame.__repr__` and `DataFrame.dtypes` - posted by GitBox <gi...@apache.org> on 2022/11/22 01:06:18 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38729: [SPARK-41196][CONNECT][INFRA][FOLLOW-UP] Change protobuf versions in CI - posted by GitBox <gi...@apache.org> on 2022/11/22 01:12:23 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 opened a new pull request, #38748: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/22 01:28:54 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on pull request #38748: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/22 01:29:13 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on pull request #38683: [SPARK-41151][SQL] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/22 01:29:31 UTC, 0 replies.
- [GitHub] [spark] desmondcheongzx opened a new pull request, #38749: Refactor by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/22 01:36:00 UTC, 0 replies.
- [GitHub] [spark] desmondcheongzx closed pull request #38749: Refactor by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/22 01:44:38 UTC, 0 replies.
- [GitHub] [spark] desmondcheongzx opened a new pull request, #38750: Refactor by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/22 01:45:15 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/22 01:54:40 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38735: [SPARK-41213][CONNECT][PYTHON] Implement `DataFrame.__repr__` and `DataFrame.dtypes` - posted by GitBox <gi...@apache.org> on 2022/11/22 01:56:05 UTC, 1 replies.
- [GitHub] [spark] jerrypeng commented on pull request #38517: [SPARK-39591][SS] Async Progress Tracking - posted by GitBox <gi...@apache.org> on 2022/11/22 02:00:57 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38731: [SPARK-41209][PYSPARK] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/22 02:05:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38706: [SPARK-41005][COLLECT][FOLLOWUP] Remove JSON code path and use `RDD.collect` in Arrow code path - posted by GitBox <gi...@apache.org> on 2022/11/22 02:05:21 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38741: [SPARK-41154][SQL][3.3] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/22 02:06:41 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38741: [SPARK-41154][SQL][3.3] Incorrect relation caching for queries with time travel spec - posted by GitBox <gi...@apache.org> on 2022/11/22 02:07:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/22 02:14:12 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/22 02:20:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/22 02:28:18 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38747: [SPARK-40834][SQL][FOLLOWUP] Take care of legacy query end events - posted by GitBox <gi...@apache.org> on 2022/11/22 02:29:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/22 02:30:25 UTC, 3 replies.
- [GitHub] [spark] AngersZhuuuu commented on a diff in pull request #38622: [SPARK-39601][YARN] AllocationFailure should not be treated as exitCausedByApp when driver is shutting down - posted by GitBox <gi...@apache.org> on 2022/11/22 02:30:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38746: [SPARK-41017][SQL][FOLLOWUP] Push Filter with both deterministic and nondeterministic predicates - posted by GitBox <gi...@apache.org> on 2022/11/22 02:40:38 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #38731: [SPARK-41209][PYTHON] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/22 03:03:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/22 03:06:17 UTC, 5 replies.
- [GitHub] [spark] toujours33 commented on a diff in pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/22 03:19:12 UTC, 14 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/22 03:20:17 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38738: WIP - posted by GitBox <gi...@apache.org> on 2022/11/22 03:21:46 UTC, 0 replies.
- [GitHub] [spark] 19855134604 commented on a diff in pull request #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/22 03:23:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38664: [SPARK-41147][SQL] Assign a name to the legacy error class `_LEGACY_ERROR_TEMP_1042` - posted by GitBox <gi...@apache.org> on 2022/11/22 04:21:02 UTC, 1 replies.
- [GitHub] [spark] dengziming commented on pull request #38715: [SPARK-41197] Upgrade Kafka version to 3.3 release - posted by GitBox <gi...@apache.org> on 2022/11/22 04:31:08 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38650: [SPARK-41135][SQL] Rename `UNSUPPORTED_EMPTY_LOCATION` to `INVALID_EMPTY_LOCATION` - posted by GitBox <gi...@apache.org> on 2022/11/22 04:35:18 UTC, 1 replies.
- [GitHub] [spark] gaoyajun02 opened a new pull request, #38751: [SPARK-40872][3.3] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/22 05:24:25 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38744: [SPARK-41217][SQL] Add the error class `FAILED_FUNCTION_CALL` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:11:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38752: [SPARK-40809][CONNECT][FOLLOW-UP] Do not use Buffer to make Scala 2.13 test pass - posted by GitBox <gi...@apache.org> on 2022/11/22 06:15:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38302: [SPARK-40834][SQL] Use SparkListenerSQLExecutionEnd to track final SQL status in UI - posted by GitBox <gi...@apache.org> on 2022/11/22 06:18:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38631: [SPARK-40809] [CONNECT] [FOLLOW] Support `alias()` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/22 06:24:20 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38748: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/22 06:29:33 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/22 06:31:36 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38748: [SPARK-41151][SQL][3.3] Keep built-in file `_metadata` column nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/22 06:33:28 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38753: [SPARK-40809][CONNECT][TESTS] Fix pyspark-connect test failed with Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/22 06:35:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38735: [SPARK-41213][CONNECT][PYTHON] Implement `DataFrame.__repr__` and `DataFrame.dtypes` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:44:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38735: [SPARK-41213][CONNECT][PYTHON] Implement `DataFrame.__repr__` and `DataFrame.dtypes` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:44:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38731: [SPARK-41209][PYTHON] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/22 06:45:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38731: [SPARK-41209][PYTHON] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/22 06:45:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/22 06:47:17 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38704: [SPARK-41193][SQL][TESTS] Ignore `collect data with single partition larger than 2GB bytes array limit` in `DatasetLargeResultCollectingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:50:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38704: [SPARK-41193][SQL][TESTS] Ignore `collect data with single partition larger than 2GB bytes array limit` in `DatasetLargeResultCollectingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:51:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:52:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:52:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38752: [SPARK-40809][CONNECT][FOLLOW-UP] Do not use Buffer to make Scala 2.13 test pass - posted by GitBox <gi...@apache.org> on 2022/11/22 06:53:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38752: [SPARK-40809][CONNECT][FOLLOW-UP] Do not use Buffer to make Scala 2.13 test pass - posted by GitBox <gi...@apache.org> on 2022/11/22 06:53:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38753: [SPARK-40809][CONNECT][PYTHON][TESTS] Fix pyspark-connect test failed with Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/22 06:53:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38697: [SPARK-41118][SQL][3.3] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:53:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #38753: [SPARK-40809][CONNECT][PYTHON][TESTS] Fix pyspark-connect test failed with Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/22 06:53:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38752: [SPARK-40809][CONNECT][FOLLOW-UP] Do not use Buffer to make Scala 2.13 test pass - posted by GitBox <gi...@apache.org> on 2022/11/22 06:54:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38697: [SPARK-41118][SQL][3.3] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/22 06:54:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38715: [SPARK-41197] Upgrade Kafka version to 3.3 release - posted by GitBox <gi...@apache.org> on 2022/11/22 07:04:17 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38744: [SPARK-41217][SQL] Add the error class `FAILED_FUNCTION_CALL` - posted by GitBox <gi...@apache.org> on 2022/11/22 07:05:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38744: [SPARK-41217][SQL] Add the error class `FAILED_FUNCTION_CALL` - posted by GitBox <gi...@apache.org> on 2022/11/22 07:05:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38685: [SPARK-41206][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1233` to `COLUMN_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/22 07:24:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/22 07:33:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38754: [SPAKR-41180][SQL] Assign an error class to "Cannot parse the data type" - posted by GitBox <gi...@apache.org> on 2022/11/22 08:49:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38734: [SPARK-41212][CONNECT][PYTHON] Implement `DataFrame.isEmpty` - posted by GitBox <gi...@apache.org> on 2022/11/22 09:15:55 UTC, 0 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38755: [WIP][SPARK-41221][SQL] Add the error class `INVALID_FORMAT` - posted by GitBox <gi...@apache.org> on 2022/11/22 09:55:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38650: [SPARK-41135][SQL] Rename `UNSUPPORTED_EMPTY_LOCATION` to `INVALID_EMPTY_LOCATION` - posted by GitBox <gi...@apache.org> on 2022/11/22 10:13:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38650: [SPARK-41135][SQL] Rename `UNSUPPORTED_EMPTY_LOCATION` to `INVALID_EMPTY_LOCATION` - posted by GitBox <gi...@apache.org> on 2022/11/22 10:14:25 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38756: [SPARK-41220][SQL] Range partitioner sample supports column pruning - posted by GitBox <gi...@apache.org> on 2022/11/22 10:28:28 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38756: [SPARK-41220][SQL] Range partitioner sample supports column pruning - posted by GitBox <gi...@apache.org> on 2022/11/22 10:33:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38757: [SPARK-41222][CONNECT][PYTHON] Unify the typing definitions - posted by GitBox <gi...@apache.org> on 2022/11/22 11:20:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38758: [SPARK-41223][BUILD] Upgrade slf4j to 2.0.4 - posted by GitBox <gi...@apache.org> on 2022/11/22 11:37:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38759: [SPARK-41224][SPARK-41165][SPARK-41184] Optimized Arrow-based collect implementation to stream from server to client - posted by GitBox <gi...@apache.org> on 2022/11/22 12:00:34 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/22 12:46:26 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38759: [SPARK-41224][SPARK-41165][SPARK-41184] Optimized Arrow-based collect implementation to stream from server to client - posted by GitBox <gi...@apache.org> on 2022/11/22 13:03:44 UTC, 3 replies.
- [GitHub] [spark] rangareddy opened a new pull request, #38761: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by GitBox <gi...@apache.org> on 2022/11/22 13:12:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/22 14:15:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38495: [SPARK-35531][SQL] Update hive table stats without unnecessary convert - posted by GitBox <gi...@apache.org> on 2022/11/22 14:16:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/22 15:04:41 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/22 15:37:09 UTC, 8 replies.
- [GitHub] [spark] steveloughran commented on a diff in pull request #36070: [SPARK-31675][CORE] Fix rename and delete files with different filesystem - posted by GitBox <gi...@apache.org> on 2022/11/22 16:49:10 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38761: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by GitBox <gi...@apache.org> on 2022/11/22 17:08:41 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #38496: [SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2022/11/22 17:13:38 UTC, 0 replies.
- [GitHub] [spark] bersprockets commented on pull request #38697: [SPARK-41118][SQL][3.3] `to_number`/`try_to_number` should return `null` when format is `null` - posted by GitBox <gi...@apache.org> on 2022/11/22 17:26:53 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38755: [SPARK-41221][SQL] Add the error class `INVALID_FORMAT` - posted by GitBox <gi...@apache.org> on 2022/11/22 17:37:17 UTC, 1 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38755: [SPARK-41221][SQL] Add the error class `INVALID_FORMAT` - posted by GitBox <gi...@apache.org> on 2022/11/22 17:39:54 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38755: [SPARK-41221][SQL] Add the error class `INVALID_FORMAT` - posted by GitBox <gi...@apache.org> on 2022/11/22 17:43:29 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #38714: [WIP][SPARK-41141]. avoid introducing a new aggregate expression in the analysis phase when subquery is referencing it - posted by GitBox <gi...@apache.org> on 2022/11/22 18:09:31 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/22 18:29:52 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/22 18:53:07 UTC, 0 replies.
- [GitHub] [spark] leandrohmvieira-db commented on a diff in pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/11/22 19:07:02 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38752: [SPARK-40809][CONNECT][FOLLOW-UP] Do not use Buffer to make Scala 2.13 test pass - posted by GitBox <gi...@apache.org> on 2022/11/22 19:28:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/22 19:46:32 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38707: [SPARK-41176][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1042 - posted by GitBox <gi...@apache.org> on 2022/11/22 19:48:27 UTC, 2 replies.
- [GitHub] [spark] ahshahid commented on a diff in pull request #38714: [WIP][SPARK-41141]. avoid introducing a new aggregate expression in the analysis phase when subquery is referencing it - posted by GitBox <gi...@apache.org> on 2022/11/22 19:50:37 UTC, 5 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38575: [SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/22 19:51:03 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/22 21:12:33 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/22 21:13:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38013: [SPARK-40509][SS][PYTHON] Add example for applyInPandasWithState - posted by GitBox <gi...@apache.org> on 2022/11/23 00:23:54 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/23 01:05:23 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/23 01:09:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/23 01:09:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38751: [SPARK-40872][3.3] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/23 01:11:52 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38723: [SPARK-41201][CONNECT][PYTHON] Implement `DataFrame.SelectExpr` in Python client - posted by GitBox <gi...@apache.org> on 2022/11/23 01:13:29 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38763: [SPARK-41201][CONNECT][PYTHON][TEST][FOLLOWUP] Reenable test_fill_na - posted by GitBox <gi...@apache.org> on 2022/11/23 01:20:29 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38763: [SPARK-41201][CONNECT][PYTHON][TEST][FOLLOWUP] Reenable test_fill_na - posted by GitBox <gi...@apache.org> on 2022/11/23 01:23:10 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38686: [SPARK-41169][CONNECT][PYTHON] Implement `DataFrame.drop` - posted by GitBox <gi...@apache.org> on 2022/11/23 01:29:45 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/23 01:52:48 UTC, 3 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/23 01:53:49 UTC, 0 replies.
- [GitHub] [spark] bersprockets closed pull request #38727: [SPARK-41205][SQL] Check that format is foldable in `TryToBinary` - posted by GitBox <gi...@apache.org> on 2022/11/23 02:00:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38685: [SPARK-41206][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1233` to `COLUMN_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/23 02:04:54 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38763: [SPARK-41201][CONNECT][PYTHON][TEST][FOLLOWUP] Reenable test_fill_na - posted by GitBox <gi...@apache.org> on 2022/11/23 02:08:41 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #38745: [SPARK-37099][SQL] Optimize the filter based on rank-like window function by reduce not required rows - posted by GitBox <gi...@apache.org> on 2022/11/23 02:18:22 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/23 02:29:57 UTC, 1 replies.
- [GitHub] [spark] pan3793 commented on pull request #38205: [SPARK-40747][CORE] Support setting driver log url using env vars on other resource managers - posted by GitBox <gi...@apache.org> on 2022/11/23 02:40:54 UTC, 0 replies.
- [GitHub] [spark] pan3793 closed pull request #38205: [SPARK-40747][CORE] Support setting driver log url using env vars on other resource managers - posted by GitBox <gi...@apache.org> on 2022/11/23 02:40:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38685: [SPARK-41206][SQL] Rename the error class `_LEGACY_ERROR_TEMP_1233` to `COLUMN_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2022/11/23 02:45:09 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38731: [SPARK-41209][PYTHON] Improve PySpark type inference in _merge_type method - posted by GitBox <gi...@apache.org> on 2022/11/23 03:07:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38575: [SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/23 03:16:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38763: [SPARK-41201][CONNECT][PYTHON][TEST][FOLLOWUP] Reenable test_fill_na - posted by GitBox <gi...@apache.org> on 2022/11/23 03:35:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38575: [SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/23 03:46:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38764: [SPARK-41206][SQL][FOLLOWUP] Make result of `checkColumnNameDuplication` stable to fix `COLUMN_ALREADY_EXISTS` check with Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/23 03:53:58 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #38560: [WIP][SPARK-38005][core] Support cleaning up merged shuffle files and state from external shuffle service - posted by GitBox <gi...@apache.org> on 2022/11/23 04:00:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/23 04:03:01 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38764: [SPARK-41206][SQL][FOLLOWUP] Make result of `checkColumnNameDuplication` stable to fix `COLUMN_ALREADY_EXISTS` check failed with Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/23 04:21:59 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/23 04:28:53 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38575: [SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/23 04:51:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38757: [SPARK-41222][CONNECT][PYTHON] Unify the typing definitions - posted by GitBox <gi...@apache.org> on 2022/11/23 04:55:45 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38757: [SPARK-41222][CONNECT][PYTHON] Unify the typing definitions - posted by GitBox <gi...@apache.org> on 2022/11/23 05:00:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #25004: [SPARK-28205][SQL] useV1SourceList configuration should be for all data sources - posted by GitBox <gi...@apache.org> on 2022/11/23 05:34:03 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38575: [SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/23 05:37:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38575: [SPARK-40948][SQL][FOLLOWUP] Restore PATH_NOT_FOUND - posted by GitBox <gi...@apache.org> on 2022/11/23 05:38:35 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38751: [SPARK-40872][3.3] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/23 05:56:00 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38750: [SPARK-41226][SQL] Refactor Spark types by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/23 05:56:03 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38764: [SPARK-41206][SQL][FOLLOWUP] Make result of `checkColumnNameDuplication` stable to fix `COLUMN_ALREADY_EXISTS` check failed with Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/23 06:12:47 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38764: [SPARK-41206][SQL][FOLLOWUP] Make result of `checkColumnNameDuplication` stable to fix `COLUMN_ALREADY_EXISTS` check failed with Scala 2.13 - posted by GitBox <gi...@apache.org> on 2022/11/23 06:14:09 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38737: [SPARK-41174][CORE][SQL] Propagate an error class to users for invalid `format` of `to_binary()` - posted by GitBox <gi...@apache.org> on 2022/11/23 06:42:10 UTC, 2 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #35799: [SPARK-38498][STREAM] Support customized StreamingListener by configuration - posted by GitBox <gi...@apache.org> on 2022/11/23 06:43:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/23 06:43:37 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38730: [SPARK-41181][SQL] Migrate the map options errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/23 06:43:49 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38710: [SPARK-41179][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1092 - posted by GitBox <gi...@apache.org> on 2022/11/23 06:46:25 UTC, 1 replies.
- [GitHub] [spark] wankunde opened a new pull request, #38765: [SPARK-35531][SQL][FOLLOWUP] Support alter table command with CASE_SENSITIVE is true - posted by GitBox <gi...@apache.org> on 2022/11/23 06:49:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38737: [SPARK-41174][CORE][SQL] Propagate an error class to users for invalid `format` of `to_binary()` - posted by GitBox <gi...@apache.org> on 2022/11/23 06:51:30 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #38766: [MINOR][SQL] Fix error message for `UNEXPECTED_INPUT_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/23 06:55:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38767: [SPARK-41183][SQL][FOLLOWUP] Fix a typo - posted by GitBox <gi...@apache.org> on 2022/11/23 06:56:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38767: [SPARK-41183][SQL][FOLLOWUP] Fix a typo - posted by GitBox <gi...@apache.org> on 2022/11/23 06:56:54 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/23 07:02:19 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38769: [SPARK-41228][SQL] Rename & Improve error message for `COLUMN_NOT_IN_GROUP_BY_CLAUSE`. - posted by GitBox <gi...@apache.org> on 2022/11/23 07:07:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38766: [MINOR][SQL] Fix error message for `UNEXPECTED_INPUT_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/23 07:09:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/23 08:21:28 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/23 08:26:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38766: [MINOR][SQL] Fix error message for `UNEXPECTED_INPUT_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/23 09:16:10 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38767: [SPARK-41183][SQL][FOLLOWUP] Fix a typo - posted by GitBox <gi...@apache.org> on 2022/11/23 09:18:15 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/23 09:51:16 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/23 10:13:15 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38769: [SPARK-41228][SQL] Rename & Improve error message for `COLUMN_NOT_IN_GROUP_BY_CLAUSE`. - posted by GitBox <gi...@apache.org> on 2022/11/23 10:17:16 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38767: [SPARK-41183][SQL][FOLLOWUP] Fix a typo - posted by GitBox <gi...@apache.org> on 2022/11/23 11:28:03 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38769: [SPARK-41228][SQL] Rename & Improve error message for `COLUMN_NOT_IN_GROUP_BY_CLAUSE`. - posted by GitBox <gi...@apache.org> on 2022/11/23 11:29:43 UTC, 9 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38747: [SPARK-40834][SQL][FOLLOWUP] Take care of legacy query end events - posted by GitBox <gi...@apache.org> on 2022/11/23 11:30:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38747: [SPARK-40834][SQL][FOLLOWUP] Take care of legacy query end events - posted by GitBox <gi...@apache.org> on 2022/11/23 11:31:16 UTC, 0 replies.
- [GitHub] [spark] rangareddy commented on pull request #38761: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by GitBox <gi...@apache.org> on 2022/11/23 11:34:44 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38740: [SQL] Add product encoders for local classes - posted by GitBox <gi...@apache.org> on 2022/11/23 11:48:46 UTC, 0 replies.
- [GitHub] [spark] pralabhkumar commented on pull request #37417: [SPARK-33782][K8S][CORE]Place spark.files, spark.jars and spark.files under the current working directory on the driver in K8S cluster mode - posted by GitBox <gi...@apache.org> on 2022/11/23 12:30:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38770: [SPARK-41238][CONNECT][PYTHON] Support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/23 12:33:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/23 13:02:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/23 13:31:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38771: [SPARK-41239][BUILD] Upgrade jackson to 2.14.1 - posted by GitBox <gi...@apache.org> on 2022/11/23 13:43:22 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #38772: [SPARK-41237][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_0030` - posted by GitBox <gi...@apache.org> on 2022/11/23 13:55:39 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #38773: [SPARK-41016][PANDAS ON SPARK API] Identical expressions should not be used on both sides of a binary operator - posted by GitBox <gi...@apache.org> on 2022/11/23 14:07:30 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38075: [SPARK-40633][BUILD] Upgrade janino to 3.1.9 - posted by GitBox <gi...@apache.org> on 2022/11/23 14:09:50 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38075: [SPARK-40633][BUILD] Upgrade janino to 3.1.9 - posted by GitBox <gi...@apache.org> on 2022/11/23 14:10:01 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38758: [SPARK-41223][BUILD] Upgrade slf4j to 2.0.4 - posted by GitBox <gi...@apache.org> on 2022/11/23 14:10:34 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38758: [SPARK-41223][BUILD] Upgrade slf4j to 2.0.4 - posted by GitBox <gi...@apache.org> on 2022/11/23 14:10:36 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38755: [SPARK-41221][SQL] Add the error class `INVALID_FORMAT` - posted by GitBox <gi...@apache.org> on 2022/11/23 14:23:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38737: [SPARK-41174][CORE][SQL] Propagate an error class to users for invalid `format` of `to_binary()` - posted by GitBox <gi...@apache.org> on 2022/11/23 14:24:46 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38737: [SPARK-41174][CORE][SQL] Propagate an error class to users for invalid `format` of `to_binary()` - posted by GitBox <gi...@apache.org> on 2022/11/23 14:25:18 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #38774: [SPARK-XXXX][CONNECT] Upgrade `Protobuf` to 3.19.5 - posted by GitBox <gi...@apache.org> on 2022/11/23 14:26:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38646: [SPARK-41131][SQL] Improve error message for `UNRESOLVED_MAP_KEY.WITHOUT_SUGGESTION` - posted by GitBox <gi...@apache.org> on 2022/11/23 14:28:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38646: [SPARK-41131][SQL] Improve error message for `UNRESOLVED_MAP_KEY.WITHOUT_SUGGESTION` - posted by GitBox <gi...@apache.org> on 2022/11/23 14:29:08 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/23 14:34:22 UTC, 1 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38707: [SPARK-41176][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1042 - posted by GitBox <gi...@apache.org> on 2022/11/23 15:32:43 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/23 15:40:24 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by GitBox <gi...@apache.org> on 2022/11/23 16:43:23 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38730: [SPARK-41181][SQL] Migrate the map options errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/23 16:43:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38754: [SPARK-41180][SQL] Assign an error class to "Cannot parse the data type" - posted by GitBox <gi...@apache.org> on 2022/11/23 16:50:01 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38771: [SPARK-41239][BUILD] Upgrade jackson to 2.14.1 - posted by GitBox <gi...@apache.org> on 2022/11/23 17:12:42 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38771: [SPARK-41239][BUILD] Upgrade jackson to 2.14.1 - posted by GitBox <gi...@apache.org> on 2022/11/23 17:12:43 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #38775: [SPARK-41243][CONNECT][PYTHON] Update the protobuf version in README - posted by GitBox <gi...@apache.org> on 2022/11/23 17:42:53 UTC, 0 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/11/23 18:02:17 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38775: [SPARK-41243][CONNECT][PYTHON][DOCS] Update the protobuf version in README - posted by GitBox <gi...@apache.org> on 2022/11/23 18:35:46 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/23 20:22:50 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38567: [SPARK-41054][UI][CORE] Support RocksDB as KVStore in live UI - posted by GitBox <gi...@apache.org> on 2022/11/23 20:29:55 UTC, 2 replies.
- [GitHub] [spark] anchovYu opened a new pull request, #38776: [WIP] Refactor Analyzer by moving several public methods to the Analyzer object - posted by GitBox <gi...@apache.org> on 2022/11/23 21:05:21 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 opened a new pull request, #38777: [SPARK-41151][FOLLOW-UP][SQL] Keep built-in file _metadata fields nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/23 22:57:21 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/23 23:04:59 UTC, 0 replies.
- [GitHub] [spark] Yaohua628 commented on pull request #38777: [SPARK-41151][FOLLOW-UP][SQL] Keep built-in file _metadata fields nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/23 23:05:09 UTC, 2 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/23 23:07:01 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #38779: [SPARK-41244][UI] Introducing Protobuf serializer for KV store - posted by GitBox <gi...@apache.org> on 2022/11/23 23:07:17 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38779: [SPARK-41244][UI] Introducing Protobuf serializer for KV store - posted by GitBox <gi...@apache.org> on 2022/11/23 23:08:37 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38779: [SPARK-41244][UI] Introducing Protobuf serializer for KV store - posted by GitBox <gi...@apache.org> on 2022/11/23 23:10:18 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38777: [SPARK-41151][FOLLOW-UP][SQL] Keep built-in file _metadata fields nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/23 23:30:12 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38779: [SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/23 23:30:55 UTC, 9 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/23 23:58:46 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/24 00:03:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37440: [SPARK-40076] [SQL] Support number-only column names in ORC data sources when orc impl is hive - posted by GitBox <gi...@apache.org> on 2022/11/24 00:21:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37406: [SPARK-39921][SQL] SkewJoin--Stream side skew in BroadcastHashJoin - posted by GitBox <gi...@apache.org> on 2022/11/24 00:21:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38775: [SPARK-41243][CONNECT][PYTHON][DOCS] Update the protobuf version in README - posted by GitBox <gi...@apache.org> on 2022/11/24 01:03:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38775: [SPARK-41243][CONNECT][PYTHON][DOCS] Update the protobuf version in README - posted by GitBox <gi...@apache.org> on 2022/11/24 01:03:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38757: [SPARK-41222][CONNECT][PYTHON] Unify the typing definitions - posted by GitBox <gi...@apache.org> on 2022/11/24 01:04:31 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38759: [SPARK-41224][SPARK-41165][SPARK-41184] Optimized Arrow-based collect implementation to stream from server to client - posted by GitBox <gi...@apache.org> on 2022/11/24 01:07:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/24 01:07:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38762: [SPARK-41225] [CONNECT] [PYTHON] Disable unsupported functions. - posted by GitBox <gi...@apache.org> on 2022/11/24 01:08:02 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #38707: [SPARK-41176][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1042 - posted by GitBox <gi...@apache.org> on 2022/11/24 01:21:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #38766: [MINOR][SQL] Fix error message for `UNEXPECTED_INPUT_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/24 01:35:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38767: [SPARK-41183][SQL][FOLLOWUP] Change the name from injectPlanNormalizationRules to injectPlanNormalizationRule - posted by GitBox <gi...@apache.org> on 2022/11/24 01:37:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38767: [SPARK-41183][SQL][FOLLOWUP] Change the name from injectPlanNormalizationRules to injectPlanNormalizationRule - posted by GitBox <gi...@apache.org> on 2022/11/24 01:37:41 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38770: [SPARK-41238][CONNECT][PYTHON] Support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/24 01:39:22 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38710: [SPARK-41179][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1092 - posted by GitBox <gi...@apache.org> on 2022/11/24 02:07:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38757: [SPARK-41222][CONNECT][PYTHON] Unify the typing definitions - posted by GitBox <gi...@apache.org> on 2022/11/24 02:44:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Make AnalyzePlan support multiple analysis tasks And implement isLocal/isStreaming/printSchema/semanticHash/sameSemantics/inputFiles - posted by GitBox <gi...@apache.org> on 2022/11/24 02:59:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/24 03:00:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/24 03:01:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/24 03:04:19 UTC, 0 replies.
- [GitHub] [spark] WolverineJiang commented on pull request #38743: [SPARK-41215][BUILD][PROTOBUF] Support user configurable protoc executables when building Spark Protobuf. - posted by GitBox <gi...@apache.org> on 2022/11/24 03:06:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38779: [SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/24 03:29:22 UTC, 10 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38779: [SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/24 03:35:33 UTC, 4 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38761: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by GitBox <gi...@apache.org> on 2022/11/24 03:39:30 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/24 03:58:35 UTC, 1 replies.
- [GitHub] [spark] wilfred-s opened a new pull request, #38780: [SPARK-41185][K8S][DOCS] Remove ARM limitation for YuniKorn from docs - posted by GitBox <gi...@apache.org> on 2022/11/24 04:05:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38640: [WIP][SPARK-41124][SQL][TEST] Add DSv2 PlanStabilitySuites - posted by GitBox <gi...@apache.org> on 2022/11/24 04:35:18 UTC, 1 replies.
- [GitHub] [spark] yorksity opened a new pull request, #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by GitBox <gi...@apache.org> on 2022/11/24 04:37:12 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38780: [SPARK-41185][K8S][DOCS] Remove ARM limitation for YuniKorn from docs - posted by GitBox <gi...@apache.org> on 2022/11/24 04:37:13 UTC, 1 replies.
- [GitHub] [spark] ibuder opened a new pull request, #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/24 04:42:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38750: [SPARK-41226][SQL] Refactor Spark types by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/24 04:43:04 UTC, 9 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38750: [SPARK-41226][SQL] Refactor Spark types by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/24 04:51:53 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #38783: [SPARK-41247][BUILD] Unify the Protobuf versions in Spark connect and Protobuf connector - posted by GitBox <gi...@apache.org> on 2022/11/24 05:05:49 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38779: [WIP][SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/24 05:07:05 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #38784: [SPARK-41248] Add "spark.sql.json.enablePartialResults" to enable/disable JSON partial results parsing added in SPARK-40646 - posted by GitBox <gi...@apache.org> on 2022/11/24 05:14:37 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #38784: [SPARK-41248] Add "spark.sql.json.enablePartialResults" to enable/disable JSON partial results parsing added in SPARK-40646 - posted by GitBox <gi...@apache.org> on 2022/11/24 05:17:03 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #38785: [SPARK-41249][SS] Add acceptance test for self-union on streaming query - posted by GitBox <gi...@apache.org> on 2022/11/24 05:22:56 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38785: [SPARK-41249][SS] Add acceptance test for self-union on streaming query - posted by GitBox <gi...@apache.org> on 2022/11/24 05:23:24 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38772: [SPARK-41237][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_0030` - posted by GitBox <gi...@apache.org> on 2022/11/24 05:23:41 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/24 05:23:44 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38659: [SPARK-41114][CONNECT] Support local data for LocalRelation - posted by GitBox <gi...@apache.org> on 2022/11/24 05:24:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38772: [SPARK-41237][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_0030` - posted by GitBox <gi...@apache.org> on 2022/11/24 05:42:06 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38784: [SPARK-41248][SQL] Add "spark.sql.json.enablePartialResults" to enable/disable JSON partial results parsing added in SPARK-40646 - posted by GitBox <gi...@apache.org> on 2022/11/24 05:49:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38766: [MINOR][SQL] Fix error message for `UNEXPECTED_INPUT_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/24 05:52:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38766: [MINOR][SQL] Fix error message for `UNEXPECTED_INPUT_TYPE` - posted by GitBox <gi...@apache.org> on 2022/11/24 05:53:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/24 05:55:46 UTC, 4 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38707: [SPARK-41176][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1042 - posted by GitBox <gi...@apache.org> on 2022/11/24 06:00:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38707: [SPARK-41176][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1042 - posted by GitBox <gi...@apache.org> on 2022/11/24 06:01:17 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #38769: [SPARK-41228][SQL] Rename & Improve error message for `COLUMN_NOT_IN_GROUP_BY_CLAUSE`. - posted by GitBox <gi...@apache.org> on 2022/11/24 06:04:23 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38786: [SPARK-41250][CONNECT][PYTHON] DataFrame.to_pandas should not return optional pandas dataframe - posted by GitBox <gi...@apache.org> on 2022/11/24 06:11:34 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38786: [SPARK-41250][CONNECT][PYTHON] DataFrame.to_pandas should not return optional pandas dataframe - posted by GitBox <gi...@apache.org> on 2022/11/24 06:13:50 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38710: [SPARK-41179][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1092 - posted by GitBox <gi...@apache.org> on 2022/11/24 06:17:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38783: [SPARK-41247][BUILD] Unify the Protobuf versions in Spark connect and Protobuf connector - posted by GitBox <gi...@apache.org> on 2022/11/24 06:23:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38783: [SPARK-41247][BUILD] Unify the Protobuf versions in Spark connect and Protobuf connector - posted by GitBox <gi...@apache.org> on 2022/11/24 06:24:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/24 06:37:40 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #38783: [SPARK-41247][BUILD] Unify the Protobuf versions in Spark connect and Protobuf connector - posted by GitBox <gi...@apache.org> on 2022/11/24 06:53:17 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/24 07:29:28 UTC, 4 replies.
- [GitHub] [spark] wilfred-s commented on pull request #38780: [SPARK-41185][K8S][DOCS] Remove ARM limitation for YuniKorn from docs - posted by GitBox <gi...@apache.org> on 2022/11/24 07:31:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38770: [SPARK-41238][CONNECT][PYTHON] Support more built-in datatypes - posted by GitBox <gi...@apache.org> on 2022/11/24 07:32:24 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38787: [SPARK-41251][PS][INFRA] Upgrade pandas from 1.5.1 to 1.5.2 - posted by GitBox <gi...@apache.org> on 2022/11/24 07:32:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38774: [SPARK-41240][CONNECT][BUILD][INFRA] Upgrade `Protobuf` to 3.19.5 - posted by GitBox <gi...@apache.org> on 2022/11/24 07:46:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38779: [WIP][SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/24 07:46:50 UTC, 5 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38787: [SPARK-41251][PS][INFRA] Upgrade pandas from 1.5.1 to 1.5.2 - posted by GitBox <gi...@apache.org> on 2022/11/24 07:50:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38745: [SPARK-37099][SQL] Optimize the filter based on rank-like window function by reduce not required rows - posted by GitBox <gi...@apache.org> on 2022/11/24 08:02:01 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38770: [SPARK-41238][CONNECT][PYTHON] Support more built-in datatypes - posted by GitBox <gi...@apache.org> on 2022/11/24 08:02:23 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38788: [SPARK-41252][BUILD] Upgrade arrow from 10.0.0 to 10.0.1 - posted by GitBox <gi...@apache.org> on 2022/11/24 08:02:35 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38770: [SPARK-41238][CONNECT][PYTHON] Support more built-in datatypes - posted by GitBox <gi...@apache.org> on 2022/11/24 08:04:32 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/24 08:09:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38725: [SPARK-41182][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1102 - posted by GitBox <gi...@apache.org> on 2022/11/24 08:12:19 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38725: [SPARK-41182][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_1102 - posted by GitBox <gi...@apache.org> on 2022/11/24 08:13:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38489: [SPARK-41003][SQL] BHJ LeftAnti does not update numOutputRows when codegen is disabled - posted by GitBox <gi...@apache.org> on 2022/11/24 08:38:29 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38489: [SPARK-41003][SQL] BHJ LeftAnti does not update numOutputRows when codegen is disabled - posted by GitBox <gi...@apache.org> on 2022/11/24 08:39:06 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #38745: [SPARK-37099][SQL] Optimize the filter based on rank-like window function by reduce not required rows - posted by GitBox <gi...@apache.org> on 2022/11/24 08:53:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38786: [SPARK-41250][CONNECT][PYTHON] DataFrame. toPandas should not return optional pandas dataframe - posted by GitBox <gi...@apache.org> on 2022/11/24 08:58:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38786: [SPARK-41250][CONNECT][PYTHON] DataFrame. toPandas should not return optional pandas dataframe - posted by GitBox <gi...@apache.org> on 2022/11/24 08:58:56 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38779: [WIP][SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/24 09:02:21 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/24 09:03:58 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by GitBox <gi...@apache.org> on 2022/11/24 09:16:48 UTC, 4 replies.
- [GitHub] [spark] yaooqinn commented on pull request #38489: [SPARK-41003][SQL] BHJ LeftAnti does not update numOutputRows when codegen is disabled - posted by GitBox <gi...@apache.org> on 2022/11/24 09:17:29 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #38730: [SPARK-41181][SQL] Migrate the map options errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/24 09:26:42 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38774: [SPARK-41240][CONNECT][BUILD][INFRA] Upgrade `Protobuf` to 3.19.5 - posted by GitBox <gi...@apache.org> on 2022/11/24 09:29:07 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38785: [SPARK-41249][SS][TEST] Add acceptance test for self-union on streaming query - posted by GitBox <gi...@apache.org> on 2022/11/24 09:33:08 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #38765: [SPARK-35531][SQL][FOLLOWUP] Support alter table command with CASE_SENSITIVE is true - posted by GitBox <gi...@apache.org> on 2022/11/24 09:34:00 UTC, 1 replies.
- [GitHub] [spark] desmondcheongzx commented on a diff in pull request #38750: [SPARK-41226][SQL] Refactor Spark types by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/24 09:38:53 UTC, 8 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38785: [SPARK-41249][SS][TEST] Add acceptance test for self-union on streaming query - posted by GitBox <gi...@apache.org> on 2022/11/24 09:42:03 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38785: [SPARK-41249][SS][TEST] Add acceptance test for self-union on streaming query - posted by GitBox <gi...@apache.org> on 2022/11/24 11:26:46 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38785: [SPARK-41249][SS][TEST] Add acceptance test for self-union on streaming query - posted by GitBox <gi...@apache.org> on 2022/11/24 11:28:03 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #36265: [SPARK-38951][SQL] Aggregate aliases override field names in ResolveAggregateFunctions - posted by GitBox <gi...@apache.org> on 2022/11/24 12:10:55 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #38789: [SPARK-41253][K8S][TESTS] Make Spark K8S volcano IT work in Github Action - posted by GitBox <gi...@apache.org> on 2022/11/24 13:18:58 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38789: [SPARK-41253][K8S][TESTS] Make Spark K8S volcano IT work in Github Action - posted by GitBox <gi...@apache.org> on 2022/11/24 13:20:28 UTC, 1 replies.
- [GitHub] [spark] CavemanIV opened a new pull request, #38790: [SPARK-41254] [Yarn] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/24 13:25:49 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38720: [SPARK-41165][SPARK-41184][CONNECT] Fix arrow collect (again) and reenable tests. - posted by GitBox <gi...@apache.org> on 2022/11/24 13:28:49 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38759: [SPARK-41224][SPARK-41165][SPARK-41184] Optimized Arrow-based collect implementation to stream from server to client - posted by GitBox <gi...@apache.org> on 2022/11/24 13:28:50 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #38640: [WIP][SPARK-41124][SQL][TEST] Add DSv2 PlanStabilitySuites - posted by GitBox <gi...@apache.org> on 2022/11/24 14:12:38 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #38598: [SPARK-41097][CORE][SQL][SS][PROTOBUF] Remove redundant collection conversion base on Scala 2.13 code - posted by GitBox <gi...@apache.org> on 2022/11/24 15:16:05 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #38791: [SPARK-41245][BUILD] Upgrade postgressql to 42.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/24 16:03:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38770: [SPARK-41238][CONNECT][PYTHON] Support more built-in datatypes - posted by GitBox <gi...@apache.org> on 2022/11/24 17:31:55 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #38783: [SPARK-41247][BUILD] Unify the Protobuf versions in Spark connect and Protobuf connector - posted by GitBox <gi...@apache.org> on 2022/11/24 19:14:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38730: [SPARK-41181][SQL] Migrate the map options errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/24 19:49:45 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38792: [SPARK-41255] [CONNECT] Rename RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/24 20:05:54 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38792: [SPARK-41255] [CONNECT] Rename RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/24 20:09:32 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #38760: [SPARK-41219][SQL] Decimal changePrecision should work with decimal(0, 0) - posted by GitBox <gi...@apache.org> on 2022/11/24 20:17:57 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/24 20:46:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/24 20:53:11 UTC, 10 replies.
- [GitHub] [spark] amaliujia commented on pull request #38792: [SPARK-41255] [CONNECT] Rename RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/24 20:55:05 UTC, 2 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #38784: [SPARK-41248][SQL] Add "spark.sql.json.enablePartialResults" to enable/disable JSON partial results parsing added in SPARK-40646 - posted by GitBox <gi...@apache.org> on 2022/11/24 21:04:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/24 23:57:01 UTC, 5 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37440: [SPARK-40076] [SQL] Support number-only column names in ORC data sources when orc impl is hive - posted by GitBox <gi...@apache.org> on 2022/11/25 00:19:37 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37406: [SPARK-39921][SQL] SkewJoin--Stream side skew in BroadcastHashJoin - posted by GitBox <gi...@apache.org> on 2022/11/25 00:19:38 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/25 00:25:54 UTC, 19 replies.
- [GitHub] [spark] Yikun closed pull request #38787: [SPARK-41251][PS][INFRA] Upgrade pandas from 1.5.1 to 1.5.2 - posted by GitBox <gi...@apache.org> on 2022/11/25 00:44:01 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38787: [SPARK-41251][PS][INFRA] Upgrade pandas from 1.5.1 to 1.5.2 - posted by GitBox <gi...@apache.org> on 2022/11/25 00:44:31 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #38689: [SPARK-41171][SQL] Push down filter through window when partitionSpec is empty - posted by GitBox <gi...@apache.org> on 2022/11/25 01:07:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38770: [SPARK-41238][CONNECT][PYTHON] Support more built-in datatypes - posted by GitBox <gi...@apache.org> on 2022/11/25 01:26:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38768: [SPARK-41230][CONNECT][PYTHON] Remove `str` from Aggregate expression type - posted by GitBox <gi...@apache.org> on 2022/11/25 01:32:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38770: [SPARK-41238][CONNECT][PYTHON] Support more built-in datatypes - posted by GitBox <gi...@apache.org> on 2022/11/25 01:36:26 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #38794: [SPARK-41257][INFRA] Upgrade actions/labeler to v4 - posted by GitBox <gi...@apache.org> on 2022/11/25 01:55:53 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #38794: [SPARK-41257][INFRA] Upgrade actions/labeler to v4 - posted by GitBox <gi...@apache.org> on 2022/11/25 01:57:00 UTC, 1 replies.
- [GitHub] [spark] Yikun closed pull request #38794: [SPARK-41257][INFRA] Upgrade actions/labeler to v4 - posted by GitBox <gi...@apache.org> on 2022/11/25 02:18:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/25 02:26:45 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/25 02:31:56 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #24: [SPARK-41258][INFRA] Upgrade docker and actions to cleanup warnning - posted by GitBox <gi...@apache.org> on 2022/11/25 02:36:00 UTC, 0 replies.
- [GitHub] [spark] Yikf opened a new pull request, #38795: [SPARK-41259][SQL] Spark-sql cli query results should correspond to schema - posted by GitBox <gi...@apache.org> on 2022/11/25 02:42:04 UTC, 0 replies.
- [GitHub] [spark] Yikf commented on pull request #38795: [SPARK-41259][SQL] Spark-sql cli query results should correspond to schema - posted by GitBox <gi...@apache.org> on 2022/11/25 02:42:39 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #38467: [SPARK-40987][CORE] Avoid creating a directory when deleting a block, causing DAGScheduler to not work - posted by GitBox <gi...@apache.org> on 2022/11/25 02:50:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38796: [SPARK-41260][PYTHON][SS] Cast NumPy instances to Python primitive types in GroupState update - posted by GitBox <gi...@apache.org> on 2022/11/25 03:30:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38796: [SPARK-41260][PYTHON][SS] Cast NumPy instances to Python primitive types in GroupState update - posted by GitBox <gi...@apache.org> on 2022/11/25 03:31:12 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38796: [SPARK-41260][PYTHON][SS] Cast NumPy instances to Python primitive types in GroupState update - posted by GitBox <gi...@apache.org> on 2022/11/25 03:31:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38742: [SPARK-41216][CONNECT][PYTHON] Implement `DataFrame.{isLocal, isStreaming, printSchema, inputFiles}` - posted by GitBox <gi...@apache.org> on 2022/11/25 03:58:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38742: [SPARK-41216][CONNECT][PYTHON] Implement `DataFrame.{isLocal, isStreaming, printSchema, inputFiles}` - posted by GitBox <gi...@apache.org> on 2022/11/25 03:58:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38797: [TEST ONLY] Upgrade buf to 1.9.0 - posted by GitBox <gi...@apache.org> on 2022/11/25 04:15:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Implement `DataFrame.{isLocal, isStreaming, printSchema, inputFiles}` - posted by GitBox <gi...@apache.org> on 2022/11/25 05:31:13 UTC, 0 replies.
- [GitHub] [spark] ibuder commented on a diff in pull request #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/25 05:31:53 UTC, 5 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/25 05:38:07 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/25 05:54:25 UTC, 4 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38795: [SPARK-41259][SQL] Spark-sql cli query results should correspond to schema - posted by GitBox <gi...@apache.org> on 2022/11/25 05:58:58 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/25 05:59:01 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38792: [SPARK-41255] [CONNECT] Rename RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/25 05:59:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38791: [SPARK-41245][BUILD] Upgrade `postgresql` to 42.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/25 05:59:08 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38790: [SPARK-41254] [YARN] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/25 05:59:11 UTC, 0 replies.
- [GitHub] [spark] wineternity commented on pull request #38702: [SPARK-41187][CORE] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/25 06:03:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38730: [SPARK-41181][SQL] Migrate the map options errors onto error classes - posted by GitBox <gi...@apache.org> on 2022/11/25 06:03:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38736: [SPARK-41214][SQL] - SQL Metrics are missing from Spark UI when AQE for Cached DataFrame is enabled - posted by GitBox <gi...@apache.org> on 2022/11/25 06:09:52 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38736: [SPARK-41214][SQL] - SQL Metrics are missing from Spark UI when AQE for Cached DataFrame is enabled - posted by GitBox <gi...@apache.org> on 2022/11/25 06:16:20 UTC, 2 replies.
- [GitHub] [spark-docker] Yikun closed pull request #24: [SPARK-41258][INFRA] Upgrade docker and actions to cleanup warnning - posted by GitBox <gi...@apache.org> on 2022/11/25 06:57:38 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #24: [SPARK-41258][INFRA] Upgrade docker and actions to cleanup warnning - posted by GitBox <gi...@apache.org> on 2022/11/25 06:57:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38702: [SPARK-41187][CORE] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/25 06:58:52 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38689: [SPARK-41171][SQL] Push down filter through window when partitionSpec is empty - posted by GitBox <gi...@apache.org> on 2022/11/25 06:59:32 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/25 07:07:38 UTC, 6 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #38798: [SPARK-41261][SS] Fix issue for applyInPandasWithState when the columns of grouping keys are not placed in order from earliest - posted by GitBox <gi...@apache.org> on 2022/11/25 07:21:05 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38798: [SPARK-41261][SS] Fix issue for applyInPandasWithState when the columns of grouping keys are not placed in order from earliest - posted by GitBox <gi...@apache.org> on 2022/11/25 07:21:26 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38790: [SPARK-41254] [YARN] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/25 07:21:49 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/25 07:25:49 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #38796: [SPARK-41260][PYTHON][SS] Cast NumPy instances to Python primitive types in GroupState update - posted by GitBox <gi...@apache.org> on 2022/11/25 07:28:45 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38788: [SPARK-41252][BUILD] Upgrade arrow from 10.0.0 to 10.0.1 - posted by GitBox <gi...@apache.org> on 2022/11/25 07:56:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Implement `DataFrame.{isLocal, isStreaming, printSchema, inputFiles}` - posted by GitBox <gi...@apache.org> on 2022/11/25 08:12:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/25 08:28:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38797: [SPARK-41263][CONNECT][INFRA] Upgrade buf to 1.9.0 - posted by GitBox <gi...@apache.org> on 2022/11/25 08:52:46 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38788: [SPARK-41252][BUILD] Upgrade arrow from 10.0.0 to 10.0.1 - posted by GitBox <gi...@apache.org> on 2022/11/25 08:55:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38788: [SPARK-41252][BUILD] Upgrade arrow from 10.0.0 to 10.0.1 - posted by GitBox <gi...@apache.org> on 2022/11/25 08:56:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38791: [SPARK-41245][BUILD] Upgrade `postgresql` to 42.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/25 09:01:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38791: [SPARK-41245][BUILD] Upgrade `postgresql` to 42.5.1 - posted by GitBox <gi...@apache.org> on 2022/11/25 09:01:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38774: [SPARK-41240][CONNECT][BUILD][INFRA][DOCS] Upgrade `Protobuf` to 3.19.5 - posted by GitBox <gi...@apache.org> on 2022/11/25 09:10:14 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #38689: [SPARK-41171][SQL] Push down filter through window when partitionSpec is empty - posted by GitBox <gi...@apache.org> on 2022/11/25 09:10:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38774: [SPARK-41240][CONNECT][BUILD][INFRA][DOCS] Upgrade `Protobuf` to 3.19.5 - posted by GitBox <gi...@apache.org> on 2022/11/25 09:11:56 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38800: [SPARK-41264][CONNECT][PYTHON][WIP] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/25 09:15:10 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/25 09:55:21 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by GitBox <gi...@apache.org> on 2022/11/25 09:55:24 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38780: [SPARK-41185][K8S][DOCS] Remove ARM limitation for YuniKorn from docs - posted by GitBox <gi...@apache.org> on 2022/11/25 09:55:27 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/25 10:01:34 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #38745: [SPARK-37099][SQL] Optimize the filter based on rank-like window function by reduce not required rows - posted by GitBox <gi...@apache.org> on 2022/11/25 10:06:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38800: [SPARK-41264][CONNECT][PYTHON][WIP] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/25 10:25:09 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by GitBox <gi...@apache.org> on 2022/11/25 11:16:26 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/25 11:54:38 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38777: [SPARK-41151][FOLLOW-UP][SQL] Keep built-in file _metadata fields nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/25 11:54:41 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38776: [WIP][SQL] Refactor Analyzer by moving several helper public methods to the new Analyzer object - posted by GitBox <gi...@apache.org> on 2022/11/25 11:54:44 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38774: [SPARK-41240][CONNECT][BUILD][INFRA][DOCS] Upgrade `Protobuf` to 3.19.5 - posted by GitBox <gi...@apache.org> on 2022/11/25 11:54:47 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38773: [SPARK-41016][PS] Identical expressions should not be used on both sides of a binary operator - posted by GitBox <gi...@apache.org> on 2022/11/25 11:54:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/25 12:01:04 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/25 13:46:07 UTC, 0 replies.
- [GitHub] [spark] gaoyajun02 commented on pull request #38751: [SPARK-40872][3.3] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/25 14:29:03 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/25 15:35:09 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/25 15:49:56 UTC, 12 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38765: [SPARK-35531][SQL][FOLLOWUP] Support alter table command with CASE_SENSITIVE is true - posted by GitBox <gi...@apache.org> on 2022/11/25 15:57:14 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38588: [SPARK-41086][SQL] Consolidate SecondArgumentXXX error to INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2022/11/25 16:10:18 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38802: [WIP] Packaging for Spark Connect Preview - posted by GitBox <gi...@apache.org> on 2022/11/25 16:44:22 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on a diff in pull request #38020: [SPARK-39877][FOLLOW-UP] PySpark DataFrame.unpivot allows for column names only - posted by GitBox <gi...@apache.org> on 2022/11/25 17:05:28 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38742: [SPARK-41216][CONNECT][PYTHON] Implement `DataFrame.{isLocal, isStreaming, printSchema, inputFiles}` - posted by GitBox <gi...@apache.org> on 2022/11/25 17:27:44 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38779: [SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/25 17:35:10 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/25 17:48:11 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/25 17:48:37 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38774: [SPARK-41240][CONNECT][BUILD][INFRA][DOCS] Upgrade `Protobuf` to 3.19.5 - posted by GitBox <gi...@apache.org> on 2022/11/25 18:08:53 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38783: [SPARK-41247][BUILD] Unify the Protobuf versions in Spark connect and Protobuf connector - posted by GitBox <gi...@apache.org> on 2022/11/25 18:25:57 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38789: [SPARK-41253][K8S][TESTS] Make Spark K8S volcano IT work in Github Action - posted by GitBox <gi...@apache.org> on 2022/11/25 18:44:25 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38790: [SPARK-41254] [YARN] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/25 18:44:50 UTC, 0 replies.
- [GitHub] [spark] zero323 opened a new pull request, #38804: [SPARK-41267][R][SQL] Add unpivot / melt to SparkR - posted by GitBox <gi...@apache.org> on 2022/11/25 18:54:03 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/25 18:56:26 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #38802: [WIP] Packaging for Spark Connect Preview - posted by GitBox <gi...@apache.org> on 2022/11/25 19:05:07 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38802: [WIP] Packaging for Spark Connect Preview - posted by GitBox <gi...@apache.org> on 2022/11/25 19:15:55 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/25 19:16:03 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/25 19:35:06 UTC, 14 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #38805: [MINOR][SQL] Fix the pretty name of the `AnyValue` expression - posted by GitBox <gi...@apache.org> on 2022/11/25 19:49:49 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/25 20:26:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38797: [SPARK-41263][CONNECT][INFRA] Upgrade buf to 1.9.0 - posted by GitBox <gi...@apache.org> on 2022/11/25 21:00:57 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/25 21:06:24 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/25 21:07:29 UTC, 5 replies.
- [GitHub] [spark] hvanhovell commented on pull request #38792: [SPARK-41255] [CONNECT] Rename RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/25 22:12:13 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38792: [SPARK-41255] [CONNECT] Rename RemoteSparkSession - posted by GitBox <gi...@apache.org> on 2022/11/25 22:12:46 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37550: [SPARK-40125][INFRA] Add separate infra image for lint job - posted by GitBox <gi...@apache.org> on 2022/11/26 00:18:32 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37341: [SPARK-38639][SQL]Ignore the corrupted rows that failed to deserialize in hive sequence table - posted by GitBox <gi...@apache.org> on 2022/11/26 00:18:34 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37014: [SPARK-39624][SQL] Support coalesce partition through CartesianProduct - posted by GitBox <gi...@apache.org> on 2022/11/26 00:18:36 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #38789: [SPARK-41253][K8S][TESTS] Make Spark K8S volcano IT work in Github Action - posted by GitBox <gi...@apache.org> on 2022/11/26 01:14:06 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/26 02:52:09 UTC, 2 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #25: [SPARK-41269][INFRA] Move image matrix into version's workflow - posted by GitBox <gi...@apache.org> on 2022/11/26 03:02:07 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38807: [SPARK-41270][CONNECT] Add Catalog tableExists and databaseExists in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/26 04:41:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38807: [SPARK-41270][CONNECT] Add Catalog tableExists and databaseExists in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/26 04:41:59 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/26 05:34:59 UTC, 5 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/26 05:51:55 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #25: [SPARK-41269][INFRA] Move image matrix into version's workflow - posted by GitBox <gi...@apache.org> on 2022/11/26 06:11:24 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/26 06:41:13 UTC, 4 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38754: [SPARK-41180][SQL] Assign an error class to "Cannot parse the data type" - posted by GitBox <gi...@apache.org> on 2022/11/26 10:30:35 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #38808: [SPARK-41272][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_2019 - posted by GitBox <gi...@apache.org> on 2022/11/26 12:21:51 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by GitBox <gi...@apache.org> on 2022/11/26 16:55:43 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38754: [SPARK-41180][SQL] Reuse `INVALID_SCHEMA` instead of `_LEGACY_ERROR_TEMP_1227` - posted by GitBox <gi...@apache.org> on 2022/11/26 18:38:56 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/26 21:04:47 UTC, 4 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38807: [SPARK-41270][CONNECT] Add Catalog tableExists and databaseExists in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/26 21:31:15 UTC, 0 replies.
- [GitHub] [spark] tomvanbussel commented on a diff in pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/26 22:18:57 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37550: [SPARK-40125][INFRA] Add separate infra image for lint job - posted by GitBox <gi...@apache.org> on 2022/11/27 00:22:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37536: [WIP][SPARK-40100][SQL] Add DataType class for Int128 type - posted by GitBox <gi...@apache.org> on 2022/11/27 00:22:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37522: [WIP][SPARK-40083][SHUFFLE] Add shuffle index cache timebased expire policy - posted by GitBox <gi...@apache.org> on 2022/11/27 00:22:06 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37451: [SPARK-40017][SQL] Adaptive adjustment `spark.sql.adaptive.advisoryPartitionSizeInBytes` - posted by GitBox <gi...@apache.org> on 2022/11/27 00:22:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37341: [SPARK-38639][SQL]Ignore the corrupted rows that failed to deserialize in hive sequence table - posted by GitBox <gi...@apache.org> on 2022/11/27 00:22:11 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37014: [SPARK-39624][SQL] Support coalesce partition through CartesianProduct - posted by GitBox <gi...@apache.org> on 2022/11/27 00:22:11 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38809: [WIP][SPARK-41273][BUILD] Update plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/11/27 00:32:15 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38808: [SPARK-41272][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_2019 - posted by GitBox <gi...@apache.org> on 2022/11/27 00:32:18 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38807: [SPARK-41270][CONNECT] Add Catalog tableExists and databaseExists in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/27 00:32:21 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/27 00:32:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/27 01:55:00 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38751: [SPARK-40872][3.3] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/27 01:57:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38798: [SPARK-41261][PYTHON][SS] Fix issue for applyInPandasWithState when the columns of grouping keys are not placed in order from earliest - posted by GitBox <gi...@apache.org> on 2022/11/27 02:01:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38798: [SPARK-41261][PYTHON][SS] Fix issue for applyInPandasWithState when the columns of grouping keys are not placed in order from earliest - posted by GitBox <gi...@apache.org> on 2022/11/27 02:01:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38804: [SPARK-41267][R][SQL] Add unpivot / melt to SparkR - posted by GitBox <gi...@apache.org> on 2022/11/27 02:02:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38804: [SPARK-41267][R][SQL] Add unpivot / melt to SparkR - posted by GitBox <gi...@apache.org> on 2022/11/27 02:02:40 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by GitBox <gi...@apache.org> on 2022/11/27 02:24:29 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #38751: [SPARK-40872][3.3] Fallback to original shuffle block when a push-merged shuffle chunk is zero-size - posted by GitBox <gi...@apache.org> on 2022/11/27 03:25:59 UTC, 0 replies.
- [GitHub] [spark] DmytroMitin commented on pull request #38740: [SQL] Add product encoders for local classes - posted by GitBox <gi...@apache.org> on 2022/11/27 04:19:54 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/27 04:28:14 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38802: [WIP] Packaging for Spark Connect Preview - posted by GitBox <gi...@apache.org> on 2022/11/27 04:28:17 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/27 04:28:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38810: [SPARK-41275][BUILD] Upgrade pickle to 1.3 - posted by GitBox <gi...@apache.org> on 2022/11/27 05:44:13 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/27 06:00:44 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38811: [SPARK-41276][SQL][MLLIB] - posted by GitBox <gi...@apache.org> on 2022/11/27 06:07:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38805: [MINOR][SQL] Fix the pretty name of the `AnyValue` expression - posted by GitBox <gi...@apache.org> on 2022/11/27 07:33:01 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38805: [MINOR][SQL] Fix the pretty name of the `AnyValue` expression - posted by GitBox <gi...@apache.org> on 2022/11/27 07:33:38 UTC, 0 replies.
- [GitHub] [spark] zero323 commented on a diff in pull request #38168: fix(type): add bool and List[bool] for join's on input - posted by GitBox <gi...@apache.org> on 2022/11/27 11:10:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38812: [MINOR][TESTS] Restore the code style check of `QueryExecutionErrorsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/27 11:15:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38812: [MINOR][SQL][TESTS] Restore the code style check of `QueryExecutionErrorsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/27 12:02:51 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38812: [MINOR][SQL][TESTS] Restore the code style check of `QueryExecutionErrorsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/27 12:04:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38812: [MINOR][SQL][TESTS] Restore the code style check of `QueryExecutionErrorsSuite` - posted by GitBox <gi...@apache.org> on 2022/11/27 12:42:38 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38784: [SPARK-41248][SQL] Add "spark.sql.json.enablePartialResults" to enable/disable JSON partial results parsing added in SPARK-40646 - posted by GitBox <gi...@apache.org> on 2022/11/27 15:20:26 UTC, 0 replies.
- [GitHub] [spark] ibuder commented on pull request #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/27 17:24:33 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/27 17:55:53 UTC, 6 replies.
- [GitHub] [spark] zero323 commented on pull request #38804: [SPARK-41267][R][SQL] Add unpivot / melt to SparkR - posted by GitBox <gi...@apache.org> on 2022/11/27 18:12:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/27 18:38:18 UTC, 3 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38813: [SPARK-41278][CONNECT] Clean up unused QualifiedAttribute in Expression.proto - posted by GitBox <gi...@apache.org> on 2022/11/27 19:22:13 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38813: [SPARK-41278][CONNECT] Clean up unused QualifiedAttribute in Expression.proto - posted by GitBox <gi...@apache.org> on 2022/11/27 19:22:43 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38808: [SPARK-41272][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_2019 - posted by GitBox <gi...@apache.org> on 2022/11/27 19:45:14 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38754: [WIP][SPARK-41180][SQL] Reuse `INVALID_SCHEMA` instead of `_LEGACY_ERROR_TEMP_1227` - posted by GitBox <gi...@apache.org> on 2022/11/27 19:49:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38782: [SPARK-38728][SQL] Test the error class: FAILED_RENAME_PATH - posted by GitBox <gi...@apache.org> on 2022/11/27 21:29:34 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/27 21:59:53 UTC, 10 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/27 22:00:13 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37522: [WIP][SPARK-40083][SHUFFLE] Add shuffle index cache timebased expire policy - posted by GitBox <gi...@apache.org> on 2022/11/28 00:19:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37536: [WIP][SPARK-40100][SQL] Add DataType class for Int128 type - posted by GitBox <gi...@apache.org> on 2022/11/28 00:19:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37451: [SPARK-40017][SQL] Adaptive adjustment `spark.sql.adaptive.advisoryPartitionSizeInBytes` - posted by GitBox <gi...@apache.org> on 2022/11/28 00:19:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36336: [SPARK-38692][SQL] Use error classes in the compilation errors of function args - posted by GitBox <gi...@apache.org> on 2022/11/28 00:19:15 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun closed pull request #25: [SPARK-41269][INFRA] Move image matrix into version's workflow - posted by GitBox <gi...@apache.org> on 2022/11/28 01:37:05 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38814: [SPARK-41280][CONNECT] Implement DataFrame.tail - posted by GitBox <gi...@apache.org> on 2022/11/28 01:37:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/28 02:06:41 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38754: [WIP][SPARK-41180][SQL] Reuse `INVALID_SCHEMA` instead of `_LEGACY_ERROR_TEMP_1227` - posted by GitBox <gi...@apache.org> on 2022/11/28 02:28:02 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38815: [SPARK-41225][CONNECT][PYTHON][FOLLOWUP] Disable `semanticHash`, `sameSemantics`, `_repr_html_ ` - posted by GitBox <gi...@apache.org> on 2022/11/28 02:37:00 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38807: [SPARK-41270][CONNECT] Add Catalog tableExists and databaseExists in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/28 02:43:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38807: [SPARK-41270][CONNECT] Add Catalog tableExists and databaseExists in Connect proto - posted by GitBox <gi...@apache.org> on 2022/11/28 02:43:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/28 03:30:14 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/28 03:30:59 UTC, 2 replies.
- [GitHub] [spark] schlosna commented on a diff in pull request #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by GitBox <gi...@apache.org> on 2022/11/28 03:43:30 UTC, 1 replies.
- [GitHub] [spark-docker] Yikun opened a new pull request, #26: [SPARK-41287][INFRA] Add test workflow to help self-build image test in fork repo - posted by GitBox <gi...@apache.org> on 2022/11/28 03:51:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38810: [SPARK-41275][BUILD] Upgrade pickle to 1.3 - posted by GitBox <gi...@apache.org> on 2022/11/28 04:21:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38810: [SPARK-41275][BUILD] Upgrade pickle to 1.3 - posted by GitBox <gi...@apache.org> on 2022/11/28 04:21:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38813: [SPARK-41278][CONNECT] Clean up unused QualifiedAttribute in Expression.proto - posted by GitBox <gi...@apache.org> on 2022/11/28 04:28:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38813: [SPARK-41278][CONNECT] Clean up unused QualifiedAttribute in Expression.proto - posted by GitBox <gi...@apache.org> on 2022/11/28 04:28:46 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38781: [SPARK-41246][core] Solve the problem of RddId negative - posted by GitBox <gi...@apache.org> on 2022/11/28 04:29:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38813: [SPARK-41278][CONNECT] Clean up unused QualifiedAttribute in Expression.proto - posted by GitBox <gi...@apache.org> on 2022/11/28 04:36:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38813: [SPARK-41278][CONNECT] Clean up unused QualifiedAttribute in Expression.proto - posted by GitBox <gi...@apache.org> on 2022/11/28 04:42:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38810: [SPARK-41275][BUILD] Upgrade pickle to 1.3 - posted by GitBox <gi...@apache.org> on 2022/11/28 04:56:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38814: [SPARK-41280][CONNECT] Implement DataFrame.tail - posted by GitBox <gi...@apache.org> on 2022/11/28 05:45:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38814: [SPARK-41280][CONNECT] Implement DataFrame.tail - posted by GitBox <gi...@apache.org> on 2022/11/28 05:45:18 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38816: [SPARK-41291][CONNECT][PYTHON] `DataFrame.explain` should print and return None - posted by GitBox <gi...@apache.org> on 2022/11/28 05:47:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38814: [SPARK-41280][CONNECT] Implement DataFrame.tail - posted by GitBox <gi...@apache.org> on 2022/11/28 05:52:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38816: [SPARK-41291][CONNECT][PYTHON] `DataFrame.explain` should print and return None - posted by GitBox <gi...@apache.org> on 2022/11/28 05:53:27 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38797: [SPARK-41263][CONNECT][INFRA] Upgrade buf to 1.9.0 - posted by GitBox <gi...@apache.org> on 2022/11/28 06:06:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38801: [SPARK-40539] [CONNECT] [PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/28 06:44:29 UTC, 0 replies.
- [GitHub] [spark] mile95 opened a new pull request, #38817: Fix typo in pydoc for pyspark.sql.function.from_utc_timestamp - posted by GitBox <gi...@apache.org> on 2022/11/28 06:44:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38815: [SPARK-41225][CONNECT][PYTHON][FOLLOWUP] Disable `semanticHash`, `sameSemantics`, `_repr_html_ ` - posted by GitBox <gi...@apache.org> on 2022/11/28 06:50:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38815: [SPARK-41225][CONNECT][PYTHON][FOLLOWUP] Disable `semanticHash`, `sameSemantics`, `_repr_html_ ` - posted by GitBox <gi...@apache.org> on 2022/11/28 06:51:30 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38816: [SPARK-41291][CONNECT][PYTHON] `DataFrame.explain` should print and return None - posted by GitBox <gi...@apache.org> on 2022/11/28 07:24:35 UTC, 0 replies.
- [GitHub] [spark-docker] martin-g commented on a diff in pull request #26: [SPARK-41287][INFRA] Add test workflow to help self-build image test in fork repo - posted by GitBox <gi...@apache.org> on 2022/11/28 07:34:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38816: [SPARK-41291][CONNECT][PYTHON] `DataFrame.explain` should print and return None - posted by GitBox <gi...@apache.org> on 2022/11/28 07:48:50 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on pull request #26: [SPARK-41287][INFRA] Add test workflow to help self-build image test in fork repo - posted by GitBox <gi...@apache.org> on 2022/11/28 08:14:54 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/28 08:17:37 UTC, 0 replies.
- [GitHub] [spark-docker] Yikun commented on a diff in pull request #26: [SPARK-41287][INFRA] Add test workflow to help self-build image test in fork repo - posted by GitBox <gi...@apache.org> on 2022/11/28 08:39:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38489: [SPARK-41003][SQL] BHJ LeftAnti does not update numOutputRows when codegen is disabled - posted by GitBox <gi...@apache.org> on 2022/11/28 08:45:08 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/28 08:45:19 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38808: [SPARK-41272][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_2019 - posted by GitBox <gi...@apache.org> on 2022/11/28 09:00:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38808: [SPARK-41272][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_2019 - posted by GitBox <gi...@apache.org> on 2022/11/28 09:01:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38818: [SPARK-41238][CONNECT][PYTHON] Support `DayTimeIntervalType` in the client - posted by GitBox <gi...@apache.org> on 2022/11/28 09:04:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/28 09:06:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38819: [SPARK-41148][CONNECT][PYTHON] Implement `DataFrame.dropna ` and `DataFrame.na.drop ` - posted by GitBox <gi...@apache.org> on 2022/11/28 09:10:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38819: [SPARK-41148][CONNECT][PYTHON] Implement `DataFrame.dropna ` and `DataFrame.na.drop ` - posted by GitBox <gi...@apache.org> on 2022/11/28 09:12:25 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38769: [SPARK-41228][SQL] Rename & Improve error message for `COLUMN_NOT_IN_GROUP_BY_CLAUSE`. - posted by GitBox <gi...@apache.org> on 2022/11/28 09:21:16 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/28 09:24:26 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/28 09:27:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38817: [MINOR][DOC] Fix typo in pydoc for pyspark.sql.function.from_utc_timestamp - posted by GitBox <gi...@apache.org> on 2022/11/28 09:31:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38817: [MINOR][DOC] Fix typo in pydoc for pyspark.sql.function.from_utc_timestamp - posted by GitBox <gi...@apache.org> on 2022/11/28 09:31:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38809: [SPARK-41273][BUILD] Update plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/11/28 09:33:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38809: [SPARK-41273][BUILD] Update plugins to latest versions - posted by GitBox <gi...@apache.org> on 2022/11/28 09:33:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38820: [SPARK-41293][TESTS] Extract a general `assertError` method to deduplicate code in `ExpressionTypeCheckingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/28 09:35:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38820: [SPARK-41293][TESTS] Extract a general `assertError` method to deduplicate code in `ExpressionTypeCheckingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/28 09:36:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38820: [SPARK-41293][TESTS] Extract a general `assertError` method to deduplicate code in `ExpressionTypeCheckingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/28 09:37:12 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38819: [SPARK-41148][CONNECT][PYTHON] Implement `DataFrame.dropna` and `DataFrame.na.drop` - posted by GitBox <gi...@apache.org> on 2022/11/28 09:43:48 UTC, 6 replies.
- [GitHub] [spark-docker] Yikun closed pull request #26: [SPARK-41287][INFRA] Add test workflow to help self-build image test in fork repo - posted by GitBox <gi...@apache.org> on 2022/11/28 09:55:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38754: [SPARK-41180][SQL] Reuse `INVALID_SCHEMA` instead of `_LEGACY_ERROR_TEMP_1227` - posted by GitBox <gi...@apache.org> on 2022/11/28 10:07:56 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38754: [SPARK-41180][SQL] Reuse `INVALID_SCHEMA` instead of `_LEGACY_ERROR_TEMP_1227` - posted by GitBox <gi...@apache.org> on 2022/11/28 10:18:21 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38819: [SPARK-41148][CONNECT][PYTHON] Implement `DataFrame.dropna` and `DataFrame.na.drop` - posted by GitBox <gi...@apache.org> on 2022/11/28 11:44:02 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38820: [SPARK-41293][SQL][TESTS] Code cleanup for `assertXXX` methods in `ExpressionTypeCheckingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/28 11:53:06 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38820: [SPARK-41293][SQL][TESTS] Code cleanup for `assertXXX` methods in `ExpressionTypeCheckingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/28 11:55:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38820: [SPARK-41293][SQL][TESTS] Code cleanup for `assertXXX` methods in `ExpressionTypeCheckingSuite` - posted by GitBox <gi...@apache.org> on 2022/11/28 11:56:08 UTC, 0 replies.
- [GitHub] [spark] CavemanIV commented on a diff in pull request #38790: [SPARK-41254] [YARN] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/28 12:41:50 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/28 12:53:00 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38821: [SPARK-41300] [CONNECT] Unset schema is interepreted as Schema - posted by GitBox <gi...@apache.org> on 2022/11/28 13:16:24 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/28 13:40:02 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/28 13:41:15 UTC, 0 replies.
- [GitHub] [spark] Fokko commented on pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by GitBox <gi...@apache.org> on 2022/11/28 14:12:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38756: [SPARK-41220][SQL] Range partitioner sample supports column pruning - posted by GitBox <gi...@apache.org> on 2022/11/28 14:21:16 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38790: [SPARK-41254] [YARN] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/28 14:25:09 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38790: [SPARK-41254] [YARN] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/28 14:25:32 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38790: [SPARK-41254] [YARN] bugfix wrong usage when check YarnAllocator.rpIdToYarnResource key existence - posted by GitBox <gi...@apache.org> on 2022/11/28 14:30:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38765: [SPARK-35531][SQL][FOLLOWUP] Support alter table command with CASE_SENSITIVE is true - posted by GitBox <gi...@apache.org> on 2022/11/28 14:52:20 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #38821: [SPARK-41300] [CONNECT] Unset schema is interpreted as Schema - posted by GitBox <gi...@apache.org> on 2022/11/28 15:05:46 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38821: [SPARK-41300] [CONNECT] Unset schema is interpreted as Schema - posted by GitBox <gi...@apache.org> on 2022/11/28 15:06:52 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38822: [SPARK-41301] [CONNECT] Homogenize Behavior for SparkSession.range() - posted by GitBox <gi...@apache.org> on 2022/11/28 15:31:45 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/28 17:09:10 UTC, 1 replies.
- [GitHub] [spark] zhouyejoe commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/11/28 17:16:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38754: [SPARK-41180][SQL] Reuse `INVALID_SCHEMA` instead of `_LEGACY_ERROR_TEMP_1227` - posted by GitBox <gi...@apache.org> on 2022/11/28 17:26:02 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38754: [SPARK-41180][SQL] Reuse `INVALID_SCHEMA` instead of `_LEGACY_ERROR_TEMP_1227` - posted by GitBox <gi...@apache.org> on 2022/11/28 17:26:41 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38821: [SPARK-41300] [CONNECT] Unset schema is interpreted as Schema - posted by GitBox <gi...@apache.org> on 2022/11/28 18:02:27 UTC, 0 replies.
- [GitHub] [spark] allisonport-db opened a new pull request, #38823: [SPARK-41290] Support defining generated columns in create table (support GENERATED ALWAYS AS syntax) - posted by GitBox <gi...@apache.org> on 2022/11/28 18:28:28 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38803: [SPARK-41114] [CONNECT] [PYTHON] [FOLLOW-UP] Python Client support for local data - posted by GitBox <gi...@apache.org> on 2022/11/28 18:35:40 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38819: [SPARK-41148][CONNECT][PYTHON] Implement `DataFrame.dropna` and `DataFrame.na.drop` - posted by GitBox <gi...@apache.org> on 2022/11/28 18:38:11 UTC, 2 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/28 18:42:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/28 18:45:29 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38822: [SPARK-41301] [CONNECT] Homogenize Behavior for SparkSession.range() - posted by GitBox <gi...@apache.org> on 2022/11/28 18:47:49 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/28 18:50:32 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/28 19:38:40 UTC, 1 replies.
- [GitHub] [spark] thejdeep commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side metrics - posted by GitBox <gi...@apache.org> on 2022/11/28 20:17:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38824: [SPARK-41304][CONNECT] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/28 20:36:13 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38824: [SPARK-41304][CONNECT] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/28 20:36:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38780: [SPARK-41185][K8S][DOCS] Remove ARM limitation for YuniKorn from docs - posted by GitBox <gi...@apache.org> on 2022/11/28 20:39:35 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38825: [SPARK-41306][CONNECT] Improve Connect Expression proto documentation - posted by GitBox <gi...@apache.org> on 2022/11/28 20:43:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38825: [SPARK-41306][CONNECT] Improve Connect Expression proto documentation - posted by GitBox <gi...@apache.org> on 2022/11/28 20:43:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38715: [SPARK-41197][BUILD] Upgrade Kafka to 3.3.1 - posted by GitBox <gi...@apache.org> on 2022/11/28 20:49:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38715: [SPARK-41197][BUILD] Upgrade Kafka to 3.3.1 - posted by GitBox <gi...@apache.org> on 2022/11/28 20:58:39 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #38824: [SPARK-41304][CONNECT] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/28 21:07:28 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/28 22:02:09 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38826: [SPARK-41297] [CONNECT] [PYTHON] Support String Expressions in filter. - posted by GitBox <gi...@apache.org> on 2022/11/28 22:18:56 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38819: [SPARK-41148][CONNECT][PYTHON] Implement `DataFrame.dropna` and `DataFrame.na.drop` - posted by GitBox <gi...@apache.org> on 2022/11/28 22:24:46 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/28 22:34:43 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38779: [SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/28 22:44:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38824: [SPARK-41304][CONNECT] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/28 22:58:29 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38826: [SPARK-41297] [CONNECT] [PYTHON] Support String Expressions in filter. - posted by GitBox <gi...@apache.org> on 2022/11/28 23:08:30 UTC, 0 replies.
- [GitHub] [spark] jkhalid commented on pull request #37738: add Support Java Class with circular references - posted by GitBox <gi...@apache.org> on 2022/11/28 23:16:32 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement `DataFrame.crossJoin` - posted by GitBox <gi...@apache.org> on 2022/11/28 23:18:11 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38779: [SPARK-41244][UI] Introducing a Protobuf serializer for UI data on KV store - posted by GitBox <gi...@apache.org> on 2022/11/28 23:32:12 UTC, 1 replies.
- [GitHub] [spark] holdenk commented on pull request #32397: [WIP][SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2022/11/28 23:40:02 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #38732: [SPARK-41210][K8S] Window based executor failure tracking mechanism - posted by GitBox <gi...@apache.org> on 2022/11/28 23:50:19 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/28 23:53:49 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/28 23:53:56 UTC, 0 replies.
- [GitHub] [spark] ocworld commented on pull request #32397: [WIP][SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2022/11/29 00:04:37 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37578: [SPARK-39791][LAUNCHER] In Spark 3.0 standalone cluster mode, unable to customize driver JVM path - posted by GitBox <gi...@apache.org> on 2022/11/29 00:21:17 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36336: [SPARK-38692][SQL] Use error classes in the compilation errors of function args - posted by GitBox <gi...@apache.org> on 2022/11/29 00:21:20 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #38712: [WIP][SPARK-41271][SQL] Parameterized SQL queries - posted by GitBox <gi...@apache.org> on 2022/11/29 00:27:14 UTC, 1 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38827: [SPARK-41308][CONNECT][PYTHON] Improve DataFrame.count() - posted by GitBox <gi...@apache.org> on 2022/11/29 00:29:49 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38827: [SPARK-41308][CONNECT][PYTHON] Improve DataFrame.count() - posted by GitBox <gi...@apache.org> on 2022/11/29 00:30:03 UTC, 0 replies.
- [GitHub] [spark] ocworld opened a new pull request, #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2022/11/29 00:31:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38824: [SPARK-41304][CONNECT][PYTHON][DOCS] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/29 01:01:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38824: [SPARK-41304][CONNECT][PYTHON][DOCS] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/29 01:02:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38825: [SPARK-41306][CONNECT] Improve Connect Expression proto documentation - posted by GitBox <gi...@apache.org> on 2022/11/29 01:02:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38825: [SPARK-41306][CONNECT] Improve Connect Expression proto documentation - posted by GitBox <gi...@apache.org> on 2022/11/29 01:02:31 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2022/11/29 01:18:40 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2022/11/29 01:25:00 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38824: [SPARK-41304][CONNECT][PYTHON][DOCS] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/29 01:28:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38818: [SPARK-41238][CONNECT][PYTHON][FOLLOWUP] Support `DayTimeIntervalType` in the client - posted by GitBox <gi...@apache.org> on 2022/11/29 01:43:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38818: [SPARK-41238][CONNECT][PYTHON][FOLLOWUP] Support `DayTimeIntervalType` in the client - posted by GitBox <gi...@apache.org> on 2022/11/29 01:43:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/29 01:44:36 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/29 02:04:07 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38827: [SPARK-41308][CONNECT][PYTHON] Improve DataFrame.count() - posted by GitBox <gi...@apache.org> on 2022/11/29 02:06:26 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement DataFrame cross join - posted by GitBox <gi...@apache.org> on 2022/11/29 02:43:15 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement DataFrame cross join - posted by GitBox <gi...@apache.org> on 2022/11/29 02:44:26 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/29 03:11:13 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/29 03:22:14 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38827: [SPARK-41308][CONNECT][PYTHON] Improve DataFrame.count() - posted by GitBox <gi...@apache.org> on 2022/11/29 03:53:15 UTC, 0 replies.
- [GitHub] [spark] SparksFyz commented on a diff in pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by GitBox <gi...@apache.org> on 2022/11/29 04:24:21 UTC, 0 replies.
- [GitHub] [spark] SparksFyz commented on pull request #38171: [SPARK-9213] [SQL] Improve regular expression performance (via joni) - posted by GitBox <gi...@apache.org> on 2022/11/29 04:31:53 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38829: [SPARK-41310][CONNECT][PYTHON] Implement DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 04:43:59 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38829: [SPARK-41310][CONNECT][PYTHON] Implement DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 04:44:05 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38829: [SPARK-41310][CONNECT][PYTHON] Implement DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 04:45:28 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38829: [SPARK-41310][CONNECT][PYTHON] Implement DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 04:54:44 UTC, 3 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38830: [SPARK-41309][SQL] Reuse `INVALID_SCHEMA.NON_STRING_LITERAL` instead of `_LEGACY_ERROR_TEMP_1093 ` - posted by GitBox <gi...@apache.org> on 2022/11/29 04:57:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38830: [SPARK-41309][SQL] Reuse `INVALID_SCHEMA.NON_STRING_LITERAL` instead of `_LEGACY_ERROR_TEMP_1093` - posted by GitBox <gi...@apache.org> on 2022/11/29 05:05:14 UTC, 5 replies.
- [GitHub] [spark] wineternity commented on a diff in pull request #38702: [SPARK-41187][CORE] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/29 05:18:06 UTC, 11 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38819: [SPARK-41148][CONNECT][PYTHON] Implement `DataFrame.dropna` and `DataFrame.na.drop` - posted by GitBox <gi...@apache.org> on 2022/11/29 05:27:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38831: [SPARK-41312][CONNECT][PYTHON] Implement DataFrame.withColumnRenamed - posted by GitBox <gi...@apache.org> on 2022/11/29 05:27:58 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38831: [SPARK-41312][CONNECT][PYTHON] Implement DataFrame.withColumnRenamed - posted by GitBox <gi...@apache.org> on 2022/11/29 05:28:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/29 05:42:18 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38712: [WIP][SPARK-41271][SQL] Parameterized SQL queries - posted by GitBox <gi...@apache.org> on 2022/11/29 05:46:35 UTC, 0 replies.
- [GitHub] [spark] xinglin opened a new pull request, #38832: SPARK-41313 Combine fixes for SPARK-3900 and SPARK-21138 - posted by GitBox <gi...@apache.org> on 2022/11/29 05:55:31 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/29 06:00:49 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/29 06:01:56 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #38728: [SPARK-41204] [CONNECT] Migrate custom exceptions to use Spark exceptions - posted by GitBox <gi...@apache.org> on 2022/11/29 06:03:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/29 06:35:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38800: [SPARK-41264][CONNECT][PYTHON] Make Literal support more datatypes - posted by GitBox <gi...@apache.org> on 2022/11/29 06:36:42 UTC, 0 replies.
- [GitHub] [spark] pkudinov commented on pull request #37536: [WIP][SPARK-40100][SQL] Add DataType class for Int128 type - posted by GitBox <gi...@apache.org> on 2022/11/29 07:09:51 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #38796: [SPARK-41260][PYTHON][SS] Cast NumPy instances to Python primitive types in GroupState update - posted by GitBox <gi...@apache.org> on 2022/11/29 07:13:13 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #38796: [SPARK-41260][PYTHON][SS] Cast NumPy instances to Python primitive types in GroupState update - posted by GitBox <gi...@apache.org> on 2022/11/29 07:13:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38833: Test Apache ORC 1.8.1 SNAPSHOT - posted by GitBox <gi...@apache.org> on 2022/11/29 07:15:49 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #38711: [SPARK-41192][Core] Remove unscheduled speculative tasks when task finished to obtain better dynamic - posted by GitBox <gi...@apache.org> on 2022/11/29 07:32:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38829: [SPARK-41310][CONNECT][PYTHON] Implement DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 07:41:33 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38831: [SPARK-41312][CONNECT][PYTHON] Implement DataFrame.withColumnRenamed - posted by GitBox <gi...@apache.org> on 2022/11/29 07:42:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38831: [SPARK-41312][CONNECT][PYTHON] Implement DataFrame.withColumnRenamed - posted by GitBox <gi...@apache.org> on 2022/11/29 07:42:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #38834: [MINOR][DOCS] Fix types and docstring in DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 07:45:46 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38834: [MINOR][PYTHON][DOCS] Fix types and docstring in DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 07:52:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #38835: [SPARK-41316][SQL] Enable tail-recursion wherever possible - posted by GitBox <gi...@apache.org> on 2022/11/29 07:54:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38834: [MINOR][PYTHON][DOCS] Fix types and docstring in DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 07:54:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38834: [MINOR][PYTHON][DOCS] Fix types and docstring in DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 07:54:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38834: [MINOR][PYTHON][DOCS] Fix types and docstring in DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 07:54:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38801: [SPARK-41317][CONNECT][PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/29 08:10:31 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38793: [SPARK-41256][CONNECT] Implement DataFrame.withColumn(s) - posted by GitBox <gi...@apache.org> on 2022/11/29 08:28:46 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on a diff in pull request #38824: [SPARK-41304][CONNECT][PYTHON][DOCS] Add missing docs for DataFrame API - posted by GitBox <gi...@apache.org> on 2022/11/29 08:47:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38702: [SPARK-41187][CORE] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/29 08:54:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38836: [SPARK-41315][CONNECT][PYTHON] Implement `DataFrame.replace` and `DataFrame.na.replace` - posted by GitBox <gi...@apache.org> on 2022/11/29 08:54:43 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38829: [SPARK-41310][CONNECT][PYTHON] Implement DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 08:58:39 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38829: [SPARK-41310][CONNECT][PYTHON] Implement DataFrame.toDF - posted by GitBox <gi...@apache.org> on 2022/11/29 08:59:07 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/29 09:06:06 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38837: [SPARK-41256][CONNECT][FOLLOWUP] Fix compile error - posted by GitBox <gi...@apache.org> on 2022/11/29 09:29:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38837: [SPARK-41256][CONNECT][FOLLOWUP] Fix compile error - posted by GitBox <gi...@apache.org> on 2022/11/29 09:29:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38837: [SPARK-41256][CONNECT][FOLLOWUP] Fix compile error - posted by GitBox <gi...@apache.org> on 2022/11/29 09:46:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38837: [SPARK-41256][CONNECT][FOLLOWUP] Fix compile error - posted by GitBox <gi...@apache.org> on 2022/11/29 09:47:15 UTC, 0 replies.
- [GitHub] [spark] rmcyang commented on a diff in pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2022/11/29 09:56:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38801: [SPARK-41317][CONNECT][PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/29 09:59:08 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38801: [SPARK-41317][CONNECT][PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/29 10:28:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38835: [SPARK-41316][SQL] Enable tail-recursion wherever possible - posted by GitBox <gi...@apache.org> on 2022/11/29 11:20:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38835: [SPARK-41316][SQL] Enable tail-recursion wherever possible - posted by GitBox <gi...@apache.org> on 2022/11/29 11:20:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38826: [SPARK-41297] [CONNECT] [PYTHON] Support String Expressions in filter. - posted by GitBox <gi...@apache.org> on 2022/11/29 11:31:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38801: [SPARK-41317][CONNECT][PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/29 12:04:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38801: [SPARK-41317][CONNECT][PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/29 12:05:03 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #38702: [SPARK-41187][CORE] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/29 12:13:24 UTC, 2 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38826: [SPARK-41297] [CONNECT] [PYTHON] Support String Expressions in filter. - posted by GitBox <gi...@apache.org> on 2022/11/29 12:24:19 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #38467: [SPARK-40987][CORE] `BlockManager#removeBlockInternal` should ensure the lock is unlocked gracefully - posted by GitBox <gi...@apache.org> on 2022/11/29 12:25:55 UTC, 0 replies.
- [GitHub] [spark] dengziming opened a new pull request, #38838: SPARK-41321: Support target field UnresolvedStar - posted by GitBox <gi...@apache.org> on 2022/11/29 13:32:05 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38832: SPARK-41313 Combine fixes for SPARK-3900 and SPARK-21138 - posted by GitBox <gi...@apache.org> on 2022/11/29 13:43:43 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #38811: [SPARK-41276][SQL][ML][MLLIB][PROTOBUF][PYTHON][R][SS][AVRO] Optimize constructor use of `StructType` - posted by GitBox <gi...@apache.org> on 2022/11/29 14:15:45 UTC, 2 replies.
- [GitHub] [spark] srowen closed pull request #38168: fix(type): add bool and List[bool] for join's on input - posted by GitBox <gi...@apache.org> on 2022/11/29 14:16:30 UTC, 0 replies.
- [GitHub] [spark] ala commented on pull request #38777: [SPARK-41151][FOLLOW-UP][SQL] Keep built-in file _metadata fields nullable value consistent - posted by GitBox <gi...@apache.org> on 2022/11/29 14:26:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38811: [SPARK-41276][SQL][ML][MLLIB][PROTOBUF][PYTHON][R][SS][AVRO] Optimize constructor use of `StructType` - posted by GitBox <gi...@apache.org> on 2022/11/29 14:32:15 UTC, 4 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38832: SPARK-41313 Combine fixes for SPARK-3900 and SPARK-21138 - posted by GitBox <gi...@apache.org> on 2022/11/29 14:56:08 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38839: [SPARK-41282] [CONNECT] [PYTHON] [FOLLOWUP] Add missing functionality to column expression - posted by GitBox <gi...@apache.org> on 2022/11/29 16:16:17 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38836: [SPARK-41315][CONNECT][PYTHON] Implement `DataFrame.replace` and `DataFrame.na.replace` - posted by GitBox <gi...@apache.org> on 2022/11/29 16:31:35 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #38830: [SPARK-41309][SQL] Reuse `INVALID_SCHEMA.NON_STRING_LITERAL` instead of `_LEGACY_ERROR_TEMP_1093` - posted by GitBox <gi...@apache.org> on 2022/11/29 16:36:34 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #38830: [SPARK-41309][SQL] Reuse `INVALID_SCHEMA.NON_STRING_LITERAL` instead of `_LEGACY_ERROR_TEMP_1093` - posted by GitBox <gi...@apache.org> on 2022/11/29 16:37:18 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #38772: [SPARK-41237][SQL] Assign a name to the error class `_LEGACY_ERROR_TEMP_0030` - posted by GitBox <gi...@apache.org> on 2022/11/29 16:52:27 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2022/11/29 16:58:44 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38826: [SPARK-41297] [CONNECT] [PYTHON] Support String Expressions in filter. - posted by GitBox <gi...@apache.org> on 2022/11/29 16:58:47 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38823: [SPARK-41290] Support defining generated columns in create table (support GENERATED ALWAYS AS syntax) - posted by GitBox <gi...@apache.org> on 2022/11/29 16:58:50 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on pull request #38776: [SPARK-27561][SQL] Support implicit lateral column alias resolution on Project and refactor Analyzer - posted by GitBox <gi...@apache.org> on 2022/11/29 17:06:27 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38837: [SPARK-41256][CONNECT][FOLLOWUP] Fix compile error - posted by GitBox <gi...@apache.org> on 2022/11/29 17:56:59 UTC, 0 replies.
- [GitHub] [spark] xinglin commented on pull request #38832: SPARK-41313 Combine fixes for SPARK-3900 and SPARK-21138 - posted by GitBox <gi...@apache.org> on 2022/11/29 17:58:08 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on pull request #38801: [SPARK-41317][CONNECT][PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/29 17:59:58 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #38832: SPARK-41313 Combine fixes for SPARK-3900 and SPARK-21138 - posted by GitBox <gi...@apache.org> on 2022/11/29 18:02:54 UTC, 1 replies.
- [GitHub] [spark] xinglin commented on a diff in pull request #38832: SPARK-41313 Combine fixes for SPARK-3900 and SPARK-21138 - posted by GitBox <gi...@apache.org> on 2022/11/29 18:09:14 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38801: [SPARK-41317][CONNECT][PYTHON] Add basic support for DataFrameWriter - posted by GitBox <gi...@apache.org> on 2022/11/29 18:31:36 UTC, 0 replies.
- [GitHub] [spark] thejdeep commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2022/11/29 18:43:43 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement DataFrame cross join - posted by GitBox <gi...@apache.org> on 2022/11/29 19:07:26 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement DataFrame cross join - posted by GitBox <gi...@apache.org> on 2022/11/29 19:10:36 UTC, 0 replies.
- [GitHub] [spark] srielau opened a new pull request, #38840: [SPARK-41323] support current_schema - posted by GitBox <gi...@apache.org> on 2022/11/29 19:31:49 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #38826: [SPARK-41297] [CONNECT] [PYTHON] Support String Expressions in filter. - posted by GitBox <gi...@apache.org> on 2022/11/29 19:56:11 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/29 20:46:45 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #38842: [SPARK-41326] [CONNECT] Fix deduplicate is missing input - posted by GitBox <gi...@apache.org> on 2022/11/29 21:03:21 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38842: [SPARK-41326] [CONNECT] Fix deduplicate is missing input - posted by GitBox <gi...@apache.org> on 2022/11/29 21:14:02 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/29 21:19:02 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38842: [SPARK-41326] [CONNECT] Fix deduplicate is missing input - posted by GitBox <gi...@apache.org> on 2022/11/29 21:20:52 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/29 21:35:58 UTC, 2 replies.
- [GitHub] [spark] ylybest opened a new pull request, #38843: SPARK-41327 Fix SparkStatusTracker.getExecutorInfos by switch On/OffH… - posted by GitBox <gi...@apache.org> on 2022/11/29 22:25:46 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38843: [SPARK-41327][CORE] Fix SparkStatusTracker.getExecutorInfos by switch On/OffHeapStorageMemory info - posted by GitBox <gi...@apache.org> on 2022/11/29 22:59:56 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38842: [SPARK-41326] [CONNECT] Fix deduplicate is missing input - posted by GitBox <gi...@apache.org> on 2022/11/29 23:00:00 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/29 23:00:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38840: [SPARK-41323] support current_schema - posted by GitBox <gi...@apache.org> on 2022/11/29 23:00:07 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38839: [SPARK-41282] [CONNECT] [PYTHON] [FOLLOWUP] Add missing functionality to column expression - posted by GitBox <gi...@apache.org> on 2022/11/29 23:00:10 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38838: [SPARK-41321][CONNECT] Support target field for UnresolvedStar - posted by GitBox <gi...@apache.org> on 2022/11/29 23:00:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38843: [SPARK-41327][CORE] Fix `SparkStatusTracker.getExecutorInfos` by switch On/OffHeapStorageMemory info - posted by GitBox <gi...@apache.org> on 2022/11/29 23:08:06 UTC, 0 replies.
- [GitHub] [spark] ylybest commented on a diff in pull request #38843: [SPARK-41327][CORE] Fix `SparkStatusTracker.getExecutorInfos` by switch On/OffHeapStorageMemory info - posted by GitBox <gi...@apache.org> on 2022/11/29 23:10:56 UTC, 0 replies.
- [GitHub] [spark] ylybest commented on pull request #38843: [SPARK-41327][CORE] Fix `SparkStatusTracker.getExecutorInfos` by switch On/OffHeapStorageMemory info - posted by GitBox <gi...@apache.org> on 2022/11/29 23:29:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37578: [SPARK-39791][LAUNCHER] In Spark 3.0 standalone cluster mode, unable to customize driver JVM path - posted by GitBox <gi...@apache.org> on 2022/11/30 00:22:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38839: [SPARK-41282] [CONNECT] [PYTHON] [FOLLOWUP] Add missing functionality to column expression - posted by GitBox <gi...@apache.org> on 2022/11/30 00:35:28 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #38778: [SPARK-41227][CONNECT][PYTHON] Implement DataFrame cross join - posted by GitBox <gi...@apache.org> on 2022/11/30 00:39:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/30 00:42:48 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/30 00:55:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38806: [SPARK-41268][CONNECT][PYTHON] Refactor "Column" for API Compatibility - posted by GitBox <gi...@apache.org> on 2022/11/30 00:56:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38842: [SPARK-41326] [CONNECT] Fix deduplicate is missing input - posted by GitBox <gi...@apache.org> on 2022/11/30 01:00:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38842: [SPARK-41326] [CONNECT] Fix deduplicate is missing input - posted by GitBox <gi...@apache.org> on 2022/11/30 01:00:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38842: [SPARK-41326] [CONNECT] Fix deduplicate is missing input - posted by GitBox <gi...@apache.org> on 2022/11/30 01:00:40 UTC, 0 replies.
- [GitHub] [spark] desmondcheongzx commented on pull request #38750: [SPARK-41226][SQL] Refactor Spark types by introducing physical types - posted by GitBox <gi...@apache.org> on 2022/11/30 01:05:56 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38844: [SPARK-41328][CONNECT][PYTHON] Add logical and string API to Column - posted by GitBox <gi...@apache.org> on 2022/11/30 01:18:47 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38844: [SPARK-41328][CONNECT][PYTHON] Add logical and string API to Column - posted by GitBox <gi...@apache.org> on 2022/11/30 01:19:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38844: [SPARK-41328][CONNECT][PYTHON] Add logical and string API to Column - posted by GitBox <gi...@apache.org> on 2022/11/30 01:28:18 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38839: [SPARK-41282] [CONNECT] [PYTHON] [FOLLOWUP] Add missing functionality to column expression - posted by GitBox <gi...@apache.org> on 2022/11/30 01:30:25 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38844: [SPARK-41328][CONNECT][PYTHON] Add logical and string API to Column - posted by GitBox <gi...@apache.org> on 2022/11/30 02:38:19 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/30 02:46:16 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38836: [SPARK-41315][CONNECT][PYTHON] Implement `DataFrame.replace` and `DataFrame.na.replace` - posted by GitBox <gi...@apache.org> on 2022/11/30 02:48:10 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38844: [SPARK-41328][CONNECT][PYTHON] Add logical and string API to Column - posted by GitBox <gi...@apache.org> on 2022/11/30 02:54:20 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38826: [SPARK-41297] [CONNECT] [PYTHON] Support String Expressions in filter. - posted by GitBox <gi...@apache.org> on 2022/11/30 03:18:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38075: [SPARK-40633][BUILD] Upgrade janino to 3.1.9 - posted by GitBox <gi...@apache.org> on 2022/11/30 03:19:21 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/30 03:40:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38836: [SPARK-41315][CONNECT][PYTHON] Implement `DataFrame.replace` and `DataFrame.na.replace` - posted by GitBox <gi...@apache.org> on 2022/11/30 03:43:13 UTC, 1 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38836: [SPARK-41315][CONNECT][PYTHON] Implement `DataFrame.replace` and `DataFrame.na.replace` - posted by GitBox <gi...@apache.org> on 2022/11/30 03:46:29 UTC, 3 replies.
- [GitHub] [spark] asfgit closed pull request #38467: [SPARK-40987][CORE] `BlockManager#removeBlockInternal` should ensure the lock is unlocked gracefully - posted by GitBox <gi...@apache.org> on 2022/11/30 03:52:59 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38467: [SPARK-40987][CORE] `BlockManager#removeBlockInternal` should ensure the lock is unlocked gracefully - posted by GitBox <gi...@apache.org> on 2022/11/30 03:56:39 UTC, 0 replies.
- [GitHub] [spark] rangareddy closed pull request #38761: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by GitBox <gi...@apache.org> on 2022/11/30 04:04:34 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38845: [SPARK-41330][CONNECT][PYSPARK][DOC] Improve Documentation for Take,Tail, Limit and Offset - posted by GitBox <gi...@apache.org> on 2022/11/30 04:10:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38844: [SPARK-41328][CONNECT][PYTHON] Add logical and string API to Column - posted by GitBox <gi...@apache.org> on 2022/11/30 04:13:44 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38844: [SPARK-41328][CONNECT][PYTHON] Add logical and string API to Column - posted by GitBox <gi...@apache.org> on 2022/11/30 04:14:06 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38845: [SPARK-41330][CONNECT][PYSPARK][DOC] Improve Documentation for Take,Tail, Limit and Offset - posted by GitBox <gi...@apache.org> on 2022/11/30 04:15:48 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38702: [SPARK-41187][CORE] LiveExecutor MemoryLeak in AppStatusListener when ExecutorLost happen - posted by GitBox <gi...@apache.org> on 2022/11/30 04:20:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38846: [SPARK-41331][CONNECT][PYTHON] Add `orderBy` and `drop_duplicates` - posted by GitBox <gi...@apache.org> on 2022/11/30 04:21:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2022/11/30 04:58:15 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/30 05:06:09 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #38847: [SPARK-41332][CONNECT][PYTHON] Fix `nullOrdering` in `SortOrder` - posted by GitBox <gi...@apache.org> on 2022/11/30 05:59:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38847: [SPARK-41332][CONNECT][PYTHON] Fix `nullOrdering` in `SortOrder` - posted by GitBox <gi...@apache.org> on 2022/11/30 05:59:56 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38845: [SPARK-41330][CONNECT][PYSPARK][DOC] Improve Documentation for Take,Tail, Limit and Offset - posted by GitBox <gi...@apache.org> on 2022/11/30 06:22:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38845: [SPARK-41330][CONNECT][PYSPARK][DOC] Improve Documentation for Take,Tail, Limit and Offset - posted by GitBox <gi...@apache.org> on 2022/11/30 06:23:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38846: [SPARK-41331][CONNECT][PYTHON] Add `orderBy` and `drop_duplicates` - posted by GitBox <gi...@apache.org> on 2022/11/30 06:34:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38846: [SPARK-41331][CONNECT][PYTHON] Add `orderBy` and `drop_duplicates` - posted by GitBox <gi...@apache.org> on 2022/11/30 06:34:41 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38846: [SPARK-41331][CONNECT][PYTHON] Add `orderBy` and `drop_duplicates` - posted by GitBox <gi...@apache.org> on 2022/11/30 06:54:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38840: [SPARK-41323][SQL] Support current_schema - posted by GitBox <gi...@apache.org> on 2022/11/30 06:56:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38840: [SPARK-41323][SQL] Support current_schema - posted by GitBox <gi...@apache.org> on 2022/11/30 06:56:37 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38847: [SPARK-41332][CONNECT][PYTHON] Fix `nullOrdering` in `SortOrder` - posted by GitBox <gi...@apache.org> on 2022/11/30 06:57:45 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38838: [SPARK-41321][CONNECT] Support target field for UnresolvedStar - posted by GitBox <gi...@apache.org> on 2022/11/30 07:03:29 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38838: [SPARK-41321][CONNECT] Support target field for UnresolvedStar - posted by GitBox <gi...@apache.org> on 2022/11/30 07:03:35 UTC, 2 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38848: [SPARK-41335][CONNECT][PYTHON] Support IsNull and IsNotNull in Column - posted by GitBox <gi...@apache.org> on 2022/11/30 07:32:00 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38848: [SPARK-41335][CONNECT][PYTHON] Support IsNull and IsNotNull in Column - posted by GitBox <gi...@apache.org> on 2022/11/30 07:32:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38836: [SPARK-41315][CONNECT][PYTHON] Implement `DataFrame.replace` and `DataFrame.na.replace` - posted by GitBox <gi...@apache.org> on 2022/11/30 07:41:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38836: [SPARK-41315][CONNECT][PYTHON] Implement `DataFrame.replace` and `DataFrame.na.replace` - posted by GitBox <gi...@apache.org> on 2022/11/30 07:46:53 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38849: [SPARK-41328][CONNECT][PYTHON][FOLLOW-UP] Simplify startsWith and endsWith - posted by GitBox <gi...@apache.org> on 2022/11/30 07:53:44 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38849: [SPARK-41328][CONNECT][PYTHON][FOLLOW-UP] Simplify startsWith and endsWith - posted by GitBox <gi...@apache.org> on 2022/11/30 07:53:51 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38849: [SPARK-41328][CONNECT][PYTHON][FOLLOW-UP] Simplify startsWith and endsWith - posted by GitBox <gi...@apache.org> on 2022/11/30 07:54:29 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38838: [SPARK-41321][CONNECT] Support target field for UnresolvedStar - posted by GitBox <gi...@apache.org> on 2022/11/30 08:06:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38849: [SPARK-41328][CONNECT][PYTHON][FOLLOW-UP] Simplify startsWith and endsWith - posted by GitBox <gi...@apache.org> on 2022/11/30 08:07:24 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38848: [SPARK-41335][CONNECT][PYTHON] Support IsNull and IsNotNull in Column - posted by GitBox <gi...@apache.org> on 2022/11/30 08:10:00 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #38848: [SPARK-41335][CONNECT][PYTHON] Support IsNull and IsNotNull in Column - posted by GitBox <gi...@apache.org> on 2022/11/30 08:11:20 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #38850: [SPARK-41326][CONNECT][FOLLOW-UP] Add e2e tests for distinct and dropDuplicates - posted by GitBox <gi...@apache.org> on 2022/11/30 08:14:15 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #38850: [SPARK-41326][CONNECT][FOLLOW-UP] Add e2e tests for distinct and dropDuplicates - posted by GitBox <gi...@apache.org> on 2022/11/30 08:14:21 UTC, 0 replies.
- [GitHub] [spark] ja-michel commented on pull request #37588: [SPARK-33393][SQL] Support SHOW TABLE EXTENDED in v2 - posted by GitBox <gi...@apache.org> on 2022/11/30 08:43:16 UTC, 0 replies.
- [GitHub] [spark] dengziming commented on a diff in pull request #38838: [SPARK-41321][CONNECT] Support target field for UnresolvedStar - posted by GitBox <gi...@apache.org> on 2022/11/30 09:51:01 UTC, 0 replies.
- [GitHub] [spark] phxism commented on pull request #18127: [SPARK-6628][SQL][Branch-2.1] Fix ClassCastException when executing sql statement 'insert into' on hbase table - posted by GitBox <gi...@apache.org> on 2022/11/30 10:20:35 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38847: [SPARK-41332][CONNECT][PYTHON] Fix `nullOrdering` in `SortOrder` - posted by GitBox <gi...@apache.org> on 2022/11/30 10:25:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38847: [SPARK-41332][CONNECT][PYTHON] Fix `nullOrdering` in `SortOrder` - posted by GitBox <gi...@apache.org> on 2022/11/30 10:26:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38841: [SPARK-41325] [CONNECT] Fix missing avg() for GroupBy on DF - posted by GitBox <gi...@apache.org> on 2022/11/30 11:06:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38849: [SPARK-41328][CONNECT][PYTHON][FOLLOW-UP] Simplify startsWith and endsWith - posted by GitBox <gi...@apache.org> on 2022/11/30 11:16:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38849: [SPARK-41328][CONNECT][PYTHON][FOLLOW-UP] Simplify startsWith and endsWith - posted by GitBox <gi...@apache.org> on 2022/11/30 11:16:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38850: [SPARK-41326][CONNECT][FOLLOW-UP] Add e2e tests for distinct and dropDuplicates - posted by GitBox <gi...@apache.org> on 2022/11/30 11:50:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38850: [SPARK-41326][CONNECT][FOLLOW-UP] Add e2e tests for distinct and dropDuplicates - posted by GitBox <gi...@apache.org> on 2022/11/30 11:50:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #38848: [SPARK-41335][CONNECT][PYTHON] Support IsNull and IsNotNull in Column - posted by GitBox <gi...@apache.org> on 2022/11/30 12:04:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38848: [SPARK-41335][CONNECT][PYTHON] Support IsNull and IsNotNull in Column - posted by GitBox <gi...@apache.org> on 2022/11/30 12:04:49 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #38668: [SPARK-41153][CORE] Log migrated shuffle data size and migration time - posted by GitBox <gi...@apache.org> on 2022/11/30 13:23:26 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38843: [SPARK-41327][CORE] Fix `SparkStatusTracker.getExecutorInfos` by switch On/OffHeapStorageMemory info - posted by GitBox <gi...@apache.org> on 2022/11/30 13:50:17 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen closed pull request #38773: [SPARK-41016][PS] Identical expressions should not be used on both sides of a binary operator - posted by GitBox <gi...@apache.org> on 2022/11/30 13:54:06 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #38811: [SPARK-41276][SQL][ML][MLLIB][PROTOBUF][PYTHON][R][SS][AVRO] Optimize constructor use of `StructType` - posted by GitBox <gi...@apache.org> on 2022/11/30 14:21:07 UTC, 0 replies.
- [GitHub] [spark] roczei commented on pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2022/11/30 15:04:55 UTC, 0 replies.
- [GitHub] [spark] kmozaid commented on pull request #31573: [SPARK-34444][SQL] Pushdown scalar-subquery filter to FileSourceScan - posted by GitBox <gi...@apache.org> on 2022/11/30 15:12:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #38851: [SPARK-41338][SQL] Resolve outer references and normal columns in the same analyzer batch - posted by GitBox <gi...@apache.org> on 2022/11/30 15:47:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38851: [SPARK-41338][SQL] Resolve outer references and normal columns in the same analyzer batch - posted by GitBox <gi...@apache.org> on 2022/11/30 15:47:57 UTC, 0 replies.
- [GitHub] [spark] xinglin commented on a diff in pull request #38832: [WIP] SPARK-41313 Combine fixes for SPARK-3900 and SPARK-21138 - posted by GitBox <gi...@apache.org> on 2022/11/30 16:46:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38843: [SPARK-41327][CORE] Fix `SparkStatusTracker.getExecutorInfos` by switch On/OffHeapStorageMemory info - posted by GitBox <gi...@apache.org> on 2022/11/30 17:19:45 UTC, 2 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #38851: [SPARK-41338][SQL] Resolve outer references and normal columns in the same analyzer batch - posted by GitBox <gi...@apache.org> on 2022/11/30 17:44:14 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38843: [SPARK-41327][CORE] Fix `SparkStatusTracker.getExecutorInfos` by switch On/OffHeapStorageMemory info - posted by GitBox <gi...@apache.org> on 2022/11/30 19:47:33 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 opened a new pull request, #38852: [SPARK-41341][CORE] Wait shuffle fetch to finish when decommission executor - posted by GitBox <gi...@apache.org> on 2022/11/30 20:21:06 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #38789: [SPARK-41253][K8S][TESTS] Make Spark K8S volcano IT work in Github Action - posted by GitBox <gi...@apache.org> on 2022/11/30 21:41:00 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on pull request #38852: [SPARK-41341][CORE] Wait shuffle fetch to finish when decommission executor - posted by GitBox <gi...@apache.org> on 2022/11/30 21:47:22 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #38574: [SPARK-41060][K8S] Fix generating driver and executor Config Maps - posted by GitBox <gi...@apache.org> on 2022/11/30 22:15:55 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #37821: [SPARK-40379][K8S] Propagate decommission executor loss reason in K8s - posted by GitBox <gi...@apache.org> on 2022/11/30 22:19:14 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by GitBox <gi...@apache.org> on 2022/11/30 22:22:18 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by GitBox <gi...@apache.org> on 2022/11/30 22:29:43 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #38852: [SPARK-41341][CORE] Wait shuffle fetch to finish when decommission executor - posted by GitBox <gi...@apache.org> on 2022/11/30 23:49:27 UTC, 0 replies.
- [GitHub] [spark] Kimahriman opened a new pull request, #38853: [SPARK-41339][SQL] Close and recreate RocksDB write batch instead of just clearing - posted by GitBox <gi...@apache.org> on 2022/11/30 23:54:17 UTC, 0 replies.