You are viewing a plain text version of this content. The canonical link for it is here.
- [GitHub] [spark] gengliangwang commented on pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum - `JobExecutionStatus` to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/01 00:07:14 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum - `JobExecutionStatus` to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/01 00:07:23 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39331: [WIP] [SPARK-41659] [CONNECT] Enable doctests in pyspark.sql.connect.readwriter - posted by GitBox <gi...@apache.org> on 2023/01/01 00:09:08 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37870: [SPARK-33152] [SQL] Improved constraint propagation - posted by GitBox <gi...@apache.org> on 2023/01/01 00:21:21 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37558: [SPARK-38954][CORE] Implement sharing of cloud credentials among driver and executors - posted by GitBox <gi...@apache.org> on 2023/01/01 00:21:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39325: [SPARK-41065][CONNECT][PYTHON] Implement `DataFrame.freqItems ` and `DataFrame.stat.freqItems ` - posted by GitBox <gi...@apache.org> on 2023/01/01 00:55:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39325: [SPARK-41065][CONNECT][PYTHON] Implement `DataFrame.freqItems ` and `DataFrame.stat.freqItems ` - posted by GitBox <gi...@apache.org> on 2023/01/01 00:56:27 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39320: [SPARK-41796][TESTS] Test the error class: UNSUPPORTED_SUBQUERY_EXPRESSION_CATEGORY.UNSUPPORTED_CORRELATED_REFERENCE_DATA_TYPE - posted by GitBox <gi...@apache.org> on 2023/01/01 01:19:47 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39325: [SPARK-41065][CONNECT][PYTHON] Implement `DataFrame.freqItems ` and `DataFrame.stat.freqItems ` - posted by GitBox <gi...@apache.org> on 2023/01/01 01:58:01 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39328: [SPARK-41066][CONNECT][PYTHON] Implement `DataFrame.sampleBy ` and `DataFrame.stat.sampleBy ` - posted by GitBox <gi...@apache.org> on 2023/01/01 02:17:16 UTC, 2 replies.
- [GitHub] [spark] yabola commented on pull request #39316: [SPARK-41792][Shuffle] Fix DB update for push based shuffle when newer shuffle merge is received - posted by GitBox <gi...@apache.org> on 2023/01/01 04:19:03 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39316: [SPARK-41792][Shuffle] Fix DB update for push based shuffle when newer shuffle merge is received - posted by GitBox <gi...@apache.org> on 2023/01/01 05:08:55 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39328: [SPARK-41066][CONNECT][PYTHON] Implement `DataFrame.sampleBy ` and `DataFrame.stat.sampleBy ` - posted by GitBox <gi...@apache.org> on 2023/01/01 07:44:26 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39323: [SPARK-41799][CONNECT][PYTHON][TESTS] Combine plan-related tests into single file - posted by GitBox <gi...@apache.org> on 2023/01/01 08:07:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39323: [SPARK-41799][CONNECT][PYTHON][TESTS] Combine plan-related tests into single file - posted by GitBox <gi...@apache.org> on 2023/01/01 08:07:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39330: [SPARK-41742][SPARK-41745][CONNECT] Reenable doc tests and add missing column alias to count() - posted by GitBox <gi...@apache.org> on 2023/01/01 09:09:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39330: [SPARK-41742][SPARK-41745][CONNECT] Reenable doc tests and add missing column alias to count() - posted by GitBox <gi...@apache.org> on 2023/01/01 09:10:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39324: [SPARK-41493][CONNECT][PYTHON] Make csv functions support options - posted by GitBox <gi...@apache.org> on 2023/01/01 09:11:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39324: [SPARK-41493][CONNECT][PYTHON] Make csv functions support options - posted by GitBox <gi...@apache.org> on 2023/01/01 09:12:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39327: [SPARK-41801][CORE][PYTHON][PS] Remove `*args: Any, **kwargs: Any` for `def transpose` - posted by GitBox <gi...@apache.org> on 2023/01/01 09:14:11 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum - `JobExecutionStatus` to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/01 09:25:32 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39296: [SPARK-41757][CONNECT] Fixing String representation for Column class - posted by GitBox <gi...@apache.org> on 2023/01/01 09:26:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39294: [SPARK-41537] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by GitBox <gi...@apache.org> on 2023/01/01 09:34:23 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by GitBox <gi...@apache.org> on 2023/01/01 09:42:19 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by GitBox <gi...@apache.org> on 2023/01/01 09:48:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39211: [SPARK-41705][CONNECT] Move generate_protos.sh to dev/ - posted by GitBox <gi...@apache.org> on 2023/01/01 09:49:26 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum - `JobExecutionStatus` to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/01 10:54:27 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39233: [SPARK-41676][CORE][SQL][SS][UI] Protobuf serializer for `StreamingQueryData` - posted by GitBox <gi...@apache.org> on 2023/01/01 11:09:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39329: [SPARK-41802][BUILD] Upgrade Apache httpcore to 4.4.16 - posted by GitBox <gi...@apache.org> on 2023/01/01 11:09:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by GitBox <gi...@apache.org> on 2023/01/01 11:41:06 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39294: [SPARK-41537][INFRA][TESTS] Github Workflow Check for Breaking Changes in Spark Connect Proto - posted by GitBox <gi...@apache.org> on 2023/01/01 11:46:46 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2023/01/01 13:28:14 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum - `JobExecutionStatus` to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/01 13:44:56 UTC, 0 replies.
- [GitHub] [spark] Daniel-Davies commented on pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by GitBox <gi...@apache.org> on 2023/01/01 14:00:28 UTC, 2 replies.
- [GitHub] [spark] MaxGekk opened a new pull request, #39332: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by GitBox <gi...@apache.org> on 2023/01/01 16:08:47 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #39097: Implement code generation for to_csv function (doGenCode) - posted by GitBox <gi...@apache.org> on 2023/01/01 16:36:18 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39320: [SPARK-41796][TESTS] Test the error class: UNSUPPORTED_SUBQUERY_EXPRESSION_CATEGORY.UNSUPPORTED_CORRELATED_REFERENCE_DATA_TYPE - posted by GitBox <gi...@apache.org> on 2023/01/01 16:45:29 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39320: [SPARK-41796][TESTS] Test the error class: UNSUPPORTED_SUBQUERY_EXPRESSION_CATEGORY.UNSUPPORTED_CORRELATED_REFERENCE_DATA_TYPE - posted by GitBox <gi...@apache.org> on 2023/01/01 16:46:10 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39333: [SPARK-41805] Reuse expressions in WindowSpecDefinition - posted by GitBox <gi...@apache.org> on 2023/01/01 17:48:43 UTC, 0 replies.
- [GitHub] [spark] Daniel-Davies commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by GitBox <gi...@apache.org> on 2023/01/01 18:02:12 UTC, 19 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39282: [SPARK-41581][SQL] Assign name to _LEGACY_ERROR_TEMP_1230 - posted by GitBox <gi...@apache.org> on 2023/01/01 19:07:40 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39316: [SPARK-41792][Shuffle] Fix DB update for push based shuffle when newer shuffle merge is received - posted by GitBox <gi...@apache.org> on 2023/01/01 19:39:57 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39316: [SPARK-41792][Shuffle] Fix DB update for push based shuffle when newer shuffle merge is received - posted by GitBox <gi...@apache.org> on 2023/01/01 19:42:25 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2023/01/01 19:48:40 UTC, 2 replies.
- [GitHub] [spark] allisonport-db opened a new pull request, #39334: [SPARK-41806][SQL] Use AppendData.byName for SQL INSERT INTO by name for DSV2 - posted by GitBox <gi...@apache.org> on 2023/01/01 23:16:04 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37870: [SPARK-33152] [SQL] Improved constraint propagation - posted by GitBox <gi...@apache.org> on 2023/01/02 00:19:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39328: [SPARK-41066][CONNECT][PYTHON] Implement `DataFrame.sampleBy ` and `DataFrame.stat.sampleBy ` - posted by GitBox <gi...@apache.org> on 2023/01/02 00:31:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39328: [SPARK-41066][CONNECT][PYTHON] Implement `DataFrame.sampleBy ` and `DataFrame.stat.sampleBy ` - posted by GitBox <gi...@apache.org> on 2023/01/02 00:31:35 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39335: [SPARK-41807][SQL] Remove non-existent error class: UNSUPPORTED_FEATURE.DISTRIBUTE_BY - posted by GitBox <gi...@apache.org> on 2023/01/02 02:47:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39336: [SPARK-41808][CONNECT][PYTHON] Make json functions support options - posted by GitBox <gi...@apache.org> on 2023/01/02 02:58:15 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39337: [SPARK-41066][CONNECT][PYTHON][FOLLOWUP] Simplify the server code and add comments for `DataFrame.sampleBy` - posted by GitBox <gi...@apache.org> on 2023/01/02 03:01:12 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39337: [SPARK-41066][CONNECT][PYTHON][FOLLOWUP] Simplify the server code and add comments for `DataFrame.sampleBy` - posted by GitBox <gi...@apache.org> on 2023/01/02 03:02:52 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39338: [SPARK-40993][SPARK-41705][CONNECT] Move Spark Connect documentation and script to dev/ and Python documentation - posted by GitBox <gi...@apache.org> on 2023/01/02 03:31:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39211: [SPARK-41705][CONNECT] Move generate_protos.sh to dev/ - posted by GitBox <gi...@apache.org> on 2023/01/02 03:31:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39338: [SPARK-40993][SPARK-41705][CONNECT] Move Spark Connect documentation and script to dev/ and Python documentation - posted by GitBox <gi...@apache.org> on 2023/01/02 03:32:45 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39338: [SPARK-40993][SPARK-41705][CONNECT] Move Spark Connect documentation and script to dev/ and Python documentation - posted by GitBox <gi...@apache.org> on 2023/01/02 03:33:49 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39339: [SPARK-41803][CONNECT][PYTHON] Add missing function `log(arg1, arg2)` - posted by GitBox <gi...@apache.org> on 2023/01/02 03:34:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39339: [SPARK-41803][CONNECT][PYTHON] Add missing function `log(arg1, arg2)` - posted by GitBox <gi...@apache.org> on 2023/01/02 03:35:55 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39331: [WIP] [SPARK-41659] [CONNECT] Enable doctests in pyspark.sql.connect.readwriter - posted by GitBox <gi...@apache.org> on 2023/01/02 03:38:25 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39339: [SPARK-41803][CONNECT][PYTHON] Add missing function `log(arg1, arg2)` - posted by GitBox <gi...@apache.org> on 2023/01/02 03:45:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39336: [SPARK-41808][CONNECT][PYTHON] Make JSON functions support options - posted by GitBox <gi...@apache.org> on 2023/01/02 04:00:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39336: [SPARK-41808][CONNECT][PYTHON] Make JSON functions support options - posted by GitBox <gi...@apache.org> on 2023/01/02 04:00:19 UTC, 0 replies.
- [GitHub] [spark] shrprasa commented on pull request #37880: [SPARK-39399] [CORE] [K8S]: Fix proxy-user authentication for Spark on k8s in cluster deploy mode - posted by GitBox <gi...@apache.org> on 2023/01/02 04:39:11 UTC, 1 replies.
- [GitHub] [spark] shrprasa commented on pull request #39221: [SPARK-41719] [CORE]: SSLOptions sub settings should be set only when ssl is enabled - posted by GitBox <gi...@apache.org> on 2023/01/02 04:39:31 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39338: [SPARK-40993][SPARK-41705][CONNECT] Move Spark Connect documentation and script to dev/ and Python documentation - posted by GitBox <gi...@apache.org> on 2023/01/02 05:48:11 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #39340: [MINOR]Fix typos in ReceiverSupervisorImpl - posted by GitBox <gi...@apache.org> on 2023/01/02 06:06:17 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39282: [SPARK-41581][SQL] Assign name to _LEGACY_ERROR_TEMP_1230 - posted by GitBox <gi...@apache.org> on 2023/01/02 06:52:13 UTC, 3 replies.
- [GitHub] [spark] MaxGekk closed pull request #39285: [SPARK-41571][SQL] Assign name to _LEGACY_ERROR_TEMP_2310 - posted by GitBox <gi...@apache.org> on 2023/01/02 06:53:39 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39281: [SPARK-41576][SQL] Assign name to _LEGACY_ERROR_TEMP_2051 - posted by GitBox <gi...@apache.org> on 2023/01/02 06:56:19 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39281: [SPARK-41576][SQL] Assign name to _LEGACY_ERROR_TEMP_2051 - posted by GitBox <gi...@apache.org> on 2023/01/02 07:27:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39341: [SPARK-41657][CONNECT][DOCS][TESTS] Enable doctests in pyspark.sql.connect.session - posted by GitBox <gi...@apache.org> on 2023/01/02 07:28:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39337: [SPARK-41066][CONNECT][PYTHON][FOLLOWUP] Simplify the server code and add comments for `DataFrame.sampleBy` - posted by GitBox <gi...@apache.org> on 2023/01/02 08:02:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39342: [SPARK-41745][CONNECT][TESTS][FOLLOW-UP] Reeanble related test cases - posted by GitBox <gi...@apache.org> on 2023/01/02 08:34:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39342: [SPARK-41745][CONNECT][TESTS][FOLLOW-UP] Reeanble related test cases - posted by GitBox <gi...@apache.org> on 2023/01/02 08:35:08 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39338: [SPARK-40993][SPARK-41705][CONNECT] Move Spark Connect documentation and script to dev/ and Python documentation - posted by GitBox <gi...@apache.org> on 2023/01/02 08:36:52 UTC, 0 replies.
- [GitHub] [spark] adrian-wang opened a new pull request, #39343: [SPARK-41816][SQL][ThriftServer] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/02 08:43:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39344: [SPARK-41810][CONNECT] Infer names from a list of dictionaries in SparkSession.createDataFrame - posted by GitBox <gi...@apache.org> on 2023/01/02 08:55:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39344: [SPARK-41810][CONNECT] Infer names from a list of dictionaries in SparkSession.createDataFrame - posted by GitBox <gi...@apache.org> on 2023/01/02 08:56:10 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39344: [SPARK-41810][CONNECT] Infer names from a list of dictionaries in SparkSession.createDataFrame - posted by GitBox <gi...@apache.org> on 2023/01/02 09:02:16 UTC, 1 replies.
- [GitHub] [spark] ezamyatin commented on pull request #37967: Scalable SkipGram-Word2Vec implementation - posted by GitBox <gi...@apache.org> on 2023/01/02 09:08:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39345: [SPARK-41809][CONNECT][PYTHON] Make function `from_json` support DataType Schema - posted by GitBox <gi...@apache.org> on 2023/01/02 09:09:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39340: [MINOR]Fix typos in ReceiverSupervisorImpl - posted by GitBox <gi...@apache.org> on 2023/01/02 09:52:00 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39340: [MINOR] Fix typos in ReceiverSupervisorImpl - posted by GitBox <gi...@apache.org> on 2023/01/02 09:56:05 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39340: [MINOR] Fix typos in ReceiverSupervisorImpl - posted by GitBox <gi...@apache.org> on 2023/01/02 09:57:16 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39345: [SPARK-41809][CONNECT][PYTHON] Make function `from_json` support DataType Schema - posted by GitBox <gi...@apache.org> on 2023/01/02 10:53:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39345: [SPARK-41809][CONNECT][PYTHON] Make function `from_json` support DataType Schema - posted by GitBox <gi...@apache.org> on 2023/01/02 10:53:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39341: [SPARK-41657][CONNECT][DOCS][TESTS] Enable doctests in pyspark.sql.connect.session - posted by GitBox <gi...@apache.org> on 2023/01/02 10:58:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39341: [SPARK-41657][CONNECT][DOCS][TESTS] Enable doctests in pyspark.sql.connect.session - posted by GitBox <gi...@apache.org> on 2023/01/02 10:58:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39342: [SPARK-41745][CONNECT][TESTS][FOLLOW-UP] Reeanble related test cases - posted by GitBox <gi...@apache.org> on 2023/01/02 10:59:54 UTC, 0 replies.
- [GitHub] [spark] infoankitp commented on pull request #38865: [SPARK-41232][SQL][PYTHON] Adding array_append function - posted by GitBox <gi...@apache.org> on 2023/01/02 11:39:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39344: [SPARK-41810][CONNECT] Infer names from a list of dictionaries in SparkSession.createDataFrame - posted by GitBox <gi...@apache.org> on 2023/01/02 12:24:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39331: [WIP] [SPARK-41659] [CONNECT] Enable doctests in pyspark.sql.connect.readwriter - posted by GitBox <gi...@apache.org> on 2023/01/02 12:26:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39284: [SPARK-41573][SQL] Assign name to _LEGACY_ERROR_TEMP_2136 - posted by GitBox <gi...@apache.org> on 2023/01/02 14:19:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39305: [SPARK-41580][SQL] Assign name to _LEGACY_ERROR_TEMP_2137 - posted by GitBox <gi...@apache.org> on 2023/01/02 14:39:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39335: [WIP][SPARK-41807][SQL] Remove non-existent error class: UNSUPPORTED_FEATURE.DISTRIBUTE_BY - posted by GitBox <gi...@apache.org> on 2023/01/02 16:10:24 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39334: [SPARK-41806][SQL] Use AppendData.byName for SQL INSERT INTO by name for DSV2 - posted by GitBox <gi...@apache.org> on 2023/01/02 16:10:27 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39333: [SPARK-41805][SQL] Reuse expressions in WindowSpecDefinition - posted by GitBox <gi...@apache.org> on 2023/01/02 16:10:31 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39331: [WIP] [SPARK-41659] [CONNECT] Enable doctests in pyspark.sql.connect.readwriter - posted by GitBox <gi...@apache.org> on 2023/01/02 16:13:17 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39346: [SPARK-41656][CONNECT] Enable doctests in pyspark.sql.connect.dataframe - posted by GitBox <gi...@apache.org> on 2023/01/02 17:54:22 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39347: [SPARK-41658][CONNECT] Enable doctests in pyspark.sql.connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/02 18:29:21 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39327: [SPARK-41801][CORE][PYTHON][PS] Remove `*args: Any, **kwargs: Any` for `def transpose` - posted by GitBox <gi...@apache.org> on 2023/01/02 20:17:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39326: [SPARK-41800][BUILD] Upgrade commons-compress to 1.22 - posted by GitBox <gi...@apache.org> on 2023/01/02 20:17:36 UTC, 0 replies.
- [GitHub] [spark] ibuder opened a new pull request, #39348: [SPARK-41311][SQL] Rewrite test RENAME_SRC_PATH_NOT_FOUND to trigger the error from user space - posted by GitBox <gi...@apache.org> on 2023/01/02 21:49:59 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #39349: [SPARK-41804][SQL] Choose correct element size in `InterpretedUnsafeProjection` for array of UDTs - posted by GitBox <gi...@apache.org> on 2023/01/02 23:56:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39331: [SPARK-41659][CONNECT] Enable doctests in pyspark.sql.connect.readwriter - posted by GitBox <gi...@apache.org> on 2023/01/03 00:41:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39331: [SPARK-41659][CONNECT] Enable doctests in pyspark.sql.connect.readwriter - posted by GitBox <gi...@apache.org> on 2023/01/03 00:42:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39347: [SPARK-41658][CONNECT] Enable doctests in pyspark.sql.connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/03 00:46:15 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39347: [SPARK-41658][CONNECT] Enable doctests in pyspark.sql.connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/03 00:52:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39346: [SPARK-41656][CONNECT] Enable doctests in pyspark.sql.connect.dataframe - posted by GitBox <gi...@apache.org> on 2023/01/03 00:52:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39339: [SPARK-41803][CONNECT][PYTHON] Add missing function `log(arg1, arg2)` - posted by GitBox <gi...@apache.org> on 2023/01/03 00:56:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39339: [SPARK-41803][CONNECT][PYTHON] Add missing function `log(arg1, arg2)` - posted by GitBox <gi...@apache.org> on 2023/01/03 00:57:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39340: [MINOR] Fix typos - posted by GitBox <gi...@apache.org> on 2023/01/03 00:57:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39340: [MINOR] Fix typos - posted by GitBox <gi...@apache.org> on 2023/01/03 00:57:59 UTC, 0 replies.
- [GitHub] [spark] dcoliversun commented on pull request #39306: [WIP][SPARK-41781][K8S] Add the ability to create pvc before creating driver/executor pod - posted by GitBox <gi...@apache.org> on 2023/01/03 01:11:14 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39349: [SPARK-41804][SQL] Choose correct element size in `InterpretedUnsafeProjection` for array of UDTs - posted by GitBox <gi...@apache.org> on 2023/01/03 01:22:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39349: [SPARK-41804][SQL] Choose correct element size in `InterpretedUnsafeProjection` for array of UDTs - posted by GitBox <gi...@apache.org> on 2023/01/03 01:22:45 UTC, 0 replies.
- [GitHub] [spark] neshkeev opened a new pull request, #39350: Fix a typo "from from" -> "from" - posted by GitBox <gi...@apache.org> on 2023/01/03 01:41:16 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39335: [SPARK-41807][SQL] Remove non-existent error class: UNSUPPORTED_FEATURE.DISTRIBUTE_BY - posted by GitBox <gi...@apache.org> on 2023/01/03 01:46:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39350: Fix a typo "from from" -> "from" - posted by GitBox <gi...@apache.org> on 2023/01/03 01:51:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39346: [SPARK-41656][CONNECT][TESTS] Enable doctests in pyspark.sql.connect.dataframe - posted by GitBox <gi...@apache.org> on 2023/01/03 02:09:59 UTC, 0 replies.
- [GitHub] [spark] neshkeev commented on pull request #39350: [MINOR] Fix a typo "from from" -> "from" - posted by GitBox <gi...@apache.org> on 2023/01/03 02:10:08 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39346: [SPARK-41656][CONNECT][TESTS] Enable doctests in pyspark.sql.connect.dataframe - posted by GitBox <gi...@apache.org> on 2023/01/03 02:10:19 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39305: [SPARK-41580][SQL] Assign name to _LEGACY_ERROR_TEMP_2137 - posted by GitBox <gi...@apache.org> on 2023/01/03 02:19:12 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #39333: [SPARK-41805][SQL] Reuse expressions in WindowSpecDefinition - posted by GitBox <gi...@apache.org> on 2023/01/03 02:28:09 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39226: [SPARK-41694][CORE] Add new config to clean up `spark.ui.store.path` directory when `SparkContext.stop()` - posted by GitBox <gi...@apache.org> on 2023/01/03 02:32:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39221: [SPARK-41719] [CORE]: SSLOptions sub settings should be set only when ssl is enabled - posted by GitBox <gi...@apache.org> on 2023/01/03 02:33:06 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/03 02:44:05 UTC, 1 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39351: [CORE] Use Map in place of SortedMap for ErrorClassesJsonReader - posted by GitBox <gi...@apache.org> on 2023/01/03 02:51:32 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39351: [CORE] Use Map in place of SortedMap for ErrorClassesJsonReader - posted by GitBox <gi...@apache.org> on 2023/01/03 02:51:54 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #39221: [SPARK-41719][CORE] Skip SSLOptions sub-settings if `ssl` is disabled - posted by GitBox <gi...@apache.org> on 2023/01/03 02:55:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39343: [SPARK-41816][SQL][ThriftServer] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/03 02:56:59 UTC, 1 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39347: [SPARK-41658][CONNECT][TESTS] Enable doctests in pyspark.sql.connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/03 02:57:39 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39351: [SPARK-41853] [CORE] Use Map in place of SortedMap for ErrorClassesJsonReader - posted by GitBox <gi...@apache.org> on 2023/01/03 03:00:04 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on a diff in pull request #39315: [SPARK-41790][SQL] Set TRANSFORM reader and writer's format correctly - posted by GitBox <gi...@apache.org> on 2023/01/03 03:07:45 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39352: [SPARK-41854][PYTHON][BUILD] Automatic reformat/check python/setup.py - posted by GitBox <gi...@apache.org> on 2023/01/03 03:34:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39352: [SPARK-41854][PYTHON][BUILD] Automatic reformat/check python/setup.py - posted by GitBox <gi...@apache.org> on 2023/01/03 04:05:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39352: [SPARK-41854][PYTHON][BUILD] Automatic reformat/check python/setup.py - posted by GitBox <gi...@apache.org> on 2023/01/03 04:05:38 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by GitBox <gi...@apache.org> on 2023/01/03 04:10:37 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39353: [WIP][SPARK-41841][CONNECT][BUILD] Support PySpark installation without JVM through PyPI - posted by GitBox <gi...@apache.org> on 2023/01/03 04:17:37 UTC, 0 replies.
- [GitHub] [spark] adrian-wang commented on pull request #39343: [SPARK-41816][SQL][ThriftServer] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/03 04:18:10 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39335: [SPARK-41807][CORE] Remove non-existent error class: UNSUPPORTED_FEATURE.DISTRIBUTE_BY - posted by GitBox <gi...@apache.org> on 2023/01/03 04:26:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39335: [SPARK-41807][CORE] Remove non-existent error class: UNSUPPORTED_FEATURE.DISTRIBUTE_BY - posted by GitBox <gi...@apache.org> on 2023/01/03 04:27:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39347: [SPARK-41658][CONNECT][TESTS] Enable doctests in pyspark.sql.connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/03 04:39:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39347: [SPARK-41658][CONNECT][TESTS] Enable doctests in pyspark.sql.connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/03 04:39:36 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39348: [SPARK-41311][SQL][TESTS] Rewrite test RENAME_SRC_PATH_NOT_FOUND to trigger the error from user space - posted by GitBox <gi...@apache.org> on 2023/01/03 04:42:40 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39348: [SPARK-41311][SQL][TESTS] Rewrite test RENAME_SRC_PATH_NOT_FOUND to trigger the error from user space - posted by GitBox <gi...@apache.org> on 2023/01/03 04:43:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by GitBox <gi...@apache.org> on 2023/01/03 04:47:46 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39314: [SPARK-41791] Add new metadata types - posted by GitBox <gi...@apache.org> on 2023/01/03 04:50:11 UTC, 8 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39354: [SPARK-41658][SPARK-41656][DOCS][FOLLOW-UP] Update JIRAs in skipped tests' comments - posted by GitBox <gi...@apache.org> on 2023/01/03 05:03:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39354: [SPARK-41658][SPARK-41656][DOCS][FOLLOW-UP] Update JIRAs in skipped tests' comments - posted by GitBox <gi...@apache.org> on 2023/01/03 05:04:05 UTC, 1 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39355: [SPARK-40263][CORE] Use interruptible lock instead of synchronized in TransportClientFactory.createClient() - posted by GitBox <gi...@apache.org> on 2023/01/03 05:20:19 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39258: [SPARK-41572][SQL] Assign name to _LEGACY_ERROR_TEMP_2149 - posted by GitBox <gi...@apache.org> on 2023/01/03 05:24:44 UTC, 4 replies.
- [GitHub] [spark] shrprasa commented on a diff in pull request #39221: [SPARK-41719][CORE] Skip SSLOptions sub-settings if `ssl` is disabled - posted by GitBox <gi...@apache.org> on 2023/01/03 05:32:25 UTC, 1 replies.
- [GitHub] [spark] shrprasa commented on pull request #39221: [SPARK-41719][CORE] Skip SSLOptions sub-settings if `ssl` is disabled - posted by GitBox <gi...@apache.org> on 2023/01/03 05:34:27 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39343: [SPARK-41816][SQL] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/03 05:35:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39354: [SPARK-41658][SPARK-41656][DOCS][FOLLOW-UP] Update JIRAs in skipped tests' comments - posted by GitBox <gi...@apache.org> on 2023/01/03 05:39:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39356: [SPARK-41423][CORE][BUILD] Exclude StageData.rddIds and accumulatorUpdates for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/03 05:46:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39356: [SPARK-41423][CORE][BUILD] Exclude StageData.rddIds and accumulatorUpdates for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/03 05:46:26 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39357: [SPARK-41677][CORE][SQL][SS] Add Protobuf serializer for StreamingQueryProgressWrapper - posted by GitBox <gi...@apache.org> on 2023/01/03 06:04:53 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #39280: [SPARK-41766][CORE] Handle decommission request sent before executor registration - posted by GitBox <gi...@apache.org> on 2023/01/03 06:16:58 UTC, 1 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39358: [SPARK-41856][CONNECT][TESTS] Enable test_create_nan_decimal_dataframe, test_freqItems, test_input_files, test_toDF_with_schema_string, test_to_pandas_required_pandas_not_found - posted by GitBox <gi...@apache.org> on 2023/01/03 06:45:25 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39359: [SPARK-41857][CONNECT][TESTS] Enable test_between_function, test_datetime_functions, test_expr, test_function_parity, test_math_functions, test_window_functions_cumulative_sum, test_corr, test_cov, test_crosstab, test_approxQuantile - posted by GitBox <gi...@apache.org> on 2023/01/03 07:05:28 UTC, 0 replies.
- [GitHub] [spark] olaky commented on a diff in pull request #39314: [SPARK-41791] Add new metadata types - posted by GitBox <gi...@apache.org> on 2023/01/03 07:24:48 UTC, 8 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39358: [SPARK-41856][CONNECT][TESTS] Enable test_create_nan_decimal_dataframe, test_freqItems, test_input_files, test_toDF_with_schema_string, test_to_pandas_required_pandas_not_found - posted by GitBox <gi...@apache.org> on 2023/01/03 07:26:09 UTC, 3 replies.
- [GitHub] [spark] panbingkun commented on pull request #39326: [SPARK-41800][BUILD] Upgrade commons-compress to 1.22 - posted by GitBox <gi...@apache.org> on 2023/01/03 07:28:01 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39326: [SPARK-41800][BUILD] Upgrade commons-compress to 1.22 - posted by GitBox <gi...@apache.org> on 2023/01/03 07:28:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39359: [SPARK-41857][CONNECT][TESTS] Enable test_between_function, test_datetime_functions, test_expr, test_function_parity, test_math_functions, test_window_functions_cumulative_sum, test_corr, test_cov, test_crosstab, test_approxQuantile - posted by GitBox <gi...@apache.org> on 2023/01/03 07:29:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39359: [SPARK-41857][CONNECT][TESTS] Enable test_between_function, test_datetime_functions, test_expr, test_math_functions, test_window_functions_cumulative_sum, test_corr, test_cov, test_crosstab, test_approxQuantile - posted by GitBox <gi...@apache.org> on 2023/01/03 08:49:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39359: [SPARK-41857][CONNECT][TESTS] Enable test_between_function, test_datetime_functions, test_expr, test_math_functions, test_window_functions_cumulative_sum, test_corr, test_cov, test_crosstab, test_approxQuantile - posted by GitBox <gi...@apache.org> on 2023/01/03 08:50:07 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39170: [SPARK-41674][SQL] Runtime filter should supports multi level shuffle join side as filter creation side - posted by GitBox <gi...@apache.org> on 2023/01/03 09:28:49 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39301: [SPARK-41782][TESTS] Regenerate benchmark results - posted by GitBox <gi...@apache.org> on 2023/01/03 09:31:40 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39353: [WIP][SPARK-41841][CONNECT][BUILD] Support PySpark installation without JVM through PyPI - posted by GitBox <gi...@apache.org> on 2023/01/03 09:37:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39353: [WIP][SPARK-41841][CONNECT][BUILD] Support PySpark installation without JVM through PyPI - posted by GitBox <gi...@apache.org> on 2023/01/03 09:37:13 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by GitBox <gi...@apache.org> on 2023/01/03 09:48:43 UTC, 5 replies.
- [GitHub] [spark] mattshma commented on a diff in pull request #39315: [SPARK-41790][SQL] Set TRANSFORM reader and writer's format correctly - posted by GitBox <gi...@apache.org> on 2023/01/03 10:05:49 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39360: [SPARK-41855][SPARK-41814][SPARK-41851][SPARK-41852][CONNECT][PYTHON] Make `createDataFrame` handle None/NaN properly - posted by GitBox <gi...@apache.org> on 2023/01/03 10:07:42 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/03 10:08:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39360: [SPARK-41855][SPARK-41814][SPARK-41851][SPARK-41852][CONNECT][PYTHON] Make `createDataFrame` handle None/NaN properly - posted by GitBox <gi...@apache.org> on 2023/01/03 10:08:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39360: [SPARK-41855][SPARK-41814][SPARK-41851][SPARK-41852][CONNECT][PYTHON] Make `createDataFrame` handle None/NaN properly - posted by GitBox <gi...@apache.org> on 2023/01/03 10:14:45 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #37280: [SPARK-39862][SQL] Fix two bugs in existence DEFAULT value lookups - posted by GitBox <gi...@apache.org> on 2023/01/03 10:15:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39360: [SPARK-41855][SPARK-41814][SPARK-41851][SPARK-41852][CONNECT][PYTHON] Make `createDataFrame` handle None/NaN properly - posted by GitBox <gi...@apache.org> on 2023/01/03 10:23:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39356: [SPARK-41423][CORE][BUILD] Exclude StageData.rddIds and accumulatorUpdates for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/03 10:34:21 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39362: [SPARK-41858][SQL] Fix ORC reader perf regression due to DEFAULT value feature - posted by GitBox <gi...@apache.org> on 2023/01/03 10:43:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #37280: [SPARK-39862][SQL] Fix two bugs in existence DEFAULT value lookups - posted by GitBox <gi...@apache.org> on 2023/01/03 10:49:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39360: [SPARK-41855][SPARK-41814][SPARK-41851][SPARK-41852][CONNECT][PYTHON] Make `createDataFrame` handle None/NaN properly - posted by GitBox <gi...@apache.org> on 2023/01/03 11:53:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39360: [SPARK-41855][SPARK-41814][SPARK-41851][SPARK-41852][CONNECT][PYTHON] Make `createDataFrame` handle None/NaN properly - posted by GitBox <gi...@apache.org> on 2023/01/03 11:54:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39363: [SPARK-41814][SPARK-41851][SPARK-41852][FOLLOW-UP] Reeanble skipped doctests - posted by GitBox <gi...@apache.org> on 2023/01/03 12:00:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39363: [SPARK-41814][SPARK-41851][SPARK-41852][FOLLOW-UP] Reeanble skipped doctests - posted by GitBox <gi...@apache.org> on 2023/01/03 12:01:10 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/03 12:07:55 UTC, 8 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38888: [SPARK-41405][SQL] Centralize the column resolution logic - posted by GitBox <gi...@apache.org> on 2023/01/03 12:09:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38888: [SPARK-41405][SQL] Centralize the column resolution logic - posted by GitBox <gi...@apache.org> on 2023/01/03 12:09:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip closed pull request #38802: [WIP] Packaging for Spark Connect Preview - posted by GitBox <gi...@apache.org> on 2023/01/03 12:10:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39364: [SPARK-41049][SQL][FOLLOWUP] Move expression initialization code to the base class - posted by GitBox <gi...@apache.org> on 2023/01/03 12:28:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39364: [SPARK-41049][SQL][FOLLOWUP] Move expression initialization code to the base class - posted by GitBox <gi...@apache.org> on 2023/01/03 12:29:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39364: [SPARK-41049][SQL][FOLLOWUP] Move expression initialization code to the base class - posted by GitBox <gi...@apache.org> on 2023/01/03 12:29:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39357: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper - posted by GitBox <gi...@apache.org> on 2023/01/03 12:52:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39365: [SPARK-41859][SQL] CreateHiveTableAsSelectCommand should set the overwrite flag correctly - posted by GitBox <gi...@apache.org> on 2023/01/03 13:00:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39365: [SPARK-41859][SQL] CreateHiveTableAsSelectCommand should set the overwrite flag correctly - posted by GitBox <gi...@apache.org> on 2023/01/03 13:00:39 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39362: [SPARK-41858][SQL] Fix ORC reader perf regression due to DEFAULT value feature - posted by GitBox <gi...@apache.org> on 2023/01/03 13:09:50 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39351: [SPARK-41853] [CORE] Use Map in place of SortedMap for ErrorClassesJsonReader - posted by GitBox <gi...@apache.org> on 2023/01/03 14:00:58 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39351: [SPARK-41853] [CORE] Use Map in place of SortedMap for ErrorClassesJsonReader - posted by GitBox <gi...@apache.org> on 2023/01/03 14:01:25 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini opened a new pull request, #39366: [SPARK-41860][SQL] Make AvroScanBuilder and JsonScanBuilder case classes - posted by GitBox <gi...@apache.org> on 2023/01/03 15:28:21 UTC, 0 replies.
- [GitHub] [spark] thejdeep commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2023/01/03 15:56:14 UTC, 2 replies.
- [GitHub] [spark] LorenzoMartini opened a new pull request, #39367: [SPARK-41861][SQL] Make v2 ScanBuilders' build() return typed scan - posted by GitBox <gi...@apache.org> on 2023/01/03 16:08:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39362: [SPARK-41858][SQL] Fix ORC reader perf regression due to DEFAULT value feature - posted by GitBox <gi...@apache.org> on 2023/01/03 17:03:18 UTC, 3 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #39368: [SPARK-28764] remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/03 17:12:10 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39362: [SPARK-41858][SQL] Fix ORC reader perf regression due to DEFAULT value feature - posted by GitBox <gi...@apache.org> on 2023/01/03 17:31:10 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39362: [SPARK-41858][SQL] Fix ORC reader perf regression due to DEFAULT value feature - posted by GitBox <gi...@apache.org> on 2023/01/03 17:42:16 UTC, 10 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #39369: [WIP][SPARK-41775][PYTHON][ML] Adding support for PyForch functions - posted by GitBox <gi...@apache.org> on 2023/01/03 18:35:20 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39362: [SPARK-41858][SQL] Fix ORC reader perf regression due to DEFAULT value feature - posted by GitBox <gi...@apache.org> on 2023/01/03 18:41:20 UTC, 0 replies.
- [GitHub] [spark] viirya closed pull request #39364: [SPARK-41049][SQL][FOLLOWUP] Move expression initialization code to the base class - posted by GitBox <gi...@apache.org> on 2023/01/03 18:46:54 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #39364: [SPARK-41049][SQL][FOLLOWUP] Move expression initialization code to the base class - posted by GitBox <gi...@apache.org> on 2023/01/03 18:47:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39363: [SPARK-41814][SPARK-41851][SPARK-41852][FOLLOW-UP] Reeanble skipped doctests - posted by GitBox <gi...@apache.org> on 2023/01/03 18:51:17 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/03 19:49:02 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/03 19:54:25 UTC, 1 replies.
- [GitHub] [spark] kyle-ai2 commented on pull request #38539: [SPARK-41030][BUILD] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/03 19:55:47 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38539: [SPARK-41030][BUILD] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/03 20:03:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38539: [SPARK-41030][BUILD] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/03 20:12:37 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #39371: [SPARK-41030][BUILD][3.2] Upgrade Apache Ivy to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/03 20:17:12 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #39371: [SPARK-41030][BUILD][3.2] Upgrade Apache Ivy to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/03 20:19:21 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #39371: [SPARK-41030][BUILD][3.2] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/03 20:24:07 UTC, 1 replies.
- [GitHub] [spark] thejdeep commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2023/01/03 20:59:54 UTC, 1 replies.
- [GitHub] [spark] rmcyang commented on a diff in pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2023/01/03 21:00:14 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/03 21:09:40 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/03 21:22:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39372: [SPARK-41863][INFRA][PYTHON][TESTS] Skip flake8 tests if the command is not available - posted by GitBox <gi...@apache.org> on 2023/01/03 22:04:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39373: [SPARK-41864][INFRA][PYTHON] Fix mypy linter errors - posted by GitBox <gi...@apache.org> on 2023/01/03 22:48:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39373: [SPARK-41864][INFRA][PYTHON] Fix mypy linter errors - posted by GitBox <gi...@apache.org> on 2023/01/03 22:51:50 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39373: [SPARK-41864][INFRA][PYTHON] Fix mypy linter errors - posted by GitBox <gi...@apache.org> on 2023/01/03 23:00:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39372: [SPARK-41863][INFRA][PYTHON][TESTS] Skip `flake8` tests if the command is not available - posted by GitBox <gi...@apache.org> on 2023/01/03 23:01:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39372: [SPARK-41863][INFRA][PYTHON][TESTS] Skip `flake8` tests if the command is not available - posted by GitBox <gi...@apache.org> on 2023/01/03 23:01:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39356: [SPARK-41423][CORE][BUILD] Exclude StageData.rddIds, this and accumulatorUpdates for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/03 23:21:02 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39356: [SPARK-41423][CORE][BUILD] Exclude StageData.rddIds, this and accumulatorUpdates for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/03 23:21:37 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39356: [SPARK-41423][CORE][BUILD] Exclude StageData.rddIds, this and accumulatorUpdates for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/03 23:22:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39358: [SPARK-41856][CONNECT][TESTS] Enable test_create_nan_decimal_dataframe, test_freqItems, test_input_files, test_to_pandas_required_pandas_not_found - posted by GitBox <gi...@apache.org> on 2023/01/04 00:29:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39358: [SPARK-41856][CONNECT][TESTS] Enable test_create_nan_decimal_dataframe, test_freqItems, test_input_files, test_to_pandas_required_pandas_not_found - posted by GitBox <gi...@apache.org> on 2023/01/04 00:29:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/04 00:30:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/04 00:30:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/04 00:32:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/04 00:33:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39374: [SPARK-41865][INFRA][3.2] Use pycodestyle to 2.7.0 to fix pycodestyle errors - posted by GitBox <gi...@apache.org> on 2023/01/04 00:35:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39374: [SPARK-41865][INFRA][3.2] Use pycodestyle to 2.7.0 to fix pycodestyle errors - posted by GitBox <gi...@apache.org> on 2023/01/04 00:40:44 UTC, 2 replies.
- [GitHub] [spark] itholic closed pull request #39137: [SPARK-41586][SPARK-41598][PYTHON] Introduce PySpark errors package and error classes - posted by GitBox <gi...@apache.org> on 2023/01/04 00:55:07 UTC, 0 replies.
- [GitHub] [spark] jchen5 opened a new pull request, #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by GitBox <gi...@apache.org> on 2023/01/04 01:20:09 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on pull request #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by GitBox <gi...@apache.org> on 2023/01/04 01:22:10 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39374: [SPARK-41865][INFRA][3.2] Use pycodestyle to 2.7.0 to fix pycodestyle errors - posted by GitBox <gi...@apache.org> on 2023/01/04 01:22:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39371: [SPARK-41030][BUILD][3.2] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/04 01:24:18 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39371: [SPARK-41030][BUILD][3.2] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/04 01:24:19 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39365: [SPARK-41859][SQL] CreateHiveTableAsSelectCommand should set the overwrite flag correctly - posted by GitBox <gi...@apache.org> on 2023/01/04 01:28:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39376: [SPARK-41850][CONNECT][PYTHON][TESTS] Enable doctest for `isnan` - posted by GitBox <gi...@apache.org> on 2023/01/04 01:40:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39221: [SPARK-41719][CORE] Skip SSLOptions sub-settings if `ssl` is disabled - posted by GitBox <gi...@apache.org> on 2023/01/04 01:40:48 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39362: [SPARK-41858][SQL] Fix ORC reader perf regression due to DEFAULT value feature - posted by GitBox <gi...@apache.org> on 2023/01/04 01:43:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39221: [SPARK-41719][CORE] Skip SSLOptions sub-settings if `ssl` is disabled - posted by GitBox <gi...@apache.org> on 2023/01/04 01:43:36 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #38865: [SPARK-41232][SQL][PYTHON] Adding array_append function - posted by GitBox <gi...@apache.org> on 2023/01/04 02:39:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38865: [SPARK-41232][SQL][PYTHON] Adding array_append function - posted by GitBox <gi...@apache.org> on 2023/01/04 02:39:47 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39377: [SPARK-41867][SQL] Selective predicate should respect InMemoryRelation - posted by GitBox <gi...@apache.org> on 2023/01/04 03:10:23 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by GitBox <gi...@apache.org> on 2023/01/04 03:10:55 UTC, 3 replies.
- [GitHub] [spark] huangxiaopingRD commented on pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by GitBox <gi...@apache.org> on 2023/01/04 03:23:12 UTC, 2 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39378: [SPARK-41821][CONNECT][PYTHON] Fix doc test for DataFrame.describe - posted by GitBox <gi...@apache.org> on 2023/01/04 03:25:48 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by GitBox <gi...@apache.org> on 2023/01/04 03:28:29 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by GitBox <gi...@apache.org> on 2023/01/04 04:20:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39369: [WIP][SPARK-41775][PYTHON][ML] Adding support for PyForch functions - posted by GitBox <gi...@apache.org> on 2023/01/04 04:20:45 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39368: [WIP][SPARK-28764]remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/04 04:20:48 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39367: [SPARK-41861][SQL] Make v2 ScanBuilders' build() return typed scan - posted by GitBox <gi...@apache.org> on 2023/01/04 04:20:52 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39366: [SPARK-41860][SQL] Make AvroScanBuilder and JsonScanBuilder case classes - posted by GitBox <gi...@apache.org> on 2023/01/04 04:20:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39376: [SPARK-41850][CONNECT][PYTHON][TESTS] Enable doctest for `isnan` - posted by GitBox <gi...@apache.org> on 2023/01/04 04:28:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39376: [SPARK-41850][CONNECT][PYTHON][TESTS] Enable doctest for `isnan` - posted by GitBox <gi...@apache.org> on 2023/01/04 04:29:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39379: [SPARK-41828][CONNECT][PYTHON] Make `createDataFrame` support empty dataframe - posted by GitBox <gi...@apache.org> on 2023/01/04 04:31:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39379: [SPARK-41828][CONNECT][PYTHON] Make `createDataFrame` support empty dataframe - posted by GitBox <gi...@apache.org> on 2023/01/04 04:34:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39380: [SPARK-41862][SQL][TESTS][FOLLOWUP] Update OrcReadBenchmark result - posted by GitBox <gi...@apache.org> on 2023/01/04 04:35:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39380: [SPARK-41862][SQL][TESTS][FOLLOWUP] Update OrcReadBenchmark result - posted by GitBox <gi...@apache.org> on 2023/01/04 04:43:13 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39370: [SPARK-41862][SQL] Fix correctness bug related to DEFAULT values in Orc reader - posted by GitBox <gi...@apache.org> on 2023/01/04 04:51:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/04 04:54:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #34195: [SPARK-36939][PYTHON][DOCS] Add orphan migration page into list in PySpark documentation - posted by GitBox <gi...@apache.org> on 2023/01/04 05:03:06 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2023/01/04 05:03:40 UTC, 3 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2023/01/04 05:05:40 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #39365: [SPARK-41859][SQL] CreateHiveTableAsSelectCommand should set the overwrite flag correctly - posted by GitBox <gi...@apache.org> on 2023/01/04 05:09:40 UTC, 0 replies.
- [GitHub] [spark] fe2s opened a new pull request, #39381: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by GitBox <gi...@apache.org> on 2023/01/04 05:13:02 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on a diff in pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by GitBox <gi...@apache.org> on 2023/01/04 05:13:56 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #34195: [SPARK-36939][PYTHON][DOCS] Add orphan migration page into list in PySpark documentation - posted by GitBox <gi...@apache.org> on 2023/01/04 05:16:55 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by GitBox <gi...@apache.org> on 2023/01/04 05:17:32 UTC, 1 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39382: [SPARK-41878][CONNECT][TESTS] pyspark.sql.tests.test_dataframe - Add JIRAs or messages for skipped messages - posted by GitBox <gi...@apache.org> on 2023/01/04 05:27:23 UTC, 0 replies.
- [GitHub] [spark] infoankitp commented on a diff in pull request #38865: [SPARK-41232][SQL][PYTHON] Adding array_append function - posted by GitBox <gi...@apache.org> on 2023/01/04 05:28:43 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE when the parameters `regexp` in regexp_replace is invalid - posted by GitBox <gi...@apache.org> on 2023/01/04 05:55:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39380: [SPARK-41862][SQL][TESTS][FOLLOWUP] Update OrcReadBenchmark result - posted by GitBox <gi...@apache.org> on 2023/01/04 06:04:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39380: [SPARK-41862][SQL][TESTS][FOLLOWUP] Update OrcReadBenchmark result - posted by GitBox <gi...@apache.org> on 2023/01/04 06:05:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #35583: [SPARK-38261][INFRA] Add missing R packages from base image - posted by GitBox <gi...@apache.org> on 2023/01/04 06:08:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #36987: [SPARK-39596][INFRA] Install `ggplot2` for GitHub Action linter job - posted by GitBox <gi...@apache.org> on 2023/01/04 06:08:58 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE when the parameters `regexp` in regexp_replace is invalid - posted by GitBox <gi...@apache.org> on 2023/01/04 06:16:23 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39377: [SPARK-41867][SQL] Selective predicate should respect InMemoryRelation - posted by GitBox <gi...@apache.org> on 2023/01/04 06:17:12 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/04 06:24:31 UTC, 0 replies.
- [GitHub] [spark] kuwii commented on pull request #39190: [SPARK-41683][CORE] Fix issue of getting incorrect property numActiveStages in jobs API - posted by GitBox <gi...@apache.org> on 2023/01/04 06:33:06 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39379: [SPARK-41828][CONNECT][PYTHON] Make `createDataFrame` support empty dataframe - posted by GitBox <gi...@apache.org> on 2023/01/04 06:55:12 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39384: [WIP][SPARK-40307][PYTHON] Introduce Arrow-optimized Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/04 07:27:38 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39385: [SPARK-41432][SQL][FOLLOWUP] Fix npe when `SparkPlanGraphWrapperSerializer#serialize` - posted by GitBox <gi...@apache.org> on 2023/01/04 07:28:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39385: [SPARK-41432][SQL][FOLLOWUP] Fix npe when `SparkPlanGraphWrapperSerializer#serialize` - posted by GitBox <gi...@apache.org> on 2023/01/04 07:28:59 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2023/01/04 07:34:52 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39386: [SPARK-41833][CONNECT][PYTHON] Make `DataFrame.collect` handle ArrayType and BinaryType porperly - posted by GitBox <gi...@apache.org> on 2023/01/04 07:37:08 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39385: [SPARK-41432][SQL][FOLLOWUP] Fix npe when `SparkPlanGraphWrapperSerializer#serializeSparkPlanGraphNodeWrapper` - posted by GitBox <gi...@apache.org> on 2023/01/04 07:43:18 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39226: [SPARK-41694][CORE] Isolate RocksDB path for Live UI and automatically cleanup when `SparkContext.stop()` - posted by GitBox <gi...@apache.org> on 2023/01/04 07:45:33 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39226: [SPARK-41694][CORE] Isolate RocksDB path for Live UI and automatically cleanup when `SparkContext.stop()` - posted by GitBox <gi...@apache.org> on 2023/01/04 07:53:47 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/04 08:02:55 UTC, 2 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #39146: [WIP][SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/04 08:23:42 UTC, 15 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39382: [SPARK-41878][CONNECT][TESTS] pyspark.sql.tests.test_dataframe - Add JIRAs or messages for skipped messages - posted by GitBox <gi...@apache.org> on 2023/01/04 08:26:10 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #36987: [SPARK-39596][INFRA] Install `ggplot2` for GitHub Action linter job - posted by GitBox <gi...@apache.org> on 2023/01/04 08:40:45 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/04 08:43:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39379: [SPARK-41828][CONNECT][PYTHON] Make `createDataFrame` support empty dataframe - posted by GitBox <gi...@apache.org> on 2023/01/04 08:45:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39379: [SPARK-41828][CONNECT][PYTHON] Make `createDataFrame` support empty dataframe - posted by GitBox <gi...@apache.org> on 2023/01/04 08:45:54 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini commented on pull request #39366: [SPARK-41860][SQL] Make AvroScanBuilder and JsonScanBuilder case classes - posted by GitBox <gi...@apache.org> on 2023/01/04 09:20:22 UTC, 1 replies.
- [GitHub] [spark] LorenzoMartini commented on pull request #39367: [SPARK-41861][SQL] Make v2 ScanBuilders' build() return typed scan - posted by GitBox <gi...@apache.org> on 2023/01/04 09:21:44 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/04 09:51:05 UTC, 10 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39350: [MINOR] Fix a typo "from from" -> "from" - posted by GitBox <gi...@apache.org> on 2023/01/04 10:35:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39386: [SPARK-41833][SPARK-41881][SPARK-41815][CONNECT][PYTHON] Make `DataFrame.collect` handle None/NaN/Array/Binary porperly - posted by GitBox <gi...@apache.org> on 2023/01/04 11:03:15 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39387: [SPARK-41586][PYTHON] Introduce `pyspark.errors` and error classes for PySpark. - posted by GitBox <gi...@apache.org> on 2023/01/04 11:03:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39385: [SPARK-41432][SQL][FOLLOWUP] Fix npe when `SparkPlanGraphWrapperSerializer#serializeSparkPlanGraphNodeWrapper` - posted by GitBox <gi...@apache.org> on 2023/01/04 11:04:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39387: [SPARK-41586][PYTHON] Introduce `pyspark.errors` and error classes for PySpark. - posted by GitBox <gi...@apache.org> on 2023/01/04 11:08:23 UTC, 11 replies.
- [GitHub] [spark] dengziming opened a new pull request, #39388: [SPARK-41354][CONNECT][PYTHON] implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/04 11:26:06 UTC, 0 replies.
- [GitHub] [spark] juechen507 commented on pull request #30003: [SPARK-32709][SQL] Support write Hive ORC/Parquet bucketed table (for Hive 1,2) - posted by GitBox <gi...@apache.org> on 2023/01/04 11:42:38 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39188: [WIP][SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/04 11:44:07 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39385: [SPARK-41882][SQL] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/04 11:46:57 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39389: [SPARK-41574][SQL] Assign name to _LEGACY_ERROR_TEMP_2009 - posted by GitBox <gi...@apache.org> on 2023/01/04 11:51:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39385: [SPARK-41882][SQL] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/04 12:00:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39390: [SPARK-41840][CONNECT][PYTHON] Add the missing alias `groupby` - posted by GitBox <gi...@apache.org> on 2023/01/04 12:07:44 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39390: [SPARK-41840][CONNECT][PYTHON] Add the missing alias `groupby` - posted by GitBox <gi...@apache.org> on 2023/01/04 12:15:18 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39391: [SPARK-41883][BUILD] Upgrade dropwizard metrics 4.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/04 12:17:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39392: [SPARK-41846][CONNECT][PYTHON] Enable doctests for window functions - posted by GitBox <gi...@apache.org> on 2023/01/04 12:38:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39170: [SPARK-41674][SQL] Runtime filter should supports multi level shuffle join side as filter creation side - posted by GitBox <gi...@apache.org> on 2023/01/04 14:55:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38163: [SPARK-40711][SQL] Add spill size metrics for window - posted by GitBox <gi...@apache.org> on 2023/01/04 15:04:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39343: [SPARK-41816][SQL] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/04 15:05:57 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, list, float or int - posted by GitBox <gi...@apache.org> on 2023/01/04 15:41:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39277: [SPARK-41708][SQL] Pull v1write information to `WriteFiles` - posted by GitBox <gi...@apache.org> on 2023/01/04 15:44:06 UTC, 8 replies.
- [GitHub] [spark] techaddict closed pull request #39355: [SPARK-40263][CORE] Use interruptible lock instead of synchronized in TransportClientFactory.createClient() - posted by GitBox <gi...@apache.org> on 2023/01/04 15:55:28 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39394: [SPARK-41575][SQL] Assign name to _LEGACY_ERROR_TEMP_2054 - posted by GitBox <gi...@apache.org> on 2023/01/04 16:11:59 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on pull request #39385: [SPARK-41882][CORE][SQL][UI] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/04 16:14:44 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39260: [SPARK-41579][SQL] Assign name to _LEGACY_ERROR_TEMP_1249 - posted by GitBox <gi...@apache.org> on 2023/01/04 16:36:26 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #39350: [MINOR] Fix a typo "from from" -> "from" - posted by GitBox <gi...@apache.org> on 2023/01/04 16:56:22 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39350: [MINOR] Fix a typo "from from" -> "from" - posted by GitBox <gi...@apache.org> on 2023/01/04 16:56:30 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/04 16:59:13 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, list, float or int - posted by GitBox <gi...@apache.org> on 2023/01/04 17:00:03 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39326: [SPARK-41800][BUILD] Upgrade commons-compress to 1.22 - posted by GitBox <gi...@apache.org> on 2023/01/04 17:00:33 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38676: [SPARK-41162][SQL] Do not push down anti-join predicates that become ambiguous - posted by GitBox <gi...@apache.org> on 2023/01/04 17:07:45 UTC, 0 replies.
- [GitHub] [spark] EnricoMi closed pull request #38676: [SPARK-41162][SQL] Do not push down anti-join predicates that become ambiguous - posted by GitBox <gi...@apache.org> on 2023/01/04 17:07:46 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38223: [SPARK-40770][PYTHON] Improved error messages for applyInPandas for schema mismatch - posted by GitBox <gi...@apache.org> on 2023/01/04 17:10:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE when the parameters `regexp` in regexp_replace is invalid - posted by GitBox <gi...@apache.org> on 2023/01/04 17:10:44 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39388: [SPARK-41354][CONNECT][PYTHON] implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/04 17:13:12 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39131: [SPARK-41162][SQL] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/04 17:38:28 UTC, 2 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39368: [SPARK-28764][CORE][TEST] Remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/04 17:41:08 UTC, 2 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39131: [SPARK-41162][SQL] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/04 17:59:52 UTC, 0 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #39188: [WIP][SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/04 18:25:09 UTC, 13 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39284: [SPARK-41573][SQL] Assign name to _LEGACY_ERROR_TEMP_2136 - posted by GitBox <gi...@apache.org> on 2023/01/04 19:20:57 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39284: [SPARK-41573][SQL] Assign name to _LEGACY_ERROR_TEMP_2136 - posted by GitBox <gi...@apache.org> on 2023/01/04 19:21:48 UTC, 0 replies.
- [GitHub] [spark] kyle-ai2 commented on pull request #39371: [SPARK-41030][BUILD][3.2] Upgrade `Apache Ivy` to 2.5.1 - posted by GitBox <gi...@apache.org> on 2023/01/04 19:26:26 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39385: [SPARK-41882][CORE][SQL][UI] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/04 20:01:47 UTC, 3 replies.
- [GitHub] [spark] gerashegalov commented on a diff in pull request #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE when the parameters `regexp` in regexp_replace is invalid - posted by GitBox <gi...@apache.org> on 2023/01/04 20:26:40 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39391: [SPARK-41883][BUILD] Upgrade dropwizard metrics 4.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/04 20:51:46 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/04 21:14:23 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39286: [SPARK-41768][CORE] Refactor the definition of enum to follow with the code style - posted by GitBox <gi...@apache.org> on 2023/01/04 21:15:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39226: [SPARK-41694][CORE] Isolate RocksDB path for Live UI and automatically cleanup when `SparkContext.stop()` - posted by GitBox <gi...@apache.org> on 2023/01/04 21:21:51 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39357: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper - posted by GitBox <gi...@apache.org> on 2023/01/04 21:57:39 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39357: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper - posted by GitBox <gi...@apache.org> on 2023/01/04 22:03:44 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39357: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper - posted by GitBox <gi...@apache.org> on 2023/01/04 22:04:10 UTC, 0 replies.
- [GitHub] [spark] leewyang commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2023/01/04 22:08:40 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by GitBox <gi...@apache.org> on 2023/01/04 22:10:56 UTC, 6 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39188: [WIP][SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/04 22:34:13 UTC, 14 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #38959: SPARK-41415: SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/04 23:39:47 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, list, float or int - posted by GitBox <gi...@apache.org> on 2023/01/04 23:48:31 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39390: [SPARK-41840][CONNECT][PYTHON] Add the missing alias `groupby` - posted by GitBox <gi...@apache.org> on 2023/01/04 23:49:04 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39390: [SPARK-41840][CONNECT][PYTHON] Add the missing alias `groupby` - posted by GitBox <gi...@apache.org> on 2023/01/04 23:49:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39392: [SPARK-41846][CONNECT][PYTHON] Enable doctests for window functions - posted by GitBox <gi...@apache.org> on 2023/01/04 23:50:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39392: [SPARK-41846][CONNECT][PYTHON] Enable doctests for window functions - posted by GitBox <gi...@apache.org> on 2023/01/04 23:50:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39386: [SPARK-41833][SPARK-41881][SPARK-41815][CONNECT][PYTHON] Make `DataFrame.collect` handle None/NaN/Array/Binary porperly - posted by GitBox <gi...@apache.org> on 2023/01/04 23:52:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39386: [SPARK-41833][SPARK-41881][SPARK-41815][CONNECT][PYTHON] Make `DataFrame.collect` handle None/NaN/Array/Binary porperly - posted by GitBox <gi...@apache.org> on 2023/01/04 23:52:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39382: [SPARK-41878][CONNECT][TESTS] pyspark.sql.tests.test_dataframe - Add JIRAs or messages for skipped messages - posted by GitBox <gi...@apache.org> on 2023/01/04 23:53:16 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37899: [SPARK-40455][CORE]Abort result stage directly when it failed caused by FetchFailedException - posted by GitBox <gi...@apache.org> on 2023/01/05 00:19:09 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2023/01/05 00:19:13 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36052: [SPARK-38777][YARN] Add `bin/spark-submit --kill / --status` support for yarn - posted by GitBox <gi...@apache.org> on 2023/01/05 00:19:14 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39146: [WIP][SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/05 00:26:15 UTC, 17 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39395: [SQL] Use foldLeft for DeduplicateRelations - posted by GitBox <gi...@apache.org> on 2023/01/05 00:32:14 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39395: [SQL] Use foldLeft for DeduplicateRelations - posted by GitBox <gi...@apache.org> on 2023/01/05 00:33:06 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39378: [SPARK-41821][CONNECT][PYTHON] Fix doc test for DataFrame.describe - posted by GitBox <gi...@apache.org> on 2023/01/05 00:58:12 UTC, 1 replies.
- [GitHub] [spark] xkrogen commented on pull request #38660: [SPARK-40199][SQL][WIP] Provide useful error when encountering null values in non-null fields - posted by GitBox <gi...@apache.org> on 2023/01/05 00:59:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39385: [SPARK-41882][CORE][SQL][UI] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/05 01:12:30 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39378: [SPARK-41821][CONNECT][PYTHON] Fix doc test for DataFrame.describe - posted by GitBox <gi...@apache.org> on 2023/01/05 01:14:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39388: [SPARK-41354][CONNECT][PYTHON] implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/05 01:17:34 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39277: [SPARK-41708][SQL] Pull v1write information to `WriteFiles` - posted by GitBox <gi...@apache.org> on 2023/01/05 01:34:08 UTC, 4 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39396: [SPARK-41825][CONNECT][PYTHON] Enable doctests related to `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2023/01/05 01:36:18 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by GitBox <gi...@apache.org> on 2023/01/05 01:53:40 UTC, 2 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE when the parameters `regexp` in regexp_replace is invalid - posted by GitBox <gi...@apache.org> on 2023/01/05 02:13:01 UTC, 0 replies.
- [GitHub] [spark] AngersZhuuuu commented on pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2023/01/05 02:15:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39391: [SPARK-41883][BUILD] Upgrade dropwizard metrics 4.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/05 02:17:26 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39357: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper - posted by GitBox <gi...@apache.org> on 2023/01/05 02:18:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39385: [SPARK-41882][CORE][SQL][UI] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/05 02:19:12 UTC, 3 replies.
- [GitHub] [spark] techaddict commented on pull request #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, list, float or int - posted by GitBox <gi...@apache.org> on 2023/01/05 02:23:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, list, float or int - posted by GitBox <gi...@apache.org> on 2023/01/05 02:39:53 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38163: [SPARK-40711][SQL] Add spill size metrics for window - posted by GitBox <gi...@apache.org> on 2023/01/05 02:41:34 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, list, float or int - posted by GitBox <gi...@apache.org> on 2023/01/05 02:43:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39260: [SPARK-41579][SQL] Assign name to _LEGACY_ERROR_TEMP_1249 - posted by GitBox <gi...@apache.org> on 2023/01/05 03:05:51 UTC, 2 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39397: [MINOR] fix typos - posted by GitBox <gi...@apache.org> on 2023/01/05 03:09:44 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on pull request #39397: [MINOR] fix typos - posted by GitBox <gi...@apache.org> on 2023/01/05 03:10:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39396: [SPARK-41825][CONNECT][PYTHON] Enable doctests related to `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2023/01/05 03:19:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39396: [SPARK-41825][CONNECT][PYTHON] Enable doctests related to `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2023/01/05 03:19:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39396: [SPARK-41825][CONNECT][PYTHON] Enable doctests related to `DataFrame.show` - posted by GitBox <gi...@apache.org> on 2023/01/05 03:32:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39398: [SPARK-41829][CONNECT][PYTHON] Add the missing ordering parameter in `Sort` and `sortWithinPartitions` - posted by GitBox <gi...@apache.org> on 2023/01/05 03:39:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39399: [SPARK-41890][CORE][SQL][UI] Reduce `toSeq` in `RDDOperationGraphWrapperSerializer`/`SparkPlanGraphWrapperSerializer` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/05 04:06:35 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39226: [SPARK-41694][CORE] Isolate RocksDB path for Live UI and automatically cleanup when `SparkContext.stop()` - posted by GitBox <gi...@apache.org> on 2023/01/05 04:32:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38163: [SPARK-40711][SQL] Add spill size metrics for window - posted by GitBox <gi...@apache.org> on 2023/01/05 04:38:31 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2023/01/05 04:40:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #36700: [SPARK-39318][SQL] Remove tpch-plan-stability WithStats golden files - posted by GitBox <gi...@apache.org> on 2023/01/05 04:40:52 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39400: [SPARK-41891][CONNECT][TESTS] Enable test_add_months_function, test_array_repeat, test_dayofweek, test_first_last_ignorenulls, test_function_parity, test_inline, test_window_time, test_reciprocal_trig_functions - posted by GitBox <gi...@apache.org> on 2023/01/05 04:41:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39395: [SQL] Use foldLeft for DeduplicateRelations - posted by GitBox <gi...@apache.org> on 2023/01/05 04:42:08 UTC, 0 replies.
- [GitHub] [spark] dengziming commented on pull request #39388: [SPARK-41354][CONNECT][PYTHON] implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/05 04:43:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39398: [SPARK-41829][CONNECT][PYTHON] Add the missing ordering parameter in `Sort` and `sortWithinPartitions` - posted by GitBox <gi...@apache.org> on 2023/01/05 05:03:38 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, float or int - posted by GitBox <gi...@apache.org> on 2023/01/05 05:32:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39393: [SPARK-41871][CONNECT] DataFrame hint parameter can be str, float or int - posted by GitBox <gi...@apache.org> on 2023/01/05 05:33:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39397: [MINOR][CONNECT] Fix typos in connect/plan.py - posted by GitBox <gi...@apache.org> on 2023/01/05 05:34:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39397: [MINOR][CONNECT] Fix typos in connect/plan.py - posted by GitBox <gi...@apache.org> on 2023/01/05 05:34:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39400: [SPARK-41891][CONNECT][TESTS] Enable test_add_months_function, test_array_repeat, test_dayofweek, test_first_last_ignorenulls, test_function_parity, test_inline, test_window_time, test_reciprocal_trig_functions - posted by GitBox <gi...@apache.org> on 2023/01/05 05:40:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39378: [SPARK-41821][CONNECT][PYTHON] Fix doc test for DataFrame.describe - posted by GitBox <gi...@apache.org> on 2023/01/05 05:43:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39378: [SPARK-41821][CONNECT][PYTHON] Fix doc test for DataFrame.describe - posted by GitBox <gi...@apache.org> on 2023/01/05 05:44:07 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39281: [SPARK-41576][SQL] Assign name to _LEGACY_ERROR_TEMP_2051 - posted by GitBox <gi...@apache.org> on 2023/01/05 05:47:19 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39281: [SPARK-41576][SQL] Assign name to _LEGACY_ERROR_TEMP_2051 - posted by GitBox <gi...@apache.org> on 2023/01/05 05:47:59 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39305: [SPARK-41580][SQL] Assign name to _LEGACY_ERROR_TEMP_2137 - posted by GitBox <gi...@apache.org> on 2023/01/05 05:57:49 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39305: [SPARK-41580][SQL] Assign name to _LEGACY_ERROR_TEMP_2137 - posted by GitBox <gi...@apache.org> on 2023/01/05 05:58:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39389: [SPARK-41574][SQL] Assign name to _LEGACY_ERROR_TEMP_2009 - posted by GitBox <gi...@apache.org> on 2023/01/05 06:05:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39385: [SPARK-41882][CORE][SQL][UI] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/05 06:14:48 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39377: [SPARK-41867][SQL] Selective predicate should respect InMemoryRelation - posted by GitBox <gi...@apache.org> on 2023/01/05 06:42:32 UTC, 7 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39131: [SPARK-41162][SQL] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 06:44:07 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38163: [SPARK-40711][SQL] Add spill size metrics for window - posted by GitBox <gi...@apache.org> on 2023/01/05 06:46:41 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39258: [SPARK-41572][SQL] Assign name to _LEGACY_ERROR_TEMP_2149 - posted by GitBox <gi...@apache.org> on 2023/01/05 06:57:43 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39131: [SPARK-41162][SQL] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 06:58:10 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2023/01/05 07:04:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39398: [SPARK-41829][CONNECT][PYTHON] Add the missing ordering parameter in `Sort` and `sortWithinPartitions` - posted by GitBox <gi...@apache.org> on 2023/01/05 07:26:38 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39401: [SPARK-41893][BUILD] Publish SBOM artifacts - posted by GitBox <gi...@apache.org> on 2023/01/05 07:29:45 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError - posted by GitBox <gi...@apache.org> on 2023/01/05 07:31:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39401: [SPARK-41893][BUILD] Publish SBOM artifacts - posted by GitBox <gi...@apache.org> on 2023/01/05 07:38:36 UTC, 8 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39394: [SPARK-41575][SQL] Assign name to _LEGACY_ERROR_TEMP_2054 - posted by GitBox <gi...@apache.org> on 2023/01/05 07:52:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39400: [SPARK-41891][CONNECT][TESTS] Enable test_add_months_function, test_array_repeat, test_dayofweek, test_first_last_ignorenulls, test_inline, test_window_time, test_reciprocal_trig_functions - posted by GitBox <gi...@apache.org> on 2023/01/05 07:59:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39400: [SPARK-41891][CONNECT][TESTS] Enable test_add_months_function, test_array_repeat, test_dayofweek, test_first_last_ignorenulls, test_inline, test_window_time, test_reciprocal_trig_functions - posted by GitBox <gi...@apache.org> on 2023/01/05 07:59:27 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError - posted by GitBox <gi...@apache.org> on 2023/01/05 08:01:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39403: [SPARK-41830][CONNECT][PYTHON] Make `DataFrame.sample` accept the same parameters as PySpark - posted by GitBox <gi...@apache.org> on 2023/01/05 08:03:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39404: [SPARK-41827][CONNECT][PYTHON] Make `GroupBy` accept column list - posted by GitBox <gi...@apache.org> on 2023/01/05 08:17:13 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError - posted by GitBox <gi...@apache.org> on 2023/01/05 08:23:37 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39405: [SPARK-41831][CONNECT][PYTHON] Make `DataFrame.select` accept column list - posted by GitBox <gi...@apache.org> on 2023/01/05 08:25:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39406: [SPARK-41894][SQL][TESTS] Add new action to `AsyncProgressTrackingMicroBatchExecutionSuite#testAsyncWriteErrorsPermissionsIssue` to restore the write permission of `commitDir`. - posted by GitBox <gi...@apache.org> on 2023/01/05 08:29:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39407: [SPARK-41842][CONNECT][PYTHON][TESTS] Enable doctests for time functions - posted by GitBox <gi...@apache.org> on 2023/01/05 08:33:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39407: [SPARK-41842][CONNECT][PYTHON][TESTS] Enable doctests for time functions - posted by GitBox <gi...@apache.org> on 2023/01/05 08:33:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39406: [SPARK-41894][SS][TESTS] Add new action to `testAsyncWriteErrorsPermissionsIssue` to restore the write permission of `commitDir`. - posted by GitBox <gi...@apache.org> on 2023/01/05 08:47:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39406: [SPARK-41894][SS][TESTS] Add new action to `testAsyncWriteErrorsPermissionsIssue` to restore the write permission of `commitDir`. - posted by GitBox <gi...@apache.org> on 2023/01/05 08:49:35 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by GitBox <gi...@apache.org> on 2023/01/05 08:50:25 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39315: [SPARK-41790][SQL] Set TRANSFORM reader and writer's format correctly - posted by GitBox <gi...@apache.org> on 2023/01/05 08:52:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39315: [SPARK-41790][SQL] Set TRANSFORM reader and writer's format correctly - posted by GitBox <gi...@apache.org> on 2023/01/05 08:52:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39406: [SPARK-41894][SS][TESTS] Restore the write permission of `commitDir` after run `testAsyncWriteErrorsPermissionsIssue` - posted by GitBox <gi...@apache.org> on 2023/01/05 09:10:22 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39403: [SPARK-41830][CONNECT][PYTHON] Make `DataFrame.sample` accept the same parameters as PySpark - posted by GitBox <gi...@apache.org> on 2023/01/05 10:18:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39403: [SPARK-41830][CONNECT][PYTHON] Make `DataFrame.sample` accept the same parameters as PySpark - posted by GitBox <gi...@apache.org> on 2023/01/05 10:19:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39407: [SPARK-41842][CONNECT][PYTHON][TESTS] Enable doctests for time functions - posted by GitBox <gi...@apache.org> on 2023/01/05 10:20:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39407: [SPARK-41842][CONNECT][PYTHON][TESTS] Enable doctests for time functions - posted by GitBox <gi...@apache.org> on 2023/01/05 10:20:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39131: [SPARK-41162][SQL] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 10:55:27 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError - posted by GitBox <gi...@apache.org> on 2023/01/05 11:28:15 UTC, 3 replies.
- [GitHub] [spark] yaooqinn commented on pull request #39343: [SPARK-41816][SQL] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/05 11:31:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39314: [SPARK-41791] Add new metadata types - posted by GitBox <gi...@apache.org> on 2023/01/05 11:37:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39314: [SPARK-41791] Add new file source metadata column types - posted by GitBox <gi...@apache.org> on 2023/01/05 11:40:18 UTC, 0 replies.
- [GitHub] [spark] olaky opened a new pull request, #39408: [SPARK-41896] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/05 11:50:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39334: [SPARK-41806][SQL] Use AppendData.byName for SQL INSERT INTO by name for DSV2 - posted by GitBox <gi...@apache.org> on 2023/01/05 12:16:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39399: [SPARK-41890][CORE][SQL][UI] Reduce `toSeq` in `RDDOperationGraphWrapperSerializer`/`SparkPlanGraphWrapperSerializer` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/05 12:30:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39408: [SPARK-41896] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/05 12:32:52 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39409: [SPARK-41162][SQL][3.3] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 12:33:41 UTC, 0 replies.
- [GitHub] [spark] olaky commented on pull request #39408: [SPARK-41896] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/05 13:18:32 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on pull request #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2023/01/05 13:19:54 UTC, 3 replies.
- [GitHub] [spark] srowen commented on pull request #39401: [SPARK-41893][BUILD] Publish SBOM artifacts - posted by GitBox <gi...@apache.org> on 2023/01/05 13:29:58 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39391: [SPARK-41883][BUILD] Upgrade dropwizard metrics 4.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/05 13:32:12 UTC, 0 replies.
- [GitHub] [spark] olaky commented on a diff in pull request #39408: [SPARK-41896] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/05 13:53:25 UTC, 1 replies.
- [GitHub] [spark] ivoson opened a new pull request, #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/05 14:30:03 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/05 14:30:58 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/05 14:36:04 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39409: [SPARK-41162][SQL][3.3] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 14:36:08 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39408: [SPARK-41896] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/05 14:36:10 UTC, 0 replies.
- [GitHub] [spark] jchen5 commented on pull request #38432: [SPARK-36124][SQL][WIP] Support subqueries with correlation through set operators - posted by GitBox <gi...@apache.org> on 2023/01/05 15:40:15 UTC, 0 replies.
- [GitHub] [spark] jchen5 closed pull request #38432: [SPARK-36124][SQL][WIP] Support subqueries with correlation through set operators - posted by GitBox <gi...@apache.org> on 2023/01/05 15:40:16 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39411: [SPARK-41162][SQL][3.1] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 15:49:55 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError - posted by GitBox <gi...@apache.org> on 2023/01/05 17:01:01 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39394: [SPARK-41575][SQL] Assign name to _LEGACY_ERROR_TEMP_2054 - posted by GitBox <gi...@apache.org> on 2023/01/05 17:04:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39411: [SPARK-41162][SQL][3.1] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 17:12:55 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39409: [SPARK-41162][SQL][3.3] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 17:16:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39131: [SPARK-41162][SQL] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/05 17:19:29 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39412: [SPARK-41892][CONNECT][TESTS] pyspark.sql.tests.test_functions - Add JIRAs or messages for skipped messages - posted by GitBox <gi...@apache.org> on 2023/01/05 18:00:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39385: [SPARK-41882][CORE][SQL][UI] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/05 18:12:13 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39385: [SPARK-41882][CORE][SQL][UI] Add tests for `SQLAppStatusStore` with RocksDB backend and fix some bugs - posted by GitBox <gi...@apache.org> on 2023/01/05 18:12:50 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39413: [SPARK-41849][CONNECT] Implement DataFrameReader.text - posted by GitBox <gi...@apache.org> on 2023/01/05 18:40:30 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39226: [SPARK-41694][CORE] Isolate RocksDB path for Live UI and automatically cleanup when `SparkContext.stop()` - posted by GitBox <gi...@apache.org> on 2023/01/05 18:51:10 UTC, 0 replies.
- [GitHub] [spark] amaliujia opened a new pull request, #39414: [SPARK-41912][SQL] Subquery should not validate CTE - posted by GitBox <gi...@apache.org> on 2023/01/05 19:07:17 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on pull request #39414: [SPARK-41912][SQL] Subquery should not validate CTE - posted by GitBox <gi...@apache.org> on 2023/01/05 19:07:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError - posted by GitBox <gi...@apache.org> on 2023/01/05 20:50:43 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/05 21:07:09 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #39401: [SPARK-41893][BUILD] Publish SBOM artifacts - posted by GitBox <gi...@apache.org> on 2023/01/05 21:07:54 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2023/01/05 21:12:29 UTC, 2 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38002: [WIP][Do not merge] Spark 40465 refactor - posted by GitBox <gi...@apache.org> on 2023/01/06 00:19:43 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37899: [SPARK-40455][CORE]Abort result stage directly when it failed caused by FetchFailedException - posted by GitBox <gi...@apache.org> on 2023/01/06 00:19:45 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36052: [SPARK-38777][YARN] Add `bin/spark-submit --kill / --status` support for yarn - posted by GitBox <gi...@apache.org> on 2023/01/06 00:19:46 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #35764: [SPARK-38444][SQL]Automatically calculate the upper and lower bounds of partitions when no specified partition related params - posted by GitBox <gi...@apache.org> on 2023/01/06 00:19:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39401: [SPARK-41893][BUILD] Publish SBOM artifacts - posted by GitBox <gi...@apache.org> on 2023/01/06 00:22:56 UTC, 0 replies.
- [GitHub] [spark] allisonport-db commented on pull request #39334: [SPARK-41806][SQL] Use AppendData.byName for SQL INSERT INTO by name for DSV2 - posted by GitBox <gi...@apache.org> on 2023/01/06 00:28:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39412: [SPARK-41892][CONNECT][TESTS] pyspark.sql.tests.test_functions - Add JIRAs or messages for skipped messages - posted by GitBox <gi...@apache.org> on 2023/01/06 00:42:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39412: [SPARK-41892][CONNECT][TESTS] pyspark.sql.tests.test_functions - Add JIRAs or messages for skipped messages - posted by GitBox <gi...@apache.org> on 2023/01/06 00:42:52 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on pull request #39146: [WIP][SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/06 00:44:40 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39146: [WIP][SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/06 00:54:35 UTC, 18 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39146: [WIP][SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/06 01:07:27 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39415: [SPARK-41895][SS][UI] Add tests for streaming UI with RocksDB backend - posted by GitBox <gi...@apache.org> on 2023/01/06 01:08:07 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39416: Revert "[SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper" - posted by GitBox <gi...@apache.org> on 2023/01/06 01:14:09 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39416: Revert "[SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper" - posted by GitBox <gi...@apache.org> on 2023/01/06 01:14:49 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39406: [SPARK-41894][SS][TESTS] Restore the write permission of `commitDir` after run `testAsyncWriteErrorsPermissionsIssue` - posted by GitBox <gi...@apache.org> on 2023/01/06 01:17:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39413: [SPARK-41849][CONNECT] Implement DataFrameReader.text - posted by GitBox <gi...@apache.org> on 2023/01/06 01:18:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39413: [SPARK-41849][CONNECT] Implement DataFrameReader.text - posted by GitBox <gi...@apache.org> on 2023/01/06 01:19:07 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #39406: [SPARK-41894][SS][TESTS] Restore the write permission of `commitDir` after run `testAsyncWriteErrorsPermissionsIssue` - posted by GitBox <gi...@apache.org> on 2023/01/06 01:21:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39408: [SPARK-41896][SQL] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/06 01:28:54 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39404: [SPARK-41827][CONNECT][PYTHON] Make `GroupBy` accept column list - posted by GitBox <gi...@apache.org> on 2023/01/06 01:31:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39404: [SPARK-41827][CONNECT][PYTHON] Make `GroupBy` accept column list - posted by GitBox <gi...@apache.org> on 2023/01/06 01:31:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39118: [SPARK-41567][BUILD] Move configuration of `versions-maven-plugin` to parent pom - posted by GitBox <gi...@apache.org> on 2023/01/06 01:38:13 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2023/01/06 01:44:18 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/06 02:05:58 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39417: [SPARK-41831][CONNECT] DataFrame.select to take a single list of columns - posted by GitBox <gi...@apache.org> on 2023/01/06 02:30:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39417: [SPARK-41831][CONNECT] DataFrame.select to take a single list of columns - posted by GitBox <gi...@apache.org> on 2023/01/06 02:30:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39417: [SPARK-41831][CONNECT] DataFrame.select to take a single list of columns - posted by GitBox <gi...@apache.org> on 2023/01/06 02:40:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39334: [SPARK-41806][SQL] Use AppendData.byName for SQL INSERT INTO by name for DSV2 - posted by GitBox <gi...@apache.org> on 2023/01/06 02:42:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39334: [SPARK-41806][SQL] Use AppendData.byName for SQL INSERT INTO by name for DSV2 - posted by GitBox <gi...@apache.org> on 2023/01/06 02:42:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39418: [SPARK-41869][CONNECT] Reject single string in dropDuplicates - posted by GitBox <gi...@apache.org> on 2023/01/06 02:46:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39418: [SPARK-41869][CONNECT] Reject single string in dropDuplicates - posted by GitBox <gi...@apache.org> on 2023/01/06 02:46:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39367: [SPARK-41861][SQL] Make v2 ScanBuilders' build() return typed scan - posted by GitBox <gi...@apache.org> on 2023/01/06 02:46:26 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39367: [SPARK-41861][SQL] Make v2 ScanBuilders' build() return typed scan - posted by GitBox <gi...@apache.org> on 2023/01/06 02:46:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39416: Revert "[SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper" - posted by GitBox <gi...@apache.org> on 2023/01/06 02:49:11 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39419: [SPARK-41840][CONNECT][TESTS] Remove the invalid JIRA in the comment. - posted by GitBox <gi...@apache.org> on 2023/01/06 02:54:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39419: [SPARK-41840][CONNECT][TESTS] Remove the invalid JIRA in the comment. - posted by GitBox <gi...@apache.org> on 2023/01/06 02:55:05 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39118: [SPARK-41567][BUILD] Move configuration of `versions-maven-plugin` to parent pom - posted by GitBox <gi...@apache.org> on 2023/01/06 02:58:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39416: Revert "[SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper" - posted by GitBox <gi...@apache.org> on 2023/01/06 03:03:42 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39420: [SPARK-41905][CONNECT] Support name as strings in slice - posted by GitBox <gi...@apache.org> on 2023/01/06 03:06:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39420: [SPARK-41905][CONNECT] Support name as strings in slice - posted by GitBox <gi...@apache.org> on 2023/01/06 03:06:54 UTC, 1 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/06 03:08:13 UTC, 6 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39421: [SPARK-41906][CONNECT][TESTS] Reenable rand test in Spark Connect. - posted by GitBox <gi...@apache.org> on 2023/01/06 03:13:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39421: [SPARK-41906][CONNECT][TESTS] Reenable rand test in Spark Connect. - posted by GitBox <gi...@apache.org> on 2023/01/06 03:13:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39405: [SPARK-41831][CONNECT][PYTHON] Make `DataFrame.select` accept column list - posted by GitBox <gi...@apache.org> on 2023/01/06 03:22:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39405: [SPARK-41831][CONNECT][PYTHON] Make `DataFrame.select` accept column list - posted by GitBox <gi...@apache.org> on 2023/01/06 03:22:42 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/06 03:25:40 UTC, 4 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39133: [SPARK-41595][SQL] Support generator function explode/explode_outer in the FROM clause - posted by GitBox <gi...@apache.org> on 2023/01/06 03:27:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39414: [SPARK-41912][SQL] Subquery should not validate CTE - posted by GitBox <gi...@apache.org> on 2023/01/06 03:30:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39414: [SPARK-41912][SQL] Subquery should not validate CTE - posted by GitBox <gi...@apache.org> on 2023/01/06 03:30:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39409: [SPARK-41162][SQL][3.3] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/06 03:32:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39409: [SPARK-41162][SQL][3.3] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/06 03:33:51 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39422: [SPARK-41875][CONNECT][PYTHON] Add test cases for `Dataset.to()` - posted by GitBox <gi...@apache.org> on 2023/01/06 03:34:06 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39146: [WIP][SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/06 03:35:20 UTC, 1 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39423: [SPARK-41921][CONNECT][TESTS] Enable doctests in connect.column and connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/06 03:39:46 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #39146: [WIP][SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/06 03:40:55 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38959: SPARK-41415: SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/06 03:41:46 UTC, 6 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2023/01/06 03:49:22 UTC, 3 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39062: [SPARK-41516] [SQL] Allow jdbc dialects to override the query used to create a table - posted by GitBox <gi...@apache.org> on 2023/01/06 03:50:32 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39384: [SPARK-40307][PYTHON] Introduce Arrow-optimized Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/06 04:10:54 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39277: [SPARK-41708][SQL] Pull v1write information to `WriteFiles` - posted by GitBox <gi...@apache.org> on 2023/01/06 04:13:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39277: [SPARK-41708][SQL] Pull v1write information to `WriteFiles` - posted by GitBox <gi...@apache.org> on 2023/01/06 04:13:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38358: [SPARK-40588] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/06 04:18:05 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39416: Revert "[SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for StreamingQueryProgressWrapper" - posted by GitBox <gi...@apache.org> on 2023/01/06 04:29:03 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 opened a new pull request, #39424: Make stage scheduling support local-cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/06 04:29:19 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39415: [SPARK-41895][SS][UI] Add tests for streaming UI with RocksDB backend - posted by GitBox <gi...@apache.org> on 2023/01/06 04:29:55 UTC, 3 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39425: [SPARK-41538][FollowUp][TESTS] Move a metadata column test case to MetadataColumnSuite - posted by GitBox <gi...@apache.org> on 2023/01/06 05:06:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39384: [SPARK-40307][PYTHON] Introduce Arrow-optimized Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/06 05:19:31 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39394: [SPARK-41575][SQL] Assign name to _LEGACY_ERROR_TEMP_2054 - posted by GitBox <gi...@apache.org> on 2023/01/06 05:27:42 UTC, 4 replies.
- [GitHub] [spark] itholic commented on pull request #39387: [SPARK-41586][PYTHON] Introduce `pyspark.errors` and error classes for PySpark. - posted by GitBox <gi...@apache.org> on 2023/01/06 05:36:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39408: [SPARK-41896][SQL] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/06 05:38:14 UTC, 13 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39418: [SPARK-41869][CONNECT] Reject single string in dropDuplicates - posted by GitBox <gi...@apache.org> on 2023/01/06 05:41:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39418: [SPARK-41869][CONNECT] Reject single string in dropDuplicates - posted by GitBox <gi...@apache.org> on 2023/01/06 05:42:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39423: [SPARK-41921][CONNECT][TESTS] Enable doctests in connect.column and connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/06 05:44:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39421: [SPARK-41906][CONNECT][TESTS] Reenable rand test in Spark Connect. - posted by GitBox <gi...@apache.org> on 2023/01/06 05:44:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39423: [SPARK-41921][CONNECT][TESTS] Enable doctests in connect.column and connect.functions - posted by GitBox <gi...@apache.org> on 2023/01/06 05:44:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39421: [SPARK-41906][CONNECT][TESTS] Reenable rand test in Spark Connect. - posted by GitBox <gi...@apache.org> on 2023/01/06 05:44:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39420: [SPARK-41905][CONNECT] Support name as strings in slice - posted by GitBox <gi...@apache.org> on 2023/01/06 05:45:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39419: [SPARK-41840][CONNECT][TESTS] Remove the invalid JIRA in the comment. - posted by GitBox <gi...@apache.org> on 2023/01/06 05:46:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39415: [SPARK-41895][SS][UI] Add tests for streaming UI with RocksDB backend - posted by GitBox <gi...@apache.org> on 2023/01/06 05:50:46 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39377: [SPARK-41867][SQL] Selective predicate should respect InMemoryRelation - posted by GitBox <gi...@apache.org> on 2023/01/06 06:39:59 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39426: [SPARK-41455][CONNECT][PYTHON] Make `DataFrame.collect` discard the timezone info - posted by GitBox <gi...@apache.org> on 2023/01/06 07:17:33 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39388: [SPARK-41354][CONNECT][PYTHON] implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/06 07:18:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39368: [SPARK-28764][CORE][TEST] Remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/06 08:00:14 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2023/01/06 08:02:37 UTC, 1 replies.
- [GitHub] [spark] mridulm closed pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by GitBox <gi...@apache.org> on 2023/01/06 08:04:48 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39425: [SPARK-41538][FollowUp][TESTS] Move a metadata column test case to MetadataColumnSuite - posted by GitBox <gi...@apache.org> on 2023/01/06 08:15:11 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39425: [SPARK-41538][FollowUp][TESTS] Move a metadata column test case to MetadataColumnSuite - posted by GitBox <gi...@apache.org> on 2023/01/06 08:15:39 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39368: [SPARK-28764][CORE][TEST] Remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/06 08:26:55 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39427: [SPARK-41922][CONNECT][PYTHON] Implement `DataFrame.semanticHash` - posted by GitBox <gi...@apache.org> on 2023/01/06 09:00:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39377: [SPARK-41867][SQL] Selective predicate should respect InMemoryRelation - posted by GitBox <gi...@apache.org> on 2023/01/06 09:05:55 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by GitBox <gi...@apache.org> on 2023/01/06 09:07:10 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by GitBox <gi...@apache.org> on 2023/01/06 09:07:56 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by GitBox <gi...@apache.org> on 2023/01/06 09:08:16 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39333: [SPARK-41805][SQL] Reuse expressions in WindowSpecDefinition - posted by GitBox <gi...@apache.org> on 2023/01/06 09:18:52 UTC, 5 replies.
- [GitHub] [spark] beliefer commented on pull request #39422: [SPARK-41875][CONNECT][PYTHON] Add test cases for `Dataset.to()` - posted by GitBox <gi...@apache.org> on 2023/01/06 09:20:29 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39097: Implement code generation for to_csv function (doGenCode) - posted by GitBox <gi...@apache.org> on 2023/01/06 09:24:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39097: Implement code generation for to_csv function (doGenCode) - posted by GitBox <gi...@apache.org> on 2023/01/06 09:25:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by GitBox <gi...@apache.org> on 2023/01/06 09:37:18 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39422: [SPARK-41875][CONNECT][PYTHON] Add test cases for `Dataset.to()` - posted by GitBox <gi...@apache.org> on 2023/01/06 09:43:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39426: [SPARK-41455][CONNECT][PYTHON] Make `DataFrame.collect` discard the timezone info - posted by GitBox <gi...@apache.org> on 2023/01/06 09:44:52 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39429: [SPARK-41874][CONNECT][PYTHON] Implement `DataFrame.sameSemantics` - posted by GitBox <gi...@apache.org> on 2023/01/06 10:04:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39428: [SPARK-41765][SQL] Pull out v1 write metrics to WriteFiles - posted by GitBox <gi...@apache.org> on 2023/01/06 10:24:44 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39384: [SPARK-40307][PYTHON] Introduce Arrow-optimized Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/06 10:25:27 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE when the parameters `regexp` in regexp_replace is invalid - posted by GitBox <gi...@apache.org> on 2023/01/06 10:25:30 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39381: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by GitBox <gi...@apache.org> on 2023/01/06 10:25:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39430: [SPARK-41923][CONNECT][PYTHON] Add `DataFrame.writeTo` to the unsupported list - posted by GitBox <gi...@apache.org> on 2023/01/06 10:29:02 UTC, 0 replies.
- [GitHub] [spark] roczei commented on pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/06 10:30:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39408: [SPARK-41896][SQL] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/06 11:08:47 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #39097: Implement code generation for to_csv function (doGenCode) - posted by GitBox <gi...@apache.org> on 2023/01/06 11:23:34 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39422: [SPARK-41875][CONNECT][PYTHON] Add test cases for `Dataset.to()` - posted by GitBox <gi...@apache.org> on 2023/01/06 11:24:29 UTC, 2 replies.
- [GitHub] [spark] bogdanghit commented on pull request #39343: [SPARK-41816][SQL] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/06 11:30:38 UTC, 0 replies.
- [GitHub] [spark] codecov-commenter commented on pull request #39368: [SPARK-28764][CORE][TEST] Remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/06 11:41:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39312: [SPARK-41788][SQL] Move InsertIntoStatement to basicLogicalOperators - posted by GitBox <gi...@apache.org> on 2023/01/06 12:17:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39368: [SPARK-28764][CORE][TEST] Remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/06 12:19:04 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39430: [SPARK-41923][CONNECT][PYTHON] Add `DataFrame.writeTo` to the unsupported list - posted by GitBox <gi...@apache.org> on 2023/01/06 12:28:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39430: [SPARK-41923][CONNECT][PYTHON] Add `DataFrame.writeTo` to the unsupported list - posted by GitBox <gi...@apache.org> on 2023/01/06 12:28:36 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39431: [SPARK-41914] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/06 12:32:40 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39431: [SPARK-41914][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/06 12:36:05 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39432: [SPARK-41924][CONNECT][PYTHON] Make StructType support metadata and Implement `DataFrame.withMetadata` - posted by GitBox <gi...@apache.org> on 2023/01/06 13:05:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39431: [SPARK-41914][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/06 13:57:49 UTC, 4 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39431: [SPARK-41914][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/06 14:11:35 UTC, 3 replies.
- [GitHub] [spark] srowen commented on pull request #39326: [SPARK-41800][BUILD] Upgrade commons-compress to 1.22 - posted by GitBox <gi...@apache.org> on 2023/01/06 14:18:03 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39326: [SPARK-41800][BUILD] Upgrade commons-compress to 1.22 - posted by GitBox <gi...@apache.org> on 2023/01/06 14:18:04 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39399: [SPARK-41890][CORE][SQL][UI] Reduce `toSeq` in `RDDOperationGraphWrapperSerializer`/`SparkPlanGraphWrapperSerializer` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/06 14:18:41 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39399: [SPARK-41890][CORE][SQL][UI] Reduce `toSeq` in `RDDOperationGraphWrapperSerializer`/`SparkPlanGraphWrapperSerializer` for Scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/06 14:18:48 UTC, 0 replies.
- [GitHub] [spark] olaky commented on a diff in pull request #39408: [SPARK-41896][SQL] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/06 14:50:24 UTC, 10 replies.
- [GitHub] [spark] tgravescs commented on pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/06 15:43:40 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39289: [SPARK-41769][MINOR] Remove useless semicolons - posted by GitBox <gi...@apache.org> on 2023/01/06 16:30:13 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on pull request #39427: [SPARK-41922][CONNECT][PYTHON] Implement `DataFrame.semanticHash` - posted by GitBox <gi...@apache.org> on 2023/01/06 16:33:26 UTC, 1 replies.
- [GitHub] [spark] techaddict commented on pull request #39429: [SPARK-41874][CONNECT][PYTHON] Implement `DataFrame.sameSemantics` - posted by GitBox <gi...@apache.org> on 2023/01/06 16:35:09 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/06 17:56:37 UTC, 2 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39433: [SPARK-41898][CONNECT][PYTHON] Window.rowsBetween, Window.rangeBetween parameters typechecking parity with pyspark - posted by GitBox <gi...@apache.org> on 2023/01/06 18:24:05 UTC, 0 replies.
- [GitHub] [spark] Obbay2 commented on pull request #35709: [SPARK-38389][SQL] Add the `DATEDIFF()` and `DATE_DIFF()` aliases for `TIMESTAMPDIFF()` - posted by GitBox <gi...@apache.org> on 2023/01/06 18:24:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39434: [SPARK-41925][SQL] Enable `spark.sql.orc.enableNestedColumnVectorizedReader` by default - posted by GitBox <gi...@apache.org> on 2023/01/06 19:11:42 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39415: [SPARK-41895][SS][UI] Add tests for streaming UI with RocksDB backend - posted by GitBox <gi...@apache.org> on 2023/01/06 19:17:03 UTC, 0 replies.
- [GitHub] [spark] tedyu closed pull request #39395: [SQL] Use foldLeft for DeduplicateRelations - posted by GitBox <gi...@apache.org> on 2023/01/06 20:48:54 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on a diff in pull request #38959: SPARK-41415: SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/06 21:49:29 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39434: [SPARK-41925][SQL] Enable `spark.sql.orc.enableNestedColumnVectorizedReader` by default - posted by GitBox <gi...@apache.org> on 2023/01/06 22:16:27 UTC, 2 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #38959: SPARK-41415: SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/06 22:21:32 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39434: [SPARK-41925][SQL] Enable `spark.sql.orc.enableNestedColumnVectorizedReader` by default - posted by GitBox <gi...@apache.org> on 2023/01/06 22:26:10 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2023/01/07 00:18:30 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38002: [WIP][Do not merge] Spark 40465 refactor - posted by GitBox <gi...@apache.org> on 2023/01/07 00:18:32 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #35764: [SPARK-38444][SQL]Automatically calculate the upper and lower bounds of partitions when no specified partition related params - posted by GitBox <gi...@apache.org> on 2023/01/07 00:18:33 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39432: [SPARK-41924][CONNECT][PYTHON] Make StructType support metadata and Implement `DataFrame.withMetadata` - posted by GitBox <gi...@apache.org> on 2023/01/07 01:10:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39432: [SPARK-41924][CONNECT][PYTHON] Make StructType support metadata and Implement `DataFrame.withMetadata` - posted by GitBox <gi...@apache.org> on 2023/01/07 01:10:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39433: [SPARK-41898][CONNECT][PYTHON] Window.rowsBetween, Window.rangeBetween parameters typechecking parity with pyspark - posted by GitBox <gi...@apache.org> on 2023/01/07 01:18:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39433: [SPARK-41898][CONNECT][PYTHON] Window.rowsBetween, Window.rangeBetween parameters typechecking parity with pyspark - posted by GitBox <gi...@apache.org> on 2023/01/07 01:18:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39422: [SPARK-41875][CONNECT][PYTHON] Add test cases for `Dataset.to()` - posted by GitBox <gi...@apache.org> on 2023/01/07 01:23:26 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39429: [SPARK-41874][CONNECT][PYTHON] Implement `DataFrame.sameSemantics` - posted by GitBox <gi...@apache.org> on 2023/01/07 01:33:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39427: [SPARK-41922][CONNECT][PYTHON] Implement `DataFrame.semanticHash` - posted by GitBox <gi...@apache.org> on 2023/01/07 01:37:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39422: [SPARK-41875][CONNECT][PYTHON] Add test cases for `Dataset.to()` - posted by GitBox <gi...@apache.org> on 2023/01/07 01:44:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39422: [SPARK-41875][CONNECT][PYTHON] Add test cases for `Dataset.to()` - posted by GitBox <gi...@apache.org> on 2023/01/07 01:44:33 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39435: [SPARK-41926][UI][TESTS] add GA Job - posted by GitBox <gi...@apache.org> on 2023/01/07 02:05:28 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39436: [SPARK-41824][CONNECT][PYTHON] Ingore the doctest for explain of connect - posted by GitBox <gi...@apache.org> on 2023/01/07 02:16:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39427: [SPARK-41922][CONNECT][PYTHON] Implement `DataFrame.semanticHash` - posted by GitBox <gi...@apache.org> on 2023/01/07 02:18:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39437: [SPARK-41927][CONNECT][PYTHON] Add the unsupported list for `GroupedData` - posted by GitBox <gi...@apache.org> on 2023/01/07 02:19:22 UTC, 0 replies.
- [GitHub] [spark] techaddict closed pull request #39427: [SPARK-41922][CONNECT][PYTHON] Implement `DataFrame.semanticHash` - posted by GitBox <gi...@apache.org> on 2023/01/07 02:21:12 UTC, 0 replies.
- [GitHub] [spark] techaddict closed pull request #39429: [SPARK-41874][CONNECT][PYTHON] Implement `DataFrame.sameSemantics` - posted by GitBox <gi...@apache.org> on 2023/01/07 02:23:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39438: [SPARK-41928][CONNECT][PYTHON] Add the unsupported list for `functions` - posted by GitBox <gi...@apache.org> on 2023/01/07 02:29:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39439: [SPARK-41929][CONNECT][PYTHON] Add function `array_compact` - posted by GitBox <gi...@apache.org> on 2023/01/07 02:37:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39440: [SPARK-41930][INFRA] Remove `branch-3.1` from publish_snapshot GitHub Action job - posted by GitBox <gi...@apache.org> on 2023/01/07 03:32:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39440: [SPARK-41930][INFRA] Remove `branch-3.1` from publish_snapshot GitHub Action job - posted by GitBox <gi...@apache.org> on 2023/01/07 03:39:30 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #39333: [SPARK-41805][SQL] Reuse expressions in WindowSpecDefinition - posted by GitBox <gi...@apache.org> on 2023/01/07 03:51:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39439: [SPARK-41929][CONNECT][PYTHON] Add function `array_compact` - posted by GitBox <gi...@apache.org> on 2023/01/07 04:49:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39440: [SPARK-41930][INFRA] Remove `branch-3.1` from publish_snapshot GitHub Action job - posted by GitBox <gi...@apache.org> on 2023/01/07 04:50:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39439: [SPARK-41929][CONNECT][PYTHON] Add function `array_compact` - posted by GitBox <gi...@apache.org> on 2023/01/07 04:50:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39438: [SPARK-41928][CONNECT][PYTHON] Add the unsupported list for `functions` - posted by GitBox <gi...@apache.org> on 2023/01/07 04:50:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39438: [SPARK-41928][CONNECT][PYTHON] Add the unsupported list for `functions` - posted by GitBox <gi...@apache.org> on 2023/01/07 04:51:24 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39437: [SPARK-41927][CONNECT][PYTHON] Add the unsupported list for `GroupedData` - posted by GitBox <gi...@apache.org> on 2023/01/07 04:51:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39436: [SPARK-41824][CONNECT][PYTHON] Ingore the doctest for explain of connect - posted by GitBox <gi...@apache.org> on 2023/01/07 04:51:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39437: [SPARK-41927][CONNECT][PYTHON] Add the unsupported list for `GroupedData` - posted by GitBox <gi...@apache.org> on 2023/01/07 04:52:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39436: [SPARK-41824][CONNECT][PYTHON] Ingore the doctest for explain of connect - posted by GitBox <gi...@apache.org> on 2023/01/07 04:52:39 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2023/01/07 05:31:17 UTC, 31 replies.
- [GitHub] [spark] mridulm commented on pull request #38959: SPARK-41415: SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/07 05:47:55 UTC, 7 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39436: [SPARK-41824][CONNECT][PYTHON] Ingore the doctest for explain of connect - posted by GitBox <gi...@apache.org> on 2023/01/07 05:56:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39441: [WIP][SPARK-41933][CONNECT] Provide local mode that automatically starts the server - posted by GitBox <gi...@apache.org> on 2023/01/07 06:28:00 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39442: [SPARK-41934][CONNECT][PYTHON] Add the unsupported function list for `session` - posted by GitBox <gi...@apache.org> on 2023/01/07 06:29:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39443: [SPARK-41935][INFRA] Skip snapshot check and transfer progress log during publishing snapshots - posted by GitBox <gi...@apache.org> on 2023/01/07 06:57:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39443: [SPARK-41935][INFRA] Skip snapshot check and transfer progress log during publishing snapshots - posted by GitBox <gi...@apache.org> on 2023/01/07 07:13:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39441: [WIP][SPARK-41933][CONNECT] Provide local mode that automatically starts the server - posted by GitBox <gi...@apache.org> on 2023/01/07 07:18:18 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39282: [SPARK-41581][SQL] Assign name to _LEGACY_ERROR_TEMP_1230 - posted by GitBox <gi...@apache.org> on 2023/01/07 08:58:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39444: [SPARK-41880][CONNECT][PYTHON] Make function `from_json` accept non-literal schema - posted by GitBox <gi...@apache.org> on 2023/01/07 09:24:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39445: [SPARK-41936][CONNECT][PYTHON] Make `withMetadata` reuse the `withColumns` proto - posted by GitBox <gi...@apache.org> on 2023/01/07 09:54:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError & refactor error classes INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2023/01/07 10:47:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39441: [SPARK-41933][CONNECT] Provide local mode that automatically starts the server - posted by GitBox <gi...@apache.org> on 2023/01/07 10:51:50 UTC, 7 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39441: [SPARK-41933][CONNECT] Provide local mode that automatically starts the server - posted by GitBox <gi...@apache.org> on 2023/01/07 10:58:19 UTC, 2 replies.
- [GitHub] [spark] beliefer commented on pull request #39436: [SPARK-41824][CONNECT][PYTHON] Ingore the doctest for explain of connect - posted by GitBox <gi...@apache.org> on 2023/01/07 12:09:20 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError & refactor error classes INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2023/01/07 12:16:13 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39446: [MINOR][SQL] Remove unnecessary method in QueryCompilationErrors & QueryExecutionErrors and related error classes - posted by GitBox <gi...@apache.org> on 2023/01/07 12:27:04 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39446: [MINOR][SQL] Remove unnecessary method & related error classes in QueryCompilationErrors & QueryExecutionErrors - posted by GitBox <gi...@apache.org> on 2023/01/07 12:29:45 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39446: [MINOR][SQL] Remove unnecessary method & related error classes in QueryCompilationErrors & QueryExecutionErrors - posted by GitBox <gi...@apache.org> on 2023/01/07 13:27:20 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #39447: [SPARK-37303][SQL][TEST] ALTER TABLE [TABLE_NAME] REPLACE COLUMNS works correctly for v2 tables - posted by GitBox <gi...@apache.org> on 2023/01/07 15:00:43 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39447: [SPARK-37303][SQL][TEST] ALTER TABLE [TABLE_NAME] REPLACE COLUMNS works correctly for v2 tables - posted by GitBox <gi...@apache.org> on 2023/01/07 15:30:26 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng closed pull request #39447: [SPARK-37303][SQL][TEST] ALTER TABLE [TABLE_NAME] REPLACE COLUMNS works correctly for v2 tables - posted by GitBox <gi...@apache.org> on 2023/01/07 15:51:37 UTC, 0 replies.
- [GitHub] [spark] adrian-wang commented on pull request #39343: [SPARK-41816][SQL] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/07 17:27:24 UTC, 0 replies.
- [GitHub] [spark] adrian-wang closed pull request #39343: [SPARK-41816][SQL] Not close filesystem when log out ThriftServer - posted by GitBox <gi...@apache.org> on 2023/01/07 17:27:24 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #39448: [CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/07 18:29:19 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on a diff in pull request #39448: [CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/07 18:30:50 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #39441: [SPARK-41933][CONNECT] Provide local mode that automatically starts the server - posted by GitBox <gi...@apache.org> on 2023/01/07 19:04:19 UTC, 2 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #39449: [SPARK-40688][SQL] Support data masking built-in function 'mask_first_n' - posted by GitBox <gi...@apache.org> on 2023/01/07 21:34:45 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39450: [SPARK-41897][CONNECT][TESTS] Enable tests with error mismatch in connect/test_parity_functions.py - posted by GitBox <gi...@apache.org> on 2023/01/07 23:09:01 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39431: [SPARK-41914][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/07 23:48:36 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38007: [SPARK-40566][SQL] Add showIndex function - posted by GitBox <gi...@apache.org> on 2023/01/08 00:20:59 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36588: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/08 00:21:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39450: [SPARK-41897][CONNECT][TESTS] Enable tests with error mismatch in connect/test_parity_functions.py - posted by GitBox <gi...@apache.org> on 2023/01/08 00:52:50 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39387: [SPARK-41586][PYTHON] Introduce `pyspark.errors` and error classes for PySpark. - posted by GitBox <gi...@apache.org> on 2023/01/08 01:12:37 UTC, 18 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39444: [SPARK-41880][CONNECT][PYTHON] Make function `from_json` accept non-literal schema - posted by GitBox <gi...@apache.org> on 2023/01/08 01:16:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39444: [SPARK-41880][CONNECT][PYTHON] Make function `from_json` accept non-literal schema - posted by GitBox <gi...@apache.org> on 2023/01/08 01:16:50 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39445: [SPARK-41936][CONNECT][PYTHON] Make `withMetadata` reuse the `withColumns` proto - posted by GitBox <gi...@apache.org> on 2023/01/08 01:17:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39445: [SPARK-41936][CONNECT][PYTHON] Make `withMetadata` reuse the `withColumns` proto - posted by GitBox <gi...@apache.org> on 2023/01/08 01:18:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39446: [MINOR][SQL] Remove unnecessary method & related error classes in QueryCompilationErrors & QueryExecutionErrors - posted by GitBox <gi...@apache.org> on 2023/01/08 01:25:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39446: [MINOR][SQL] Remove unnecessary method & related error classes in QueryCompilationErrors & QueryExecutionErrors - posted by GitBox <gi...@apache.org> on 2023/01/08 01:25:32 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/08 01:45:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39452: [SPARK-41899][CONNECT][PYTHON] `createDataFrame` should respect user provided DDL schema - posted by GitBox <gi...@apache.org> on 2023/01/08 02:27:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39442: [SPARK-41934][CONNECT][PYTHON] Add the unsupported function list for `session` - posted by GitBox <gi...@apache.org> on 2023/01/08 02:29:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39442: [SPARK-41934][CONNECT][PYTHON] Add the unsupported function list for `session` - posted by GitBox <gi...@apache.org> on 2023/01/08 02:29:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/08 02:49:27 UTC, 2 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39453: [SPARK-41938][BUILD] Upgrade sbt to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/08 03:37:17 UTC, 0 replies.
- [GitHub] [spark] atalv opened a new pull request, #39454: [SPARK-41937][R] fix error in R (>= 4.2.0) for SparkR datetime column comparing with Sys.time() - posted by GitBox <gi...@apache.org> on 2023/01/08 03:37:22 UTC, 0 replies.
- [GitHub] [spark] amaliujia commented on a diff in pull request #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/08 03:37:47 UTC, 3 replies.
- [GitHub] [spark] amaliujia commented on pull request #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/08 03:42:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39441: [SPARK-41933][CONNECT] Provide local mode that automatically starts the server - posted by GitBox <gi...@apache.org> on 2023/01/08 03:48:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39454: [SPARK-41937][R] fix error in R (>= 4.2.0) for SparkR datetime column comparing with Sys.time() - posted by GitBox <gi...@apache.org> on 2023/01/08 04:55:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39455: [SPARK-41939][CONNECT][PYTHON] Add the unsupported list for `catalog` functions - posted by GitBox <gi...@apache.org> on 2023/01/08 05:01:19 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39452: [SPARK-41899][CONNECT][PYTHON] `createDataFrame` should respect user provided DDL schema - posted by GitBox <gi...@apache.org> on 2023/01/08 05:02:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39452: [SPARK-41899][CONNECT][PYTHON] `createDataFrame` should respect user provided DDL schema - posted by GitBox <gi...@apache.org> on 2023/01/08 05:03:20 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39448: [CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/08 05:08:50 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39456: [SPARK-41904][CONNECT][PYTHON] Fix Function `nth_value` functions output - posted by GitBox <gi...@apache.org> on 2023/01/08 05:25:13 UTC, 0 replies.
- [GitHub] [spark] atalv commented on pull request #39454: [SPARK-41937][R] fix error in R (>= 4.2.0) for SparkR datetime column comparing with Sys.time() - posted by GitBox <gi...@apache.org> on 2023/01/08 05:33:15 UTC, 0 replies.
- [GitHub] [spark] madwed-stripe commented on pull request #37551: [SPARK-38591][SQL] Add sortWithinGroups to KeyValueGroupedDataset - posted by GitBox <gi...@apache.org> on 2023/01/08 05:40:53 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39280: [SPARK-41766][CORE] Handle decommission request sent before executor registration - posted by GitBox <gi...@apache.org> on 2023/01/08 05:49:10 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError & refactor error classes INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2023/01/08 07:30:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39402: [SPARK-41889][SQL] Attach root cause to invalidPatternError & refactor error classes INVALID_PARAMETER_VALUE - posted by GitBox <gi...@apache.org> on 2023/01/08 07:36:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39441: [SPARK-41933][CONNECT] Provide local mode that automatically starts the server - posted by GitBox <gi...@apache.org> on 2023/01/08 07:49:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39455: [SPARK-41939][CONNECT][PYTHON] Add the unsupported list for `catalog` functions - posted by GitBox <gi...@apache.org> on 2023/01/08 07:51:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39455: [SPARK-41939][CONNECT][PYTHON] Add the unsupported list for `catalog` functions - posted by GitBox <gi...@apache.org> on 2023/01/08 07:51:39 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/08 08:24:41 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39450: [SPARK-41897][CONNECT][TESTS] Enable tests with error mismatch in connect/test_parity_functions.py - posted by GitBox <gi...@apache.org> on 2023/01/08 08:25:56 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39457: [SPARK-41940][SQL] Infer IsNotNull constraints for complex join expressions - posted by GitBox <gi...@apache.org> on 2023/01/08 09:05:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39332: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by GitBox <gi...@apache.org> on 2023/01/08 11:51:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39406: [SPARK-41894][SS][TESTS] Restore the write permission of `commitDir` after run `testAsyncWriteErrorsPermissionsIssue` - posted by GitBox <gi...@apache.org> on 2023/01/08 13:10:10 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39458: [SPARK-41941][BUILD] Upgrade `scalatest` related test dependencies to 3.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/08 13:20:22 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on a diff in pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/08 14:55:34 UTC, 5 replies.
- [GitHub] [spark] ivoson opened a new pull request, #39459: [WIP][SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache. - posted by GitBox <gi...@apache.org> on 2023/01/08 16:29:37 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39448: [CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/08 17:02:12 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #39457: [WIP][SPARK-41940][SQL] Infer IsNotNull constraints for complex join expressions - posted by GitBox <gi...@apache.org> on 2023/01/08 17:34:17 UTC, 0 replies.
- [GitHub] [spark] wankunde closed pull request #39457: [WIP][SPARK-41940][SQL] Infer IsNotNull constraints for complex join expressions - posted by GitBox <gi...@apache.org> on 2023/01/08 17:34:32 UTC, 0 replies.
- [GitHub] [spark] warrenzhu25 commented on a diff in pull request #39280: [SPARK-41766][CORE] Handle decommission request sent before executor registration - posted by GitBox <gi...@apache.org> on 2023/01/08 19:36:03 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #38879: [SPARK-41362][CONNECT][PYTHON] Better error messages for invalid argument types. - posted by GitBox <gi...@apache.org> on 2023/01/08 22:04:04 UTC, 0 replies.
- [GitHub] [spark] grundprinzip closed pull request #38879: [SPARK-41362][CONNECT][PYTHON] Better error messages for invalid argument types. - posted by GitBox <gi...@apache.org> on 2023/01/08 22:04:04 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39091: [SPARK-41527][CONNECT][PYTHON] Implement `DataFrame.observe` - posted by GitBox <gi...@apache.org> on 2023/01/08 22:06:06 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/08 22:09:16 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39456: [SPARK-41904][CONNECT][PYTHON] Fix Function `nth_value` functions output - posted by GitBox <gi...@apache.org> on 2023/01/08 22:12:14 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2023/01/09 00:19:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #36850: [SPARK-39069][SQL] Enhance ConstantPropagation to replace constants in inequality predicates - posted by GitBox <gi...@apache.org> on 2023/01/09 00:19:55 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36588: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/09 00:19:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39454: [SPARK-41937][R] Fix error in R (>= 4.2.0) for SparkR datetime column comparing with Sys.time() - posted by GitBox <gi...@apache.org> on 2023/01/09 00:40:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39454: [SPARK-41937][R] Fix error in R (>= 4.2.0) for SparkR datetime column comparing with Sys.time() - posted by GitBox <gi...@apache.org> on 2023/01/09 00:40:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39453: [SPARK-41938][BUILD] Upgrade sbt to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/09 00:41:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39453: [SPARK-41938][BUILD] Upgrade sbt to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/09 00:42:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39456: [SPARK-41904][CONNECT][PYTHON] Fix Function `nth_value` functions output - posted by GitBox <gi...@apache.org> on 2023/01/09 00:43:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/09 00:45:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39368: [SPARK-28764][CORE][TEST] Remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/09 00:50:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39368: [SPARK-28764][CORE][TEST] Remove writePartitionedFile in ExternalSorter - posted by GitBox <gi...@apache.org> on 2023/01/09 00:51:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39458: [SPARK-41941][BUILD] Upgrade `scalatest` related test dependencies to 3.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/09 01:54:30 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39431: [SPARK-41914][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/09 02:04:14 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39394: [SPARK-41575][SQL] Assign name to _LEGACY_ERROR_TEMP_2054 - posted by GitBox <gi...@apache.org> on 2023/01/09 02:15:36 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39389: [SPARK-41574][SQL] Update `_LEGACY_ERROR_TEMP_2009` as `INTERNAL_ERROR`. - posted by GitBox <gi...@apache.org> on 2023/01/09 02:20:34 UTC, 1 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39406: [SPARK-41894][SS][TESTS] Restore the write permission of `commitDir` after run `testAsyncWriteErrorsPermissionsIssue` - posted by GitBox <gi...@apache.org> on 2023/01/09 02:21:44 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39406: [SPARK-41894][SS][TESTS] Restore the write permission of `commitDir` after run `testAsyncWriteErrorsPermissionsIssue` - posted by GitBox <gi...@apache.org> on 2023/01/09 02:22:33 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39282: [SPARK-41581][SQL] Update `_LEGACY_ERROR_TEMP_1230` as `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/09 02:23:13 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39457: [WIP][SPARK-41940][SQL] Infer IsNotNull constraints for complex join expressions - posted by GitBox <gi...@apache.org> on 2023/01/09 02:55:52 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/09 03:20:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35709: [SPARK-38389][SQL] Add the `DATEDIFF()` and `DATE_DIFF()` aliases for `TIMESTAMPDIFF()` - posted by GitBox <gi...@apache.org> on 2023/01/09 03:21:15 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on a diff in pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/09 03:49:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #38163: [SPARK-40711][SQL] Add spill size metrics for window - posted by GitBox <gi...@apache.org> on 2023/01/09 04:44:38 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39458: [SPARK-41941][BUILD] Upgrade `scalatest` related test dependencies to 3.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/09 05:21:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39458: [SPARK-41941][BUILD] Upgrade `scalatest` related test dependencies to 3.2.15 - posted by GitBox <gi...@apache.org> on 2023/01/09 05:22:18 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe commented on pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/09 05:28:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39333: [SPARK-41805][SQL] Reuse expressions in WindowSpecDefinition - posted by GitBox <gi...@apache.org> on 2023/01/09 05:34:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39333: [SPARK-41805][SQL] Reuse expressions in WindowSpecDefinition - posted by GitBox <gi...@apache.org> on 2023/01/09 05:34:13 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39461: [SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 05:42:59 UTC, 0 replies.
- [GitHub] [spark] dengziming commented on pull request #39388: [SPARK-41354][CONNECT][PYTHON] Implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/09 06:22:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39462: [SPARK-41879][CONNECT][PYTHON] Make `DataFrame.collect` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/09 06:29:40 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/09 06:33:45 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39463: [SPARK-41944][CONNECT] Pass configurations when local remote mode is on - posted by GitBox <gi...@apache.org> on 2023/01/09 06:34:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39463: [SPARK-41944][CONNECT] Pass configurations when local remote mode is on - posted by GitBox <gi...@apache.org> on 2023/01/09 06:34:20 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39456: [SPARK-41904][CONNECT][PYTHON] Fix Function `nth_value` functions output - posted by GitBox <gi...@apache.org> on 2023/01/09 06:43:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/09 06:59:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39388: [SPARK-41354][CONNECT][PYTHON] Implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/09 07:09:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39388: [SPARK-41354][CONNECT][PYTHON] Implement RepartitionByExpression - posted by GitBox <gi...@apache.org> on 2023/01/09 07:10:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39461: [WIP][SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 07:18:58 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39461: [WIP][SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 07:20:35 UTC, 7 replies.
- [GitHub] [spark] itholic opened a new pull request, #39464: [SPARK-41947][CORE][DOCS] Update the contents of error class guidelines - posted by GitBox <gi...@apache.org> on 2023/01/09 07:27:28 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39282: [SPARK-41581][SQL] Update `_LEGACY_ERROR_TEMP_1230` as `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/09 07:41:21 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39282: [SPARK-41581][SQL] Update `_LEGACY_ERROR_TEMP_1230` as `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/09 07:41:59 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39464: [SPARK-41947][CORE][DOCS] Update the contents of error class guidelines - posted by GitBox <gi...@apache.org> on 2023/01/09 07:46:07 UTC, 0 replies.
- [GitHub] [spark] WangGuangxin commented on a diff in pull request #38877: [SPARK-41361] [SQL] Invalid call toAttribute on unresolved object exception caused by WidenSetOperationTypes - posted by GitBox <gi...@apache.org> on 2023/01/09 07:48:07 UTC, 1 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #39459: [WIP][SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/09 08:03:43 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39462: [SPARK-41879][CONNECT][PYTHON] Make `DataFrame.collect` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/09 08:06:51 UTC, 8 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39461: [WIP][SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 08:11:29 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE.PATTERN when the parameters `regexp` is invalid - posted by GitBox <gi...@apache.org> on 2023/01/09 08:37:26 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39383: [SPARK-41780][SQL] Should throw INVALID_PARAMETER_VALUE.PATTERN when the parameters `regexp` is invalid - posted by GitBox <gi...@apache.org> on 2023/01/09 08:38:05 UTC, 0 replies.
- [GitHub] [spark] dengziming opened a new pull request, #39465: [MINOR] Fix wrong file name - posted by GitBox <gi...@apache.org> on 2023/01/09 08:38:29 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39466: [SPARK-41948][SQL] Fix NPE for error classes: CANNOT_PARSE_JSON_FIELD - posted by GitBox <gi...@apache.org> on 2023/01/09 08:40:34 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39461: [WIP][SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 09:00:05 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39456: [SPARK-41904][CONNECT][PYTHON] Fix Function `nth_value` functions output - posted by GitBox <gi...@apache.org> on 2023/01/09 09:01:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39461: [SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 09:02:29 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39461: [SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 09:02:31 UTC, 1 replies.
- [GitHub] [spark] beliefer closed pull request #39456: [SPARK-41904][CONNECT][PYTHON] Fix Function `nth_value` functions output - posted by GitBox <gi...@apache.org> on 2023/01/09 09:14:54 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39466: [SPARK-41948][SQL] Fix NPE for error classes: CANNOT_PARSE_JSON_FIELD - posted by GitBox <gi...@apache.org> on 2023/01/09 09:23:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39453: [SPARK-41938][BUILD] Upgrade sbt to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/09 09:45:16 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39461: [SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 09:48:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39465: [MINOR] Fix wrong file name - posted by GitBox <gi...@apache.org> on 2023/01/09 10:16:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39465: [MINOR] Fix wrong file name - posted by GitBox <gi...@apache.org> on 2023/01/09 10:17:18 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39464: [SPARK-41947][CORE][DOCS] Update the contents of error class guidelines - posted by GitBox <gi...@apache.org> on 2023/01/09 12:02:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39461: [SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 12:11:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39461: [SPARK-41945][CONNECT][PYTHON] Python: connect client lost column data with pyarrow.Table.to_pylist - posted by GitBox <gi...@apache.org> on 2023/01/09 12:12:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39463: [SPARK-41944][CONNECT] Pass configurations when local remote mode is on - posted by GitBox <gi...@apache.org> on 2023/01/09 12:42:41 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/09 12:48:36 UTC, 0 replies.
- [GitHub] [spark] codecov-commenter commented on pull request #39466: [SPARK-41948][SQL] Fix NPE for error classes: CANNOT_PARSE_JSON_FIELD - posted by GitBox <gi...@apache.org> on 2023/01/09 12:49:16 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #39424: [SPARK-41949][CORE][PYTHON] Make stage scheduling support local-cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/09 12:54:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39467: [SPARK-41950][PS][BUILD] Pin mlflow to 2.0.X - posted by GitBox <gi...@apache.org> on 2023/01/09 13:05:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39467: [SPARK-41950][PS][BUILD] Pin mlflow to 2.0.X - posted by GitBox <gi...@apache.org> on 2023/01/09 13:06:09 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39468: [SPARK-41708][SQL][FOLLOWUP] WriteFiles should replace exprId using new query - posted by GitBox <gi...@apache.org> on 2023/01/09 13:21:18 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2023/01/09 13:29:14 UTC, 0 replies.
- [GitHub] [spark] ulysses-you closed pull request #39468: [SPARK-41708][SQL][FOLLOWUP] WriteFiles should replace exprId using new query - posted by GitBox <gi...@apache.org> on 2023/01/09 13:45:43 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39468: [SPARK-41708][SQL][FOLLOWUP] WriteFiles should replace exprId using new query - posted by GitBox <gi...@apache.org> on 2023/01/09 13:45:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39468: [SPARK-41708][SQL][FOLLOWUP] WriteFiles should replace exprId using new query - posted by GitBox <gi...@apache.org> on 2023/01/09 13:47:36 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39468: [SPARK-41708][SQL][FOLLOWUP] WriteFiles should replace exprId using new query - posted by GitBox <gi...@apache.org> on 2023/01/09 13:52:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39332: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by GitBox <gi...@apache.org> on 2023/01/09 15:21:16 UTC, 5 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/09 15:25:36 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39459: [WIP][SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/09 15:25:40 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #35709: [SPARK-38389][SQL] Add the `DATEDIFF()` and `DATE_DIFF()` aliases for `TIMESTAMPDIFF()` - posted by GitBox <gi...@apache.org> on 2023/01/09 15:36:25 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/09 15:42:07 UTC, 4 replies.
- [GitHub] [spark] techaddict commented on pull request #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/09 15:44:38 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #39469: [SPARK-XXXX][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/09 16:23:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39435: [SPARK-41926][UI][TESTS] Add Github action test job with RocksDB as UI backend - posted by GitBox <gi...@apache.org> on 2023/01/09 16:27:32 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39366: [SPARK-41860][SQL] Make AvroScanBuilder and JsonScanBuilder case classes - posted by GitBox <gi...@apache.org> on 2023/01/09 17:53:37 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39332: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by GitBox <gi...@apache.org> on 2023/01/09 19:09:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39366: [SPARK-41860][SQL] Make AvroScanBuilder and JsonScanBuilder case classes - posted by GitBox <gi...@apache.org> on 2023/01/09 19:26:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39467: [SPARK-41950][PS][BUILD] Pin mlflow to 2.0.X - posted by GitBox <gi...@apache.org> on 2023/01/09 19:27:33 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/09 19:58:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/09 20:01:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/09 20:03:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39410: [SPARK-41848][CORE] Fixing task over-scheduled with TaskResourceProfile - posted by GitBox <gi...@apache.org> on 2023/01/09 20:04:40 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39464: [SPARK-41947][CORE][DOCS] Update the contents of error class guidelines - posted by GitBox <gi...@apache.org> on 2023/01/09 20:23:38 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39464: [SPARK-41947][CORE][DOCS] Update the contents of error class guidelines - posted by GitBox <gi...@apache.org> on 2023/01/09 20:24:19 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on a diff in pull request #39039: [SPARK-40776][SQL][PROTOBUF][DOCS] Spark-Protobuf docs - posted by GitBox <gi...@apache.org> on 2023/01/09 20:31:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39470: [SPARK-41951][DOCS] Update SQL migration guide and documentations - posted by GitBox <gi...@apache.org> on 2023/01/09 21:58:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39470: [SPARK-41951][DOCS] Update SQL migration guide and documentations - posted by GitBox <gi...@apache.org> on 2023/01/09 22:00:17 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39470: [SPARK-41951][DOCS] Update SQL migration guide and documentations - posted by GitBox <gi...@apache.org> on 2023/01/09 22:07:05 UTC, 2 replies.
- [GitHub] [spark] srielau commented on pull request #38146: [SPARK-40687][SQL] Support data masking built-in function 'mask' - posted by GitBox <gi...@apache.org> on 2023/01/09 22:10:20 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39470: [SPARK-41951][DOCS] Update SQL migration guide and documentations - posted by GitBox <gi...@apache.org> on 2023/01/09 22:56:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39470: [SPARK-41951][DOCS] Update SQL migration guide and documentations - posted by GitBox <gi...@apache.org> on 2023/01/09 23:17:22 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/09 23:43:22 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39471: [SPARK-41432][UI][FOLLOWUP] Fix a bug in protobuf serializer of SparkPlanGraphNodeWrapper - posted by GitBox <gi...@apache.org> on 2023/01/09 23:54:17 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39471: [SPARK-41432][UI][FOLLOWUP] Fix a bug in protobuf serializer of SparkPlanGraphNodeWrapper - posted by GitBox <gi...@apache.org> on 2023/01/09 23:56:18 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37759: [SPARK-40306][SQL]Support more than Integer.MAX_VALUE of the same join key - posted by GitBox <gi...@apache.org> on 2023/01/10 00:20:26 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #36850: [SPARK-39069][SQL] Enhance ConstantPropagation to replace constants in inequality predicates - posted by GitBox <gi...@apache.org> on 2023/01/10 00:20:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by GitBox <gi...@apache.org> on 2023/01/10 00:37:54 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39384: [SPARK-40307][PYTHON] Introduce Arrow-optimized Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/10 00:42:17 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39467: [SPARK-41950][PS][BUILD] Pin mlflow to 2.0.X - posted by GitBox <gi...@apache.org> on 2023/01/10 00:53:34 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39146: [SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/10 00:58:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39146: [SPARK-41589][PYTHON][ML] PyTorch Distributor Baseline API Changes - posted by GitBox <gi...@apache.org> on 2023/01/10 00:58:52 UTC, 0 replies.
- [GitHub] [spark] zhouyejoe commented on a diff in pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2023/01/10 01:18:46 UTC, 3 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39470: [SPARK-41951][DOCS] Update SQL migration guide and documentations - posted by GitBox <gi...@apache.org> on 2023/01/10 01:28:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38865: [SPARK-41232][SQL][PYTHON] Adding array_append function - posted by GitBox <gi...@apache.org> on 2023/01/10 01:31:44 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #38865: [SPARK-41232][SQL][PYTHON] Adding array_append function - posted by GitBox <gi...@apache.org> on 2023/01/10 01:32:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39471: [SPARK-41432][UI][FOLLOWUP] Fix a bug in protobuf serializer of SparkPlanGraphNodeWrapper - posted by GitBox <gi...@apache.org> on 2023/01/10 02:32:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39472: [SPARK-41957][CONNECT][PYTHON] Enable the doctest for `DataFrame.hint` - posted by GitBox <gi...@apache.org> on 2023/01/10 02:34:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39470: [SPARK-41951][DOCS] Update SQL migration guide and documentations - posted by GitBox <gi...@apache.org> on 2023/01/10 02:34:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39472: [SPARK-41957][CONNECT][PYTHON] Enable the doctest for `DataFrame.hint` - posted by GitBox <gi...@apache.org> on 2023/01/10 02:36:38 UTC, 2 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #39473: [SPARK-38173][SQL][3.2] Quoted column cannot be recognized correctly when quotedRegexColumnNames is true - posted by GitBox <gi...@apache.org> on 2023/01/10 02:39:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/10 02:40:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39448: [SPARK-41943][CORE] Use java api to create files and grant permissions - posted by GitBox <gi...@apache.org> on 2023/01/10 02:40:57 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39469: [SPARK-XXXX][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/10 02:42:23 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/10 02:48:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39467: [SPARK-41950][PS][BUILD] Pin scikit-learn to 1.1.X - posted by GitBox <gi...@apache.org> on 2023/01/10 03:16:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39467: [SPARK-41950][PS][BUILD] Pin scikit-learn to 1.1.X - posted by GitBox <gi...@apache.org> on 2023/01/10 03:17:02 UTC, 0 replies.
- [GitHub] [spark] Ngone51 opened a new pull request, #39474: [SPARK-41958][CORE] Disallow arbitrary custom classpath with proxy user in cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/10 03:24:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39424: [SPARK-41949][CORE][PYTHON] Make stage scheduling support local-cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/10 03:33:03 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39424: [SPARK-41949][CORE][PYTHON] Make stage scheduling support local-cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/10 03:36:07 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39467: [SPARK-41950][PS][BUILD] Pin scikit-learn to 1.1.X - posted by GitBox <gi...@apache.org> on 2023/01/10 03:42:08 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #39312: [SPARK-41788][SQL] Move InsertIntoStatement to basicLogicalOperators - posted by GitBox <gi...@apache.org> on 2023/01/10 03:42:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39462: [SPARK-41879][CONNECT][PYTHON] Make `DataFrame.collect` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/10 04:09:10 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39471: [SPARK-41432][UI][FOLLOWUP] Fix a bug in protobuf serializer of SparkPlanGraphNodeWrapper - posted by GitBox <gi...@apache.org> on 2023/01/10 04:39:22 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by GitBox <gi...@apache.org> on 2023/01/10 04:43:35 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by GitBox <gi...@apache.org> on 2023/01/10 04:44:48 UTC, 4 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by GitBox <gi...@apache.org> on 2023/01/10 04:45:55 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39431: [SPARK-41914][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/10 05:10:03 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39431: [SPARK-41914][SQL] FileFormatWriter materializes AQE plan before accessing outputOrdering - posted by GitBox <gi...@apache.org> on 2023/01/10 05:10:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by GitBox <gi...@apache.org> on 2023/01/10 05:12:23 UTC, 1 replies.
- [GitHub] [spark] harupy commented on pull request #39467: [SPARK-41950][PS][BUILD] Pin scikit-learn to 1.1.X - posted by GitBox <gi...@apache.org> on 2023/01/10 05:25:47 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39476: [SPARK-41907][CONNECT][PYTHON] Because the plan is different from pyspark, so the result of sampleby is not determined. - posted by GitBox <gi...@apache.org> on 2023/01/10 05:40:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39476: [SPARK-41907][CONNECT][PYTHON] Because the plan is different from pyspark, so the result of sampleby is not determined. - posted by GitBox <gi...@apache.org> on 2023/01/10 05:51:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39472: [SPARK-41957][CONNECT][PYTHON] Enable the doctest for `DataFrame.hint` - posted by GitBox <gi...@apache.org> on 2023/01/10 05:58:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39472: [SPARK-41957][CONNECT][PYTHON] Enable the doctest for `DataFrame.hint` - posted by GitBox <gi...@apache.org> on 2023/01/10 05:58:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39477: [SPARK-41872][CONNECT][PYTHON][TESTS] Enable tests `test_fillna` and `test_replace` - posted by GitBox <gi...@apache.org> on 2023/01/10 06:06:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39133: [SPARK-41595][SQL] Support generator function explode/explode_outer in the FROM clause - posted by GitBox <gi...@apache.org> on 2023/01/10 06:08:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39133: [SPARK-41595][SQL] Support generator function explode/explode_outer in the FROM clause - posted by GitBox <gi...@apache.org> on 2023/01/10 06:09:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39478: [SPARK-41934][CONNECT][PYTHON][FOLLOWUP] Add `Session.readStream` to the unsupported list - posted by GitBox <gi...@apache.org> on 2023/01/10 07:17:13 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39332: [WIP][SPARK-40822][SQL] Stable derived column aliases - posted by GitBox <gi...@apache.org> on 2023/01/10 07:19:34 UTC, 3 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by GitBox <gi...@apache.org> on 2023/01/10 07:26:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by GitBox <gi...@apache.org> on 2023/01/10 07:59:11 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #39479: [SPARK-41961][SQL] Support table-valued functions with LATERAL - posted by GitBox <gi...@apache.org> on 2023/01/10 08:09:51 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on a diff in pull request #39479: [SPARK-41961][SQL] Support table-valued functions with LATERAL - posted by GitBox <gi...@apache.org> on 2023/01/10 08:18:43 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39480: [SPARK-41960][SQL] Assign name to _LEGACY_ERROR_TEMP_1056 - posted by GitBox <gi...@apache.org> on 2023/01/10 08:23:21 UTC, 0 replies.
- [GitHub] [spark] shuyouZZ opened a new pull request, #39481: [SPARK][SQL] Update the import order of scala package in class `SpecificParquetRecordReaderBase` - posted by GitBox <gi...@apache.org> on 2023/01/10 08:33:28 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #38312: [SPARK-40819][SQL] Timestamp nanos behaviour regression - posted by GitBox <gi...@apache.org> on 2023/01/10 08:53:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39477: [SPARK-41872][CONNECT][PYTHON][TESTS] Enable tests `test_fillna` and `test_replace` - posted by GitBox <gi...@apache.org> on 2023/01/10 08:57:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39477: [SPARK-41872][CONNECT][PYTHON][TESTS] Enable tests `test_fillna` and `test_replace` - posted by GitBox <gi...@apache.org> on 2023/01/10 08:57:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39478: [SPARK-41934][CONNECT][PYTHON][FOLLOWUP] Add `Session.readStream` to the unsupported list - posted by GitBox <gi...@apache.org> on 2023/01/10 09:02:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39478: [SPARK-41934][CONNECT][PYTHON][FOLLOWUP] Add `Session.readStream` to the unsupported list - posted by GitBox <gi...@apache.org> on 2023/01/10 09:03:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39474: [SPARK-41958][CORE] Disallow arbitrary custom classpath with proxy user in cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/10 09:05:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39474: [SPARK-41958][CORE] Disallow arbitrary custom classpath with proxy user in cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/10 09:06:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39482: [SPARK-41877][CONNECT][TESTS] Reenable unpivot test and separate negative test - posted by GitBox <gi...@apache.org> on 2023/01/10 09:32:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39482: [SPARK-41877][CONNECT][TESTS] Reenable unpivot test and separate negative test - posted by GitBox <gi...@apache.org> on 2023/01/10 09:32:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39482: [SPARK-41877][CONNECT][TESTS] Reenable unpivot test and separate negative test - posted by GitBox <gi...@apache.org> on 2023/01/10 09:33:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39473: [SPARK-38173][SQL][3.2] Quoted column cannot be recognized correctly when quotedRegexColumnNames is true - posted by GitBox <gi...@apache.org> on 2023/01/10 10:26:17 UTC, 1 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39483: [SPARK-41886][CONNECT][PYTHON] `DataFrame.intersect` doctest output has different order - posted by GitBox <gi...@apache.org> on 2023/01/10 11:00:25 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39476: [SPARK-41907][CONNECT][PYTHON][TESTS] Because the plan is different from pyspark, so the result of sampleby is not determined. - posted by GitBox <gi...@apache.org> on 2023/01/10 11:01:31 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39484: [SPARK-41964][CONNECT][PYTHON] Add the list of unsupported IO functions - posted by GitBox <gi...@apache.org> on 2023/01/10 11:15:02 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39485: [SPARK-41965][PYTHON][DOCS] Add `DataFrameWriterV2` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/10 11:27:07 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini commented on pull request #36918: [SQL][SPARK-39528] Use V2 Filter in SupportsRuntimeFiltering - posted by GitBox <gi...@apache.org> on 2023/01/10 11:35:40 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39486: [SPARK-41966][PYTHON][DOCS] Add `CharType` and `TimestampNTZType` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/10 11:35:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39476: [SPARK-41907][CONNECT][PYTHON][TESTS] Update and enable test `test_sampleby` - posted by GitBox <gi...@apache.org> on 2023/01/10 11:48:31 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39476: [SPARK-41907][CONNECT][PYTHON][TESTS] Update and enable test `test_sampleby` - posted by GitBox <gi...@apache.org> on 2023/01/10 11:49:52 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39469: [SPARK-XXXX][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/10 12:03:53 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by GitBox <gi...@apache.org> on 2023/01/10 12:51:42 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39268: [SPARK-41752][SQL][UI] Group nested executions under the root execution - posted by GitBox <gi...@apache.org> on 2023/01/10 12:53:04 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #39424: [SPARK-41949][CORE][PYTHON] Make stage scheduling support local-cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/10 13:06:07 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39471: [SPARK-41432][UI][FOLLOWUP] Fix a bug in protobuf serializer of SparkPlanGraphNodeWrapper - posted by GitBox <gi...@apache.org> on 2023/01/10 13:16:08 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39469: [SPARK-XXXX][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/10 13:18:02 UTC, 3 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39394: [SPARK-41575][SQL] Assign name to _LEGACY_ERROR_TEMP_2054 - posted by GitBox <gi...@apache.org> on 2023/01/10 13:24:45 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39394: [SPARK-41575][SQL] Assign name to _LEGACY_ERROR_TEMP_2054 - posted by GitBox <gi...@apache.org> on 2023/01/10 13:25:27 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39389: [SPARK-41574][SQL] Update `_LEGACY_ERROR_TEMP_2009` as `INTERNAL_ERROR`. - posted by GitBox <gi...@apache.org> on 2023/01/10 13:28:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39481: [MINOR][SQL] Update the import order of scala package in class `SpecificParquetRecordReaderBase` - posted by GitBox <gi...@apache.org> on 2023/01/10 13:36:43 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39468: [SPARK-41708][SQL][FOLLOWUP] WriteFiles should replace exprId using new query - posted by GitBox <gi...@apache.org> on 2023/01/10 14:06:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39468: [SPARK-41708][SQL][FOLLOWUP] WriteFiles should replace exprId using new query - posted by GitBox <gi...@apache.org> on 2023/01/10 14:07:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/10 14:43:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/10 14:43:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39482: [SPARK-41877][CONNECT][TESTS] Reenable unpivot test and separate negative test - posted by GitBox <gi...@apache.org> on 2023/01/10 14:45:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39482: [SPARK-41877][CONNECT][TESTS] Reenable unpivot test and separate negative test - posted by GitBox <gi...@apache.org> on 2023/01/10 14:45:30 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39483: [SPARK-41886][CONNECT][PYTHON] `DataFrame.intersect` doctest output has different order - posted by GitBox <gi...@apache.org> on 2023/01/10 14:47:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39483: [SPARK-41886][CONNECT][PYTHON] `DataFrame.intersect` doctest output has different order - posted by GitBox <gi...@apache.org> on 2023/01/10 14:48:00 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39471: [SPARK-41432][UI][FOLLOWUP] Fix a bug in protobuf serializer of SparkPlanGraphNodeWrapper - posted by GitBox <gi...@apache.org> on 2023/01/10 14:49:04 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39487: [SPARK-41968][CORE][SQL] Refactor `ProtobufSerDe` to `ProtobufSerDe[T]` - posted by GitBox <gi...@apache.org> on 2023/01/10 16:26:05 UTC, 2 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38376: [SPARK-40817] [Kubernetes] Do not discard remote user-specified files when launching Spark jobs on Kubernetes - posted by GitBox <gi...@apache.org> on 2023/01/10 16:58:52 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39471: [SPARK-41432][UI][FOLLOWUP] Fix a bug in protobuf serializer of SparkPlanGraphNodeWrapper - posted by GitBox <gi...@apache.org> on 2023/01/10 17:24:20 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #39449: [SPARK-40688][SQL] Support data masking built-in function 'mask_first_n' - posted by GitBox <gi...@apache.org> on 2023/01/10 17:47:50 UTC, 2 replies.
- [GitHub] [spark] databricks-david-lewis opened a new pull request, #39488: WIP SparkPath - posted by GitBox <gi...@apache.org> on 2023/01/10 17:57:19 UTC, 1 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2023/01/10 18:03:39 UTC, 5 replies.
- [GitHub] [spark] databricks-david-lewis closed pull request #39488: WIP SparkPath - posted by GitBox <gi...@apache.org> on 2023/01/10 18:28:39 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #38146: [SPARK-40687][SQL] Support data masking built-in function 'mask' - posted by GitBox <gi...@apache.org> on 2023/01/10 18:54:29 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #38146: [SPARK-40687][SQL] Support data masking built-in function 'mask' - posted by GitBox <gi...@apache.org> on 2023/01/10 19:05:21 UTC, 1 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2023/01/10 19:16:51 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #39479: [SPARK-41961][SQL] Support table-valued functions with LATERAL - posted by GitBox <gi...@apache.org> on 2023/01/10 19:39:48 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on pull request #39473: [SPARK-38173][SQL][3.2] Quoted column cannot be recognized correctly when quotedRegexColumnNames is true - posted by GitBox <gi...@apache.org> on 2023/01/10 19:53:57 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39473: [SPARK-38173][SQL][3.2] Quoted column cannot be recognized correctly when quotedRegexColumnNames is true - posted by GitBox <gi...@apache.org> on 2023/01/10 20:59:53 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #38428: [SPARK-40912][CORE][WIP] Overhead of Exceptions in KryoDeserializationStream - posted by GitBox <gi...@apache.org> on 2023/01/10 21:16:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38428: [SPARK-40912][CORE][WIP] Overhead of Exceptions in KryoDeserializationStream - posted by GitBox <gi...@apache.org> on 2023/01/10 21:19:10 UTC, 0 replies.
- [GitHub] [spark] databricks-david-lewis commented on pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/10 22:00:30 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/10 22:01:38 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #35969: [SPARK-38651][SQL] Add configuration to support writing out empty schemas in supported filebased datasources - posted by GitBox <gi...@apache.org> on 2023/01/10 23:41:55 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/10 23:42:41 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39489: [SPARK-41752][SQL][FOLLOW-UP] Fix Protobuf serializer for SQLExecutionUIData - posted by GitBox <gi...@apache.org> on 2023/01/10 23:56:33 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39489: [SPARK-41752][SQL][FOLLOW-UP] Fix Protobuf serializer for SQLExecutionUIData - posted by GitBox <gi...@apache.org> on 2023/01/10 23:57:52 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39489: [SPARK-41752][SQL][FOLLOW-UP] Fix Protobuf serializer for SQLExecutionUIData - posted by GitBox <gi...@apache.org> on 2023/01/10 23:58:11 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39484: [SPARK-41964][CONNECT][PYTHON] Add the list of unsupported IO functions - posted by GitBox <gi...@apache.org> on 2023/01/11 00:22:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39484: [SPARK-41964][CONNECT][PYTHON] Add the list of unsupported IO functions - posted by GitBox <gi...@apache.org> on 2023/01/11 00:23:01 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39476: [SPARK-41907][CONNECT][PYTHON][TESTS] Update and enable test `test_sampleby` - posted by GitBox <gi...@apache.org> on 2023/01/11 00:45:27 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39483: [SPARK-41886][CONNECT][PYTHON] `DataFrame.intersect` doctest output has different order - posted by GitBox <gi...@apache.org> on 2023/01/11 00:47:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39486: [SPARK-41966][PYTHON][DOCS] Add `CharType` and `TimestampNTZType` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/11 01:09:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39486: [SPARK-41966][PYTHON][DOCS] Add `CharType` and `TimestampNTZType` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/11 01:09:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39485: [SPARK-41965][PYTHON][DOCS] Add `DataFrameWriterV2` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/11 01:09:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39485: [SPARK-41965][PYTHON][DOCS] Add `DataFrameWriterV2` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/11 01:10:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39490: [SPARK-41589][PYTHON][ML][BUILD] Add pyspark.ml.torch to setup.py - posted by GitBox <gi...@apache.org> on 2023/01/11 01:18:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39490: [SPARK-41589][PYTHON][ML][BUILD] Add pyspark.ml.torch to setup.py - posted by GitBox <gi...@apache.org> on 2023/01/11 01:18:56 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on pull request #39490: [SPARK-41589][PYTHON][ML][BUILD] Add pyspark.ml.torch to setup.py - posted by GitBox <gi...@apache.org> on 2023/01/11 01:19:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39491: [SPARK-41887][CONNECT][PYTHON] Make `DataFrame.hint` accept list typed parameter - posted by GitBox <gi...@apache.org> on 2023/01/11 01:20:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39491: [SPARK-41887][CONNECT][PYTHON] Make `DataFrame.hint` accept list typed parameter - posted by GitBox <gi...@apache.org> on 2023/01/11 01:26:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39490: [SPARK-41589][PYTHON][ML][BUILD][FOLLOW-UP] Add pyspark.ml.torch to setup.py - posted by GitBox <gi...@apache.org> on 2023/01/11 01:27:41 UTC, 2 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39492: [SPARK-41876][CONNECT][PYTHON] Connect can not test with `test_dataframe` due to `AnalysisException` coverted to `SparkConnectAnalysisException` - posted by GitBox <gi...@apache.org> on 2023/01/11 02:18:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39485: [SPARK-41965][PYTHON][DOCS] Add `DataFrameWriterV2` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/11 02:31:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39485: [SPARK-41965][PYTHON][DOCS] Add `DataFrameWriterV2` to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/11 02:33:07 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39492: [SPARK-41876][CONNECT][PYTHON] Connect can not test with `test_dataframe` due to `AnalysisException` coverted to `SparkConnectAnalysisException` - posted by GitBox <gi...@apache.org> on 2023/01/11 02:36:37 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39492: [SPARK-41876][CONNECT][PYTHON] Connect can not test with `test_dataframe` due to `AnalysisException` coverted to `SparkConnectAnalysisException` - posted by GitBox <gi...@apache.org> on 2023/01/11 02:41:18 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/11 02:53:04 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39492: [SPARK-41876][CONNECT][PYTHON] Connect can not test with `test_dataframe` due to `AnalysisException` coverted to `SparkConnectAnalysisException` - posted by GitBox <gi...@apache.org> on 2023/01/11 02:54:24 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/11 02:55:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39469: [SPARK-XXXX][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/11 03:00:30 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39462: [SPARK-41879][CONNECT][PYTHON] Make `DataFrame.collect` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/11 03:12:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39492: [SPARK-41876][CONNECT][PYTHON] `test_dataframe` should catch both `AnalysisException` and `SparkConnectAnalysisException` - posted by GitBox <gi...@apache.org> on 2023/01/11 03:15:21 UTC, 1 replies.
- [GitHub] [spark] Yikun commented on pull request #39490: [SPARK-41589][PYTHON][ML][BUILD][FOLLOW-UP] Add pyspark.ml.torch to setup.py - posted by GitBox <gi...@apache.org> on 2023/01/11 03:15:26 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39492: [SPARK-41876][CONNECT][PYTHON] `test_dataframe` should catch both `AnalysisException` and `SparkConnectAnalysisException` - posted by GitBox <gi...@apache.org> on 2023/01/11 03:21:02 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39462: [SPARK-41879][CONNECT][PYTHON] Make `DataFrame.collect` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/11 03:36:04 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39493: [SPARK-41965][PYTHON][DOCS][WIP] Add DataFrameWriterV2 to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/11 03:38:19 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39489: [SPARK-41752][SQL][FOLLOW-UP] Fix Protobuf serializer for SQLExecutionUIData - posted by GitBox <gi...@apache.org> on 2023/01/11 03:39:42 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39494: [SPARK-41972][TESTS] Fix a flaky test in StreamingQueryStatusListenerSuite - posted by GitBox <gi...@apache.org> on 2023/01/11 04:42:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39490: [SPARK-41589][PYTHON][ML][BUILD][FOLLOW-UP] Add pyspark.ml.torch to setup.py - posted by GitBox <gi...@apache.org> on 2023/01/11 04:59:17 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39495: [SPARK-41973][SQL] Assign name to _LEGACY_ERROR_TEMP_1311 - posted by GitBox <gi...@apache.org> on 2023/01/11 05:26:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39495: [SPARK-41973][SQL] Assign name to _LEGACY_ERROR_TEMP_1311 - posted by GitBox <gi...@apache.org> on 2023/01/11 05:27:40 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39480: [SPARK-41960][SQL] Assign name to _LEGACY_ERROR_TEMP_1056 - posted by GitBox <gi...@apache.org> on 2023/01/11 05:27:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39494: [SPARK-41972][TESTS] Fix a flaky test in StreamingQueryStatusListenerSuite - posted by GitBox <gi...@apache.org> on 2023/01/11 05:29:08 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #35982: [SPARK-38336][SQL] Support INSERT INTO commands into tables with DEFAULT columns - posted by GitBox <gi...@apache.org> on 2023/01/11 05:30:33 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39494: [SPARK-41972][TESTS] Fix a flaky test in StreamingQueryStatusListenerSuite - posted by GitBox <gi...@apache.org> on 2023/01/11 05:31:03 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39496: [SPARK-41974][SQL] Turn `INCORRECT_END_OFFSET` into `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/11 05:43:44 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39496: [SPARK-41974][SQL] Turn `INCORRECT_END_OFFSET` into `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/11 05:44:33 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39487: [SPARK-41968][CORE][SQL] Refactor `ProtobufSerDe` to `ProtobufSerDe[T]` - posted by GitBox <gi...@apache.org> on 2023/01/11 06:13:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39494: [SPARK-41972][TESTS] Fix a flaky test in StreamingQueryStatusListenerSuite - posted by GitBox <gi...@apache.org> on 2023/01/11 06:20:51 UTC, 1 replies.
- [GitHub] [spark] allisonport-db commented on a diff in pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by GitBox <gi...@apache.org> on 2023/01/11 06:22:49 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39487: [SPARK-41968][CORE][SQL] Refactor `ProtobufSerDe` to `ProtobufSerDe[T]` - posted by GitBox <gi...@apache.org> on 2023/01/11 06:28:04 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39487: [SPARK-41968][CORE][SQL] Refactor `ProtobufSerDe` to `ProtobufSerDe[T]` - posted by GitBox <gi...@apache.org> on 2023/01/11 06:32:24 UTC, 9 replies.
- [GitHub] [spark] itholic opened a new pull request, #39497: [SPARK-41975][SQL] Improve error message for `INDEX_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2023/01/11 07:12:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39494: [SPARK-41972][TESTS] Fix a flaky test in StreamingQueryStatusListenerSuite - posted by GitBox <gi...@apache.org> on 2023/01/11 07:13:17 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39498: [SPARK-41975][SQL] Improve error message for `INDEX_NOT_FOUND` - posted by GitBox <gi...@apache.org> on 2023/01/11 07:32:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39499: [SPARK-41977][SPARK-41978][CONNECT] SparkSession.range to take float as arguments - posted by GitBox <gi...@apache.org> on 2023/01/11 07:36:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39188: [SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/11 07:45:30 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39500: [SPARK-41980][CONNECT][TESTS] Enable test_functions_broadcast in functions parity test - posted by GitBox <gi...@apache.org> on 2023/01/11 07:54:33 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39481: [MINOR][SQL] Update the import order of scala package in class `SpecificParquetRecordReaderBase` - posted by GitBox <gi...@apache.org> on 2023/01/11 07:56:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39491: [SPARK-41887][CONNECT][PYTHON] Make `DataFrame.hint` accept list typed parameter - posted by GitBox <gi...@apache.org> on 2023/01/11 07:58:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39491: [SPARK-41887][CONNECT][PYTHON] Make `DataFrame.hint` accept list typed parameter - posted by GitBox <gi...@apache.org> on 2023/01/11 07:58:48 UTC, 0 replies.
- [GitHub] [spark] NarekDW opened a new pull request, #39501: [SPARK-41295][SQL] Rename the error classes - posted by GitBox <gi...@apache.org> on 2023/01/11 08:23:41 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39502: [SPARK-41981][SQL] Collapse percentile functions if possible - posted by GitBox <gi...@apache.org> on 2023/01/11 08:59:09 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39503: [SPARK-41969][TESTS] Fix flaky test: StreamingQueryStatusListenerSuite.test small retained queries - posted by GitBox <gi...@apache.org> on 2023/01/11 09:12:40 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39504: [SPARK-41969][TESTS] Fix flaky test: StreamingQueryStatusListenerSuite.test small retained queries - posted by GitBox <gi...@apache.org> on 2023/01/11 09:12:41 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39504: [SPARK-41969][TESTS] Fix flaky test: StreamingQueryStatusListenerSuite.test small retained queries - posted by GitBox <gi...@apache.org> on 2023/01/11 09:19:03 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39504: [SPARK-41969][TESTS] Fix flaky test: StreamingQueryStatusListenerSuite.test small retained queries - posted by GitBox <gi...@apache.org> on 2023/01/11 09:20:15 UTC, 1 replies.
- [GitHub] [spark] panbingkun closed pull request #39504: [SPARK-41969][TESTS] Fix flaky test: StreamingQueryStatusListenerSuite.test small retained queries - posted by GitBox <gi...@apache.org> on 2023/01/11 09:20:59 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39501: [SPARK-41295][SQL] Rename the error classes - posted by GitBox <gi...@apache.org> on 2023/01/11 09:21:08 UTC, 1 replies.
- [GitHub] [spark] rangareddy commented on a diff in pull request #38875: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by GitBox <gi...@apache.org> on 2023/01/11 09:22:53 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39494: [SPARK-41972][TESTS] Fix a flaky test in StreamingQueryStatusListenerSuite - posted by GitBox <gi...@apache.org> on 2023/01/11 09:26:33 UTC, 3 replies.
- [GitHub] [spark] panbingkun closed pull request #39503: [SPARK-41969][TESTS] Fix flaky test: StreamingQueryStatusListenerSuite.test small retained queries - posted by GitBox <gi...@apache.org> on 2023/01/11 09:35:16 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #39501: [SPARK-41295][SQL] Rename the error classes - posted by GitBox <gi...@apache.org> on 2023/01/11 09:51:15 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39505: [SPARK-41979][SQL] Add missing dots for error messages in error classes. - posted by GitBox <gi...@apache.org> on 2023/01/11 09:54:44 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2023/01/11 10:11:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39505: [SPARK-41979][SQL] Add missing dots for error messages in error classes. - posted by GitBox <gi...@apache.org> on 2023/01/11 10:20:42 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39506: [SPARK-41983][SQL] Rename & improve error message for `NULL_COMPARISON_RESULT` - posted by GitBox <gi...@apache.org> on 2023/01/11 10:43:43 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39505: [SPARK-41979][SQL] Add missing dots for error messages in error classes. - posted by GitBox <gi...@apache.org> on 2023/01/11 10:51:20 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #39501: [SPARK-41295][SQL] Rename the error classes - posted by GitBox <gi...@apache.org> on 2023/01/11 10:55:24 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #39507: [SPARK-41984][SQL] Rename & improve error message for `RESET_PERMISSION_TO_ORIGINAL` - posted by GitBox <gi...@apache.org> on 2023/01/11 11:11:54 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by GitBox <gi...@apache.org> on 2023/01/11 11:41:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by GitBox <gi...@apache.org> on 2023/01/11 11:44:29 UTC, 9 replies.
- [GitHub] [spark] wankunde commented on a diff in pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2023/01/11 11:51:00 UTC, 10 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39509: [SPARK-41635][SQL] Fix group by all error reporting - posted by GitBox <gi...@apache.org> on 2023/01/11 12:00:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39509: [SPARK-41635][SQL] Fix group by all error reporting - posted by GitBox <gi...@apache.org> on 2023/01/11 12:01:15 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39509: [SPARK-41635][SQL] Fix group by all error reporting - posted by GitBox <gi...@apache.org> on 2023/01/11 12:01:30 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by GitBox <gi...@apache.org> on 2023/01/11 12:02:33 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39510: [WIP][SPARK-41047][SQL] Improve docs for round - posted by GitBox <gi...@apache.org> on 2023/01/11 12:08:55 UTC, 0 replies.
- [GitHub] [spark] panbingkun closed pull request #39510: [WIP][SPARK-41047][SQL] Improve docs for round - posted by GitBox <gi...@apache.org> on 2023/01/11 12:09:36 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39511: [SPARK-41047][SQL] Improve docs for round - posted by GitBox <gi...@apache.org> on 2023/01/11 12:13:28 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt commented on pull request #38428: [SPARK-40912][CORE]Overhead of Exceptions in KryoDeserializationStream - posted by GitBox <gi...@apache.org> on 2023/01/11 12:26:55 UTC, 4 replies.
- [GitHub] [spark] wangyum opened a new pull request, #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/11 12:29:53 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/11 12:30:18 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/11 12:33:49 UTC, 4 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38875: [SPARK-40988][SQL][TEST] Test case for insert partition should verify value - posted by GitBox <gi...@apache.org> on 2023/01/11 12:39:44 UTC, 1 replies.
- [GitHub] [spark] wangyum opened a new pull request, #39513: [MINOR][SQL][YARN] Fix a typo: less then -> less than - posted by GitBox <gi...@apache.org> on 2023/01/11 12:48:06 UTC, 0 replies.
- [GitHub] [spark] VindhyaG commented on pull request #39190: [SPARK-41683][CORE] Fix issue of getting incorrect property numActiveStages in jobs API - posted by GitBox <gi...@apache.org> on 2023/01/11 12:56:01 UTC, 0 replies.
- [GitHub] [spark] wankunde commented on pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/11 13:05:03 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39469: [SPARK-XXXX][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/11 13:10:44 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39466: [WIP][SPARK-41948][SQL] Fix NPE for error classes: CANNOT_PARSE_JSON_FIELD - posted by GitBox <gi...@apache.org> on 2023/01/11 13:10:47 UTC, 0 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39514: [SPARK-41987][CONNECT][PYTHON] createDataFrame supports column with map type - posted by GitBox <gi...@apache.org> on 2023/01/11 13:11:36 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39514: [SPARK-41987][CONNECT][PYTHON] createDataFrame supports column with map type - posted by GitBox <gi...@apache.org> on 2023/01/11 13:25:23 UTC, 0 replies.
- [GitHub] [spark] rangareddy opened a new pull request, #39515: [SPARK-38743][SQL][TEST] Test the error class: MISSING_STATIC_PARTITION_COLUMNAdding test case for Missing Static Partition Column - posted by GitBox <gi...@apache.org> on 2023/01/11 13:37:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39499: [SPARK-41977][SPARK-41978][CONNECT] SparkSession.range to take float as arguments - posted by GitBox <gi...@apache.org> on 2023/01/11 15:23:42 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39499: [SPARK-41977][SPARK-41978][CONNECT] SparkSession.range to take float as arguments - posted by GitBox <gi...@apache.org> on 2023/01/11 15:24:04 UTC, 0 replies.
- [GitHub] [spark] zzzzming95 commented on pull request #38356: [SPARK-40885] `Sort` may not take effect when it is the last 'Transform' operator - posted by GitBox <gi...@apache.org> on 2023/01/11 16:24:42 UTC, 0 replies.
- [GitHub] [spark] soxofaan opened a new pull request, #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by GitBox <gi...@apache.org> on 2023/01/11 16:26:11 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39513: [MINOR][SQL][YARN] Fix a typo: less then -> less than - posted by GitBox <gi...@apache.org> on 2023/01/11 17:20:43 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39513: [MINOR][SQL][YARN] Fix a typo: less then -> less than - posted by GitBox <gi...@apache.org> on 2023/01/11 17:20:45 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/11 17:21:43 UTC, 0 replies.
- [GitHub] [spark] databricks-david-lewis commented on a diff in pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/11 17:45:15 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39494: [SPARK-41972][TESTS] Fix a flaky test in StreamingQueryStatusListenerSuite - posted by GitBox <gi...@apache.org> on 2023/01/11 18:07:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38428: [SPARK-40912][CORE]Overhead of Exceptions in KryoDeserializationStream - posted by GitBox <gi...@apache.org> on 2023/01/11 18:11:22 UTC, 1 replies.
- [GitHub] [spark] rxin commented on a diff in pull request #39509: [SPARK-41635][SQL] Fix group by all error reporting - posted by GitBox <gi...@apache.org> on 2023/01/11 18:27:59 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #35982: [SPARK-38336][SQL] Support INSERT INTO commands into tables with DEFAULT columns - posted by GitBox <gi...@apache.org> on 2023/01/11 18:32:14 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #39496: [SPARK-41974][SQL] Turn `INCORRECT_END_OFFSET` into `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/11 18:36:19 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39509: [SPARK-41635][SQL] Fix group by all error reporting - posted by GitBox <gi...@apache.org> on 2023/01/11 18:42:11 UTC, 2 replies.
- [GitHub] [spark] allisonwang-db commented on a diff in pull request #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by GitBox <gi...@apache.org> on 2023/01/11 18:55:57 UTC, 1 replies.
- [GitHub] [spark] jchen5 commented on a diff in pull request #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by GitBox <gi...@apache.org> on 2023/01/11 18:58:12 UTC, 4 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #39517: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/11 20:33:55 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39517: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/11 20:34:19 UTC, 2 replies.
- [GitHub] [spark] hvanhovell commented on a diff in pull request #39517: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/11 20:36:12 UTC, 7 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #39518: [SPARK-41991][SQL] `CheckOverflowInTableInsert` should accept ExpressionProxy as child - posted by GitBox <gi...@apache.org> on 2023/01/11 20:39:39 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39511: [SPARK-41047][SQL] Improve docs for round - posted by GitBox <gi...@apache.org> on 2023/01/11 22:31:00 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39511: [SPARK-41047][SQL] Improve docs for round - posted by GitBox <gi...@apache.org> on 2023/01/11 22:39:35 UTC, 0 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #39479: [SPARK-41961][SQL] Support table-valued functions with LATERAL - posted by GitBox <gi...@apache.org> on 2023/01/11 23:00:38 UTC, 0 replies.
- [GitHub] [spark] eric-maynard opened a new pull request, #39519: [SPARK-41995] Accept non-foldable expressions in schema_of_json - posted by GitBox <gi...@apache.org> on 2023/01/11 23:21:29 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39487: [SPARK-41968][CORE][SQL] Refactor `ProtobufSerDe` to `ProtobufSerDe[T]` - posted by GitBox <gi...@apache.org> on 2023/01/11 23:23:22 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39188: [SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/11 23:41:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39188: [SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/11 23:42:13 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39267: [WIP][SPARK-41592][PYTHON][ML] Pytorch file Distributed Training - posted by GitBox <gi...@apache.org> on 2023/01/11 23:57:35 UTC, 12 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39519: [SPARK-41995] Accept non-foldable expressions in schema_of_json - posted by GitBox <gi...@apache.org> on 2023/01/12 00:07:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39519: [SPARK-41995][SQL] Accept non-foldable expressions in schema_of_json - posted by GitBox <gi...@apache.org> on 2023/01/12 00:09:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39519: [SPARK-41995][SQL] Accept non-foldable expressions in schema_of_json - posted by GitBox <gi...@apache.org> on 2023/01/12 00:09:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by GitBox <gi...@apache.org> on 2023/01/12 00:17:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by GitBox <gi...@apache.org> on 2023/01/12 00:18:02 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39518: [SPARK-41991][SQL] `CheckOverflowInTableInsert` should accept ExpressionProxy as child - posted by GitBox <gi...@apache.org> on 2023/01/12 00:29:48 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39499: [SPARK-41977][SPARK-41978][CONNECT] SparkSession.range to take float as arguments - posted by GitBox <gi...@apache.org> on 2023/01/12 00:50:00 UTC, 0 replies.
- [GitHub] [spark] erenavsarogullari commented on pull request #39037: [SPARK-41214][SQL] Fix AQE cache does not update plan and metrics - posted by GitBox <gi...@apache.org> on 2023/01/12 01:13:28 UTC, 0 replies.
- [GitHub] [spark] eric-maynard commented on pull request #39519: [SPARK-41995][SQL] Accept non-foldable expressions in schema_of_json - posted by GitBox <gi...@apache.org> on 2023/01/12 01:15:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39500: [SPARK-41980][CONNECT][TESTS] Enable test_functions_broadcast in functions parity test - posted by GitBox <gi...@apache.org> on 2023/01/12 01:31:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39500: [SPARK-41980][CONNECT][TESTS] Enable test_functions_broadcast in functions parity test - posted by GitBox <gi...@apache.org> on 2023/01/12 01:31:48 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39435: [SPARK-41926][UI][TESTS] Add Github action test job with RocksDB as UI backend - posted by GitBox <gi...@apache.org> on 2023/01/12 02:04:54 UTC, 0 replies.
- [GitHub] [spark] anishshri-db opened a new pull request, #39520: [SPARK-41996] Fix kafka test to verify lost partitions to account for slow Kafka operations - posted by GitBox <gi...@apache.org> on 2023/01/12 02:05:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39509: [SPARK-41635][SQL] Fix group by all error reporting - posted by GitBox <gi...@apache.org> on 2023/01/12 02:08:48 UTC, 0 replies.
- [GitHub] [spark] anishshri-db commented on pull request #39520: [SPARK-41996][SS] Fix kafka test to verify lost partitions to account for slow Kafka operations - posted by GitBox <gi...@apache.org> on 2023/01/12 02:10:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39479: [SPARK-41961][SQL] Support table-valued functions with LATERAL - posted by GitBox <gi...@apache.org> on 2023/01/12 02:15:29 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39521: [SPARK-41887][CONNECT][TESTS][FOLLOW-UP] Enable test_extended_hint_types test case - posted by GitBox <gi...@apache.org> on 2023/01/12 02:21:42 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39505: [WIP][SPARK-41979][SQL] Add missing dots for error messages in error classes. - posted by GitBox <gi...@apache.org> on 2023/01/12 02:40:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39522: [SPARK-41998][CONNECT][TESTS] Reeuse pyspark.sql.tests.test_readwriter test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 02:52:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/12 02:54:48 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39517: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/12 03:04:04 UTC, 6 replies.
- [GitHub] [spark] rangareddy commented on pull request #39515: [SPARK-38743][SQL][TEST] Test the error class: MISSING_STATIC_PARTITION_COLUMNAdding test case for Missing Static Partition Column - posted by GitBox <gi...@apache.org> on 2023/01/12 03:08:53 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39523: [SPARK-42003][SQL] Reduce duplicate code in ResolveGroupByAll - posted by GitBox <gi...@apache.org> on 2023/01/12 03:22:13 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39524: [WIP][SPARK-41990][SQL] Fix bug for FieldReference - posted by GitBox <gi...@apache.org> on 2023/01/12 03:22:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39523: [SPARK-42003][SQL] Reduce duplicate code in ResolveGroupByAll - posted by GitBox <gi...@apache.org> on 2023/01/12 03:30:06 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39520: [SPARK-41996][SQL][SS] Fix kafka test to verify lost partitions to account for slow Kafka operations - posted by GitBox <gi...@apache.org> on 2023/01/12 03:49:06 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39520: [SPARK-41996][SQL][SS] Fix kafka test to verify lost partitions to account for slow Kafka operations - posted by GitBox <gi...@apache.org> on 2023/01/12 03:50:11 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39037: [SPARK-41214][SQL] Fix AQE cache does not update plan and metrics - posted by GitBox <gi...@apache.org> on 2023/01/12 04:38:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39525: [SPARK-42007][CONNECT][TESTS] Reuse pyspark.sql.tests.test_group test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 05:02:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39526: [SPARK-42008][CONNECT][TESTS] Reuse pyspark.sql.tests.test_datasources test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 05:13:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39527: [SPARK-42009][CONNECT][TESTS] Reuse pyspark.sql.tests.test_serde test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 05:26:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39528: [SPARK-42010][CONNECT][TESTS] Reuse pyspark.sql.tests.test_column test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 05:43:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39529: [SPARK-42019][CONNECT][TESTS] Reuse pyspark.sql.tests.test_types test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 06:10:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39487: [SPARK-41968][CORE][SQL] Refactor `ProtobufSerDe` to `ProtobufSerDe[T]` - posted by GitBox <gi...@apache.org> on 2023/01/12 06:21:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39521: [SPARK-41887][CONNECT][TESTS][FOLLOW-UP] Enable test_extended_hint_types test case - posted by GitBox <gi...@apache.org> on 2023/01/12 06:31:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39522: [SPARK-41998][CONNECT][TESTS] Reuse pyspark.sql.tests.test_readwriter test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 06:32:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39521: [SPARK-41887][CONNECT][TESTS][FOLLOW-UP] Enable test_extended_hint_types test case - posted by GitBox <gi...@apache.org> on 2023/01/12 06:32:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39522: [SPARK-41998][CONNECT][TESTS] Reuse pyspark.sql.tests.test_readwriter test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 06:32:48 UTC, 0 replies.
- [GitHub] [spark] asfgit closed pull request #37638: [SPARK-33573][SHUFFLE][YARN] Shuffle server side metrics for Push-based shuffle - posted by GitBox <gi...@apache.org> on 2023/01/12 06:53:33 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by GitBox <gi...@apache.org> on 2023/01/12 07:15:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39525: [SPARK-42007][CONNECT][TESTS] Reuse pyspark.sql.tests.test_group test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 08:01:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39525: [SPARK-42007][CONNECT][TESTS] Reuse pyspark.sql.tests.test_group test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 08:01:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39530: [SPARK-42026][CORE] Protobuf serializer for `AppSummary` and `PoolData` - posted by GitBox <gi...@apache.org> on 2023/01/12 08:16:58 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #38799: [SPARK-37099][SQL] Introduce the group limit of Window for rank-based filter to optimize top-k computation - posted by GitBox <gi...@apache.org> on 2023/01/12 08:21:50 UTC, 1 replies.
- [GitHub] [spark] wankunde commented on pull request #39502: [SPARK-41981][SQL] Collapse percentile functions if possible - posted by GitBox <gi...@apache.org> on 2023/01/12 08:24:51 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39530: [SPARK-42026][CORE] Protobuf serializer for `AppSummary` and `PoolData` - posted by GitBox <gi...@apache.org> on 2023/01/12 08:28:12 UTC, 2 replies.
- [GitHub] [spark] wankunde commented on pull request #38496: [SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2023/01/12 08:29:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39526: [SPARK-42008][CONNECT][TESTS] Reuse pyspark.sql.tests.test_datasources test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 08:36:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39526: [SPARK-42008][CONNECT][TESTS] Reuse pyspark.sql.tests.test_datasources test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 08:36:51 UTC, 0 replies.
- [GitHub] [spark] soxofaan commented on a diff in pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by GitBox <gi...@apache.org> on 2023/01/12 08:54:29 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #39531: [SPARK-XXXX][CONNECT] Add Guava Shading rules to `connect-common` to avoid startup failure - posted by GitBox <gi...@apache.org> on 2023/01/12 09:02:02 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39523: [SPARK-42003][SQL] Reduce duplicate code in ResolveGroupByAll - posted by GitBox <gi...@apache.org> on 2023/01/12 09:03:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39523: [SPARK-42003][SQL] Reduce duplicate code in ResolveGroupByAll - posted by GitBox <gi...@apache.org> on 2023/01/12 09:04:00 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39037: [SPARK-41214][SQL] Fix AQE cache does not update plan and metrics - posted by GitBox <gi...@apache.org> on 2023/01/12 09:06:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39527: [SPARK-42009][CONNECT][TESTS] Reuse pyspark.sql.tests.test_serde test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 09:14:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39527: [SPARK-42009][CONNECT][TESTS] Reuse pyspark.sql.tests.test_serde test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 09:15:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by GitBox <gi...@apache.org> on 2023/01/12 09:24:40 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39469: [SPARK-42028][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/12 09:41:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39531: [SPARK-42029][CONNECT] Add Guava Shading rules to `connect-common` to avoid startup failure - posted by GitBox <gi...@apache.org> on 2023/01/12 10:25:03 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39531: [SPARK-42029][CONNECT] Add Guava Shading rules to `connect-common` to avoid startup failure - posted by GitBox <gi...@apache.org> on 2023/01/12 10:25:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39528: [SPARK-42010][CONNECT][TESTS] Reuse pyspark.sql.tests.test_column test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 10:57:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39528: [SPARK-42010][CONNECT][TESTS] Reuse pyspark.sql.tests.test_column test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 10:58:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39514: [WIP][SPARK-41987][CONNECT][PYTHON] createDataFrame supports column with map type - posted by GitBox <gi...@apache.org> on 2023/01/12 11:01:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39531: [SPARK-42029][CONNECT] Add Guava Shading rules to `connect-common` to avoid startup failure - posted by GitBox <gi...@apache.org> on 2023/01/12 11:14:51 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39469: [SPARK-42028][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/12 11:23:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39469: [SPARK-42028][CONNECT][PYTHON] Truncating nanoseconds timestampsl - posted by GitBox <gi...@apache.org> on 2023/01/12 11:23:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39531: [SPARK-42029][CONNECT] Add Guava Shading rules to `connect-common` to avoid startup failure - posted by GitBox <gi...@apache.org> on 2023/01/12 11:28:16 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39529: [SPARK-42019][CONNECT][TESTS] Reuse pyspark.sql.tests.test_types test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 11:37:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39529: [SPARK-42019][CONNECT][TESTS] Reuse pyspark.sql.tests.test_types test cases - posted by GitBox <gi...@apache.org> on 2023/01/12 11:38:39 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39532: [SPARK-42030][CORE] Remove unused Constructor from RocksDB.TypeAliases and LevelDB.TypeAliases - posted by GitBox <gi...@apache.org> on 2023/01/12 11:46:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39532: [SPARK-42030][CORE] Remove unused Constructor from RocksDB.TypeAliases and LevelDB.TypeAliases - posted by GitBox <gi...@apache.org> on 2023/01/12 11:50:59 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #39384: [SPARK-40307][PYTHON] Introduce Arrow-optimized Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/12 12:24:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39524: [WIP][SPARK-41990][SQL] Fix bug for FieldReference - posted by GitBox <gi...@apache.org> on 2023/01/12 12:38:47 UTC, 0 replies.
- [GitHub] [spark] mcdull-zhang commented on a diff in pull request #38877: [SPARK-41361] [SQL] Invalid call toAttribute on unresolved object exception caused by WidenSetOperationTypes - posted by GitBox <gi...@apache.org> on 2023/01/12 12:55:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39533: [DON'T MERGE] Clean up `remove` methods that do not need override - posted by GitBox <gi...@apache.org> on 2023/01/12 13:11:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39534: [MINOR][CONNECT][DOCS] Fix typo in `connect/README.md` - posted by GitBox <gi...@apache.org> on 2023/01/12 13:42:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39534: [MINOR][CONNECT][DOCS] Fix typo in `connect/README.md` - posted by GitBox <gi...@apache.org> on 2023/01/12 13:42:47 UTC, 1 replies.
- [GitHub] [spark] wangyum commented on pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/12 13:58:42 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39524: [WIP][SPARK-41990][SQL] Fix bug for FieldReference - posted by GitBox <gi...@apache.org> on 2023/01/12 14:07:57 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39534: [MINOR][CONNECT][DOCS] Fix typo in `connect/README.md` - posted by GitBox <gi...@apache.org> on 2023/01/12 14:49:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39534: [MINOR][CONNECT][DOCS] Fix typo in `connect/README.md` - posted by GitBox <gi...@apache.org> on 2023/01/12 14:49:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39535: [SPARK-41746][SPARK-41838][SPARK-41837][SPARK-41835][SPARK-41836][SPARK-41847][CONNECT][PYTHON] Make `createDataFrame(rows/lists/tuples/dicts)` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/12 15:09:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39535: [SPARK-41746][SPARK-41838][SPARK-41837][SPARK-41835][SPARK-41836][SPARK-41847][CONNECT][PYTHON] Make `createDataFrame(rows/lists/tuples/dicts)` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/12 15:14:08 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #39507: [SPARK-41984][SQL] Rename & improve error message for `RESET_PERMISSION_TO_ORIGINAL` - posted by GitBox <gi...@apache.org> on 2023/01/12 15:16:47 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39506: [SPARK-41983][SQL] Rename & improve error message for `NULL_COMPARISON_RESULT` - posted by GitBox <gi...@apache.org> on 2023/01/12 15:16:51 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39524: [WIP][SPARK-41990][SQL] Fix bug for FieldReference - posted by GitBox <gi...@apache.org> on 2023/01/12 16:11:58 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39519: [SPARK-41995][SQL] Accept non-foldable expressions in schema_of_json - posted by GitBox <gi...@apache.org> on 2023/01/12 16:12:02 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/12 16:16:15 UTC, 0 replies.
- [GitHub] [spark] mengxr commented on a diff in pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2023/01/12 16:57:42 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on pull request #39533: [SPARK-42031][CORE][SQL] Clean up `remove` methods that do not need override - posted by GitBox <gi...@apache.org> on 2023/01/12 17:07:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/12 18:21:25 UTC, 2 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39530: [SPARK-42026][CORE] Protobuf serializer for `AppSummary` and `PoolData` - posted by GitBox <gi...@apache.org> on 2023/01/12 19:03:16 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39530: [SPARK-42026][CORE] Protobuf serializer for `AppSummary` and `PoolData` - posted by GitBox <gi...@apache.org> on 2023/01/12 19:04:09 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39515: [SPARK-38743][SQL][TEST] Test the error class: MISSING_STATIC_PARTITION_COLUMN - posted by GitBox <gi...@apache.org> on 2023/01/12 20:21:11 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #39533: [SPARK-42031][CORE][SQL] Clean up `remove` methods that do not need override - posted by GitBox <gi...@apache.org> on 2023/01/12 21:48:13 UTC, 0 replies.
- [GitHub] [spark] rednaxelafx commented on pull request #39518: [SPARK-41991][SQL] `CheckOverflowInTableInsert` should accept ExpressionProxy as child - posted by GitBox <gi...@apache.org> on 2023/01/12 21:50:49 UTC, 1 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #39267: [WIP][SPARK-41592][PYTHON][ML] Pytorch file Distributed Training - posted by GitBox <gi...@apache.org> on 2023/01/12 21:59:23 UTC, 5 replies.
- [GitHub] [spark] bersprockets commented on pull request #39518: [SPARK-41991][SQL] `CheckOverflowInTableInsert` should accept ExpressionProxy as child - posted by GitBox <gi...@apache.org> on 2023/01/12 22:15:42 UTC, 1 replies.
- [GitHub] [spark] rangadi opened a new pull request, #39536: [SQL][PROTOBUF] Fix how exception is handled in error reporting. - posted by GitBox <gi...@apache.org> on 2023/01/12 22:36:40 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #39536: [SQL][PROTOBUF] Fix how exception is handled in error reporting. - posted by GitBox <gi...@apache.org> on 2023/01/12 22:37:11 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #39536: [SQL][PROTOBUF] Fix how exception is handled in error reporting. - posted by GitBox <gi...@apache.org> on 2023/01/12 22:38:12 UTC, 0 replies.
- [GitHub] [spark] srielau opened a new pull request, #39537: [SPARK-41994] [DRAFT] Assign SQLSTATE's (1/?) - posted by GitBox <gi...@apache.org> on 2023/01/12 23:09:47 UTC, 0 replies.
- [GitHub] [spark] jerrypeng opened a new pull request, #39538: [SPARK-41596] Document the new feature "Async Progress Tracking" to Structured Streaming guide doc - posted by GitBox <gi...@apache.org> on 2023/01/12 23:24:55 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s cluster mode - posted by GitBox <gi...@apache.org> on 2023/01/12 23:36:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39539: [SPARK-42037][INFRA] Remove prefix in build environment variables - posted by GitBox <gi...@apache.org> on 2023/01/12 23:44:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39539: [SPARK-42037][INFRA] Remove `AMPLAB_` prefix in build environment variables - posted by GitBox <gi...@apache.org> on 2023/01/12 23:52:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39539: [SPARK-42037][INFRA] Remove `AMPLAB_` prefix in build environment variables - posted by GitBox <gi...@apache.org> on 2023/01/13 00:09:43 UTC, 0 replies.
- [GitHub] [spark] sunchao opened a new pull request, #39540: [SPARK-42039][SQL] SPJ: Remove Option in KeyGroupedPartitioning#partitionValuesOpt - posted by GitBox <gi...@apache.org> on 2023/01/13 00:35:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39518: [SPARK-41991][SQL] `CheckOverflowInTableInsert` should accept ExpressionProxy as child - posted by GitBox <gi...@apache.org> on 2023/01/13 00:41:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39435: [SPARK-41926][UI][TESTS] Add Github action test job with RocksDB as UI backend - posted by GitBox <gi...@apache.org> on 2023/01/13 01:01:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39435: [SPARK-41926][UI][TESTS] Add Github action test job with RocksDB as UI backend - posted by GitBox <gi...@apache.org> on 2023/01/13 01:01:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39514: [SPARK-41987][CONNECT][PYTHON] Connect API: createDataFrame should supports column with map type - posted by GitBox <gi...@apache.org> on 2023/01/13 01:13:30 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39532: [SPARK-42030][CORE] Remove unused Constructor from RocksDB.TypeAliases and LevelDB.TypeAliases - posted by GitBox <gi...@apache.org> on 2023/01/13 02:31:33 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39532: [SPARK-42030][CORE] Remove unused Constructor from RocksDB.TypeAliases and LevelDB.TypeAliases - posted by GitBox <gi...@apache.org> on 2023/01/13 02:31:34 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/13 02:41:53 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39498: [SPARK-41976][SQL] Improve error message for `INDEX_NOT_FOUND` - posted by GitBox <gi...@apache.org> on 2023/01/13 02:53:03 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #39497: [SPARK-41975][SQL] Improve error message for `INDEX_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2023/01/13 02:53:09 UTC, 0 replies.
- [GitHub] [spark] harupy opened a new pull request, #39542: Fix type hints that are incompatible with Python <= 3.8 - posted by GitBox <gi...@apache.org> on 2023/01/13 03:01:05 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39543: [SPARK-42044][SQL] Fix incorrect error message for `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by GitBox <gi...@apache.org> on 2023/01/13 03:04:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39544: [SPARK-42028][CONNECT][PYTHON][FOLLOW-UP] Uses the same logic with PySpark, and reeanbles skipped test - posted by GitBox <gi...@apache.org> on 2023/01/13 03:33:12 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39544: [SPARK-42028][CONNECT][PYTHON][FOLLOW-UP] Uses the same logic with PySpark, and reeanbles skipped test - posted by GitBox <gi...@apache.org> on 2023/01/13 03:35:15 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39542: [SPARK-41591][PYTHON][ML][FOLLOW-UP] Fix type hints that are incompatible with Python <= 3.8 - posted by GitBox <gi...@apache.org> on 2023/01/13 03:37:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39542: [SPARK-41591][PYTHON][ML][FOLLOW-UP] Fix type hints that are incompatible with Python <= 3.8 - posted by GitBox <gi...@apache.org> on 2023/01/13 03:37:31 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39533: [SPARK-42031][CORE][SQL] Clean up `remove` methods that do not need override - posted by GitBox <gi...@apache.org> on 2023/01/13 03:40:38 UTC, 0 replies.
- [GitHub] [spark] harupy commented on a diff in pull request #39188: [SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/13 04:00:41 UTC, 7 replies.
- [GitHub] [spark] harupy commented on pull request #39188: [SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/13 04:03:22 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39545: [SPARK-42042][CONNECT][PYTHON] `DataFrameReader` should support StructType schema - posted by GitBox <gi...@apache.org> on 2023/01/13 04:18:31 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39546: [SPARK-42045][SQL] ANSI SQL mode: Round/Bround should return an error on integer overflow - posted by GitBox <gi...@apache.org> on 2023/01/13 04:33:21 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39540: [SPARK-42039][SQL] SPJ: Remove Option in KeyGroupedPartitioning#partitionValuesOpt - posted by GitBox <gi...@apache.org> on 2023/01/13 04:34:01 UTC, 1 replies.
- [GitHub] [spark] beliefer commented on pull request #39514: [SPARK-41987][CONNECT][PYTHON] Connect API: createDataFrame should supports column with map type - posted by GitBox <gi...@apache.org> on 2023/01/13 04:36:51 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #39514: [SPARK-41987][CONNECT][PYTHON] Connect API: createDataFrame should supports column with map type - posted by GitBox <gi...@apache.org> on 2023/01/13 04:36:51 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on pull request #39542: [SPARK-41591][PYTHON][ML][FOLLOW-UP] Fix type hints that are incompatible with Python <= 3.8 - posted by GitBox <gi...@apache.org> on 2023/01/13 04:47:31 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39188: [SPARK-41591][PYTHON][ML] Training PyTorch Files on Single Node Multi GPU - posted by GitBox <gi...@apache.org> on 2023/01/13 04:51:25 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39535: [SPARK-41746][SPARK-41838][SPARK-41837][SPARK-41835][SPARK-41836][SPARK-41847][CONNECT][PYTHON] Make `createDataFrame(rows/lists/tuples/dicts)` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/13 04:51:59 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39535: [SPARK-41746][SPARK-41838][SPARK-41837][SPARK-41835][SPARK-41836][SPARK-41847][CONNECT][PYTHON] Make `createDataFrame(rows/lists/tuples/dicts)` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/13 04:53:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39546: [SPARK-42045][SQL] ANSI SQL mode: Round/Bround should return an error on integer overflow - posted by GitBox <gi...@apache.org> on 2023/01/13 04:55:16 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39547: [SPARK-42016][CONNECT][PYTHON] Enable tests related to the nested column - posted by GitBox <gi...@apache.org> on 2023/01/13 05:05:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39548: [SPARK-42014][CONNECT][PYTHON] Enable 2 tests in test_parity_serde - posted by GitBox <gi...@apache.org> on 2023/01/13 05:16:23 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39538: [SPARK-41596][SS][DOCS] Document the new feature "Async Progress Tracking" to Structured Streaming guide doc - posted by GitBox <gi...@apache.org> on 2023/01/13 05:19:37 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39539: [SPARK-42037][INFRA] Remove `AMPLAB_` prefix in build environment variables - posted by GitBox <gi...@apache.org> on 2023/01/13 05:22:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39545: [SPARK-42042][CONNECT][PYTHON] `DataFrameReader` should support StructType schema - posted by GitBox <gi...@apache.org> on 2023/01/13 05:23:57 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38496: [SPARK-40708][SQL] Auto update table statistics based on write metrics - posted by GitBox <gi...@apache.org> on 2023/01/13 05:57:55 UTC, 0 replies.
- [GitHub] [spark] anchovYu commented on a diff in pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by GitBox <gi...@apache.org> on 2023/01/13 06:03:18 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39544: [SPARK-42028][CONNECT][PYTHON][FOLLOW-UP] Uses the same logic with PySpark, and reeanbles skipped test - posted by GitBox <gi...@apache.org> on 2023/01/13 06:11:18 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39549: [SPARK-42046][TESTS] Add `connect-client-jvm` to connect module - posted by GitBox <gi...@apache.org> on 2023/01/13 06:13:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39549: [SPARK-42046][TESTS] Add `connect-client-jvm` to `connect` module - posted by GitBox <gi...@apache.org> on 2023/01/13 06:17:07 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39544: [SPARK-42028][CONNECT][PYTHON][FOLLOW-UP] Uses the same logic with PySpark, and reeanbles skipped test - posted by GitBox <gi...@apache.org> on 2023/01/13 06:20:30 UTC, 0 replies.
- [GitHub] [spark] rangadi opened a new pull request, #39550: [SQL][PROTOBUF] Add missing options for Protobuf functions - posted by GitBox <gi...@apache.org> on 2023/01/13 06:21:43 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #39550: [SQL][PROTOBUF] Add missing options for Protobuf functions - posted by GitBox <gi...@apache.org> on 2023/01/13 06:22:00 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #39550: [SQL][PROTOBUF] Add missing options for Protobuf functions - posted by GitBox <gi...@apache.org> on 2023/01/13 06:26:05 UTC, 0 replies.
- [GitHub] [spark] jerrypeng commented on pull request #39538: [SPARK-41596][SS][DOCS] Document the new feature "Async Progress Tracking" to Structured Streaming guide doc - posted by GitBox <gi...@apache.org> on 2023/01/13 06:27:51 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39539: [SPARK-42037][INFRA] Rename `AMPLAB_` to `SPARK_` in Jenkins build environment variables - posted by GitBox <gi...@apache.org> on 2023/01/13 06:35:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39539: [SPARK-42037][INFRA] Rename `AMPLAB_` to `SPARK_` in Jenkins build environment variables - posted by GitBox <gi...@apache.org> on 2023/01/13 06:35:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39551: [SPARK-42047][SPARK-41900][CONNECT][PYTHON] Literal should support numpy datatypes - posted by GitBox <gi...@apache.org> on 2023/01/13 06:39:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39551: [SPARK-42047][SPARK-41900][CONNECT][PYTHON] Literal should support numpy datatypes - posted by GitBox <gi...@apache.org> on 2023/01/13 06:40:56 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #39538: [SPARK-41596][SS][DOCS] Document the new feature "Async Progress Tracking" to Structured Streaming guide doc - posted by GitBox <gi...@apache.org> on 2023/01/13 06:48:57 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39545: [SPARK-42042][CONNECT][PYTHON] `DataFrameReader` should support StructType schema - posted by GitBox <gi...@apache.org> on 2023/01/13 06:56:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39547: [SPARK-42016][CONNECT][PYTHON] Enable tests related to the nested column - posted by GitBox <gi...@apache.org> on 2023/01/13 06:57:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39547: [SPARK-42016][CONNECT][PYTHON] Enable tests related to the nested column - posted by GitBox <gi...@apache.org> on 2023/01/13 06:58:20 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39475: [SPARK-41959][SQL] Improve v1 writes with empty2null - posted by GitBox <gi...@apache.org> on 2023/01/13 07:03:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/13 07:10:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39512: [SPARK-41986][SQL] Introduce shuffle on SinglePartition - posted by GitBox <gi...@apache.org> on 2023/01/13 07:10:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39552: [SPARK-42001][CONNECT][PYTHON][TESTS] Enable two schema related reader tests - posted by GitBox <gi...@apache.org> on 2023/01/13 07:20:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39361: [SPARK-41822][CONNECT] Setup gRPC connection for Scala/JVM client - posted by GitBox <gi...@apache.org> on 2023/01/13 07:31:16 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39502: [SPARK-41981][SQL] Collapse percentile functions if possible - posted by GitBox <gi...@apache.org> on 2023/01/13 07:50:03 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39501: [SPARK-41295][SQL] Rename the error classes - posted by GitBox <gi...@apache.org> on 2023/01/13 07:50:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39549: [SPARK-42046][TESTS] Add `connect-client-jvm` to `connect` module and fix port failure - posted by GitBox <gi...@apache.org> on 2023/01/13 08:12:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39551: [SPARK-42047][SPARK-41900][CONNECT][PYTHON] Literal should support Numpy datatypes - posted by GitBox <gi...@apache.org> on 2023/01/13 08:35:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39552: [SPARK-42001][CONNECT][PYTHON][TESTS] Enable two schema related reader tests - posted by GitBox <gi...@apache.org> on 2023/01/13 08:39:47 UTC, 0 replies.
- [GitHub] [spark] simonvanderveldt commented on a diff in pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by GitBox <gi...@apache.org> on 2023/01/13 09:04:24 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39548: [SPARK-42014][CONNECT][PYTHON][TESTS] Enable 2 tests in test_parity_serde - posted by GitBox <gi...@apache.org> on 2023/01/13 09:06:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39548: [SPARK-42014][CONNECT][PYTHON][TESTS] Enable 2 tests in test_parity_serde - posted by GitBox <gi...@apache.org> on 2023/01/13 09:07:02 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39524: [WIP][SPARK-41990][SQL] Fix bug for FieldReference - posted by GitBox <gi...@apache.org> on 2023/01/13 09:11:29 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39552: [SPARK-42001][CONNECT][PYTHON][TESTS] Enable two schema related reader tests - posted by GitBox <gi...@apache.org> on 2023/01/13 09:43:07 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39553: [SPARK-42041][CONNECT][PYTHON] DataFrameReader should support list of paths - posted by GitBox <gi...@apache.org> on 2023/01/13 10:38:08 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39554: [SPARK-42055][BUILD] Upgrade scalatest-maven-plugin to 2.2.0 - posted by GitBox <gi...@apache.org> on 2023/01/13 10:52:03 UTC, 0 replies.
- [GitHub] [spark] yaooqinn opened a new pull request, #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by GitBox <gi...@apache.org> on 2023/01/13 11:02:44 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by GitBox <gi...@apache.org> on 2023/01/13 11:10:08 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39549: [SPARK-42046][TESTS] Add `connect-client-jvm` to `connect` module and fix port failure - posted by GitBox <gi...@apache.org> on 2023/01/13 11:13:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39553: [SPARK-42041][SPARK-42013][CONNECT][PYTHON] DataFrameReader should support list of paths - posted by GitBox <gi...@apache.org> on 2023/01/13 11:23:02 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39553: [SPARK-42041][SPARK-42013][CONNECT][PYTHON] DataFrameReader should support list of paths - posted by GitBox <gi...@apache.org> on 2023/01/13 11:30:38 UTC, 0 replies.
- [GitHub] [spark] ocworld commented on pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s client mode with the driver running inside a POD - posted by GitBox <gi...@apache.org> on 2023/01/13 11:41:14 UTC, 1 replies.
- [GitHub] [spark] ocworld commented on a diff in pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s client mode with the driver running inside a POD - posted by GitBox <gi...@apache.org> on 2023/01/13 11:41:54 UTC, 1 replies.
- [GitHub] [spark] beregon87 commented on pull request #38376: [SPARK-40817] [Kubernetes] Do not discard remote user-specified files when launching Spark jobs on Kubernetes - posted by GitBox <gi...@apache.org> on 2023/01/13 13:04:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39524: [WIP][SPARK-41990][SQL] Fix bug for FieldReference - posted by GitBox <gi...@apache.org> on 2023/01/13 13:25:57 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/13 13:45:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by GitBox <gi...@apache.org> on 2023/01/13 14:08:28 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39556: [WIP][SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/13 14:40:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39556: [WIP][SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/13 14:43:17 UTC, 14 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39556: [WIP][SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/13 14:49:07 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39549: [SPARK-42046][TESTS] Add `connect-client-jvm` to `connect` module and fix port failure - posted by GitBox <gi...@apache.org> on 2023/01/13 15:13:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39554: [SPARK-42055][BUILD] Upgrade scalatest-maven-plugin to 2.2.0 - posted by GitBox <gi...@apache.org> on 2023/01/13 15:16:03 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #38828: [SPARK-35084][CORE] Spark 3: supporting --packages in k8s client mode with the driver running inside a POD - posted by GitBox <gi...@apache.org> on 2023/01/13 15:30:32 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #39556: [WIP][SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/13 15:37:57 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2023/01/13 15:48:41 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39556: [WIP][SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/13 15:56:07 UTC, 0 replies.
- [GitHub] [spark] LorenzoMartini commented on pull request #32921: [SPARK-35779][SQL] Dynamic filtering for Data Source V2 - posted by GitBox <gi...@apache.org> on 2023/01/13 17:02:33 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39546: [SPARK-42045][SQL] ANSI SQL mode: Round/Bround should return an error on integer overflow - posted by GitBox <gi...@apache.org> on 2023/01/13 17:39:56 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39546: [SPARK-42045][SQL] ANSI SQL mode: Round/Bround should return an error on integer overflow - posted by GitBox <gi...@apache.org> on 2023/01/13 17:40:49 UTC, 0 replies.
- [GitHub] [spark] rangadi commented on pull request #39550: [SPARK-42056][SQL][PROTOBUF] Add missing options for Protobuf functions - posted by GitBox <gi...@apache.org> on 2023/01/13 18:05:30 UTC, 1 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39557: [SPARK-42045][SQL] ANSI SQL mode: Round/Bround should return an error on tiny/small/big integer overflow - posted by GitBox <gi...@apache.org> on 2023/01/13 18:48:34 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng opened a new pull request, #39558: [SPARK-41982] Partitions of type string should not be treated as numeric types - posted by GitBox <gi...@apache.org> on 2023/01/13 19:09:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38376: [SPARK-40817] [Kubernetes] Do not discard remote user-specified files when launching Spark jobs on Kubernetes - posted by GitBox <gi...@apache.org> on 2023/01/13 19:13:56 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39536: [SPARK-42057][SQL][PROTOBUF] Fix how exception is handled in error reporting. - posted by GitBox <gi...@apache.org> on 2023/01/13 19:18:26 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/13 19:22:56 UTC, 7 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #39039: [SPARK-40776][SQL][PROTOBUF][DOCS] Spark-Protobuf docs - posted by GitBox <gi...@apache.org> on 2023/01/13 19:24:56 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39559: [SPARK-42011][CONNECT][PYTHON] Implement DataFrameReader.csv - posted by GitBox <gi...@apache.org> on 2023/01/13 19:47:28 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39560: [MINOR][CONNECT][TESTS] Fix typos in tests/connect/test_connect_basic.py - posted by GitBox <gi...@apache.org> on 2023/01/13 19:52:05 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39560: [MINOR][CONNECT][TESTS] Fix typos in tests/connect/test_connect_basic.py - posted by GitBox <gi...@apache.org> on 2023/01/13 19:52:59 UTC, 1 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #39536: [SPARK-42057][SQL][PROTOBUF] Fix how exception is handled in error reporting. - posted by GitBox <gi...@apache.org> on 2023/01/13 20:01:00 UTC, 2 replies.
- [GitHub] [spark] rmcyang commented on a diff in pull request #38959: SPARK-41415: SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/13 21:54:16 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39537: [SPARK-41994] [DRAFT] Assign SQLSTATE's (1/?) - posted by GitBox <gi...@apache.org> on 2023/01/13 22:32:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39559: [SPARK-42011][CONNECT][PYTHON] Implement DataFrameReader.csv - posted by GitBox <gi...@apache.org> on 2023/01/13 23:03:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39365: [SPARK-41859][SQL] CreateHiveTableAsSelectCommand should set the overwrite flag correctly - posted by GitBox <gi...@apache.org> on 2023/01/13 23:11:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39560: [MINOR][CONNECT][TESTS] Fix typos in tests/connect/test_connect_basic.py - posted by GitBox <gi...@apache.org> on 2023/01/13 23:18:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39560: [MINOR][CONNECT][TESTS] Fix typos in tests/connect/test_connect_basic.py - posted by GitBox <gi...@apache.org> on 2023/01/13 23:19:12 UTC, 0 replies.
- [GitHub] [spark] williamhyun opened a new pull request, #39561: [SPARK-42059][BUILD] Update ORC to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/14 00:03:25 UTC, 0 replies.
- [GitHub] [spark] williamhyun commented on pull request #39561: [SPARK-42059][BUILD] Update ORC to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/14 00:05:31 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39562: [SPARK-41964][CONNECT][PYTHON][FOLLOW-UP] Fix the jdbc writer not implemented Test - posted by GitBox <gi...@apache.org> on 2023/01/14 00:11:49 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on a diff in pull request #39559: [SPARK-42011][CONNECT][PYTHON] Implement DataFrameReader.csv - posted by GitBox <gi...@apache.org> on 2023/01/14 00:17:54 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38039: [SPARK-40603][SQL] Throw the original error from catalog implementations - posted by GitBox <gi...@apache.org> on 2023/01/14 00:18:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38024: [SPARK-40591][CORE][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2023/01/14 00:18:40 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on pull request #39559: [SPARK-42011][CONNECT][PYTHON] Implement DataFrameReader.csv - posted by GitBox <gi...@apache.org> on 2023/01/14 00:21:44 UTC, 0 replies.
- [GitHub] [spark] hussein-awala opened a new pull request, #39563: [SPARK-42060] add new config to override driver/executor k8s containers names - posted by GitBox <gi...@apache.org> on 2023/01/14 00:35:50 UTC, 0 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #39564: [SPARK-41990][SQL] Filrering by composite field name doesn't work - posted by GitBox <gi...@apache.org> on 2023/01/14 01:20:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39552: [SPARK-42001][CONNECT][PYTHON][TESTS] Update the related JIRA tickets of two DataFrameReader tests - posted by GitBox <gi...@apache.org> on 2023/01/14 01:47:14 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #39561: [SPARK-42059][BUILD] Update ORC to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/14 02:15:01 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #39564: [SPARK-41990][SQL] Filrering by composite field name doesn't work - posted by GitBox <gi...@apache.org> on 2023/01/14 02:36:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39561: [SPARK-42059][BUILD] Update ORC to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/14 02:57:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39561: [SPARK-42059][BUILD] Update ORC to 1.8.2 - posted by GitBox <gi...@apache.org> on 2023/01/14 02:57:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39479: [SPARK-41961][SQL] Support table-valued functions with LATERAL - posted by GitBox <gi...@apache.org> on 2023/01/14 03:09:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39479: [SPARK-41961][SQL] Support table-valued functions with LATERAL - posted by GitBox <gi...@apache.org> on 2023/01/14 03:10:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39493: [SPARK-41965][PYTHON][DOCS][WIP] Add DataFrameWriterV2 to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/14 03:11:28 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39493: [SPARK-41965][PYTHON][DOCS][WIP] Add DataFrameWriterV2 to PySpark API references - posted by GitBox <gi...@apache.org> on 2023/01/14 03:11:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39551: [SPARK-42047][SPARK-41900][CONNECT][PYTHON] Literal should support Numpy datatypes - posted by GitBox <gi...@apache.org> on 2023/01/14 03:12:49 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39551: [SPARK-42047][SPARK-41900][CONNECT][PYTHON] Literal should support Numpy datatypes - posted by GitBox <gi...@apache.org> on 2023/01/14 03:13:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39552: [SPARK-42001][CONNECT][PYTHON][TESTS] Update the related JIRA tickets of two DataFrameReader tests - posted by GitBox <gi...@apache.org> on 2023/01/14 03:15:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39552: [SPARK-42001][CONNECT][PYTHON][TESTS] Update the related JIRA tickets of two DataFrameReader tests - posted by GitBox <gi...@apache.org> on 2023/01/14 03:16:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39565: [SPARK-42062][CONNECT][TESTS] Enforce scalafmt for connect-common - posted by GitBox <gi...@apache.org> on 2023/01/14 03:31:26 UTC, 0 replies.
- [GitHub] [spark] thejdeep commented on a diff in pull request #35969: [SPARK-38651][SQL] Add configuration to support writing out empty schemas in supported filebased datasources - posted by GitBox <gi...@apache.org> on 2023/01/14 03:41:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #35969: [SPARK-38651][SQL] Add configuration to support writing out empty schemas in supported filebased datasources - posted by GitBox <gi...@apache.org> on 2023/01/14 03:52:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #35969: [SPARK-38651][SQL] Add `spark.sql.legacy.allowEmptySchemaWrite` - posted by GitBox <gi...@apache.org> on 2023/01/14 03:55:06 UTC, 1 replies.
- [GitHub] [spark] thejdeep commented on pull request #35969: [SPARK-38651][SQL] Add `spark.sql.legacy.allowEmptySchemaWrite` - posted by GitBox <gi...@apache.org> on 2023/01/14 03:59:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37525: [WIP][SPARK-40086][SQL] Improve AliasAwareOutputPartitioning to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/14 04:01:11 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #35969: [SPARK-38651][SQL] Add `spark.sql.legacy.allowEmptySchemaWrite` - posted by GitBox <gi...@apache.org> on 2023/01/14 04:09:42 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2023/01/14 04:14:47 UTC, 10 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39408: [SPARK-41896][SQL] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/14 04:29:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39408: [SPARK-41896][SQL] Filtering by row index returns empty results - posted by GitBox <gi...@apache.org> on 2023/01/14 04:29:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39536: [SPARK-42057][SQL][PROTOBUF] Fix how exception is handled in error reporting. - posted by GitBox <gi...@apache.org> on 2023/01/14 04:29:45 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39536: [SPARK-42057][SQL][PROTOBUF] Fix how exception is handled in error reporting. - posted by GitBox <gi...@apache.org> on 2023/01/14 04:30:13 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39556: [WIP][SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/14 04:30:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39557: [SPARK-42045][SQL] ANSI SQL mode: Round/Bround should return an error on tiny/small/big integer overflow - posted by GitBox <gi...@apache.org> on 2023/01/14 04:30:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39557: [SPARK-42045][SQL] ANSI SQL mode: Round/Bround should return an error on tiny/small/big integer overflow - posted by GitBox <gi...@apache.org> on 2023/01/14 04:30:54 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39556: [SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/14 04:33:44 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39131: [SPARK-41162][SQL] Fix anti- and semi-join for self-join with aggregations - posted by GitBox <gi...@apache.org> on 2023/01/14 04:34:09 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39565: [SPARK-42062][CONNECT][TESTS] Enforce scalafmt for connect-common - posted by GitBox <gi...@apache.org> on 2023/01/14 05:27:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39563: [SPARK-42060][K8S][WIP] add new config to override driver/executor k8s containers names - posted by GitBox <gi...@apache.org> on 2023/01/14 05:36:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39562: [SPARK-41964][CONNECT][PYTHON][FOLLOW-UP] Fix the jdbc writer not implemented Test - posted by GitBox <gi...@apache.org> on 2023/01/14 06:42:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39562: [SPARK-41964][CONNECT][PYTHON][FOLLOW-UP] Fix the jdbc writer not implemented Test - posted by GitBox <gi...@apache.org> on 2023/01/14 06:44:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39553: [SPARK-42041][SPARK-42013][CONNECT][PYTHON] DataFrameReader should support list of paths - posted by GitBox <gi...@apache.org> on 2023/01/14 06:44:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39559: [SPARK-42011][CONNECT][PYTHON] Implement DataFrameReader.csv - posted by GitBox <gi...@apache.org> on 2023/01/14 06:45:25 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39553: [SPARK-42041][SPARK-42013][CONNECT][PYTHON] DataFrameReader should support list of paths - posted by GitBox <gi...@apache.org> on 2023/01/14 06:45:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39564: [SPARK-41990][SQL] Filrering by composite field name doesn't work - posted by GitBox <gi...@apache.org> on 2023/01/14 06:49:03 UTC, 1 replies.
- [GitHub] [spark] imhunterand opened a new pull request, #39566: Patched()Fix Protobuf Java vulnerable to Uncontrolled Resource Consumption - posted by GitBox <gi...@apache.org> on 2023/01/14 07:13:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #35969: [SPARK-38651][SQL] Add `spark.sql.legacy.allowEmptySchemaWrite` - posted by GitBox <gi...@apache.org> on 2023/01/14 07:39:01 UTC, 1 replies.
- [GitHub] [spark] smallzhongfeng commented on pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by GitBox <gi...@apache.org> on 2023/01/14 07:45:58 UTC, 3 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39567: [SPARK-42012][CONNECT][PYTHON] Implement DataFrameReader.orc - posted by GitBox <gi...@apache.org> on 2023/01/14 08:02:46 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #39564: [SPARK-41990][SQL] Filrering by composite field name doesn't work - posted by GitBox <gi...@apache.org> on 2023/01/14 08:09:21 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2023/01/14 08:12:56 UTC, 11 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #37525: [WIP][SPARK-40086][SQL] Improve AliasAwareOutputPartitioning to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/14 08:20:52 UTC, 1 replies.
- [GitHub] [spark] NarekDW commented on pull request #39501: [SPARK-41295][SQL] Rename the error classes - posted by GitBox <gi...@apache.org> on 2023/01/14 08:37:00 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39459: [WIP][SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/14 08:42:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/14 09:16:41 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39451: [SPARK-41832][CONNECT][PYTHON] Fix `DataFrame.unionByName`, add allow_missing_columns - posted by GitBox <gi...@apache.org> on 2023/01/14 09:17:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39568: [SPARK-41847][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable test `test_explode ` - posted by GitBox <gi...@apache.org> on 2023/01/14 09:25:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39569: [SPARK-42063][CORE] Register `byte[][]` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/14 09:30:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39569: [SPARK-42063][CORE] Register `byte[][]` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/14 09:33:38 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #35969: [SPARK-38651][SQL] Add `spark.sql.legacy.allowEmptySchemaWrite` - posted by GitBox <gi...@apache.org> on 2023/01/14 09:58:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39569: [SPARK-42063][CORE] Register `byte[][]` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/14 10:00:01 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39570: [SPARK-41903][CONNECT][PYTHON] `Literal` should support 1-dim ndarray - posted by GitBox <gi...@apache.org> on 2023/01/14 10:21:11 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39570: [SPARK-41903][CONNECT][PYTHON] `Literal` should support 1-dim ndarray - posted by GitBox <gi...@apache.org> on 2023/01/14 10:21:42 UTC, 1 replies.
- [GitHub] [spark] asfgit closed pull request #37922: [SPARK-40480][SHUFFLE] Remove push-based shuffle data after query finished - posted by GitBox <gi...@apache.org> on 2023/01/14 10:24:26 UTC, 0 replies.
- [GitHub] [spark] hussein-awala commented on pull request #39563: [SPARK-42060][K8S][WIP] add new config to override driver/executor k8s containers names - posted by GitBox <gi...@apache.org> on 2023/01/14 11:02:22 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39563: [SPARK-42060][K8S][WIP] add new config to override driver/executor k8s containers names - posted by GitBox <gi...@apache.org> on 2023/01/14 11:15:57 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #39571: [SPARK-42064][SQL] Implement bloom filter join hint - posted by GitBox <gi...@apache.org> on 2023/01/14 13:28:05 UTC, 0 replies.
- [GitHub] [spark] wangyum closed pull request #39569: [SPARK-42063][CORE] Register `byte[][]` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/14 13:46:13 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39569: [SPARK-42063][CORE] Register `byte[][]` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/14 13:47:03 UTC, 0 replies.
- [GitHub] [spark] Kimahriman opened a new pull request, #39572: [SPARK-39979][SQL] Add option to use large variable width vectors for arrow UDF operations - posted by GitBox <gi...@apache.org> on 2023/01/14 14:43:29 UTC, 0 replies.
- [GitHub] [spark] Kimahriman commented on pull request #39572: [SPARK-39979][SQL] Add option to use large variable width vectors for arrow UDF operations - posted by GitBox <gi...@apache.org> on 2023/01/14 14:45:34 UTC, 3 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #39573: [SPARK-42065][PYTHON][TESTS] Remove duplicated `test_freqItems` test - posted by GitBox <gi...@apache.org> on 2023/01/14 20:07:41 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/14 20:27:38 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on a diff in pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/14 21:08:04 UTC, 0 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39574: [K8S][MINOR] Correct typo in log for decommission - posted by GitBox <gi...@apache.org> on 2023/01/14 21:32:11 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39574: [K8S][MINOR] Correct typo in log for decommission - posted by GitBox <gi...@apache.org> on 2023/01/14 21:32:55 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/14 22:32:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39567: [SPARK-42012][CONNECT][PYTHON] Implement DataFrameReader.orc - posted by GitBox <gi...@apache.org> on 2023/01/14 23:44:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39574: [K8S][MINOR] Correct typo in log for decommission - posted by GitBox <gi...@apache.org> on 2023/01/14 23:45:34 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38073: [SPARK-40632][SQL] Do not inject runtime filter if join condition reference non simple expression - posted by GitBox <gi...@apache.org> on 2023/01/15 00:20:47 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38039: [SPARK-40603][SQL] Throw the original error from catalog implementations - posted by GitBox <gi...@apache.org> on 2023/01/15 00:20:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38024: [SPARK-40591][CORE][SQL] Fix data loss caused by ignoreCorruptFiles - posted by GitBox <gi...@apache.org> on 2023/01/15 00:20:49 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37996: [SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2023/01/15 00:20:50 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37822: [SPARK-40381][DEPLOY] Support standalone worker recommission - posted by GitBox <gi...@apache.org> on 2023/01/15 00:20:51 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2023/01/15 00:20:53 UTC, 0 replies.
- [GitHub] [spark] srielau opened a new pull request, #39575: [Spark-42058] [DRAFT] sqlstate 2/2 - posted by GitBox <gi...@apache.org> on 2023/01/15 00:34:27 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39575: [Spark-42058] [DRAFT] sqlstate 2/2 - posted by GitBox <gi...@apache.org> on 2023/01/15 00:35:31 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39573: [SPARK-42065][PYTHON][CONNECT][TESTS] Remove duplicated `test_freqItems` test - posted by GitBox <gi...@apache.org> on 2023/01/15 00:35:34 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39572: [SPARK-39979][SQL] Add option to use large variable width vectors for arrow UDF operations - posted by GitBox <gi...@apache.org> on 2023/01/15 00:35:37 UTC, 0 replies.
- [GitHub] [spark] attilapiros closed pull request #38828: [SPARK-35084][CORE] Fixing --packages in k8s client mode with the driver running inside a POD - posted by GitBox <gi...@apache.org> on 2023/01/15 01:30:10 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #38828: [SPARK-35084][CORE] Fixing --packages in k8s client mode with the driver running inside a POD - posted by GitBox <gi...@apache.org> on 2023/01/15 01:32:29 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39576: [WIP][SPARK-42067][CONNECT][BUILD] Upgrade buf from 1.11.0 to 1.12.0 - posted by GitBox <gi...@apache.org> on 2023/01/15 01:46:27 UTC, 0 replies.
- [GitHub] [spark] ocworld commented on pull request #38828: [SPARK-35084][CORE] Fixing --packages in k8s client mode with the driver running inside a POD - posted by GitBox <gi...@apache.org> on 2023/01/15 02:13:19 UTC, 1 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/15 05:00:16 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/15 05:33:35 UTC, 1 replies.
- [GitHub] [spark] mridulm closed pull request #38959: SPARK-41415: SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/15 05:59:09 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/15 06:21:09 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/15 06:21:20 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39566: Patched()Fix Protobuf Java vulnerable to Uncontrolled Resource Consumption - posted by GitBox <gi...@apache.org> on 2023/01/15 06:50:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39564: [SPARK-41990][SQL] Use `FieldReference.column` instead of `apply` in V1 to V2 filter conversion - posted by GitBox <gi...@apache.org> on 2023/01/15 06:52:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39564: [SPARK-41990][SQL] Use `FieldReference.column` instead of `apply` in V1 to V2 filter conversion - posted by GitBox <gi...@apache.org> on 2023/01/15 06:53:34 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39578: [SPARK-42071][CORE] Register `scala.math.Ordering$Reverse` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/15 06:58:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39578: [SPARK-42071][CORE] Register `scala.math.Ordering$Reverse` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/15 06:59:30 UTC, 2 replies.
- [GitHub] [spark] huaxingao commented on pull request #39564: [SPARK-41990][SQL] Use `FieldReference.column` instead of `apply` in V1 to V2 filter conversion - posted by GitBox <gi...@apache.org> on 2023/01/15 07:33:58 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39578: [SPARK-42071][CORE] Register `scala.math.Ordering$Reverse` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/15 08:16:51 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39578: [SPARK-42071][CORE] Register `scala.math.Ordering$Reverse` to KyroSerializer - posted by GitBox <gi...@apache.org> on 2023/01/15 09:10:27 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by GitBox <gi...@apache.org> on 2023/01/15 10:59:40 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39559: [SPARK-42011][CONNECT][PYTHON] Implement DataFrameReader.csv - posted by GitBox <gi...@apache.org> on 2023/01/15 11:08:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39573: [SPARK-42065][PYTHON][CONNECT][TESTS] Remove duplicated `test_freqItems` test - posted by GitBox <gi...@apache.org> on 2023/01/15 11:10:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39573: [SPARK-42065][PYTHON][CONNECT][TESTS] Remove duplicated `test_freqItems` test - posted by GitBox <gi...@apache.org> on 2023/01/15 11:10:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39570: [SPARK-41903][CONNECT][PYTHON] `Literal` should support 1-dim ndarray - posted by GitBox <gi...@apache.org> on 2023/01/15 11:14:51 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39568: [SPARK-41847][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable test `test_explode ` - posted by GitBox <gi...@apache.org> on 2023/01/15 11:15:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39568: [SPARK-41847][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable test `test_explode ` - posted by GitBox <gi...@apache.org> on 2023/01/15 11:15:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39579: [SPARK-42072][CORE] `core` module requires `javax.servlet-api` - posted by GitBox <gi...@apache.org> on 2023/01/15 11:50:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39572: [SPARK-39979][SQL] Add option to use large variable width vectors for arrow UDF operations - posted by GitBox <gi...@apache.org> on 2023/01/15 12:19:01 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39579: [SPARK-42072][CORE] `core` module requires `javax.servlet-api` - posted by GitBox <gi...@apache.org> on 2023/01/15 12:49:14 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39550: [SPARK-42056][SQL][PROTOBUF] Add missing options for Protobuf functions - posted by GitBox <gi...@apache.org> on 2023/01/15 14:07:36 UTC, 0 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39580: [MINOR] Throw Timeout exception when SASL Request Retry is enabled - posted by GitBox <gi...@apache.org> on 2023/01/15 14:25:41 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39580: [MINOR] [FOLLOW-UP] Throw Timeout exception when SASL Request Retry is enabled - posted by GitBox <gi...@apache.org> on 2023/01/15 14:26:42 UTC, 0 replies.
- [GitHub] [spark] ivoson commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/15 16:14:33 UTC, 10 replies.
- [GitHub] [spark] ivoson commented on pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/15 16:15:48 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/15 16:15:54 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39498: [SPARK-41976][SQL] Improve error message for `INDEX_NOT_FOUND` - posted by GitBox <gi...@apache.org> on 2023/01/15 16:28:19 UTC, 1 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39581: [SPARK-42011][SPARK-42012][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable csv, orc tests in connect/test_parity_datasources.py - posted by GitBox <gi...@apache.org> on 2023/01/15 16:54:54 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39580: [MINOR] [FOLLOW-UP] Throw Timeout exception when SASL Request Retry is enabled - posted by GitBox <gi...@apache.org> on 2023/01/15 17:06:30 UTC, 1 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39582: [SPARK-41921][CONNECT][TESTS][FOLLOW-UP] Enable doctests in connect/column and connect/readwriter - posted by GitBox <gi...@apache.org> on 2023/01/15 17:28:06 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39583: [SPARK-42073][CONNECT][PYTHON][TESTS] Enable tests in common/test_parity_serde, common/test_parity_types - posted by GitBox <gi...@apache.org> on 2023/01/15 18:09:07 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39538: [SPARK-41596][SS][DOCS] Document the new feature "Async Progress Tracking" to Structured Streaming guide doc - posted by GitBox <gi...@apache.org> on 2023/01/15 18:20:50 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39537: [SPARK-41994] Assign SQLSTATE's (1/2) - posted by GitBox <gi...@apache.org> on 2023/01/15 18:20:52 UTC, 0 replies.
- [GitHub] [spark] tedyu closed pull request #39580: [MINOR] [FOLLOW-UP] Throw Timeout exception when SASL Request Retry is enabled - posted by GitBox <gi...@apache.org> on 2023/01/15 18:27:53 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #37525: [WIP][SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/15 18:49:18 UTC, 6 replies.
- [GitHub] [spark] SandishKumarHN commented on a diff in pull request #39550: [SPARK-42056][SQL][PROTOBUF] Add missing options for Protobuf functions - posted by GitBox <gi...@apache.org> on 2023/01/15 19:33:43 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39584: [SPARK-42074][SQL] Enable KryoSerializer in TPCDSQueryBenchmark to enforce SQL class registration - posted by GitBox <gi...@apache.org> on 2023/01/15 23:23:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39584: [SPARK-42074][SQL] Enable `KryoSerializer` in `TPCDSQueryBenchmark` to enforce SQL class registration - posted by GitBox <gi...@apache.org> on 2023/01/15 23:43:04 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39584: [SPARK-42074][SQL] Enable `KryoSerializer` in `TPCDSQueryBenchmark` to enforce SQL class registration - posted by GitBox <gi...@apache.org> on 2023/01/15 23:44:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39584: [SPARK-42074][SQL] Enable `KryoSerializer` in `TPCDSQueryBenchmark` to enforce SQL class registration - posted by GitBox <gi...@apache.org> on 2023/01/16 00:15:24 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38092: [SPARK-40647][CORE] DAGScheduler should fail job until all related running tasks have been killed - posted by GitBox <gi...@apache.org> on 2023/01/16 00:19:39 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38073: [SPARK-40632][SQL] Do not inject runtime filter if join condition reference non simple expression - posted by GitBox <gi...@apache.org> on 2023/01/16 00:19:40 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37996: [SPARK-40558][SQL] Add Reusable Exchange in Bloom creation side plan - posted by GitBox <gi...@apache.org> on 2023/01/16 00:19:41 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37822: [SPARK-40381][DEPLOY] Support standalone worker recommission - posted by GitBox <gi...@apache.org> on 2023/01/16 00:19:42 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37809: [SPARK-40355][SQL] Improve pushdown for orc & parquet when cast scenario - posted by GitBox <gi...@apache.org> on 2023/01/16 00:19:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39581: [SPARK-42011][SPARK-42012][CONNECT][PYTHON][TESTS][FOLLOW-UP] Enable csv, orc tests in connect/test_parity_datasources.py - posted by GitBox <gi...@apache.org> on 2023/01/16 00:28:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39581: [SPARK-42011][SPARK-42012][CONNECT][PYTHON][TESTS][FOLLOW-UP] Enable csv, orc tests in connect/test_parity_datasources.py - posted by GitBox <gi...@apache.org> on 2023/01/16 00:28:37 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39582: [SPARK-41921][CONNECT][TESTS][FOLLOW-UP] Enable doctests in connect/column and connect/readwriter - posted by GitBox <gi...@apache.org> on 2023/01/16 00:29:07 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39582: [SPARK-41921][CONNECT][TESTS][FOLLOW-UP] Enable doctests in connect/column and connect/readwriter - posted by GitBox <gi...@apache.org> on 2023/01/16 00:33:13 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39583: [SPARK-42073][CONNECT][PYTHON][TESTS] Enable tests in common/test_parity_serde, common/test_parity_types - posted by GitBox <gi...@apache.org> on 2023/01/16 00:47:59 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39571: [SPARK-42064][SQL] Implement bloom filter join hint - posted by GitBox <gi...@apache.org> on 2023/01/16 00:48:01 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39583: [SPARK-42073][CONNECT][PYTHON][TESTS] Enable tests in common/test_parity_serde, common/test_parity_types - posted by GitBox <gi...@apache.org> on 2023/01/16 00:48:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39570: [SPARK-41903][CONNECT][PYTHON] `Literal` should support 1-dim ndarray - posted by GitBox <gi...@apache.org> on 2023/01/16 00:50:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39387: [SPARK-41586][PYTHON] Introduce `pyspark.errors` and error classes for PySpark. - posted by GitBox <gi...@apache.org> on 2023/01/16 01:22:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39387: [SPARK-41586][PYTHON] Introduce `pyspark.errors` and error classes for PySpark. - posted by GitBox <gi...@apache.org> on 2023/01/16 01:22:57 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39585: [WIP] Unregistered Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/16 01:58:23 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #37525: [WIP][SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/16 02:10:48 UTC, 1 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39586: [MINOR] Drop saslTimeoutSeen - posted by GitBox <gi...@apache.org> on 2023/01/16 02:32:54 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39571: [SPARK-42064][SQL] Implement bloom filter join hint - posted by GitBox <gi...@apache.org> on 2023/01/16 02:38:29 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/16 02:39:21 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/16 02:43:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39584: [SPARK-42074][SQL] Enable `KryoSerializer` in `TPCDSQueryBenchmark` to enforce SQL class registration - posted by GitBox <gi...@apache.org> on 2023/01/16 02:46:09 UTC, 0 replies.
- [GitHub] [spark] yaooqinn commented on a diff in pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by GitBox <gi...@apache.org> on 2023/01/16 02:51:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39564: [SPARK-41990][SQL] Use `FieldReference.column` instead of `apply` in V1 to V2 filter conversion - posted by GitBox <gi...@apache.org> on 2023/01/16 03:18:35 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39587: [SPARK-42076][CONNECT][PYTHON] Factor data conversion `arrow -> rows` out to `conversion.py` - posted by GitBox <gi...@apache.org> on 2023/01/16 03:19:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39588: [SPARK-42077][CONNECT][PYTHON] Literal should throw TypeError for unsupported DataType - posted by GitBox <gi...@apache.org> on 2023/01/16 03:27:00 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #35969: [SPARK-38651][SQL] Add `spark.sql.legacy.allowEmptySchemaWrite` - posted by GitBox <gi...@apache.org> on 2023/01/16 03:29:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37525: [WIP][SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/16 03:40:42 UTC, 1 replies.
- [GitHub] [spark] tedyu closed pull request #39586: [MINOR] Drop saslTimeoutSeen - posted by GitBox <gi...@apache.org> on 2023/01/16 03:49:09 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39585: [WIP] Unregistered Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/16 04:19:51 UTC, 6 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by GitBox <gi...@apache.org> on 2023/01/16 04:43:23 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39589: [MINOR][CORE][TESTS] Suppress unchecked warnings in OneForOneStreamManagerSuite - posted by GitBox <gi...@apache.org> on 2023/01/16 04:44:10 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39590: [SPARK-42079][CONNECT][PYTHON] Rename proto messages for `toDF` and `withColumnsRenamed` - posted by GitBox <gi...@apache.org> on 2023/01/16 04:46:28 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39591: [SPARK-42078][PYTHON] Migrate errors thrown by JVM into `PySparkException`. - posted by GitBox <gi...@apache.org> on 2023/01/16 04:59:06 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39587: [SPARK-42076][CONNECT][PYTHON] Factor data conversion `arrow -> rows` out to `conversion.py` - posted by GitBox <gi...@apache.org> on 2023/01/16 05:11:35 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39587: [SPARK-42076][CONNECT][PYTHON] Factor data conversion `arrow -> rows` out to `conversion.py` - posted by GitBox <gi...@apache.org> on 2023/01/16 05:11:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39589: [MINOR][CORE][TESTS] Suppress unchecked warnings in OneForOneStreamManagerSuite - posted by GitBox <gi...@apache.org> on 2023/01/16 05:13:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39589: [MINOR][CORE][TESTS] Suppress unchecked warnings in OneForOneStreamManagerSuite - posted by GitBox <gi...@apache.org> on 2023/01/16 05:13:51 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39588: [SPARK-42077][CONNECT][PYTHON] Literal should throw TypeError for unsupported DataType - posted by GitBox <gi...@apache.org> on 2023/01/16 05:14:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39588: [SPARK-42077][CONNECT][PYTHON] Literal should throw TypeError for unsupported DataType - posted by GitBox <gi...@apache.org> on 2023/01/16 05:14:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39592: [SPARK-42081][SQL] Improve the plan change validation - posted by GitBox <gi...@apache.org> on 2023/01/16 05:16:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39592: [SPARK-42081][SQL] Improve the plan change validation - posted by GitBox <gi...@apache.org> on 2023/01/16 05:17:28 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39593: [SPARK-42083][K8S] Make `ExecutorPodsAllocator` extendable - posted by GitBox <gi...@apache.org> on 2023/01/16 05:28:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39593: [SPARK-42083][K8S] Make `ExecutorPodsAllocator` extendable - posted by GitBox <gi...@apache.org> on 2023/01/16 05:28:25 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39594: [SPARK-42085][CONNECT][PYTHON] Make `from_arrow_schema` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/16 05:42:01 UTC, 0 replies.
- [GitHub] [spark] czxm opened a new pull request, #39595: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions in most cases - posted by GitBox <gi...@apache.org> on 2023/01/16 05:43:39 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39585: [WIP] Unregistered Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/16 05:49:43 UTC, 10 replies.
- [GitHub] [spark] huaxingao commented on a diff in pull request #39564: [SPARK-41990][SQL] Use `FieldReference.column` instead of `apply` in V1 to V2 filter conversion - posted by GitBox <gi...@apache.org> on 2023/01/16 05:49:58 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39585: [WIP] Unregistered Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/16 05:52:03 UTC, 2 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39596: [SPARK-42084][SQL] Avoid leaking the qualified-access-only restriction - posted by GitBox <gi...@apache.org> on 2023/01/16 05:58:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39596: [SPARK-42084][SQL] Avoid leaking the qualified-access-only restriction - posted by GitBox <gi...@apache.org> on 2023/01/16 05:59:48 UTC, 2 replies.
- [GitHub] [spark] huaxingao opened a new pull request, #39597: [SPARK-41990][SQL][FOLLOWUP] Add comments to explain why `FieldReference.column` is used when `ParseException` happens - posted by GitBox <gi...@apache.org> on 2023/01/16 06:00:27 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #39597: [SPARK-41990][SQL][FOLLOWUP] Add comments to explain why `FieldReference.column` is used when `ParseException` happens - posted by GitBox <gi...@apache.org> on 2023/01/16 06:00:58 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39597: [SPARK-41990][SQL][FOLLOWUP] Add comments to explain why `FieldReference.column` is used when `ParseException` happens - posted by GitBox <gi...@apache.org> on 2023/01/16 06:14:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39593: [SPARK-42083][K8S] Make `(Executor|StatefulSet)PodsAllocator` extendable - posted by GitBox <gi...@apache.org> on 2023/01/16 06:14:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39598: [SPARK-41708][SQL][FOLLOWUP] Override `equals` and `hashCode` to make `CatalogFileIndex` print the same results when using Scala 2.12 and 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/16 06:30:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39591: [SPARK-42078][PYTHON] Migrate errors thrown by JVM into `PySparkException`. - posted by GitBox <gi...@apache.org> on 2023/01/16 06:43:27 UTC, 9 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39599: [SPARK-42086][SQL][TESTS] Sort test cases in SQLQueryTestSuite - posted by GitBox <gi...@apache.org> on 2023/01/16 06:51:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39593: [SPARK-42083][K8S] Make `(Executor|StatefulSet)PodsAllocator` extendable - posted by GitBox <gi...@apache.org> on 2023/01/16 06:54:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39599: [SPARK-42086][SQL][TESTS] Sort test cases in SQLQueryTestSuite - posted by GitBox <gi...@apache.org> on 2023/01/16 06:55:11 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Override `equals` and `hashCode` of `TableIdentifier` to make `CatalogFileIndex` print the same results when using Scala 2.12 and 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/16 07:01:19 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Override `equals` and `hashCode` of `TableIdentifier` to make `CatalogFileIndex` print the same results when using Scala 2.12 and 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/16 07:08:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39600: [SPARK-42032][SPARK-41988][CONNECT][PYTHON][TESTS] `MapType` related doctests sort the dicts - posted by GitBox <gi...@apache.org> on 2023/01/16 07:09:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39600: [SPARK-42032][SPARK-41988][CONNECT][PYTHON][TESTS] `MapType` related doctests sort the dicts - posted by GitBox <gi...@apache.org> on 2023/01/16 07:10:22 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #37525: [WIP][SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/16 07:11:50 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Override `equals` and `hashCode` of `TableIdentifier` to make `CatalogFileIndex` print the same results when using Scala 2.12 and 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/16 07:12:28 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39556: [SPARK-42049][SQL] Improve AliasAwareOutputExpression - posted by GitBox <gi...@apache.org> on 2023/01/16 07:15:41 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Override `equals` and `hashCode` of `TableIdentifier` to make `CatalogFileIndex` print the same results when using Scala 2.12 and 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/16 07:24:36 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39601: [SPARK-42087][SQL][TESTS] Use `--no-same-owner` when `HiveExternalCatalogVersionsSuite` untars - posted by GitBox <gi...@apache.org> on 2023/01/16 07:41:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39602: [SPARK-42021][CONNECT][PYTHON] Make `createDataFrame` support `array.array` - posted by GitBox <gi...@apache.org> on 2023/01/16 07:56:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39602: [SPARK-42021][CONNECT][PYTHON] Make `createDataFrame` support `array.array` - posted by GitBox <gi...@apache.org> on 2023/01/16 07:57:40 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39594: [SPARK-42085][CONNECT][PYTHON] Make `from_arrow_schema` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/16 07:59:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39594: [SPARK-42085][CONNECT][PYTHON] Make `from_arrow_schema` support nested types - posted by GitBox <gi...@apache.org> on 2023/01/16 08:00:15 UTC, 0 replies.
- [GitHub] [spark] zekai-li opened a new pull request, #39603: [SPARK-42088][build]Improved setup.py adaptation for windows - posted by GitBox <gi...@apache.org> on 2023/01/16 08:08:38 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39599: [SPARK-42086][SQL][TESTS] Sort test cases in SQLQueryTestSuite - posted by GitBox <gi...@apache.org> on 2023/01/16 08:09:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39601: [SPARK-42087][SQL][TESTS] Use `--no-same-owner` when `HiveExternalCatalogVersionsSuite` untars - posted by GitBox <gi...@apache.org> on 2023/01/16 08:42:58 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Override `equals` and `hashCode` of `TableIdentifier` to make `CatalogFileIndex` print the same results when using Scala 2.12 and 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/16 08:51:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39597: [SPARK-41990][SQL][FOLLOWUP] Add comments to explain why `FieldReference.column` is used when `ParseException` happens - posted by GitBox <gi...@apache.org> on 2023/01/16 08:52:40 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #39597: [SPARK-41990][SQL][FOLLOWUP] Add comments to explain why `FieldReference.column` is used when `ParseException` happens - posted by GitBox <gi...@apache.org> on 2023/01/16 08:53:02 UTC, 0 replies.
- [GitHub] [spark] lfspace opened a new pull request, #39604: when spark job had ran in k8s is finished ,it register to shutdown ho… - posted by GitBox <gi...@apache.org> on 2023/01/16 08:53:59 UTC, 0 replies.
- [GitHub] [spark] lfspace commented on pull request #39604: when spark job had ran in k8s is finished ,it register to shutdown ho… - posted by GitBox <gi...@apache.org> on 2023/01/16 09:04:39 UTC, 0 replies.
- [GitHub] [spark] lfspace closed pull request #39604: when spark job had ran in k8s is finished ,it register to shutdown ho… - posted by GitBox <gi...@apache.org> on 2023/01/16 09:04:57 UTC, 0 replies.
- [GitHub] [spark] lfspace opened a new pull request, #39605: when spark job had ran in k8s is finished ,it register to shutdown ho… - posted by GitBox <gi...@apache.org> on 2023/01/16 09:06:43 UTC, 0 replies.
- [GitHub] [spark] dcoliversun commented on pull request #39306: [SPARK-41781][K8S] Add the ability to create pvc before creating driver/executor pod - posted by GitBox <gi...@apache.org> on 2023/01/16 09:10:23 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39600: [SPARK-42032][SPARK-41988][CONNECT][PYTHON][TESTS] `MapType` related doctests sort the dicts - posted by GitBox <gi...@apache.org> on 2023/01/16 09:11:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39600: [SPARK-42032][SPARK-41988][CONNECT][PYTHON][TESTS] `MapType` related doctests sort the dicts - posted by GitBox <gi...@apache.org> on 2023/01/16 09:12:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39601: [SPARK-42087][SQL][TESTS] Use `--no-same-owner` when `HiveExternalCatalogVersionsSuite` untars - posted by GitBox <gi...@apache.org> on 2023/01/16 09:33:19 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove hashCode - posted by GitBox <gi...@apache.org> on 2023/01/16 09:48:50 UTC, 1 replies.
- [GitHub] [spark] lival opened a new pull request, #39606: Automating Data Caching for Spark - posted by GitBox <gi...@apache.org> on 2023/01/16 10:03:35 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39517: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/16 10:03:44 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39517: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/16 10:04:45 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove hashCode - posted by GitBox <gi...@apache.org> on 2023/01/16 10:06:46 UTC, 0 replies.
- [GitHub] [spark] czxm commented on pull request #39595: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions in most cases - posted by GitBox <gi...@apache.org> on 2023/01/16 10:08:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove hashCode - posted by GitBox <gi...@apache.org> on 2023/01/16 10:21:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39603: [SPARK-42088][build]Improved setup.py adaptation for windows - posted by GitBox <gi...@apache.org> on 2023/01/16 11:02:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39603: [SPARK-42088][BUILD] Improve setup.py adaptation for Windows - posted by GitBox <gi...@apache.org> on 2023/01/16 11:03:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove `@hashCode` - posted by GitBox <gi...@apache.org> on 2023/01/16 11:26:24 UTC, 9 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39607: [SPARK-41902][CONNECT][PYTHON][TESTS] Enable test `test_map_functions ` - posted by GitBox <gi...@apache.org> on 2023/01/16 11:40:19 UTC, 0 replies.
- [GitHub] [spark] smallzhongfeng commented on a diff in pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by GitBox <gi...@apache.org> on 2023/01/16 11:43:46 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove `@hashCode` - posted by GitBox <gi...@apache.org> on 2023/01/16 11:48:41 UTC, 1 replies.
- [GitHub] [spark] tenglei closed pull request #38181: [SPARK-40720][CORE] Fix spark-ui jobs status not updating under high concurrency scenario - posted by GitBox <gi...@apache.org> on 2023/01/16 11:50:52 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/16 12:02:13 UTC, 2 replies.
- [GitHub] [spark] zekai-li commented on pull request #39603: [SPARK-42088][BUILD] Improve setup.py adaptation for Windows - posted by GitBox <gi...@apache.org> on 2023/01/16 12:09:45 UTC, 0 replies.
- [GitHub] [spark] olaky opened a new pull request, #39608: [SPARK-41896] Additional tests for _metadata filters - posted by GitBox <gi...@apache.org> on 2023/01/16 12:16:32 UTC, 0 replies.
- [GitHub] [spark] Ngone51 commented on pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/16 12:18:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39603: [SPARK-42088][BUILD] Improve setup.py adaptation for Windows - posted by GitBox <gi...@apache.org> on 2023/01/16 12:27:25 UTC, 0 replies.
- [GitHub] [spark] ayushi-agarwal opened a new pull request, #39609: ll - posted by GitBox <gi...@apache.org> on 2023/01/16 12:55:05 UTC, 0 replies.
- [GitHub] [spark] ayushi-agarwal closed pull request #39609: ll - posted by GitBox <gi...@apache.org> on 2023/01/16 12:55:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove `@hashCode` - posted by GitBox <gi...@apache.org> on 2023/01/16 12:58:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove `@hashCode` - posted by GitBox <gi...@apache.org> on 2023/01/16 12:59:13 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39598: [SPARK-41708][SQL][FOLLOWUP] Add a new `replaceAll` to `SQLQueryTestHelper#replaceNotIncludedMsg` to remove `@hashCode` - posted by GitBox <gi...@apache.org> on 2023/01/16 12:59:16 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39537: [SPARK-41994] Assign SQLSTATE's (1/2) - posted by GitBox <gi...@apache.org> on 2023/01/16 13:05:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39537: [SPARK-41994] Assign SQLSTATE's (1/2) - posted by GitBox <gi...@apache.org> on 2023/01/16 13:06:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39460: [SPARK-39217][SQL] Makes DPP support the pruning side has Union - posted by GitBox <gi...@apache.org> on 2023/01/16 13:32:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39610: [SPARK-41708][SQL][FOLLOWUP] Override `toString` method of `FileIndex` - posted by GitBox <gi...@apache.org> on 2023/01/16 13:47:35 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39610: [SPARK-41708][SQL][FOLLOWUP] Override `toString` method of `FileIndex` - posted by GitBox <gi...@apache.org> on 2023/01/16 13:49:43 UTC, 3 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39611: Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/16 13:51:26 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39611: Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/16 13:52:35 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 closed pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2023/01/16 13:54:21 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #37734: [SPARK-40264][ML] add batch_infer_udf function to pyspark.ml.functions - posted by GitBox <gi...@apache.org> on 2023/01/16 13:55:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39610: [SPARK-41708][SQL][FOLLOWUP] Override `toString` method of `FileIndex` - posted by GitBox <gi...@apache.org> on 2023/01/16 15:14:31 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39612: [SPARK-42091][BUILD] Upgrade jetty to 9.4.50.v20221201 - posted by GitBox <gi...@apache.org> on 2023/01/16 16:05:47 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39613: [SPARK-42092][BUILD] Upgrade RoaringBitmap to 0.9.38 - posted by GitBox <gi...@apache.org> on 2023/01/16 16:14:49 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39611: [SPARK-42090] Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/16 17:03:11 UTC, 2 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39614: [SPARK-42002][WIP][CONNECT][PYTHON] Implement DataFrameWriterV2 - posted by GitBox <gi...@apache.org> on 2023/01/16 18:03:19 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #39615: [SPARK-42093][SQL] Move JavaTypeInference to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/16 18:12:18 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39611: [SPARK-42090] Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/16 18:55:10 UTC, 2 replies.
- [GitHub] [spark] akpatnam25 commented on a diff in pull request #39611: [SPARK-42090] Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/16 21:32:22 UTC, 2 replies.
- [GitHub] [spark] tedyu commented on a diff in pull request #39611: [SPARK-42090] Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/16 22:20:11 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39610: [SPARK-41708][SQL][FOLLOWUP] Override `toString` method of `FileIndex` - posted by GitBox <gi...@apache.org> on 2023/01/16 23:53:26 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39610: [SPARK-41708][SQL][FOLLOWUP] Override `toString` method of `FileIndex` - posted by GitBox <gi...@apache.org> on 2023/01/16 23:53:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39590: [SPARK-42079][CONNECT][PYTHON] Rename proto messages for `toDF` and `withColumnsRenamed` - posted by GitBox <gi...@apache.org> on 2023/01/17 00:12:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39590: [SPARK-42079][CONNECT][PYTHON] Rename proto messages for `toDF` and `withColumnsRenamed` - posted by GitBox <gi...@apache.org> on 2023/01/17 00:12:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39602: [SPARK-42021][CONNECT][PYTHON] Make `createDataFrame` support `array.array` - posted by GitBox <gi...@apache.org> on 2023/01/17 00:13:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39607: [SPARK-41902][CONNECT][PYTHON][TESTS] Enable test `test_map_functions ` - posted by GitBox <gi...@apache.org> on 2023/01/17 00:14:11 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39602: [SPARK-42021][CONNECT][PYTHON] Make `createDataFrame` support `array.array` - posted by GitBox <gi...@apache.org> on 2023/01/17 00:14:17 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39607: [SPARK-41902][CONNECT][PYTHON][TESTS] Enable test `test_map_functions ` - posted by GitBox <gi...@apache.org> on 2023/01/17 00:14:48 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38092: [SPARK-40647][CORE] DAGScheduler should fail job until all related running tasks have been killed - posted by GitBox <gi...@apache.org> on 2023/01/17 00:19:53 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37641: [SPARK-40201][SQL][TESTS] Improve v1 write test coverage - posted by GitBox <gi...@apache.org> on 2023/01/17 00:19:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39612: [SPARK-42091][BUILD] Upgrade jetty to 9.4.50.v20221201 - posted by GitBox <gi...@apache.org> on 2023/01/17 00:37:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39612: [SPARK-42091][BUILD] Upgrade jetty to 9.4.50.v20221201 - posted by GitBox <gi...@apache.org> on 2023/01/17 00:38:01 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39603: [SPARK-42088][PYTHON][WINDOWS][BUILD] Improve setup.py adaptation for Windows - posted by GitBox <gi...@apache.org> on 2023/01/17 00:38:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39603: [SPARK-42088][PYTHON][WINDOWS][BUILD] Improve setup.py adaptation for Windows - posted by GitBox <gi...@apache.org> on 2023/01/17 00:38:30 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39517: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/17 01:00:32 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39576: [SPARK-42067][CONNECT][BUILD] Upgrade buf from 1.11.0 to 1.12.0 - posted by GitBox <gi...@apache.org> on 2023/01/17 01:05:20 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39576: [SPARK-42067][CONNECT][BUILD] Upgrade buf from 1.11.0 to 1.12.0 - posted by GitBox <gi...@apache.org> on 2023/01/17 01:05:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39616: [SPARK-41757][CONNECT] Fix string representation for Column class - posted by GitBox <gi...@apache.org> on 2023/01/17 01:13:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39617: [SPARK-41866][CONNECT][TESTS] Enable test_create_dataframe_from_array_of_long in dataframe parity test - posted by GitBox <gi...@apache.org> on 2023/01/17 01:18:39 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39618: [SPARK-42095][CONNECT][PYTHON][TESTS] Fix gRPC check in tests - posted by GitBox <gi...@apache.org> on 2023/01/17 02:01:05 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2023/01/17 02:01:28 UTC, 1 replies.
- [GitHub] [spark] rangadi commented on a diff in pull request #39550: [SPARK-42056][SQL][PROTOBUF] Add missing options for Protobuf functions - posted by GitBox <gi...@apache.org> on 2023/01/17 02:04:04 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39610: [SPARK-41708][SQL][FOLLOWUP] Override `toString` method of `FileIndex` - posted by GitBox <gi...@apache.org> on 2023/01/17 02:09:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39612: [SPARK-42091][BUILD] Upgrade jetty to 9.4.50.v20221201 - posted by GitBox <gi...@apache.org> on 2023/01/17 02:10:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39613: [SPARK-42092][BUILD] Upgrade RoaringBitmap to 0.9.38 - posted by GitBox <gi...@apache.org> on 2023/01/17 02:31:01 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39575: [Spark-42058] SQLSTATE 2/2 - posted by GitBox <gi...@apache.org> on 2023/01/17 02:52:38 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39575: [Spark-42058] SQLSTATE 2/2 - posted by GitBox <gi...@apache.org> on 2023/01/17 02:53:03 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39591: [SPARK-42078][PYTHON] Migrate errors thrown by JVM into `PySparkException`. - posted by GitBox <gi...@apache.org> on 2023/01/17 03:01:34 UTC, 11 replies.
- [GitHub] [spark] mridulm commented on pull request #38428: [SPARK-40912][CORE]Overhead of Exceptions in KryoDeserializationStream - posted by GitBox <gi...@apache.org> on 2023/01/17 03:03:32 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39611: [SPARK-42090] Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/17 03:12:40 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39619: [SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions - posted by GitBox <gi...@apache.org> on 2023/01/17 03:55:12 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39619: [SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions - posted by GitBox <gi...@apache.org> on 2023/01/17 04:04:18 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39579: [SPARK-42072][CORE] `core` module requires `javax.servlet-api` - posted by GitBox <gi...@apache.org> on 2023/01/17 04:19:26 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39618: [SPARK-42095][CONNECT][PYTHON][TESTS] Fix gRPC check in tests - posted by GitBox <gi...@apache.org> on 2023/01/17 04:20:29 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39618: [SPARK-42095][CONNECT][PYTHON][TESTS] Fix gRPC check in tests - posted by GitBox <gi...@apache.org> on 2023/01/17 04:20:47 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39617: [SPARK-41866][CONNECT][TESTS] Enable test_create_dataframe_from_array_of_long in dataframe parity test - posted by GitBox <gi...@apache.org> on 2023/01/17 04:22:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39617: [SPARK-41866][CONNECT][TESTS] Enable test_create_dataframe_from_array_of_long in dataframe parity test - posted by GitBox <gi...@apache.org> on 2023/01/17 04:22:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by GitBox <gi...@apache.org> on 2023/01/17 04:44:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39558: [SPARK-41982][SQL] Partitions of type string should not be treated as numeric types - posted by GitBox <gi...@apache.org> on 2023/01/17 04:44:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39615: [SPARK-42093][SQL] Move JavaTypeInference to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/17 04:51:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39620: [SPARK-42096][CONNECT] Some code cleanup for `connect` module - posted by GitBox <gi...@apache.org> on 2023/01/17 05:05:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39619: [SPARK-42089][CONNECT][PYTHON] Fix variable name issues in nested lambda functions - posted by GitBox <gi...@apache.org> on 2023/01/17 05:07:39 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39498: [SPARK-41976][SQL] Improve error message for `INDEX_NOT_FOUND` - posted by GitBox <gi...@apache.org> on 2023/01/17 05:11:26 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39621: [SPARK-42097][CORE] Register SerializedLambda and BitSet to KryoSerializer - posted by GitBox <gi...@apache.org> on 2023/01/17 05:21:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39621: [SPARK-42097][CORE] Register `SerializedLambda` and `BitSet` to KryoSerializer - posted by GitBox <gi...@apache.org> on 2023/01/17 05:47:41 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39115: [SPARK-41563][SQL] Support partition filter in MSCK REPAIR TABLE statement - posted by GitBox <gi...@apache.org> on 2023/01/17 05:52:52 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39616: [SPARK-41757][SPARK-41901][CONNECT] Fix string representation for Column class - posted by GitBox <gi...@apache.org> on 2023/01/17 05:58:30 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39296: [SPARK-41757][CONNECT] Fixing String representation for Column class - posted by GitBox <gi...@apache.org> on 2023/01/17 05:59:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39616: [SPARK-41757][SPARK-41901][CONNECT] Fix string representation for Column class - posted by GitBox <gi...@apache.org> on 2023/01/17 05:59:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39621: [SPARK-42097][CORE] Register `SerializedLambda` and `BitSet` to KryoSerializer - posted by GitBox <gi...@apache.org> on 2023/01/17 06:11:36 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39611: [SPARK-42090] Introduce sasl retry count in RetryingBlockTransferor - posted by GitBox <gi...@apache.org> on 2023/01/17 06:49:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39622: [SPARK-42099][SPARK-41845][CONNECT][PYTHON] Fix `count(*)` and `count(expr(*))` - posted by GitBox <gi...@apache.org> on 2023/01/17 06:52:54 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39622: [SPARK-42099][SPARK-41845][CONNECT][PYTHON] Fix `count(*)` and `count(expr(*))` - posted by GitBox <gi...@apache.org> on 2023/01/17 06:59:24 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39459: [SPARK-41497][CORE] Fixing accumulator undercount in the case of the retry task with rdd cache - posted by GitBox <gi...@apache.org> on 2023/01/17 07:15:23 UTC, 4 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39623: [SPARK-42100][SQL] Protect NPE triggered by `SQLExecutionUIData#description` in `SQLExecutionUIDataSerializer` - posted by GitBox <gi...@apache.org> on 2023/01/17 07:25:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39622: [SPARK-42099][SPARK-41845][CONNECT][PYTHON] Fix `count(*)`, `count(col(*))`, `count(expr(*))` - posted by GitBox <gi...@apache.org> on 2023/01/17 08:02:38 UTC, 1 replies.
- [GitHub] [spark] lival closed pull request #39606: Automating Data Caching for Spark - posted by GitBox <gi...@apache.org> on 2023/01/17 08:05:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39622: [SPARK-42099][SPARK-41845][CONNECT][PYTHON] Fix `count(*)`, `count(col(*))`, `count(expr(*))` - posted by GitBox <gi...@apache.org> on 2023/01/17 09:38:12 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39623: [SPARK-42100][SQL] Protect NPE in `SQLExecutionUIDataSerializer#serialize` - posted by GitBox <gi...@apache.org> on 2023/01/17 09:44:48 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39624: [SPARK-42101][SQL] Wrap InMemoryTableScanExec with QueryStage - posted by GitBox <gi...@apache.org> on 2023/01/17 10:27:46 UTC, 2 replies.
- [GitHub] [spark] ulysses-you closed pull request #39624: [SPARK-42101][SQL] Wrap InMemoryTableScanExec with QueryStage - posted by GitBox <gi...@apache.org> on 2023/01/17 10:30:29 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39625: [WIP][SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by GitBox <gi...@apache.org> on 2023/01/17 11:14:13 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39585: [WIP] Unregistered Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/17 12:49:01 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38034: [SPARK-40599][SQL] Add multiTransform methods to TreeNode to generate alternatives - posted by GitBox <gi...@apache.org> on 2023/01/17 12:59:02 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39585: [WIP] Unregistered Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/17 13:53:16 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37525: [WIP][SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/17 13:53:28 UTC, 0 replies.
- [GitHub] [spark] lival opened a new pull request, #39626: An automatic caching solution for Spark - posted by GitBox <gi...@apache.org> on 2023/01/17 14:13:41 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39267: [WIP][SPARK-41592][PYTHON][ML] Pytorch file Distributed Training - posted by GitBox <gi...@apache.org> on 2023/01/17 14:18:27 UTC, 8 replies.
- [GitHub] [spark] peter-toth commented on pull request #29210: [SPARK-24497][SQL] Support recursive SQL query - posted by GitBox <gi...@apache.org> on 2023/01/17 14:20:13 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on pull request #36506: [SPARK-25050][SQL] Avro: writing complex unions - posted by GitBox <gi...@apache.org> on 2023/01/17 14:53:46 UTC, 0 replies.
- [GitHub] [spark] hvanhovell opened a new pull request, #39627: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/17 16:12:20 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #39592: [SPARK-42081][SQL] Improve the plan change validation - posted by GitBox <gi...@apache.org> on 2023/01/17 16:25:44 UTC, 2 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by GitBox <gi...@apache.org> on 2023/01/17 17:15:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39627: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/17 17:26:49 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39627: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/17 17:40:11 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39613: [SPARK-42092][BUILD] Upgrade RoaringBitmap to 0.9.38 - posted by GitBox <gi...@apache.org> on 2023/01/17 17:41:50 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39613: [SPARK-42092][BUILD] Upgrade RoaringBitmap to 0.9.38 - posted by GitBox <gi...@apache.org> on 2023/01/17 17:42:02 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/17 17:48:17 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #38888: [SPARK-41405][SQL] Centralize the column resolution logic - posted by GitBox <gi...@apache.org> on 2023/01/17 17:52:46 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/17 17:57:13 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39613: [SPARK-42092][BUILD] Upgrade RoaringBitmap to 0.9.38 - posted by GitBox <gi...@apache.org> on 2023/01/17 18:05:43 UTC, 0 replies.
- [GitHub] [spark] erenavsarogullari commented on a diff in pull request #39037: [SPARK-41214][SQL] Fix AQE cache does not update plan and metrics - posted by GitBox <gi...@apache.org> on 2023/01/17 18:11:47 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39592: [SPARK-42081][SQL] Improve the plan change validation - posted by GitBox <gi...@apache.org> on 2023/01/17 18:39:35 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39627: [SPARK-41993][SQL] Move RowEncoder to AgnosticEncoders - posted by GitBox <gi...@apache.org> on 2023/01/17 18:52:56 UTC, 0 replies.
- [GitHub] [spark] leewyang opened a new pull request, #39628: [SPARK-40264][ML] followup pydoc edits - posted by GitBox <gi...@apache.org> on 2023/01/17 19:08:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39540: [SPARK-42039][SQL] SPJ: Remove Option in KeyGroupedPartitioning#partitionValuesOpt - posted by GitBox <gi...@apache.org> on 2023/01/17 19:51:58 UTC, 0 replies.
- [GitHub] [spark] leewyang commented on pull request #39628: [SPARK-40264][ML] followup pydoc edits - posted by GitBox <gi...@apache.org> on 2023/01/17 21:09:29 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39608: [SPARK-41896][SQL][TESTS] Additional tests for _metadata filters - posted by GitBox <gi...@apache.org> on 2023/01/17 21:29:42 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39605: when spark job had ran in k8s is finished ,it register to shutdown ho… - posted by GitBox <gi...@apache.org> on 2023/01/17 21:29:46 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #39629: [SPARK-42103][PYSPARK][ML] Added Instrumentation for PyTorch Distributor - posted by GitBox <gi...@apache.org> on 2023/01/17 21:46:39 UTC, 0 replies.
- [GitHub] [spark] lzlfred opened a new pull request, #39630: [SPARK-42061] mark expression InvokeLike and ExternalMapToCatalyst stateful - posted by GitBox <gi...@apache.org> on 2023/01/17 21:55:43 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39596: [SPARK-42084][SQL] Avoid leaking the qualified-access-only restriction - posted by GitBox <gi...@apache.org> on 2023/01/17 23:27:33 UTC, 1 replies.
- [GitHub] [spark] akpatnam25 opened a new pull request, #39631: Spark 41415 - posted by GitBox <gi...@apache.org> on 2023/01/17 23:30:32 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 closed pull request #39631: Spark 41415 - posted by GitBox <gi...@apache.org> on 2023/01/17 23:30:45 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39595: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions in most cases - posted by GitBox <gi...@apache.org> on 2023/01/17 23:34:33 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 opened a new pull request, #39632: Spark 41415 spark 42090 backport - posted by GitBox <gi...@apache.org> on 2023/01/17 23:45:58 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #39632: SPARK-41415/SPARK-42090 Backport to 3.2 - posted by GitBox <gi...@apache.org> on 2023/01/17 23:46:54 UTC, 2 replies.
- [GitHub] [spark] sunchao opened a new pull request, #39633: [WIP][SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by GitBox <gi...@apache.org> on 2023/01/17 23:57:08 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 opened a new pull request, #39634: SPARK-41415/SPARK-42090 Backport to 3.3 - posted by GitBox <gi...@apache.org> on 2023/01/17 23:57:41 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #39634: SPARK-41415/SPARK-42090 Backport to 3.3 - posted by GitBox <gi...@apache.org> on 2023/01/17 23:58:07 UTC, 1 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2023/01/18 00:20:02 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37819: [SPARK-40377][SQL] Allow customize maxBroadcastTableBytes - posted by GitBox <gi...@apache.org> on 2023/01/18 00:20:05 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37641: [SPARK-40201][SQL][TESTS] Improve v1 write test coverage - posted by GitBox <gi...@apache.org> on 2023/01/18 00:20:07 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #39635: [SPARK-41822][Connect]Run Scala client tests - posted by GitBox <gi...@apache.org> on 2023/01/18 00:48:18 UTC, 0 replies.
- [GitHub] [spark] zhenlineo closed pull request #39635: [SPARK-41822][Connect]Run Scala client tests - posted by GitBox <gi...@apache.org> on 2023/01/18 01:06:18 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39538: [SPARK-41596][SS][DOCS] Document the new feature "Async Progress Tracking" to Structured Streaming guide doc - posted by GitBox <gi...@apache.org> on 2023/01/18 01:31:58 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39630: [SPARK-42061][SQL] mark expression InvokeLike and ExternalMapToCatalyst stateful - posted by GitBox <gi...@apache.org> on 2023/01/18 01:53:46 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39630: [SPARK-42061][SQL] mark expression InvokeLike and ExternalMapToCatalyst stateful - posted by GitBox <gi...@apache.org> on 2023/01/18 01:54:28 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by GitBox <gi...@apache.org> on 2023/01/18 02:03:56 UTC, 7 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39628: [SPARK-40264][ML] followup pydoc edits - posted by GitBox <gi...@apache.org> on 2023/01/18 02:11:52 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39636: [WIP][SQL] Move conversion `COUNT(*) -> COUNT(1)` to Analyzer - posted by GitBox <gi...@apache.org> on 2023/01/18 02:35:14 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39622: [SPARK-42099][SPARK-41845][CONNECT][PYTHON] Fix `count(*)`, `count(col(*))`, `count(expr(*))` - posted by GitBox <gi...@apache.org> on 2023/01/18 02:38:20 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39585: [WIP] Unregistered Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/18 02:43:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39636: [WIP][SQL] Move `COUNT(*) -> COUNT(1)` to Analyzer - posted by GitBox <gi...@apache.org> on 2023/01/18 03:01:17 UTC, 0 replies.
- [GitHub] [spark] maryannxue commented on pull request #39624: [SPARK-42101][SQL] Wrap InMemoryTableScanExec with QueryStage - posted by GitBox <gi...@apache.org> on 2023/01/18 03:51:27 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39633: [WIP][SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by GitBox <gi...@apache.org> on 2023/01/18 03:53:29 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #39637: [WIP] Integration testing - posted by GitBox <gi...@apache.org> on 2023/01/18 04:25:49 UTC, 0 replies.
- [GitHub] [spark] maryannxue commented on a diff in pull request #39624: [SPARK-42101][SQL] Wrap InMemoryTableScanExec with QueryStage - posted by GitBox <gi...@apache.org> on 2023/01/18 04:40:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/18 04:58:57 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by GitBox <gi...@apache.org> on 2023/01/18 05:06:18 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37551: [SPARK-38591][SQL] Add sortWithinGroups to KeyValueGroupedDataset - posted by GitBox <gi...@apache.org> on 2023/01/18 05:14:14 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by GitBox <gi...@apache.org> on 2023/01/18 05:36:00 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39036: [SPARK-41461][BUILD][CORE][CONNECT][PROTOBUF] Unify the environment variable of *_PROTOC_EXEC_PATH. - posted by GitBox <gi...@apache.org> on 2023/01/18 05:49:27 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39638: [SPARK-42082][SPARK-41598][PYTHON] Introduce `PySparkValueError` and `PySparkTypeError` - posted by GitBox <gi...@apache.org> on 2023/01/18 06:09:20 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/18 06:09:44 UTC, 2 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39638: [SPARK-42082][SPARK-41598][PYTHON] Introduce `PySparkValueError` and `PySparkTypeError` - posted by GitBox <gi...@apache.org> on 2023/01/18 06:13:19 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39623: [SPARK-42100][SQL][UI] Protect NPE in `SQLExecutionUIDataSerializer#serialize` - posted by GitBox <gi...@apache.org> on 2023/01/18 06:15:02 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39592: [SPARK-42081][SQL] Improve the plan change validation - posted by GitBox <gi...@apache.org> on 2023/01/18 06:21:40 UTC, 2 replies.
- [GitHub] [spark] WolverineJiang commented on pull request #39036: [SPARK-41485][BUILD][CORE][CONNECT][PROTOBUF] Unify the environment variable of *_PROTOC_EXEC_PATH. - posted by GitBox <gi...@apache.org> on 2023/01/18 06:24:05 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #37551: [SPARK-38591][SQL] Add sortWithinGroups to KeyValueGroupedDataset - posted by GitBox <gi...@apache.org> on 2023/01/18 06:34:24 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39591: [SPARK-42078][PYTHON] Migrate errors thrown by JVM into `PySparkException`. - posted by GitBox <gi...@apache.org> on 2023/01/18 06:48:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39591: [SPARK-42078][PYTHON] Migrate errors thrown by JVM into `PySparkException`. - posted by GitBox <gi...@apache.org> on 2023/01/18 06:49:32 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39639: [SPARK-42080][PYTHON] Add guideline for PySpark errors. - posted by GitBox <gi...@apache.org> on 2023/01/18 06:52:51 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39639: [SPARK-42080][PYTHON] Add guideline for PySpark errors. - posted by GitBox <gi...@apache.org> on 2023/01/18 06:54:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39639: [SPARK-42080][PYTHON] Add guideline for PySpark errors. - posted by GitBox <gi...@apache.org> on 2023/01/18 06:57:13 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39596: [SPARK-42084][SQL] Avoid leaking the qualified-access-only restriction - posted by GitBox <gi...@apache.org> on 2023/01/18 07:03:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by GitBox <gi...@apache.org> on 2023/01/18 07:08:08 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39623: [SPARK-42100][SQL][UI] Protect NPE in `SQLExecutionUIDataSerializer#serialize` - posted by GitBox <gi...@apache.org> on 2023/01/18 07:32:48 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39623: [SPARK-42100][SQL][UI] Protect NPE in `SQLExecutionUIDataSerializer#serialize` - posted by GitBox <gi...@apache.org> on 2023/01/18 07:34:03 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #37551: [SPARK-38591][SQL] Add sortWithinGroups to KeyValueGroupedDataset - posted by GitBox <gi...@apache.org> on 2023/01/18 08:06:23 UTC, 2 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #37551: [SPARK-38591][SQL] Add sortWithinGroups to KeyValueGroupedDataset - posted by GitBox <gi...@apache.org> on 2023/01/18 08:19:49 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/18 08:24:01 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37551: [SPARK-38591][SQL] Add sortWithinGroups to KeyValueGroupedDataset - posted by GitBox <gi...@apache.org> on 2023/01/18 08:24:42 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39596: [SPARK-42084][SQL] Avoid leaking the qualified-access-only restriction - posted by GitBox <gi...@apache.org> on 2023/01/18 08:48:12 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39640: [SPARK-38591][SQL] Add flatMapSortedGroups and cogroupSorted - posted by GitBox <gi...@apache.org> on 2023/01/18 08:53:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39634: SPARK-41415/SPARK-42090 Backport to 3.3 - posted by GitBox <gi...@apache.org> on 2023/01/18 08:57:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39495: [SPARK-41973][SQL] Assign name to _LEGACY_ERROR_TEMP_1311 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:12:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39495: [SPARK-41973][SQL] Assign name to _LEGACY_ERROR_TEMP_1311 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:13:49 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39496: [SPARK-41974][SQL] Turn `INCORRECT_END_OFFSET` into `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:14:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39496: [SPARK-41974][SQL] Turn `INCORRECT_END_OFFSET` into `INTERNAL_ERROR` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:14:30 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39497: [SPARK-41975][SQL] Improve error message for `INDEX_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:15:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39497: [SPARK-41975][SQL] Improve error message for `INDEX_ALREADY_EXISTS` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:15:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39507: [SPARK-41984][SQL] Rename & improve error message for `RESET_PERMISSION_TO_ORIGINAL` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:17:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39507: [SPARK-41984][SQL] Rename & improve error message for `RESET_PERMISSION_TO_ORIGINAL` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:17:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39260: [SPARK-41579][SQL] Assign name to _LEGACY_ERROR_TEMP_1249 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:19:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39260: [SPARK-41579][SQL] Assign name to _LEGACY_ERROR_TEMP_1249 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:19:37 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39260: [SPARK-41579][SQL] Assign name to _LEGACY_ERROR_TEMP_1249 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:19:51 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39389: [SPARK-41574][SQL] Update `_LEGACY_ERROR_TEMP_2009` as `INTERNAL_ERROR`. - posted by GitBox <gi...@apache.org> on 2023/01/18 09:20:41 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39389: [SPARK-41574][SQL] Update `_LEGACY_ERROR_TEMP_2009` as `INTERNAL_ERROR`. - posted by GitBox <gi...@apache.org> on 2023/01/18 09:20:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39258: [SPARK-41572][SQL] Assign name to _LEGACY_ERROR_TEMP_2149 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:21:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39258: [SPARK-41572][SQL] Assign name to _LEGACY_ERROR_TEMP_2149 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:22:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39480: [SPARK-41960][SQL] Assign name to _LEGACY_ERROR_TEMP_1056 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:23:10 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39480: [SPARK-41960][SQL] Assign name to _LEGACY_ERROR_TEMP_1056 - posted by GitBox <gi...@apache.org> on 2023/01/18 09:23:25 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39506: [SPARK-41983][SQL] Rename & improve error message for `NULL_COMPARISON_RESULT` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:24:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39506: [SPARK-41983][SQL] Rename & improve error message for `NULL_COMPARISON_RESULT` - posted by GitBox <gi...@apache.org> on 2023/01/18 09:24:52 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39640: [SPARK-38591][SQL] Add flatMapSortedGroups and cogroupSorted - posted by GitBox <gi...@apache.org> on 2023/01/18 10:07:17 UTC, 4 replies.
- [GitHub] [spark] nija-at opened a new pull request, #39641: [SPARK-42106] [Pyspark] Hide parameters when re-printing remote URL in REPL - posted by GitBox <gi...@apache.org> on 2023/01/18 10:35:06 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39596: [SPARK-42084][SQL] Avoid leaking the qualified-access-only restriction - posted by GitBox <gi...@apache.org> on 2023/01/18 10:44:43 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39641: [SPARK-42106] [Pyspark] Hide parameters when printing remote URL in REPL - posted by GitBox <gi...@apache.org> on 2023/01/18 10:57:58 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39640: [SPARK-38591][SQL] Add flatMapSortedGroups and cogroupSorted - posted by GitBox <gi...@apache.org> on 2023/01/18 11:01:33 UTC, 8 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39640: [SPARK-38591][SQL] Add flatMapSortedGroups and cogroupSorted - posted by GitBox <gi...@apache.org> on 2023/01/18 11:33:06 UTC, 10 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39636: [SPARK-42108][SQL] Make Analyzer transform `Count(*)` into `Count(1)` - posted by GitBox <gi...@apache.org> on 2023/01/18 11:36:48 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39641: [SPARK-42106] [Pyspark] Hide parameters when printing remote URL in REPL - posted by GitBox <gi...@apache.org> on 2023/01/18 11:54:25 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39642: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for `StreamingQueryProgressWrapper` - posted by GitBox <gi...@apache.org> on 2023/01/18 12:17:41 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39638: [SPARK-42082][SPARK-41598][PYTHON] Introduce `PySparkValueError` and `PySparkTypeError` - posted by GitBox <gi...@apache.org> on 2023/01/18 12:42:52 UTC, 0 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #29210: [SPARK-24497][SQL] Support recursive SQL query - posted by GitBox <gi...@apache.org> on 2023/01/18 13:22:52 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #29210: [SPARK-24497][SQL] Support recursive SQL query - posted by GitBox <gi...@apache.org> on 2023/01/18 13:27:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39543: [SPARK-42044][SQL] Fix incorrect error message for `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by GitBox <gi...@apache.org> on 2023/01/18 13:48:31 UTC, 1 replies.
- [GitHub] [spark] nija-at commented on pull request #39641: [SPARK-42106] [Pyspark] Hide parameters when printing remote URL in REPL - posted by GitBox <gi...@apache.org> on 2023/01/18 14:15:33 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #29210: [WIP][SPARK-24497][SQL] Support recursive SQL query - posted by GitBox <gi...@apache.org> on 2023/01/18 14:16:27 UTC, 0 replies.
- [GitHub] [spark] peter-toth closed pull request #29210: [WIP][SPARK-24497][SQL] Support recursive SQL query - posted by GitBox <gi...@apache.org> on 2023/01/18 14:16:55 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39641: [SPARK-42106] [Pyspark] Hide parameters when printing remote URL in REPL - posted by GitBox <gi...@apache.org> on 2023/01/18 14:44:04 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39605: when spark job had ran in k8s is finished ,it register to shutdown ho… - posted by GitBox <gi...@apache.org> on 2023/01/18 14:44:21 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39634: SPARK-41415/SPARK-42090 Backport to 3.3 - posted by GitBox <gi...@apache.org> on 2023/01/18 14:44:58 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/18 14:49:50 UTC, 0 replies.
- [GitHub] [spark] ClownXC opened a new pull request, #39643: fix typo in BlockManager.scala: 'cheksum' ---> 'checksum' - posted by GitBox <gi...@apache.org> on 2023/01/18 16:21:41 UTC, 0 replies.
- [GitHub] [spark] RamakrishnaHande commented on pull request #26060: [SPARK-29400][CORE] Improve PrometheusResource to use labels - posted by GitBox <gi...@apache.org> on 2023/01/18 16:39:31 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39633: [WIP][SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by GitBox <gi...@apache.org> on 2023/01/18 16:41:51 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #26060: [SPARK-29400][CORE] Improve PrometheusResource to use labels - posted by GitBox <gi...@apache.org> on 2023/01/18 16:53:08 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2023/01/18 17:26:23 UTC, 2 replies.
- [GitHub] [spark] dtenedor commented on pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2023/01/18 18:03:31 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2023/01/18 18:36:15 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/18 19:11:58 UTC, 3 replies.
- [GitHub] [spark] akpatnam25 closed pull request #39634: SPARK-41415/SPARK-42090 Backport to 3.3 - posted by GitBox <gi...@apache.org> on 2023/01/18 19:25:25 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 opened a new pull request, #39644: [SPARK-41415] SASL Request Retries Backport to 3.3 - posted by GitBox <gi...@apache.org> on 2023/01/18 19:28:20 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #39644: [SPARK-41415] SASL Request Retries Backport to 3.3 - posted by GitBox <gi...@apache.org> on 2023/01/18 19:28:55 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 closed pull request #39632: SPARK-41415/SPARK-42090 Backport to 3.2 - posted by GitBox <gi...@apache.org> on 2023/01/18 19:29:09 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 opened a new pull request, #39645: [SPARK-41415][3.2] SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/18 19:35:46 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #39645: [SPARK-41415][3.2] SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/18 19:36:18 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39646: [SPARK-42109][BUILD] Upgrade Kafka to 3.3.2 - posted by GitBox <gi...@apache.org> on 2023/01/18 20:07:23 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by GitBox <gi...@apache.org> on 2023/01/18 20:39:36 UTC, 1 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask udf from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/18 20:42:24 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39646: [SPARK-42109][BUILD] Upgrade Kafka to 3.3.2 - posted by GitBox <gi...@apache.org> on 2023/01/18 21:16:35 UTC, 2 replies.
- [GitHub] [spark] dtenedor commented on pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by GitBox <gi...@apache.org> on 2023/01/18 22:16:36 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 opened a new pull request, #39647: [SPARK-42075] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/18 22:19:47 UTC, 0 replies.
- [GitHub] [spark] chaoqin-li1123 commented on pull request #39647: [SPARK-42075] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/18 22:20:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39648: [SPARK-42110][SQL][TESTS] Reduce the number of repetition in ParquetDeltaEncodingSuite.`random data test` - posted by GitBox <gi...@apache.org> on 2023/01/18 22:31:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39648: [SPARK-42110][SQL][TESTS] Reduce the number of repetition in ParquetDeltaEncodingSuite.`random data test` - posted by GitBox <gi...@apache.org> on 2023/01/18 22:36:37 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39646: [SPARK-42109][BUILD] Upgrade Kafka to 3.3.2 - posted by GitBox <gi...@apache.org> on 2023/01/18 22:38:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39648: [SPARK-42110][SQL][TESTS] Reduce the number of repetition in ParquetDeltaEncodingSuite.`random data test` - posted by GitBox <gi...@apache.org> on 2023/01/18 22:51:43 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39585: [WIP] Scalar Inline Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/18 23:22:45 UTC, 7 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/18 23:54:04 UTC, 4 replies.
- [GitHub] [spark] HeartSaVioR commented on a diff in pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/19 00:04:52 UTC, 13 replies.
- [GitHub] [spark] chaoqin-li1123 commented on a diff in pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/19 00:07:27 UTC, 2 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2023/01/19 00:07:32 UTC, 35 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38159: [SPARK-40594][SQL] Eagerly release hashed relation in ShuffledHashJoin - posted by GitBox <gi...@apache.org> on 2023/01/19 00:19:58 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37915: [SPARK-40465][SQL] Refactor Decimal so as we can use other underlying implementation - posted by GitBox <gi...@apache.org> on 2023/01/19 00:20:01 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #37819: [SPARK-40377][SQL] Allow customize maxBroadcastTableBytes - posted by GitBox <gi...@apache.org> on 2023/01/19 00:20:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39267: [WIP][SPARK-41592][PYTHON][ML] Pytorch file Distributed Training - posted by GitBox <gi...@apache.org> on 2023/01/19 00:24:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39267: [SPARK-41592][PYTHON][ML] Pytorch file Distributed Training - posted by GitBox <gi...@apache.org> on 2023/01/19 00:24:55 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39641: [SPARK-42106][PYTHON] Hide parameters when printing remote URL in REPL - posted by GitBox <gi...@apache.org> on 2023/01/19 00:37:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39641: [SPARK-42106][PYTHON] Hide parameters when printing remote URL in REPL - posted by GitBox <gi...@apache.org> on 2023/01/19 00:37:57 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #39644: [SPARK-41415][3.3] SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/19 01:03:50 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/19 02:04:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39255: [DON'T MERGE][BUILD] Switch default protobuf-java version to 3.x - posted by GitBox <gi...@apache.org> on 2023/01/19 02:04:23 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39488: [SPARK-41970] Introduce SparkPath for typesafety - posted by GitBox <gi...@apache.org> on 2023/01/19 02:06:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39636: [SPARK-42108][SQL] Make Analyzer transform `Count(*)` into `Count(1)` - posted by GitBox <gi...@apache.org> on 2023/01/19 02:25:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39636: [SPARK-42108][SQL] Make Analyzer transform `Count(*)` into `Count(1)` - posted by GitBox <gi...@apache.org> on 2023/01/19 02:25:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/19 02:43:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39649: [SPARK-42111][SQL][TESTS] Mark `Orc*FilterSuite/OrcV*SchemaPruningSuite` as `ExtendedSQLTest` - posted by GitBox <gi...@apache.org> on 2023/01/19 02:54:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39649: [SPARK-42111][SQL][TESTS] Mark `Orc*FilterSuite/OrcV*SchemaPruningSuite` as `ExtendedSQLTest` - posted by GitBox <gi...@apache.org> on 2023/01/19 03:00:36 UTC, 3 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/19 03:09:29 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/19 03:13:39 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #37525: [SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/19 03:15:09 UTC, 36 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39299: [WIP][SPARK-41593][PYTHON][ML] Adding logging from executors - posted by GitBox <gi...@apache.org> on 2023/01/19 03:43:57 UTC, 16 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39369: [WIP][SPARK-41775][PYTHON][ML] Adding support for PyForch functions - posted by GitBox <gi...@apache.org> on 2023/01/19 04:19:55 UTC, 1 replies.
- [GitHub] [spark] WeichenXu123 commented on pull request #39369: [WIP][SPARK-41775][PYTHON][ML] Adding support for PyForch functions - posted by GitBox <gi...@apache.org> on 2023/01/19 04:23:50 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39637: [WIP][SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by GitBox <gi...@apache.org> on 2023/01/19 04:25:41 UTC, 1 replies.
- [GitHub] [spark] chaoqin-li1123 commented on pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/19 05:41:46 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/19 05:48:12 UTC, 2 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2023/01/19 06:34:52 UTC, 3 replies.
- [GitHub] [spark] peter-toth commented on a diff in pull request #37525: [SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/19 07:21:06 UTC, 43 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by GitBox <gi...@apache.org> on 2023/01/19 07:53:48 UTC, 6 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39650: [SPARK-42112][SS] Add null check before `ContinuousWriteRDD#compute` function close dataWriter - posted by GitBox <gi...@apache.org> on 2023/01/19 08:28:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39650: [SPARK-42112][SQL][SS] Add null check before `ContinuousWriteRDD#compute` function close `dataWriter` - posted by GitBox <gi...@apache.org> on 2023/01/19 08:30:15 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39650: [SPARK-42112][SQL][SS] Add null check before `ContinuousWriteRDD#compute` function close `dataWriter` - posted by GitBox <gi...@apache.org> on 2023/01/19 08:34:48 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39651: [SPARK-42113][PS][INFRA] Upgrade pandas to 1.5.3 - posted by GitBox <gi...@apache.org> on 2023/01/19 09:22:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39642: [WIP][SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for `StreamingQueryProgressWrapper` - posted by GitBox <gi...@apache.org> on 2023/01/19 09:42:24 UTC, 1 replies.
- [GitHub] [spark] antonipp commented on a diff in pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/19 09:45:34 UTC, 3 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #39652: [SPARK-40599][SQL] Relax multiTransform rule type to allow any kinds of Seq - posted by GitBox <gi...@apache.org> on 2023/01/19 11:21:18 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #39652: [SPARK-40599][SQL] Relax multiTransform rule type to allow any kinds of Seq - posted by GitBox <gi...@apache.org> on 2023/01/19 11:21:59 UTC, 0 replies.
- [GitHub] [spark] codecov-commenter commented on pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/19 11:37:30 UTC, 0 replies.
- [GitHub] [spark] panbingkun commented on a diff in pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/19 12:05:05 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39653: [SPARK-42115][SQL][PYTHON] Push down limit through Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/19 12:57:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39653: [SPARK-42115][SQL][PYTHON] Push down limit through Python UDFs - posted by GitBox <gi...@apache.org> on 2023/01/19 13:00:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39585: [WIP] Scalar Inline Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/19 13:19:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39649: [SPARK-42111][SQL][TESTS] Mark `Orc*FilterSuite/OrcV*SchemaPruningSuite` as `ExtendedSQLTest` - posted by GitBox <gi...@apache.org> on 2023/01/19 13:31:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39649: [SPARK-42111][SQL][TESTS] Mark `Orc*FilterSuite/OrcV*SchemaPruningSuite` as `ExtendedSQLTest` - posted by GitBox <gi...@apache.org> on 2023/01/19 13:31:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39639: [SPARK-42080][PYTHON][DOCS] Add guideline for PySpark errors - posted by GitBox <gi...@apache.org> on 2023/01/19 13:35:18 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39190: [SPARK-41683][CORE] Fix issue of getting incorrect property numActiveStages in jobs API - posted by GitBox <gi...@apache.org> on 2023/01/19 14:26:05 UTC, 4 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39654: [SHUFFLE][MINOR] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/19 14:51:11 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39654: [SHUFFLE][MINOR] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/19 14:51:21 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #37525: [SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by GitBox <gi...@apache.org> on 2023/01/19 15:39:22 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39652: [SPARK-40599][SQL] Relax multiTransform rule type to allow alternatives to be any kinds of Seq - posted by GitBox <gi...@apache.org> on 2023/01/19 15:54:06 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #39652: [SPARK-40599][SQL] Relax multiTransform rule type to allow alternatives to be any kinds of Seq - posted by GitBox <gi...@apache.org> on 2023/01/19 15:54:40 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39652: [SPARK-40599][SQL] Relax multiTransform rule type to allow alternatives to be any kinds of Seq - posted by GitBox <gi...@apache.org> on 2023/01/19 15:54:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39651: [SPARK-42113][PS][INFRA] Upgrade pandas to 1.5.3 - posted by GitBox <gi...@apache.org> on 2023/01/19 16:54:56 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2023/01/19 17:13:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39655: [SPARK-42116][SQL][TESTS] Mark `ColumnarBatchSuite` as `ExtendedSQLTest` - posted by GitBox <gi...@apache.org> on 2023/01/19 17:48:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39655: [SPARK-42116][SQL][TESTS] Mark `ColumnarBatchSuite` as `ExtendedSQLTest` - posted by GitBox <gi...@apache.org> on 2023/01/19 18:18:05 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39654: [SHUFFLE][MINOR] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/19 18:23:58 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on a diff in pull request #39654: [SHUFFLE][MINOR] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/19 18:26:33 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2023/01/19 18:29:40 UTC, 1 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38419: [SPARK-40945][SQL] Support built-in function to truncate numbers - posted by GitBox <gi...@apache.org> on 2023/01/19 18:41:49 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/19 18:49:15 UTC, 1 replies.
- [GitHub] [spark] tedyu commented on a diff in pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/19 18:57:06 UTC, 2 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39637: [WIP][SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by GitBox <gi...@apache.org> on 2023/01/19 19:51:20 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2023/01/19 19:59:51 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39655: [SPARK-42116][SQL][TESTS] Mark `ColumnarBatchSuite` as `ExtendedSQLTest` - posted by GitBox <gi...@apache.org> on 2023/01/19 20:04:25 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39637: [SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by GitBox <gi...@apache.org> on 2023/01/19 20:20:07 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39299: [WIP][SPARK-41593][PYTHON][ML] Adding logging from executors - posted by GitBox <gi...@apache.org> on 2023/01/19 20:22:47 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/19 20:31:08 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2023/01/19 21:08:57 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39275: [SPARK-41759][CORE] Use `weakIntern` on string values in create new objects during deserialization - posted by GitBox <gi...@apache.org> on 2023/01/19 21:53:46 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/19 22:00:44 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/19 22:00:51 UTC, 0 replies.
- [GitHub] [spark] aokolnychyi commented on pull request #32921: [SPARK-35779][SQL] Dynamic filtering for Data Source V2 - posted by GitBox <gi...@apache.org> on 2023/01/19 22:37:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/19 22:42:00 UTC, 5 replies.
- [GitHub] [spark] allisonwang-db opened a new pull request, #39656: [SPARK-42119][SQL] Add built-in table-valued functions inline and inline_outer - posted by GitBox <gi...@apache.org> on 2023/01/19 22:46:39 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/19 22:53:21 UTC, 2 replies.
- [GitHub] [spark] zhenlineo commented on pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/19 22:56:43 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/19 23:08:28 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2023/01/19 23:19:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38005: [SPARK-40550][SQL] DataSource V2: Handle DELETE commands for delta-based sources - posted by GitBox <gi...@apache.org> on 2023/01/19 23:20:22 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/19 23:21:19 UTC, 7 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38180: [SPARK-40719][SQL] `CTAS` should respect `TBLPROPERTIES` during execution - posted by GitBox <gi...@apache.org> on 2023/01/20 00:20:29 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #38053: [SPARK-40600] Support recursiveFileLookup for partitioned datasource - posted by GitBox <gi...@apache.org> on 2023/01/20 00:20:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] closed pull request #38159: [SPARK-40594][SQL] Eagerly release hashed relation in ShuffledHashJoin - posted by GitBox <gi...@apache.org> on 2023/01/20 00:20:31 UTC, 0 replies.
- [GitHub] [spark] github-actions[bot] commented on pull request #37360: [SPARK-39931][PYTHON][WIP] Improve applyInPandas performance for very small groups - posted by GitBox <gi...@apache.org> on 2023/01/20 00:20:33 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #39657: [SPARK-42123][SQL] Include column default values in DESCRIBE output - posted by GitBox <gi...@apache.org> on 2023/01/20 00:29:56 UTC, 0 replies.
- [GitHub] [spark] tedyu commented on pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/20 00:35:52 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38180: [SPARK-40719][SQL] `CTAS` should respect `TBLPROPERTIES` during execution - posted by GitBox <gi...@apache.org> on 2023/01/20 00:50:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/20 01:34:39 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #39658: [SPARK-42043][CONNECT] Fix connect jar finding issue in test for scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/20 01:37:13 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #39658: [SPARK-42043][CONNECT] Fix connect jar finding issue in test for scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/20 01:38:52 UTC, 0 replies.
- [GitHub] [spark] bersprockets opened a new pull request, #39659: [MINOR][TESTS][DOCS] Fix instructions for running `AvroWriteBenchmark` from sbt - posted by GitBox <gi...@apache.org> on 2023/01/20 01:42:27 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39657: [SPARK-42123][SQL] Include column default values in DESCRIBE output - posted by GitBox <gi...@apache.org> on 2023/01/20 01:43:21 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/20 01:46:11 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39645: [SPARK-41415][3.2] SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/20 01:46:15 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39644: [SPARK-41415][3.3] SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/20 01:46:18 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39643: [Minor] fix typo in BlockManager.scala: 'cheksum' ---> 'checksum' - posted by GitBox <gi...@apache.org> on 2023/01/20 01:46:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39659: [MINOR][TESTS][DOCS] Fix instructions for running `AvroWriteBenchmark` from sbt - posted by GitBox <gi...@apache.org> on 2023/01/20 01:54:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39659: [MINOR][TESTS][DOCS] Fix instructions for running `AvroWriteBenchmark` from sbt - posted by GitBox <gi...@apache.org> on 2023/01/20 01:54:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by GitBox <gi...@apache.org> on 2023/01/20 01:58:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39658: [SPARK-42043][CONNECT] Fix connect jar finding issue in test for scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/20 02:14:22 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39658: [SPARK-42043][CONNECT][TESTS] Fix connect jar finding issue for scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/20 02:17:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/20 02:33:36 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/20 02:34:01 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/20 02:36:09 UTC, 1 replies.
- [GitHub] [spark] aokolnychyi commented on a diff in pull request #38004: [SPARK-40551][SQL] DataSource V2: Add APIs for delta-based row-level operations - posted by GitBox <gi...@apache.org> on 2023/01/20 02:51:41 UTC, 0 replies.
- [GitHub] [spark] ClownXC commented on pull request #39643: [Minor] fix typo in BlockManager.scala: 'cheksum' ---> 'checksum' - posted by GitBox <gi...@apache.org> on 2023/01/20 02:53:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39658: [SPARK-42043][CONNECT][TESTS] Fix connect jar finding issue for scala 2.13 - posted by GitBox <gi...@apache.org> on 2023/01/20 02:55:06 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39650: [SPARK-42112][SQL][SS] Add null check before `ContinuousWriteRDD#compute` function close `dataWriter` - posted by GitBox <gi...@apache.org> on 2023/01/20 03:24:04 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39585: [WIP] Scalar Inline Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/20 03:27:45 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39650: [SPARK-42112][SQL][SS] Add null check before `ContinuousWriteRDD#compute` function close `dataWriter` - posted by GitBox <gi...@apache.org> on 2023/01/20 03:31:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by GitBox <gi...@apache.org> on 2023/01/20 03:41:00 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39644: [SPARK-41415][3.3] SASL Request Retries - posted by GitBox <gi...@apache.org> on 2023/01/20 03:44:52 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39620: [SPARK-42096][CONNECT] Some code cleanup for `connect` module - posted by GitBox <gi...@apache.org> on 2023/01/20 03:46:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39640: [SPARK-38591][SQL] Add flatMapSortedGroups and cogroupSorted - posted by GitBox <gi...@apache.org> on 2023/01/20 03:52:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39643: [MINOR] Fix typo `cheksum` to `checksum` in `BlockManager` - posted by GitBox <gi...@apache.org> on 2023/01/20 04:01:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39643: [MINOR] Fix typo `cheksum` to `checksum` in `BlockManager` - posted by GitBox <gi...@apache.org> on 2023/01/20 04:01:55 UTC, 0 replies.
- [GitHub] [spark] wecharyu commented on pull request #39115: [SPARK-41563][SQL] Support partition filter in MSCK REPAIR TABLE statement - posted by GitBox <gi...@apache.org> on 2023/01/20 04:17:37 UTC, 1 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #39660: [SPARK-42128] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by GitBox <gi...@apache.org> on 2023/01/20 04:33:23 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by GitBox <gi...@apache.org> on 2023/01/20 04:35:19 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by GitBox <gi...@apache.org> on 2023/01/20 04:38:21 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39661: [SPARK-41884][CONNECT] Support naive tuple as a nested row - posted by GitBox <gi...@apache.org> on 2023/01/20 04:44:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39661: [SPARK-41884][CONNECT] Support naive tuple as a nested row - posted by GitBox <gi...@apache.org> on 2023/01/20 04:45:16 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39620: [SPARK-42096][CONNECT] Some code cleanup for `connect` module - posted by GitBox <gi...@apache.org> on 2023/01/20 05:03:00 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39585: [SPARK-42124][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/20 05:18:14 UTC, 3 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39640: [SPARK-38591][SQL] Add flatMapSortedGroups and cogroupSorted - posted by GitBox <gi...@apache.org> on 2023/01/20 05:21:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39640: [SPARK-38591][SQL] Add flatMapSortedGroups and cogroupSorted - posted by GitBox <gi...@apache.org> on 2023/01/20 05:22:01 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39637: [SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by GitBox <gi...@apache.org> on 2023/01/20 05:57:53 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR opened a new pull request, #39662: [SPARK-42105][SS][DOCS] Reflect the change of SPARK-40925 to SS guide doc - posted by GitBox <gi...@apache.org> on 2023/01/20 06:08:23 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39663: [SPARK-42129][BUILD] Upgrade rocksdbjni to 7.9.2 - posted by GitBox <gi...@apache.org> on 2023/01/20 06:13:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by GitBox <gi...@apache.org> on 2023/01/20 06:20:19 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39661: [SPARK-41884][CONNECT] Support naive tuple as a nested row - posted by GitBox <gi...@apache.org> on 2023/01/20 06:26:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39661: [SPARK-41884][CONNECT] Support naive tuple as a nested row - posted by GitBox <gi...@apache.org> on 2023/01/20 06:27:20 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask function from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/20 06:39:34 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39577: [SPARK-42070][SQL] Change the default value of argument of Mask function from -1 to NULL - posted by GitBox <gi...@apache.org> on 2023/01/20 06:40:14 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39585: [SPARK-42124][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect - posted by GitBox <gi...@apache.org> on 2023/01/20 06:42:21 UTC, 6 replies.
- [GitHub] [spark] HeartSaVioR commented on pull request #39662: [SPARK-42105][SS][DOCS] Reflect the change of SPARK-40925 to SS guide doc - posted by GitBox <gi...@apache.org> on 2023/01/20 06:45:38 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39543: [SPARK-42044][SQL] Fix incorrect error message for `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by GitBox <gi...@apache.org> on 2023/01/20 06:52:53 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39498: [SPARK-41976][SQL] Improve error message for `INDEX_NOT_FOUND` - posted by GitBox <gi...@apache.org> on 2023/01/20 06:53:47 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39498: [SPARK-41976][SQL] Improve error message for `INDEX_NOT_FOUND` - posted by GitBox <gi...@apache.org> on 2023/01/20 06:54:05 UTC, 0 replies.
- [GitHub] [spark] ggershinsky opened a new pull request, #39664: [SPARK-42114][SQL] Test of uniform parquet encryption - posted by GitBox <gi...@apache.org> on 2023/01/20 07:36:40 UTC, 0 replies.
- [GitHub] [spark] ggershinsky closed pull request #39664: [SPARK-42114][SQL] Test of uniform parquet encryption - posted by GitBox <gi...@apache.org> on 2023/01/20 07:40:17 UTC, 1 replies.
- [GitHub] [spark] ggershinsky commented on pull request #39664: [SPARK-42114][SQL] Test of uniform parquet encryption - posted by GitBox <gi...@apache.org> on 2023/01/20 07:42:39 UTC, 2 replies.
- [GitHub] [spark] ggershinsky opened a new pull request, #39665: [SPARK-42114][SQL] Test of uniform parquet encryption - posted by GitBox <gi...@apache.org> on 2023/01/20 07:55:26 UTC, 0 replies.
- [GitHub] [spark] ggershinsky commented on pull request #39665: [SPARK-42114][SQL] Test of uniform parquet encryption - posted by GitBox <gi...@apache.org> on 2023/01/20 07:59:35 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39629: [SPARK-42103][PYTHON][ML] Added Instrumentation for PyTorch Distributor - posted by GitBox <gi...@apache.org> on 2023/01/20 08:04:32 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39628: [SPARK-40264][ML][DOCS] Supplement docstring in pyspark.ml.functions.predict_batch_udf - posted by GitBox <gi...@apache.org> on 2023/01/20 08:04:37 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39626: An automatic caching solution for Spark - posted by GitBox <gi...@apache.org> on 2023/01/20 08:04:40 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39662: [SPARK-42105][SS][DOCS] Reflect the change of SPARK-40925 to SS guide doc - posted by GitBox <gi...@apache.org> on 2023/01/20 08:04:42 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by GitBox <gi...@apache.org> on 2023/01/20 08:10:15 UTC, 0 replies.
- [GitHub] [spark] HeartSaVioR closed pull request #39647: [SPARK-42075][DSTREAM] Deprecate DStream API - posted by GitBox <gi...@apache.org> on 2023/01/20 08:11:40 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by GitBox <gi...@apache.org> on 2023/01/20 08:13:07 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39628: [SPARK-40264][ML][DOCS] Supplement docstring in pyspark.ml.functions.predict_batch_udf - posted by GitBox <gi...@apache.org> on 2023/01/20 08:13:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39665: [SPARK-42114][SQL] Test of uniform parquet encryption - posted by GitBox <gi...@apache.org> on 2023/01/20 08:14:48 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by GitBox <gi...@apache.org> on 2023/01/20 08:26:44 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by GitBox <gi...@apache.org> on 2023/01/20 08:34:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39642: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for `StreamingQueryProgressWrapper` - posted by GitBox <gi...@apache.org> on 2023/01/20 08:42:38 UTC, 5 replies.
- [GitHub] [spark] EnricoMi closed pull request #37551: [SPARK-38591][SQL] Add sortWithinGroups to KeyValueGroupedDataset - posted by GitBox <gi...@apache.org> on 2023/01/20 08:49:41 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by GitBox <gi...@apache.org> on 2023/01/20 08:55:04 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by GitBox <gi...@apache.org> on 2023/01/20 08:57:59 UTC, 3 replies.
- [GitHub] [spark] beliefer opened a new pull request, #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by GitBox <gi...@apache.org> on 2023/01/20 09:13:33 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by GitBox <gi...@apache.org> on 2023/01/20 09:19:36 UTC, 5 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39661: [SPARK-41884][CONNECT] Support naive tuple as a nested row - posted by GitBox <gi...@apache.org> on 2023/01/20 09:23:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39665: [SPARK-42114][SQL][TESTS] Add uniform parquet encryption test case - posted by GitBox <gi...@apache.org> on 2023/01/20 09:53:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39665: [SPARK-42114][SQL][TESTS] Add uniform parquet encryption test case - posted by GitBox <gi...@apache.org> on 2023/01/20 09:55:25 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by GitBox <gi...@apache.org> on 2023/01/20 09:57:33 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39664: [SPARK-42114][SQL] Test of uniform parquet encryption - posted by GitBox <gi...@apache.org> on 2023/01/20 09:59:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39541: [SPARK-42043][CONNECT] Scala Client Result with E2E Tests - posted by GitBox <gi...@apache.org> on 2023/01/20 10:07:29 UTC, 10 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39668: [WIP] Test 3.4.0 tagging - posted by GitBox <gi...@apache.org> on 2023/01/20 10:09:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39668: [WIP] Test 3.4.0 tagging - posted by GitBox <gi...@apache.org> on 2023/01/20 10:26:19 UTC, 0 replies.
- [GitHub] [spark] antonipp opened a new pull request, #39669: [SPARK-40817][K8S][3.3] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/20 10:42:48 UTC, 0 replies.
- [GitHub] [spark] antonipp opened a new pull request, #39670: [SPARK-40817][K8S][3.2] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/20 10:42:51 UTC, 0 replies.
- [GitHub] [spark] antonipp commented on pull request #38376: [SPARK-40817][K8S] `spark.files` should preserve remote files - posted by GitBox <gi...@apache.org> on 2023/01/20 10:45:35 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #39671: [SPARK-40303][DOCS] Recommends users to use JDK 8u362 and later versions - posted by GitBox <gi...@apache.org> on 2023/01/20 10:48:57 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39671: [SPARK-40303][DOCS] Recommends users to use JDK 8u362 and later versions - posted by GitBox <gi...@apache.org> on 2023/01/20 11:10:11 UTC, 0 replies.
- [GitHub] [spark] vicennial opened a new pull request, #39672: [SPARK-42133] Add basic Dataset API methods to Spark Connect Scala Client - posted by GitBox <gi...@apache.org> on 2023/01/20 11:15:45 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39673: [SPARK-42132][SQL] Deduplicate attributes in groupByKey.cogroup - posted by GitBox <gi...@apache.org> on 2023/01/20 11:17:43 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39673: [SPARK-42132][SQL] Deduplicate attributes in groupByKey.cogroup - posted by GitBox <gi...@apache.org> on 2023/01/20 11:21:31 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39671: [SPARK-40303][DOCS] Recommends users to use JDK 8u362 and later versions - posted by GitBox <gi...@apache.org> on 2023/01/20 11:46:05 UTC, 0 replies.
- [GitHub] [spark] WeichenXu123 commented on a diff in pull request #39369: [SPARK-41775][PYTHON][ML] Adding support for PyForch functions - posted by GitBox <gi...@apache.org> on 2023/01/20 12:20:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39671: [SPARK-40303][DOCS] Deprecate old Java 8 versions prior to 8u362 - posted by GitBox <gi...@apache.org> on 2023/01/20 12:28:10 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39668: [WIP] Test 3.4.0 tagging - posted by GitBox <gi...@apache.org> on 2023/01/20 12:28:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39671: [SPARK-40303][DOCS] Deprecate old Java 8 versions prior to 8u362 - posted by GitBox <gi...@apache.org> on 2023/01/20 12:34:25 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39663: [SPARK-42129][BUILD] Upgrade rocksdbjni to 7.9.2 - posted by GitBox <gi...@apache.org> on 2023/01/20 12:35:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39674: [DON'T MERGE] Test remove SPARK_USE_CONC_INCR_GC - posted by GitBox <gi...@apache.org> on 2023/01/20 12:36:50 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39675: [MINOR][DOCS] Update the doc of arrow & kubernetes - posted by GitBox <gi...@apache.org> on 2023/01/20 12:41:55 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39674: [DON'T MERGE] Test remove SPARK_USE_CONC_INCR_GC - posted by GitBox <gi...@apache.org> on 2023/01/20 12:57:06 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39663: [SPARK-42129][BUILD] Upgrade rocksdbjni to 7.9.2 - posted by GitBox <gi...@apache.org> on 2023/01/20 13:01:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39674: [DON'T MERGE] Test remove SPARK_USE_CONC_INCR_GC - posted by GitBox <gi...@apache.org> on 2023/01/20 13:20:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39675: [MINOR][DOCS] Update the doc of arrow & kubernetes - posted by GitBox <gi...@apache.org> on 2023/01/20 13:22:13 UTC, 2 replies.
- [GitHub] [spark] peter-toth opened a new pull request, #39676: [SPARK-42134][SQL] Fix getPartitionFiltersAndDataFilters() to handle filters without referenced attributes - posted by GitBox <gi...@apache.org> on 2023/01/20 13:59:26 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #39676: [SPARK-42134][SQL] Fix getPartitionFiltersAndDataFilters() to handle filters without referenced attributes - posted by GitBox <gi...@apache.org> on 2023/01/20 14:02:36 UTC, 1 replies.
- [GitHub] [spark] ggershinsky commented on pull request #39665: [SPARK-42114][SQL][TESTS] Add uniform parquet encryption test case - posted by GitBox <gi...@apache.org> on 2023/01/20 14:16:35 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on a diff in pull request #39674: [DON'T MERGE] Test remove SPARK_USE_CONC_INCR_GC - posted by GitBox <gi...@apache.org> on 2023/01/20 14:30:29 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by GitBox <gi...@apache.org> on 2023/01/20 15:23:54 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #39505: [SPARK-41979][SQL] Add missing dots for error messages in error classes. - posted by GitBox <gi...@apache.org> on 2023/01/20 15:56:08 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #39677: [SPARK-42043][CONNECT][TEST] Better env var and a few bug fixes - posted by GitBox <gi...@apache.org> on 2023/01/20 16:16:28 UTC, 0 replies.
- [GitHub] [spark] peter-toth commented on pull request #38038: [SPARK-42136] Refactor BroadcastHashJoinExec output partitioning calculation - posted by GitBox <gi...@apache.org> on 2023/01/20 16:22:51 UTC, 2 replies.
- [GitHub] [spark] zhenlineo commented on pull request #39677: [SPARK-42043][CONNECT][TEST] Better env var and a few bug fixes - posted by GitBox <gi...@apache.org> on 2023/01/20 16:27:53 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39633: [SPARK-42038][SQL] SPJ: Support partially clustered distribution - posted by GitBox <gi...@apache.org> on 2023/01/20 16:54:30 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39671: [SPARK-40303][DOCS] Deprecate old Java 8 versions prior to 8u362 - posted by GitBox <gi...@apache.org> on 2023/01/20 17:04:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39677: [SPARK-42043][CONNECT][TEST][FOLLOWUP] Better env var and a few bug fixes - posted by GitBox <gi...@apache.org> on 2023/01/20 17:07:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39677: [SPARK-42043][CONNECT][TEST][FOLLOWUP] Better env var and a few bug fixes - posted by GitBox <gi...@apache.org> on 2023/01/20 17:08:23 UTC, 0 replies.
- [GitHub] [spark] lu-wang-dl commented on a diff in pull request #39369: [SPARK-41775][PYTHON][ML] Adding support for PyTorch functions - posted by GitBox <gi...@apache.org> on 2023/01/20 17:57:37 UTC, 0 replies.
- [GitHub] [spark] rithwik-db commented on a diff in pull request #39369: [SPARK-41775][PYTHON][ML] Adding support for PyTorch functions - posted by GitBox <gi...@apache.org> on 2023/01/20 17:59:27 UTC, 0 replies.
- [GitHub] [spark] RyanBerti commented on pull request #39678: [SPARK-16484][SQL] Add HyperLogLogPlusPlus sketch generator/evaluator/aggregator - posted by "RyanBerti (via GitHub)" <gi...@apache.org> on 2023/01/20 18:13:04 UTC, 1 replies.
- [GitHub] [spark] ggwiebe commented on pull request #39519: [SPARK-41995][SQL] Accept non-foldable expressions in schema_of_json - posted by "ggwiebe (via GitHub)" <gi...@apache.org> on 2023/01/20 18:38:15 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39657: [SPARK-42123][SQL] Include column default values in DESCRIBE output - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/20 19:04:15 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/20 21:10:51 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39657: [SPARK-42123][SQL] Include column default values in DESCRIBE output - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/20 21:28:13 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39666: [SPARK-42130][UI] Handle null string values in AccumulableInfo and ProcessSummary - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/20 21:48:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39679: [SPARK-42137][CORE] Enable `spark.kryo.unsafe` by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/20 22:22:11 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39669: [SPARK-40817][K8S][3.3] `spark.files` should preserve remote files - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/20 22:40:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39670: [SPARK-40817][K8S][3.2] `spark.files` should preserve remote files - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/20 22:44:08 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39680: [SPARK-42138][UI] Handle null string values in JobData/TaskDataWrapper/ExecutorStageSummaryWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/20 23:31:07 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39680: [SPARK-42138][UI] Handle null string values in JobData/TaskDataWrapper/ExecutorStageSummaryWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/20 23:31:21 UTC, 1 replies.
- [GitHub] [spark] panbingkun commented on pull request #39675: [MINOR][DOCS] Update the doc of arrow & kubernetes - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/01/20 23:45:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39675: [MINOR][DOCS] Remove Python 3.9 and Apache Arrow warning comment - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 00:13:48 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39678: [SPARK-16484][SQL] Add HyperLogLogPlusPlus sketch generator/evaluator/aggregator - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/21 00:34:52 UTC, 0 replies.
- [GitHub] [spark] tedyu closed pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by "tedyu (via GitHub)" <gi...@apache.org> on 2023/01/21 00:54:19 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39679: [SPARK-42137][CORE] Enable `spark.kryo.unsafe` by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 01:17:54 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39679: [SPARK-42137][CORE] Enable `spark.kryo.unsafe` by default - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 01:36:42 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39622: [SPARK-42099][SPARK-41845][CONNECT][PYTHON] Fix `count(*)` and `count(col(*))` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/21 01:48:45 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39622: [SPARK-42099][SPARK-41845][CONNECT][PYTHON] Fix `count(*)` and `count(col(*))` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/21 01:49:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39680: [SPARK-42138][UI] Handle null string values in JobData/TaskDataWrapper/ExecutorStageSummaryWrapper - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 01:54:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39638: [SPARK-42082][SPARK-41598][PYTHON][CONNECT] Introduce `PySparkValueError` and `PySparkTypeError` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/21 01:59:43 UTC, 0 replies.
- [GitHub] [spark] joveyuan-db opened a new pull request, #39681: [SPARK-18011] Fix SparkR NA date serialization - posted by "joveyuan-db (via GitHub)" <gi...@apache.org> on 2023/01/21 02:31:20 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #39676: [SPARK-42134][SQL] Fix getPartitionFiltersAndDataFilters() to handle filters without referenced attributes - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/21 02:35:45 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #39676: [SPARK-42134][SQL] Fix getPartitionFiltersAndDataFilters() to handle filters without referenced attributes - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/21 02:37:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39682: [SPARK-42139][CORE][SQL] Handle null string values in SQLExecutionUIData/SQLPlanMetric - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 02:55:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39638: [SPARK-42082][SPARK-41598][PYTHON][CONNECT] Introduce `PySparkValueError` and `PySparkTypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 03:18:05 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39644: [SPARK-41415][3.3] SASL Request Retries - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/21 03:29:33 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39645: [SPARK-41415][3.2] SASL Request Retries - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/21 03:37:47 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39644: [SPARK-41415][3.3] SASL Request Retries - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/21 03:37:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39682: [SPARK-42139][CORE][SQL] Handle null string values in SQLExecutionUIData/SparkPlanGraphWrapper/SQLPlanMetric - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 03:39:19 UTC, 4 replies.
- [GitHub] [spark] mridulm closed pull request #39645: [SPARK-41415][3.2] SASL Request Retries - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/21 03:39:31 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39645: [SPARK-41415][3.2] SASL Request Retries - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/21 03:39:31 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/21 03:46:26 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39682: [SPARK-42139][CORE][SQL] Handle null string values in SQLExecutionUIData/SparkPlanGraphWrapper/SQLPlanMetric - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/21 03:48:20 UTC, 1 replies.
- [GitHub] [spark] tedyu opened a new pull request, #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by "tedyu (via GitHub)" <gi...@apache.org> on 2023/01/21 03:56:42 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39683: [SPARK-42144][CORE][SQL] Handle null string values in StageDataWrapper/StreamBlockData/StreamingQueryData - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 04:59:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39683: [SPARK-42144][CORE][SQL] Handle null string values in StageDataWrapper/StreamBlockData/StreamingQueryData - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 05:00:36 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39680: [SPARK-42138][UI] Handle null string values in JobData/TaskDataWrapper/ExecutorStageSummaryWrapper - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 05:16:51 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #39678: [SPARK-16484][SQL] Add HyperLogLogPlusPlus sketch generator/evaluator/aggregator - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/21 05:34:07 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39680: [SPARK-42138][UI] Handle null string values in JobData/TaskDataWrapper/ExecutorStageSummaryWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 06:34:35 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39680: [SPARK-42138][UI] Handle null string values in JobData/TaskDataWrapper/ExecutorStageSummaryWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 06:35:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39550: [SPARK-42056][SQL][PROTOBUF] Add missing options for Protobuf functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 06:55:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39550: [SPARK-42056][SQL][PROTOBUF] Add missing options for Protobuf functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 06:55:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39677: [SPARK-42043][CONNECT][TEST][FOLLOWUP] Fix jar finding bug and use better env vars and time measurement - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 06:55:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39677: [SPARK-42043][CONNECT][TEST][FOLLOWUP] Fix jar finding bug and use better env vars and time measurement - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 06:56:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39684: [SPARK-42140][CORE] Handle null string values in ApplicationEnvironmentInfoWrapper/ApplicationInfoWrapper - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:06:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39684: [SPARK-42140][CORE] Handle null string values in ApplicationEnvironmentInfoWrapper/ApplicationInfoWrapper - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:07:49 UTC, 2 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39685: [SPARK-42142][UI] Handle null string values in CachedQuantile/ExecutorSummary/PoolData - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:07:53 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39685: [SPARK-42142][UI] Handle null string values in CachedQuantile/ExecutorSummary/PoolData - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:08:14 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39683: [SPARK-42144][CORE][SQL] Handle null string values in StageDataWrapper/StreamBlockData/StreamingQueryData - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:10:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39628: [SPARK-40264][ML][DOCS] Supplement docstring in pyspark.ml.functions.predict_batch_udf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 07:13:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39628: [SPARK-40264][ML][DOCS] Supplement docstring in pyspark.ml.functions.predict_batch_udf - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 07:13:56 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39682: [SPARK-42139][CORE][SQL] Handle null string values in SQLExecutionUIData/SparkPlanGraphWrapper/SQLPlanMetric - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:14:24 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39299: [SPARK-41593][PYTHON][ML] Adding logging from executors - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 07:17:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39299: [SPARK-41593][PYTHON][ML] Adding logging from executors - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 07:18:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39637: [SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 07:22:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39637: [SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 07:22:43 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39637: [SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 07:24:20 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39686: [SPARK-42143][UI] Handle null string values in RDDStorageInfo/RDDDataDistribution/RDDPartitionInfo - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:32:22 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39686: [SPARK-42143][UI] Handle null string values in RDDStorageInfo/RDDDataDistribution/RDDPartitionInfo - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:32:35 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39682: [SPARK-42139][CORE][SQL] Handle null string values in SQLExecutionUIData/SparkPlanGraphWrapper/SQLPlanMetric - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 07:56:58 UTC, 6 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39683: [SPARK-42144][CORE][SQL] Handle null string values in StageDataWrapper/StreamBlockData/StreamingQueryData - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 08:26:11 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39638: [SPARK-42082][SPARK-41598][PYTHON][CONNECT] Introduce `PySparkValueError` and `PySparkTypeError` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/21 08:50:58 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39543: [SPARK-42044][SQL] Fix incorrect error message for `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/21 09:14:36 UTC, 3 replies.
- [GitHub] [spark] yabola opened a new pull request, #39687: [SPARK-41470][Core] Relax constraints on Storage-Partitioned Join should assume InternalRow implements equals and hashCode - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/01/21 09:50:17 UTC, 0 replies.
- [GitHub] [spark] yabola commented on pull request #39687: [SPARK-41470][Core] Relax constraints on Storage-Partitioned Join should assume InternalRow implements equals and hashCode - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/01/21 09:53:29 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39688: [SPARK-42146][CORE] Refactor `Utils#setStringField` to make maven build pass when sql module use this method - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 10:14:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39689: [SPARK-42148][K8S][BUILD] Upgrade `kubernetes-client` to 6.4.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 10:51:10 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39688: [SPARK-42146][CORE] Refactor `Utils#setStringField` to make maven build pass when sql module use this method - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 11:07:49 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39674: [SPARK-42149][YARN] Remove the env `SPARK_USE_CONC_INCR_GC` used to enable CMS GC for Yarn AM - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 11:28:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39674: [SPARK-42149][YARN] Remove the env `SPARK_USE_CONC_INCR_GC` used to enable CMS GC for Yarn AM - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 11:36:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39638: [SPARK-42082][SPARK-41598][PYTHON][CONNECT] Introduce `PySparkValueError` and `PySparkTypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 12:36:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39638: [SPARK-42082][SPARK-41598][PYTHON][CONNECT] Introduce `PySparkValueError` and `PySparkTypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/21 12:36:31 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39190: [SPARK-41683][CORE] Fix issue of getting incorrect property numActiveStages in jobs API - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/21 15:28:06 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/21 15:29:15 UTC, 1 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39688: [SPARK-42146][CORE] Refactor `Utils#setStringField` to make maven build pass when sql module use this method - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/21 15:31:48 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39688: [SPARK-42146][CORE] Refactor `Utils#setStringField` to make maven build pass when sql module use this method - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 15:35:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39679: [SPARK-42137][CORE] Enable `spark.kryo.unsafe` by default - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 15:59:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39690: [SPARK-42150][K8S][DOCS] Upgrade Volcano to 1.7.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 16:08:55 UTC, 0 replies.
- [GitHub] [spark] wangyum opened a new pull request, #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/01/21 16:14:15 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/01/21 16:15:47 UTC, 1 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #39692: [SPARK-41629][CONNECT][FOLLOW] Enable access to SparkSession from Plugin - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/21 16:18:02 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39692: [SPARK-41629][CONNECT][FOLLOW] Enable access to SparkSession from Plugin - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/21 16:53:22 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39693: [SPARK-41712][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/21 17:49:51 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39693: [SPARK-41712][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/21 18:00:20 UTC, 2 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39694: [SPARK-42152][BUILD] Use `_` instead of `-` in `shadedPattern` for relocation package name - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/21 18:43:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39689: [SPARK-42148][K8S][BUILD] Upgrade `kubernetes-client` to 6.4.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 20:04:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39689: [SPARK-42148][K8S][BUILD] Upgrade `kubernetes-client` to 6.4.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 20:07:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39685: [SPARK-42142][UI] Handle null string values in CachedQuantile/ExecutorSummary/PoolData - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 21:29:05 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39685: [SPARK-42142][UI] Handle null string values in CachedQuantile/ExecutorSummary/PoolData - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 21:29:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39686: [SPARK-42143][UI] Handle null string values in RDDStorageInfo/RDDDataDistribution/RDDPartitionInfo - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 21:29:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39686: [SPARK-42143][UI] Handle null string values in RDDStorageInfo/RDDDataDistribution/RDDPartitionInfo - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 21:30:14 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39690: [SPARK-42150][K8S][DOCS] Upgrade `Volcano` to 1.7.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 21:30:52 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 21:33:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39668: [WIP] Test 3.4.0 tagging - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/21 21:34:28 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39654: [MINOR][SHUFFLE] Include IOException in warning log of finalizeShuffleMerge - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/21 21:56:48 UTC, 0 replies.
- [GitHub] [spark] grundprinzip opened a new pull request, #39695: [SPARK-XXXX] SparkConnectClient supports RetryPolicies now - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/21 22:07:41 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39684: [SPARK-42140][CORE] Handle null string values in ApplicationEnvironmentInfoWrapper/ApplicationInfoWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 23:33:27 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39684: [SPARK-42140][CORE] Handle null string values in ApplicationEnvironmentInfoWrapper/ApplicationInfoWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/21 23:34:02 UTC, 0 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39696: [SPARK-42153][UI] Handle null string values in PairStrings/RDDOperationNode/RDDOperationClusterWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 00:21:39 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39696: [SPARK-42153][UI] Handle null string values in PairStrings/RDDOperationNode/RDDOperationClusterWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 00:21:53 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39690: [SPARK-42150][K8S][DOCS] Upgrade `Volcano` to 1.7.0 - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 00:25:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39690: [SPARK-42150][K8S][DOCS] Upgrade `Volcano` to 1.7.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 00:26:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39690: [SPARK-42150][K8S][DOCS] Upgrade `Volcano` to 1.7.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 00:35:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39697: [SPARK-42154][K8S][TESTS] Enable Volcano unit tests and integration tests in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 00:50:58 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39692: [SPARK-41629][CONNECT][FOLLOW] Enable access to SparkSession from Plugin - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 01:53:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39692: [SPARK-41629][CONNECT][FOLLOW] Enable access to SparkSession from Plugin - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 01:53:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39585: [SPARK-42124][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 02:15:38 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39698: [SPARK-41283][CONNECT][PYTHON] Add `array_append` to Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 02:36:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39692: [SPARK-41629][CONNECT][FOLLOW] Enable access to SparkSession from Plugin - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/22 02:55:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39585: [SPARK-42124][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/22 02:56:48 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39699: [SPARK-41772][CONNECT][PYTHON] Fix incorrect column name in `withField`'s doctest - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 03:14:04 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39688: [SPARK-42146][CORE] Refactor `Utils#setStringField` to make maven build pass when sql module use this method - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 03:45:04 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39688: [SPARK-42146][CORE] Refactor `Utils#setStringField` to make maven build pass when sql module use this method - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 03:45:42 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39190: [SPARK-41683][CORE] Fix issue of getting incorrect property numActiveStages in jobs API - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/22 03:54:32 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39674: [SPARK-42149][YARN] Remove the env `SPARK_USE_CONC_INCR_GC` used to enable CMS GC for Yarn AM - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/22 04:01:34 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39674: [SPARK-42149][YARN] Remove the env `SPARK_USE_CONC_INCR_GC` used to enable CMS GC for Yarn AM - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/22 04:03:18 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39674: [SPARK-42149][YARN] Remove the env `SPARK_USE_CONC_INCR_GC` used to enable CMS GC for Yarn AM - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/22 04:04:25 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39696: [SPARK-42153][UI] Handle null string values in PairStrings/RDDOperationNode/RDDOperationClusterWrapper - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 04:09:34 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39698: [SPARK-41283][CONNECT][PYTHON] Add `array_append` to Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 09:50:46 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39698: [SPARK-41283][CONNECT][PYTHON] Add `array_append` to Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 09:51:09 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng closed pull request #39699: [SPARK-41772][CONNECT][PYTHON] Fix incorrect column name in `withField`'s doctest - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 09:53:56 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39699: [SPARK-41772][CONNECT][PYTHON] Fix incorrect column name in `withField`'s doctest - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 09:55:01 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39693: [SPARK-41712][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework. - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/22 10:18:37 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39693: [SPARK-41712][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 10:50:31 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #39690: [SPARK-42150][K8S][DOCS] Upgrade `Volcano` to 1.7.0 - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/22 11:47:05 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39695: [SPARK-XXXX] SparkConnectClient supports RetryPolicies now - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/22 12:31:52 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39700: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 17:15:04 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39695: [SPARK-42156] SparkConnectClient supports RetryPolicies now - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/22 18:05:35 UTC, 1 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39693: [SPARK-41712][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework. - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/22 18:10:14 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 18:38:27 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 18:39:00 UTC, 1 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39687: [SPARK-41470][SQL] Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/22 18:49:23 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/22 20:06:51 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39697: [SPARK-42154][K8S][TESTS] Enable `Volcano` unit and integration tests in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 20:11:56 UTC, 3 replies.
- [GitHub] [spark] Kimahriman commented on pull request #37616: [SPARK-40178][PYTHON][SQL] Fix partitioning hint parameters in PySpark - posted by "Kimahriman (via GitHub)" <gi...@apache.org> on 2023/01/22 20:18:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #38428: [SPARK-40912][CORE]Overhead of Exceptions in KryoDeserializationStream - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 20:21:35 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 20:25:34 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 20:45:19 UTC, 4 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39695: [SPARK-42156] SparkConnectClient supports RetryPolicies now - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 21:36:58 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39682: [SPARK-42139][CORE][SQL] Handle null string values in SQLExecutionUIData/SparkPlanGraphWrapper/SQLPlanMetric - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 21:48:27 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39682: [SPARK-42139][CORE][SQL] Handle null string values in SQLExecutionUIData/SparkPlanGraphWrapper/SQLPlanMetric - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 21:48:50 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39683: [SPARK-42144][CORE][SQL] Handle null string values in StageDataWrapper/StreamBlockData/StreamingQueryData - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 21:49:14 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39683: [SPARK-42144][CORE][SQL] Handle null string values in StageDataWrapper/StreamBlockData/StreamingQueryData - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 21:49:33 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39642: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for `StreamingQueryProgressWrapper` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/22 21:51:34 UTC, 4 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/22 21:52:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 22:02:05 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/01/22 22:17:08 UTC, 9 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/01/22 22:22:27 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39703: [SPARK-42157][CORE] spark.scheduler.mode=FAIR should provide FAIR scheduler - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 22:47:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39703: [SPARK-42157][CORE] `spark.scheduler.mode=FAIR` should provide FAIR scheduler - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 22:57:06 UTC, 5 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39681: [SPARK-18011] Fix SparkR NA date serialization - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/22 23:02:36 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39678: [SPARK-16484][SQL] Add HyperLogLogPlusPlus sketch generator/evaluator/aggregator - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/22 23:02:39 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39673: [SPARK-42132][SQL] Deduplicate attributes in groupByKey.cogroup - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/22 23:02:43 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39672: [SPARK-42133] Add basic Dataset API methods to Spark Connect Scala Client - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/22 23:02:48 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39697: [SPARK-42154][K8S][TESTS] Enable `Volcano` unit and integration tests in GitHub Action - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/01/22 23:14:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39697: [SPARK-42154][K8S][TESTS] Enable `Volcano` unit and integration tests in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 23:28:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39697: [SPARK-42154][K8S][TESTS] Enable `Volcano` unit and integration tests in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/22 23:29:37 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/23 01:08:15 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39657: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/23 01:08:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39704: [MINOR][DOCS] Add all supported resource managers in `Scheduling Within an Application` section - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/23 02:28:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39694: [SPARK-42152][BUILD][CORE][SQL][PYTHON][PROTOBUF] Use `_` instead of `-` for relocation package name - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/23 02:56:23 UTC, 2 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39691: [SPARK-31561][SQL] Add QUALIFY clause - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/01/23 03:58:10 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39642: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for `StreamingQueryProgressWrapper` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/23 04:06:16 UTC, 8 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39695: [SPARK-42156] SparkConnectClient supports RetryPolicies now - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/23 04:30:04 UTC, 14 replies.
- [GitHub] [spark] itholic commented on pull request #39700: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/23 04:42:32 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/23 04:42:47 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/23 04:42:52 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39703: [SPARK-42157][CORE] `spark.scheduler.mode=FAIR` should provide FAIR scheduler - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/23 05:35:15 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/23 05:43:04 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/23 05:50:56 UTC, 0 replies.
- [GitHub] [spark] imhunterand commented on pull request #39566: Patched()Fix Protobuf Java vulnerable to Uncontrolled Resource Consumption - posted by "imhunterand (via GitHub)" <gi...@apache.org> on 2023/01/23 06:09:45 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/23 06:19:53 UTC, 1 replies.
- [GitHub] [spark] purple-dude commented on pull request #30889: [SPARK-33398] Fix loading tree models prior to Spark 3.0 - posted by "purple-dude (via GitHub)" <gi...@apache.org> on 2023/01/23 06:41:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39555: [SPARK-42051][SQL] Codegen Support for HiveGenericUDF - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/23 06:45:18 UTC, 1 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39508: [SPARK-41985][SQL] Centralize more column resolution rules - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/01/23 06:53:48 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #39703: [SPARK-42157][CORE] `spark.scheduler.mode=FAIR` should provide FAIR scheduler - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/23 06:54:44 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39703: [SPARK-42157][CORE] `spark.scheduler.mode=FAIR` should provide FAIR scheduler - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/23 06:57:01 UTC, 8 replies.
- [GitHub] [spark] itholic opened a new pull request, #39706: [SPARK-42158][SQL] Integrate `_LEGACY_ERROR_TEMP_1003` into `FIELD_NOT_FOUND` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/23 07:30:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39369: [SPARK-41775][PYTHON][ML] Adding support for PyTorch functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:06:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39369: [SPARK-41775][PYTHON][ML] Adding support for PyTorch functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:07:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39694: [SPARK-42152][BUILD][CORE][SQL][PYTHON][PROTOBUF] Use `_` instead of `-` for relocation package name - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:09:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39694: [SPARK-42152][BUILD][CORE][SQL][PYTHON][PROTOBUF] Use `_` instead of `-` for relocation package name - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:09:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39693: [SPARK-41712][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:11:23 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39693: [SPARK-41712][PYTHON][CONNECT] Migrate the Spark Connect errors into PySpark error framework. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:13:09 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39505: [SPARK-41979][SQL] Add missing dots for error messages in error classes. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:16:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39505: [SPARK-41979][SQL] Add missing dots for error messages in error classes. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:17:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39543: [SPARK-42044][SQL] Fix incorrect error message for `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 08:17:17 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39706: [SPARK-42158][SQL] Integrate `_LEGACY_ERROR_TEMP_1003` into `FIELD_NOT_FOUND` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/23 10:25:24 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39706: [SPARK-42158][SQL] Integrate `_LEGACY_ERROR_TEMP_1003` into `FIELD_NOT_FOUND` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/23 11:39:58 UTC, 2 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39466: [SPARK-41948][SQL] Fix NPE for error classes: CANNOT_PARSE_JSON_FIELD - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/23 12:14:15 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39466: [SPARK-41948][SQL] Fix NPE for error classes: CANNOT_PARSE_JSON_FIELD - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/23 12:16:12 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39707: [SPARK-42161][BUILD] Upgrade arrow to 11.0.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/23 12:26:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39707: [WIP][SPARK-42161][BUILD] Upgrade arrow to 11.0.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/23 12:27:05 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/23 12:32:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39707: [WIP][SPARK-42161][BUILD] Upgrade arrow to 11.0.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 13:09:43 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/23 13:13:06 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39695: [SPARK-42156] SparkConnectClient supports RetryPolicies now - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/23 13:37:38 UTC, 9 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/23 14:02:25 UTC, 1 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/01/23 14:46:09 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/01/23 14:52:40 UTC, 2 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #39672: [SPARK-42133] Add basic Dataset API methods to Spark Connect Scala Client - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/23 15:17:57 UTC, 0 replies.
- [GitHub] [spark] tgravescs commented on pull request #39703: [SPARK-42157][CORE] `spark.scheduler.mode=FAIR` should provide FAIR scheduler - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2023/01/23 15:29:27 UTC, 0 replies.
- [GitHub] [spark] pboulos commented on a diff in pull request #39239: [SPARK-41730][PYTHON] Set tz to UTC while converting of timestamps to python's datetime - posted by "pboulos (via GitHub)" <gi...@apache.org> on 2023/01/23 15:38:04 UTC, 0 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #39672: [SPARK-42133] Add basic Dataset API methods to Spark Connect Scala Client - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/01/23 16:11:58 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39687: [SPARK-41470][SQL] Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/01/23 16:35:26 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39449: [SPARK-40688][SQL] Support data masking built-in function 'mask_first_n' - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/23 17:06:46 UTC, 2 replies.
- [GitHub] [spark] rangadi commented on pull request #39694: [SPARK-42152][BUILD][CORE][SQL][PYTHON][PROTOBUF] Use `_` instead of `-` for relocation package name - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2023/01/23 17:40:59 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39449: [SPARK-40688][SQL] Support data masking built-in function 'mask_first_n' - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/23 17:41:47 UTC, 1 replies.
- [GitHub] [spark] robert3005 commented on pull request #38209: [SPARK-21195][CORE] Dynamically register metrics from sources as they are reported - posted by "robert3005 (via GitHub)" <gi...@apache.org> on 2023/01/23 18:07:00 UTC, 0 replies.
- [GitHub] [spark] sunchao opened a new pull request, #39708: [SPARK-41413][FOLLOWUP][SQL] More test coverage in KeyGroupedPartitioningSuite - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/01/23 18:43:24 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 opened a new pull request, #39709: [SPARK-42090][3.3] Introduce sasl retry count in RetryingBlockTransferor - posted by "akpatnam25 (via GitHub)" <gi...@apache.org> on 2023/01/23 18:54:03 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #39709: [SPARK-42090][3.3] Introduce sasl retry count in RetryingBlockTransferor - posted by "akpatnam25 (via GitHub)" <gi...@apache.org> on 2023/01/23 18:54:26 UTC, 2 replies.
- [GitHub] [spark] akpatnam25 opened a new pull request, #39710: [SPARK-42090][3.2] Introduce sasl retry count in RetryingBlockTransferor - posted by "akpatnam25 (via GitHub)" <gi...@apache.org> on 2023/01/23 18:58:50 UTC, 0 replies.
- [GitHub] [spark] akpatnam25 commented on pull request #39710: [SPARK-42090][3.2] Introduce sasl retry count in RetryingBlockTransferor - posted by "akpatnam25 (via GitHub)" <gi...@apache.org> on 2023/01/23 18:59:20 UTC, 1 replies.
- [GitHub] [spark] allisonwang-db commented on pull request #39656: [SPARK-42119][SQL] Add built-in table-valued functions inline and inline_outer - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2023/01/23 19:22:46 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen opened a new pull request, #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/01/23 19:42:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39694: [SPARK-42152][BUILD][CORE][SQL][PYTHON][PROTOBUF] Use `_` instead of `-` for relocation package name - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/23 20:03:36 UTC, 0 replies.
- [GitHub] [spark] SandishKumarHN commented on pull request #39694: [SPARK-42152][BUILD][CORE][SQL][PYTHON][PROTOBUF] Use `_` instead of `-` for relocation package name - posted by "SandishKumarHN (via GitHub)" <gi...@apache.org> on 2023/01/23 21:35:19 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39708: [SPARK-41413][FOLLOWUP][SQL] More test coverage in KeyGroupedPartitioningSuite - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/01/23 21:54:15 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39585: [SPARK-42124][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/23 23:52:21 UTC, 1 replies.
- [GitHub] [spark] rithwik-db closed pull request #39629: [SPARK-42103][PYTHON][ML] Added Instrumentation for PyTorch Distributor - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/01/24 00:59:41 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #39712: [TODO][Connect] Scala Client Mima Compatibility Tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/24 01:40:23 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39672: [SPARK-42133] Add basic Dataset API methods to Spark Connect Scala Client - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/01/24 02:10:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39713: [SPARK-42164][CORE] Register partitioned-table-related classes to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 02:16:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39713: [SPARK-42164][CORE] Register partitioned-table-related classes to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 02:28:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39700: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/24 03:46:23 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39713: [SPARK-42164][CORE] Register partitioned-table-related classes to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 04:05:59 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39713: [SPARK-42164][CORE] Register partitioned-table-related classes to KryoSerializer - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 05:39:39 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39703: [SPARK-42157][CORE] `spark.scheduler.mode=FAIR` should provide FAIR scheduler - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 07:48:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39708: [SPARK-41413][FOLLOWUP][SQL][TESTS] More test coverage in KeyGroupedPartitioningSuite - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 07:51:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39708: [SPARK-41413][FOLLOWUP][SQL][TESTS] More test coverage in KeyGroupedPartitioningSuite - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 07:51:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39704: [MINOR][K8S][DOCS] Add all resource managers in `Scheduling Within an Application` section - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 07:52:45 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39704: [MINOR][K8S][DOCS] Add all resource managers in `Scheduling Within an Application` section - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 08:06:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 08:15:20 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/24 08:21:59 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39714: [SPARK-42166][K8S] Make `docker-image-tool.sh` usage message up-to-date - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 08:42:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39714: [SPARK-42166][K8S] Make `docker-image-tool.sh` usage message up-to-date - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 08:46:09 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39369: [SPARK-41775][PYTHON][ML] Adding support for PyTorch functions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 09:09:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39715: [SPARK-41775][PYTHON][FOLLOWU] Use pyspark.cloudpickle - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 09:12:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39637: [SPARK-41777][PYSPARK][ML] Integration testing for TorchDistributor - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 09:25:19 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39716: [SPARK-42167][INFRA] Improve GitHub Action job to stop on failures earlier - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 09:51:32 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39716: [SPARK-42167][INFRA] Improve GitHub Action `lint` job to stop on failures earlier - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 10:07:54 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39716: [SPARK-42167][INFRA] Improve GitHub Action `lint` job to stop on failures earlier - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/24 10:12:05 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39369: [SPARK-41775][PYTHON][ML] Adding support for PyTorch functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/24 10:15:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39715: [SPARK-41775][PYTHON][FOLLOWUP] Use `pyspark.cloudpickle` instead of `cloudpickle` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 10:20:58 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39714: [SPARK-42166][K8S] Make `docker-image-tool.sh` usage message up-to-date - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 10:23:37 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39715: [SPARK-41775][PYTHON][FOLLOWUP] Use `pyspark.cloudpickle` instead of `cloudpickle` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 10:26:30 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #39714: [SPARK-42166][K8S] Make `docker-image-tool.sh` usage message up-to-date - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/24 12:37:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39716: [SPARK-42167][INFRA] Improve GitHub Action `lint` job to stop on failures earlier - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/24 13:04:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39681: [SPARK-18011] Fix SparkR NA date serialization - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/24 13:06:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39681: [SPARK-18011] Fix SparkR NA date serialization - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/24 13:06:45 UTC, 0 replies.
- [GitHub] [spark] yeachan153 commented on pull request #38518: [SPARK-33349][K8S] Reset the executor pods watcher when we receive a version changed from k8s - posted by "yeachan153 (via GitHub)" <gi...@apache.org> on 2023/01/24 13:36:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39543: [SPARK-42044][SQL] Fix incorrect error message for `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/24 14:02:18 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39717: [SPARK-42168][3.3][SQL][PYTHON] Fix required child distribution of FlatMapCoGroupsInPandas (as in CoGroup) - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/24 14:05:31 UTC, 0 replies.
- [GitHub] [spark] cashmand opened a new pull request, #39718: [SPARK-42163] Fix schema pruning for non-foldable array index or map key - posted by "cashmand (via GitHub)" <gi...@apache.org> on 2023/01/24 15:03:05 UTC, 0 replies.
- [GitHub] [spark] cashmand commented on pull request #39718: [SPARK-42163] Fix schema pruning for non-foldable array index or map key - posted by "cashmand (via GitHub)" <gi...@apache.org> on 2023/01/24 15:03:46 UTC, 0 replies.
- [GitHub] [spark] NarekDW closed pull request #39097: [SPARK-42169] Implement code generation for to_csv function (StructsToCsv) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/24 15:07:39 UTC, 0 replies.
- [GitHub] [spark] NarekDW opened a new pull request, #39719: [SPARK-42169] Implement code generation for to_csv function (StructsToCsv) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/24 15:12:19 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #39097: [SPARK-42169] Implement code generation for to_csv function (StructsToCsv) - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/24 15:15:55 UTC, 0 replies.
- [GitHub] [spark] ocworld commented on pull request #32397: [WIP][SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode - posted by "ocworld (via GitHub)" <gi...@apache.org> on 2023/01/24 15:23:57 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39712: [TODO][Connect] Scala Client Mima Compatibility Tests - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/24 16:00:39 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/24 16:00:44 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39710: [SPARK-42090][3.2] Introduce sasl retry count in RetryingBlockTransferor - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/24 16:00:49 UTC, 0 replies.
- [GitHub] [spark] AmplabJenkins commented on pull request #39709: [SPARK-42090][3.3] Introduce sasl retry count in RetryingBlockTransferor - posted by "AmplabJenkins (via GitHub)" <gi...@apache.org> on 2023/01/24 16:00:53 UTC, 0 replies.
- [GitHub] [spark] NarekDW opened a new pull request, #39720: [SPARK-41500] [SQL] Year/Month Interval operations bug fix - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/24 16:37:33 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39566: Patched()Fix Protobuf Java vulnerable to Uncontrolled Resource Consumption - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/24 17:26:57 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39381: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/24 17:31:35 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39716: [SPARK-42167][INFRA] Improve GitHub Action `lint` job to stop on failures earlier - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 17:56:27 UTC, 0 replies.
- [GitHub] [spark] RunyaoChen commented on pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/01/24 18:00:49 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39721: [SPARK-42171][PYSPARK] Enable `pyspark-errors` module test in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 18:16:11 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39710: [SPARK-42090][3.2] Introduce sasl retry count in RetryingBlockTransferor - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/24 18:22:12 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39710: [SPARK-42090][3.2] Introduce sasl retry count in RetryingBlockTransferor - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/24 18:22:13 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39709: [SPARK-42090][3.3] Introduce sasl retry count in RetryingBlockTransferor - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/24 18:23:52 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39709: [SPARK-42090][3.3] Introduce sasl retry count in RetryingBlockTransferor - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/24 18:23:53 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39720: [SPARK-41500] [SQL] Year/Month Interval operations bug fix - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/24 18:26:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39387: [SPARK-41586][PYTHON] Introduce `pyspark.errors` and error classes for PySpark. - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 18:27:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39721: [SPARK-42171][PYSPARK] Enable `pyspark-errors` module test in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 18:30:30 UTC, 1 replies.
- [GitHub] [spark] db-scnakandala opened a new pull request, #39722: [SPARK-42162] - posted by "db-scnakandala (via GitHub)" <gi...@apache.org> on 2023/01/24 18:47:07 UTC, 0 replies.
- [GitHub] [spark] NarekDW opened a new pull request, #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/24 19:11:54 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/24 19:12:21 UTC, 0 replies.
- [GitHub] [spark] rithwik-db opened a new pull request, #39724: [SPARK-41775][PYTHON][FOLLOWUP] Fix stdout rerouting - posted by "rithwik-db (via GitHub)" <gi...@apache.org> on 2023/01/24 19:40:46 UTC, 0 replies.
- [GitHub] [spark] jbguerraz commented on pull request #32397: [WIP][SPARK-35084][CORE] Spark 3: supporting "--packages" in k8s cluster mode - posted by "jbguerraz (via GitHub)" <gi...@apache.org> on 2023/01/24 19:50:33 UTC, 0 replies.
- [GitHub] [spark] rmcyang opened a new pull request, #39725: [SPARK-33573][FOLLOW-UP] Increment ignoredBlockBytes when shuffle push blocks are late or colliding - posted by "rmcyang (via GitHub)" <gi...@apache.org> on 2023/01/24 20:14:33 UTC, 0 replies.
- [GitHub] [spark] rmcyang commented on pull request #39725: [SPARK-33573][FOLLOW-UP] Increment ignoredBlockBytes when shuffle push blocks are late or colliding - posted by "rmcyang (via GitHub)" <gi...@apache.org> on 2023/01/24 20:16:04 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39721: [SPARK-42171][PYSPARK][TESTS] Fix `pyspark-errors` module and enable it in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 20:55:13 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #39657: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/24 20:59:46 UTC, 0 replies.
- [GitHub] [spark] dtenedor closed pull request #39657: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/24 20:59:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39721: [SPARK-42171][PYSPARK][TESTS] Fix `pyspark-errors` module and enable it in GitHub Action - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 21:08:47 UTC, 0 replies.
- [GitHub] [spark] dtenedor opened a new pull request, #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output #39657 - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/24 21:10:44 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output #39657 - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/24 21:14:43 UTC, 0 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #39725: [SPARK-33573][FOLLOW-UP] Increment ignoredBlockBytes when shuffle push blocks are late or colliding - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/01/24 21:37:23 UTC, 7 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #39449: [SPARK-40688][SQL] Support data masking built-in function 'mask_first_n' - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/01/24 21:37:53 UTC, 11 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output #39657 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 21:51:09 UTC, 9 replies.
- [GitHub] [spark] rmcyang commented on a diff in pull request #39725: [SPARK-33573][FOLLOW-UP] Increment ignoredBlockBytes when shuffle push blocks are late or colliding - posted by "rmcyang (via GitHub)" <gi...@apache.org> on 2023/01/24 22:26:40 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39727: Use scikit-learn instead of sklearn - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/24 22:49:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39721: [SPARK-42171][PYSPARK][TESTS] Fix `pyspark-errors` module and enable it in GitHub Action - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:05:23 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:12:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39375: [SPARK-36124][SQL] Support subqueries with correlation through UNION - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:12:49 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39724: [SPARK-41775][PYTHON][FOLLOWUP] Fix stdout rerouting - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:13:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39724: [SPARK-41775][PYTHON][FOLLOWUP] Fix stdout rerouting - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:13:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:23:50 UTC, 3 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39717: [SPARK-42168][3.3][SQL][PYTHON] Fix required child distribution of FlatMapCoGroupsInPandas (as in CoGroup) - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:27:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39727: [SPARK-42174][PYTHON][INFRA] Use `scikit-learn` instead of `sklearn` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:29:22 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39727: [SPARK-42174][PYTHON][INFRA] Use `scikit-learn` instead of `sklearn` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:29:37 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on a diff in pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/25 00:30:01 UTC, 10 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39656: [SPARK-42119][SQL] Add built-in table-valued functions inline and inline_outer - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:31:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39656: [SPARK-42119][SQL] Add built-in table-valued functions inline and inline_outer - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 00:31:46 UTC, 0 replies.
- [GitHub] [spark] holdenk opened a new pull request, #39728: [SPARK-42173][CORE] RpcAddress equality can fail - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/01/25 00:40:24 UTC, 0 replies.
- [GitHub] [spark] sadikovi opened a new pull request, #39729: [SPARK-42175] Fix cast of a boolean value to timestamp - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/01/25 00:56:08 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #39729: [SPARK-42175][SQL] Fix cast of a boolean value to timestamp - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/01/25 00:57:40 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39727: [SPARK-42174][PYTHON][INFRA] Use `scikit-learn` instead of `sklearn` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 00:58:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39728: [SPARK-42173][CORE] RpcAddress equality can fail - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 01:04:34 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on a diff in pull request #39728: [SPARK-42173][CORE] RpcAddress equality can fail - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/01/25 01:12:38 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39728: [SPARK-42173][CORE] RpcAddress equality can fail - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 01:14:53 UTC, 2 replies.
- [GitHub] [spark] itholic commented on pull request #39721: [SPARK-42171][PYSPARK][TESTS] Fix `pyspark-errors` module and enable it in GitHub Action - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/25 01:19:42 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output #39657 - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/25 01:28:59 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39729: [SPARK-42175][SQL] Fix cast of a boolean value to timestamp - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 02:05:02 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on a diff in pull request #39729: [SPARK-42176][SQL] Fix cast of a boolean value to timestamp - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/01/25 02:16:32 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39730: [SPARK-42177][INFRA][3.4] Change master to brach-3.4 in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 03:22:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39730: [SPARK-42177][INFRA][3.4] Change master to brach-3.4 in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 03:24:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39731: [SPARK-42177][INFRA][3.4] Change master to brach-3.4 in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 03:24:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39731: [SPARK-42177][INFRA][3.4] Change master to brach-3.4 in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 03:25:00 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39729: [SPARK-42176][SQL] Fix cast of a boolean value to timestamp - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 03:32:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39729: [SPARK-42176][SQL] Fix cast of a boolean value to timestamp - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 03:32:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39731: [SPARK-42177][INFRA][3.4] Change master to branch-3.4 in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 03:37:00 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/25 05:04:24 UTC, 13 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39732: [SPARK-42178][UI] Handle remaining null string values in ui protobuf serializer and add tests - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 05:40:18 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39732: [SPARK-42178][UI] Handle remaining null string values in ui protobuf serializer and add tests - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 05:40:36 UTC, 1 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39732: [SPARK-42178][UI] Handle remaining null string values in ui protobuf serializer and add tests - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 05:41:25 UTC, 3 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39733: Setting version to 3.5.0-SNAPSHOT - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 05:59:15 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39732: [SPARK-42178][UI] Handle remaining null string values in ui protobuf serializer and add tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/25 06:04:51 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39734: [SPARK-41812][SPARK-41823][CONNECT][PYTHON] Fix ambiguous columns in Join - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/25 06:32:15 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39734: [SPARK-41812][SPARK-41823][CONNECT][PYTHON] Fix ambiguous columns in Join - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/25 06:33:43 UTC, 2 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39734: [SPARK-41812][SPARK-41823][CONNECT][PYTHON] Fix ambiguous columns issue in Join - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/25 06:45:14 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39702: [SPARK-41487][SQL] Assign name to _LEGACY_ERROR_TEMP_1020 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/25 06:54:49 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39695: [SPARK-42156] SparkConnectClient supports RetryPolicies now - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/25 07:12:15 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39731: [SPARK-42177][INFRA][3.4] Change master to branch-3.4 in GitHub Actions - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 07:12:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39735: [SPARK-42179][BUILD][SQL][3.3] Upgrade ORC to 1.7.8 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 07:34:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39732: [SPARK-42178][UI] Handle remaining null string values in ui protobuf serializer and add tests - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 07:38:02 UTC, 2 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39717: [SPARK-42168][3.2][SQL][PYTHON] Fix required child distribution of FlatMapCoGroupsInPandas (as in CoGroup) - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/25 07:38:41 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39735: [SPARK-42179][BUILD][SQL][3.3] Upgrade ORC to 1.7.8 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 07:40:06 UTC, 2 replies.
- [GitHub] [spark] peter-toth commented on pull request #39722: [WIP][SPARK-42162] Introduce MultiAdd expression as a memory optimization for canonicalizing large trees of Add expressions - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/01/25 07:41:24 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39732: [SPARK-42178][UI] Handle remaining null string values in ui protobuf serializer and add tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/25 07:43:44 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39700: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/25 07:48:33 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39733: Setting version to 3.5.0-SNAPSHOT - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/25 07:57:33 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39733: Setting version to 3.5.0-SNAPSHOT - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 08:03:58 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39736: [SPARK-42180][BUILD] Update Scala to 2.12.17 in `_config.yml` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/25 08:15:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39736: [SPARK-42180][BUILD] Update `SCALA_VERSION`in `_config.yml` to 2.12.17 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/25 08:18:40 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39736: [SPARK-42180][BUILD][DOCS] Update `SCALA_VERSION` in `_config.yml` to 2.12.17 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 08:27:54 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39736: [SPARK-42180][BUILD][DOCS] Update `SCALA_VERSION` in `_config.yml` to 2.12.17 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 08:28:17 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39736: [SPARK-42180][BUILD][DOCS] Update `SCALA_VERSION` in `_config.yml` to 2.12.17 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/25 08:28:58 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39737: [SPARK-42181][PYSPARK][TESTS] Skip `torch` tests when `torch` is not installed - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 08:36:33 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39737: [SPARK-42181][PYSPARK][TESTS] Skip `torch` tests when `torch` is not installed - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 08:39:54 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39642: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for `StreamingQueryProgressWrapper` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 09:02:28 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39642: [SPARK-41677][CORE][SQL][SS][UI] Add Protobuf serializer for `StreamingQueryProgressWrapper` - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 09:02:43 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39695: [SPARK-42156] SparkConnectClient supports RetryPolicies now - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 10:17:54 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39737: [SPARK-42181][PYSPARK][TESTS] Skip `torch` tests when `torch` is not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:31:00 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/25 10:32:00 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39737: [SPARK-42181][PYTHON][TESTS] Skip `torch` tests when `torch` is not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:32:05 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/25 10:32:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39737: [SPARK-42181][PYTHON][TESTS] Skip `torch` tests when `torch` is not installed - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 10:36:15 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39738: [SPARK-42182][CONNECT][TESTS] Make `ReusedConnectTestCase` to take Spark configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:38:17 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39739: [WIP] Accept return type in DDL strings for Python User-defined Functions in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 10:40:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39585: [SPARK-42124][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:42:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39585: [SPARK-42124][PYTHON][CONNECT] Scalar Inline Python UDF in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:43:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39737: [SPARK-42181][PYTHON][TESTS] Skip `torch` tests when `torch` is not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:48:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39737: [SPARK-42181][PYTHON][TESTS] Skip `torch` tests when `torch` is not installed - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:49:03 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39733: Setting version to 3.5.0-SNAPSHOT - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 10:51:12 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39737: [SPARK-42181][PYTHON][TESTS] Skip `torch` tests when `torch` is not installed - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 10:52:43 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39733: [SPARK-38572][BUILD] Setting version to 3.5.0-SNAPSHOT - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 10:54:56 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39740: [SPARK-42183][PYTHON][ML][TESTS] Exclude pyspark.ml.torch.tests in MyPy tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:56:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39733: [SPARK-38572][BUILD] Setting version to 3.5.0-SNAPSHOT - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 10:56:36 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39740: [SPARK-42183][PYTHON][ML][TESTS] Exclude pyspark.ml.torch.tests in MyPy tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 10:58:35 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39733: [SPARK-42184][BUILD] Setting version to 3.5.0-SNAPSHOT - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 10:58:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39717: [SPARK-42168][3.2][SQL][PYTHON] Fix required child distribution of FlatMapCoGroupsInPandas (as in CoGroup) - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 11:00:45 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39739: [SPARK-42126][PYTHON][CONNECT] Accept return type in DDL strings for Python Scalar UDFs in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 11:12:06 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39739: [SPARK-42126][PYTHON][CONNECT] Accept return type in DDL strings for Python Scalar UDFs in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 11:14:17 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39731: [SPARK-42177][INFRA][3.4] Change master to branch-3.4 in GitHub Actions - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/25 11:16:24 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39735: [SPARK-42179][BUILD][SQL][3.3] Upgrade ORC to 1.7.8 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 11:25:13 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39741: [SPARK-42185][INFRA] Add `branch-3.4` to publish_snapshot GitHub Action job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 11:46:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39739: [SPARK-42126][PYTHON][CONNECT] Accept return type in DDL strings for Python Scalar UDFs in Spark Connect - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 11:47:10 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39741: [SPARK-42185][INFRA] Add `branch-3.4` to publish_snapshot GitHub Action job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 11:48:22 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39741: [SPARK-42185][INFRA] Add `branch-3.4` to publish_snapshot GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 11:48:46 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39741: [SPARK-42185][INFRA] Add `branch-3.4` to `publish_snapshot` GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 11:50:20 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39741: [SPARK-42185][INFRA] Add `branch-3.4` to `publish_snapshot` GitHub Action job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 11:50:37 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39739: [SPARK-42126][PYTHON][CONNECT] Accept return type in DDL strings for Python Scalar UDFs in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/25 12:01:11 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #39731: [SPARK-42177][INFRA][3.4] Change master to branch-3.4 in GitHub Actions - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/25 12:01:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39742: [SPARK-42186][R] Make SparkR able to stop properly when the connection is timed-out - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 12:41:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39731: [SPARK-42177][INFRA][3.4] Change master to branch-3.4 in GitHub Actions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 12:50:00 UTC, 0 replies.
- [GitHub] [spark] tomvanbussel commented on a diff in pull request #39738: [SPARK-42182][CONNECT][TESTS] Make `ReusedConnectTestCase` to take Spark configurations - posted by "tomvanbussel (via GitHub)" <gi...@apache.org> on 2023/01/25 13:21:57 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39743: [SPARK-42187][CONNECT][TESTS] Avoid using RemoteSparkSession.builder.getOrCreate in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 13:41:06 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39740: [SPARK-42183][PYTHON][ML][TESTS] Exclude pyspark.ml.torch.tests in MyPy tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 13:43:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39733: [SPARK-42184][BUILD] Setting version to 3.5.0-SNAPSHOT - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 13:50:30 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39744: [SPARK-38591][SQL][FOLLOW-UP] Fix ambiguous references for sorted cogroups - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/25 14:06:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39733: [SPARK-42184][BUILD] Setting version to 3.5.0-SNAPSHOT - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 16:49:15 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39733: [SPARK-42184][BUILD] Setting version to 3.5.0-SNAPSHOT - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 16:53:43 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39738: [SPARK-42182][CONNECT][TESTS] Make `ReusedConnectTestCase` to take Spark configurations - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 16:59:58 UTC, 0 replies.
- [GitHub] [spark] snmvaughan opened a new pull request, #39745: [SPARK-42188] Force SBT protobuf version to match Maven on branch 3.2 - posted by "snmvaughan (via GitHub)" <gi...@apache.org> on 2023/01/25 17:12:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39742: [SPARK-42186][R] Make SparkR be able to stop properly when the connection is timed-out - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 17:13:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39742: [SPARK-42186][R] Make SparkR be able to stop properly when the connection is timed-out - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 17:13:48 UTC, 0 replies.
- [GitHub] [spark] snmvaughan commented on pull request #39745: [SPARK-42188] Force SBT protobuf version to match Maven on branch 3.2 - posted by "snmvaughan (via GitHub)" <gi...@apache.org> on 2023/01/25 17:24:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39745: [SPARK-42188] Force SBT protobuf version to match Maven on branch 3.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 17:32:38 UTC, 1 replies.
- [GitHub] [spark] snmvaughan opened a new pull request, #39746: [SPARK-42188] Force SBT protobuf version to match Maven on branch 3.3 - posted by "snmvaughan (via GitHub)" <gi...@apache.org> on 2023/01/25 17:35:35 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39725: [SPARK-33573][FOLLOW-UP] Increment ignoredBlockBytes when shuffle push blocks are late or colliding - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/25 17:56:57 UTC, 2 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/25 18:02:28 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/01/25 18:04:03 UTC, 3 replies.
- [GitHub] [spark] otterc commented on pull request #39725: [SPARK-33573][FOLLOW-UP] Increment ignoredBlockBytes when shuffle push blocks are late or colliding - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/01/25 18:20:41 UTC, 1 replies.
- [GitHub] [spark] mridulm commented on pull request #39725: [SPARK-33573][FOLLOW-UP] Increment ignoredBlockBytes when shuffle push blocks are late or colliding - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/25 18:25:35 UTC, 2 replies.
- [GitHub] [spark] huaxingao commented on pull request #39746: [SPARK-42188] Force SBT protobuf version to match Maven on branch 3.3 - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/25 18:47:05 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39732: [SPARK-42178][UI] Handle remaining null string values in ui protobuf serializer and add tests - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 18:54:05 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39744: [SPARK-38591][SQL][FOLLOW-UP] Fix ambiguous references for sorted cogroups - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/25 19:02:56 UTC, 3 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #39746: [SPARK-42188] Force SBT protobuf version to match Maven on branch 3.3 - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/01/25 19:26:46 UTC, 0 replies.
- [GitHub] [spark] linhongliu-db commented on pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "linhongliu-db (via GitHub)" <gi...@apache.org> on 2023/01/25 19:34:57 UTC, 0 replies.
- [GitHub] [spark] vinodkc opened a new pull request, #39747: [SPARK-40686][SQL] Support udf 'luhn_check' - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/01/25 19:38:04 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39748: [SPARK-42190][K8S] Support `local` mode in spark.kubernetes.driver.master - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 19:49:10 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/01/25 19:54:36 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on a diff in pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/25 20:09:53 UTC, 1 replies.
- [GitHub] [spark] dtenedor commented on pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/25 20:10:17 UTC, 2 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #39728: [SPARK-42173][CORE] RpcAddress equality can fail - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/01/25 21:24:10 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39746: [SPARK-42188][BUILD][3.3] Force SBT protobuf version to match Maven - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 21:31:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39710: [SPARK-42090][3.2] Introduce sasl retry count in RetryingBlockTransferor - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/25 22:24:14 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 22:32:05 UTC, 1 replies.
- [GitHub] [spark] gengliangwang closed pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/25 22:35:41 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39743: [SPARK-42187][CONNECT][TESTS] Avoid using RemoteSparkSession.builder.getOrCreate in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 23:26:17 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39738: [SPARK-42182][CONNECT][TESTS] Make `ReusedConnectTestCase` to take Spark configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/25 23:57:59 UTC, 1 replies.
- [GitHub] [spark] allisonport-db commented on pull request #38302: [SPARK-40834][SQL] Use SparkListenerSQLExecutionEnd to track final SQL status in UI - posted by "allisonport-db (via GitHub)" <gi...@apache.org> on 2023/01/26 00:28:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39748: [SPARK-42190][K8S] Support `local` mode in spark.kubernetes.driver.master - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 01:36:31 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39717: [SPARK-42168][3.2][SQL][PYTHON] Fix required child distribution of FlatMapCoGroupsInPandas (as in CoGroup) - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 01:43:41 UTC, 0 replies.
- [GitHub] [spark] viirya commented on a diff in pull request #39748: [SPARK-42190][K8S] Support `local` mode in `spark.kubernetes.driver.master` - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/01/26 01:48:10 UTC, 1 replies.
- [GitHub] [spark] entong commented on a diff in pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "entong (via GitHub)" <gi...@apache.org> on 2023/01/26 01:49:25 UTC, 0 replies.
- [GitHub] [spark] vinodkc commented on a diff in pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "vinodkc (via GitHub)" <gi...@apache.org> on 2023/01/26 01:54:15 UTC, 17 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39748: [SPARK-42190][K8S] Support `local` mode in `spark.kubernetes.driver.master` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 01:59:56 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39748: [SPARK-42190][K8S] Support `local` mode in `spark.kubernetes.driver.master` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 02:08:22 UTC, 1 replies.
- [GitHub] [spark] viirya commented on pull request #39748: [SPARK-42190][K8S] Support `local` mode in `spark.kubernetes.driver.master` - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/01/26 02:12:18 UTC, 1 replies.
- [GitHub] [spark] huaxingao commented on pull request #39746: [SPARK-42188][BUILD][3.3] Force SBT protobuf version to match Maven - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/26 02:20:18 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #39746: [SPARK-42188][BUILD][3.3] Force SBT protobuf version to match Maven - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/26 02:20:19 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #39745: [SPARK-42188][BUILD][3.2] Force SBT protobuf version to match Maven - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/26 02:23:59 UTC, 0 replies.
- [GitHub] [spark] huaxingao closed pull request #39745: [SPARK-42188][BUILD][3.2] Force SBT protobuf version to match Maven - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/26 02:24:00 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39748: [SPARK-42190][K8S] Support `local` mode in `spark.kubernetes.driver.master` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 02:41:16 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 03:06:39 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39748: [SPARK-42190][K8S] Support `local` mode in `spark.kubernetes.driver.master` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 03:23:48 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39738: [SPARK-42182][CONNECT][TESTS] Make `ReusedConnectTestCase` to take Spark configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 04:19:50 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39738: [SPARK-42182][CONNECT][TESTS] Make `ReusedConnectTestCase` to take Spark configurations - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 04:20:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39748: [SPARK-42190][K8S] Support `local` mode in `spark.kubernetes.driver.master` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 04:22:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39743: [SPARK-42187][CONNECT][TESTS] Avoid using RemoteSparkSession.builder.getOrCreate in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 04:24:19 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39743: [SPARK-42187][CONNECT][TESTS] Avoid using RemoteSparkSession.builder.getOrCreate in tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 04:33:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 04:44:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39707: [WIP][SPARK-42161][BUILD] Upgrade Apache Arrow to 11.0.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 04:46:08 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39707: [WIP][SPARK-42161][BUILD] Upgrade Apache Arrow to 11.0.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 04:57:08 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39749: [SPARK-42195][INFRA] Add Github action test job for branch-3.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 05:29:38 UTC, 0 replies.
- [GitHub] [spark] ganeshchand opened a new pull request, #39750: [SPARK-42196][SS] Fix typo - posted by "ganeshchand (via GitHub)" <gi...@apache.org> on 2023/01/26 05:54:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39751: [SPARK-42197][CONNECT] Reuses JVM initialization, and separate configuration groups to set in remote local mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 06:10:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39751: [SPARK-42197][CONNECT] Reuses JVM initialization, and separate configuration groups to set in remote local mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 06:11:16 UTC, 2 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39749: [SPARK-42195][INFRA] Add Github action test job for branch-3.4 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 06:13:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39749: [SPARK-42195][INFRA] Add Github action test job for branch-3.4 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 06:13:38 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39752: [SPARK-42168][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/26 07:23:04 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on pull request #39752: [SPARK-42168][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/26 07:25:06 UTC, 5 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39749: [SPARK-42195][INFRA] Add Github action test job for branch-3.4 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 07:30:39 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39753: [SPARK-42125][CONNECT][PYTHON] Pandas UDF in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/26 07:32:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39751: [SPARK-42197][CONNECT] Reuses JVM initialization, and separate configuration groups to set in remote local mode - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 07:33:45 UTC, 0 replies.
- [GitHub] [spark] holdenk commented on pull request #39728: [SPARK-42173][CORE] RpcAddress equality can fail - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/01/26 07:41:58 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39751: [SPARK-42197][CONNECT] Reuses JVM initialization, and separate configuration groups to set in remote local mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 08:37:11 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39749: [SPARK-42195][INFRA] Add Daily Scala 2.13 Github Action Job for branch-3.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 09:38:21 UTC, 5 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39749: [SPARK-42195][INFRA] Add Daily Scala 2.13 Github Action Job for branch-3.4 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 09:45:58 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39749: [SPARK-42195][INFRA] Add Daily Scala 2.13 Github Action Job for branch-3.4 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 09:52:08 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #39739: [SPARK-42126][PYTHON][CONNECT] Accept return type in DDL strings for Python Scalar UDFs in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/26 11:15:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39751: [SPARK-42197][CONNECT] Reuses JVM initialization, and separate configuration groups to set in remote local mode - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 11:57:08 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39749: [SPARK-42195][INFRA] Add Daily Scala 2.13 Github Action Job for branch-3.4 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 11:58:03 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39749: [SPARK-42195][INFRA] Add Daily Scala 2.13 Github Action Job for branch-3.4 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 11:58:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39750: [SPARK-42196][SS] Fix typo - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/26 12:04:18 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39754: [SPARK-42199][SQL] Fix issues around Dataset.groupByKey - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/26 15:44:27 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39728: [SPARK-42173][CORE] RpcAddress equality can fail - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 17:12:58 UTC, 0 replies.
- [GitHub] [spark] robert3005 opened a new pull request, #39755: [SPARK-21195][CORE] Dynamically register metrics from sources as they are reported - posted by "robert3005 (via GitHub)" <gi...@apache.org> on 2023/01/26 17:20:50 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on pull request #39752: [SPARK-42168][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/01/26 17:23:08 UTC, 2 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/26 17:31:09 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 17:55:44 UTC, 8 replies.
- [GitHub] [spark] zhenlineo commented on pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/26 18:00:10 UTC, 7 replies.
- [GitHub] [spark] RunyaoChen commented on a diff in pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "RunyaoChen (via GitHub)" <gi...@apache.org> on 2023/01/26 18:21:54 UTC, 6 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39756: [DON'T MERGE] Enable GA build and test for `connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 18:23:57 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39756: [DON'T MERGE] Enable GA build and test for `connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 18:27:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39756: [DON'T MERGE][INFRA] Enable GA build and test for `connect-client-jvm` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 18:42:02 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39756: [DON'T MERGE][INFRA] Enable GA build and test for `connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/26 18:44:35 UTC, 0 replies.
- [GitHub] [spark] LucaCanali opened a new pull request, #39757: [SPARK-41585][YARN][DOC] Improve doc of the excludeNodes configuration by clarifying the dependency with dynamic allocation - posted by "LucaCanali (via GitHub)" <gi...@apache.org> on 2023/01/26 18:59:15 UTC, 0 replies.
- [GitHub] [spark] otterc commented on a diff in pull request #39725: [SPARK-33573][FOLLOW-UP] Enhance ignoredBlockBytes in pushMergeMetrics to cover more scenarios - posted by "otterc (via GitHub)" <gi...@apache.org> on 2023/01/26 19:38:25 UTC, 3 replies.
- [GitHub] [spark] mridulm commented on pull request #37479: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/26 19:39:55 UTC, 0 replies.
- [GitHub] [spark] shardulm94 commented on pull request #37479: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "shardulm94 (via GitHub)" <gi...@apache.org> on 2023/01/26 19:42:47 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/26 19:45:20 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39758: [SPARK-42201][BUILD] `build/sbt` should allow `SBT_OPTS` to override JVM memory setting - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 20:18:33 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39725: [SPARK-33573][FOLLOW-UP] Enhance ignoredBlockBytes in pushMergeMetrics to cover more scenarios - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/26 20:34:58 UTC, 2 replies.
- [GitHub] [spark] rmcyang commented on a diff in pull request #39725: [SPARK-33573][FOLLOW-UP] Enhance ignoredBlockBytes in pushMergeMetrics to cover more scenarios - posted by "rmcyang (via GitHub)" <gi...@apache.org> on 2023/01/26 20:38:03 UTC, 1 replies.
- [GitHub] [spark] jchen5 opened a new pull request, #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/01/26 20:55:14 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #36165: [SPARK-36620][SHUFFLE] Add Push Based Shuffle client side read metrics - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/01/26 21:25:55 UTC, 0 replies.
- [GitHub] [spark] zhenlineo opened a new pull request, #39760: [SPARK-42202][Connect][Test] Improve the E2E test server stop logic - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/26 21:28:17 UTC, 0 replies.
- [GitHub] [spark] zhenlineo closed pull request #39760: [SPARK-42202][Connect][Test] Improve the E2E test server stop logic - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/26 21:28:40 UTC, 0 replies.
- [GitHub] [spark] techaddict opened a new pull request, #39761: [SPARK-41757][CONNECT][TESTS][FOLLOW-UP] Enable connect.functions.col doctest - posted by "techaddict (via GitHub)" <gi...@apache.org> on 2023/01/26 22:03:39 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39725: [SPARK-33573][FOLLOW-UP] Enhance ignoredBlockBytes in pushMergeMetrics to cover more scenarios - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/26 22:30:01 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39758: [SPARK-42201][BUILD] `build/sbt` should allow `SBT_OPTS` to override JVM memory setting - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/26 22:59:31 UTC, 3 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39758: [SPARK-42201][BUILD] `build/sbt` should allow `SBT_OPTS` to override JVM memory setting - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 00:39:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39756: [SPARK-42200][INFRA] Enable GA build and test for `connect-client-jvm` module - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/27 02:28:27 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39761: [SPARK-41757][CONNECT][PYTHON][FOLLOW-UP] Enable connect.functions.col doctest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/27 02:30:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39761: [SPARK-41757][CONNECT][PYTHON][FOLLOW-UP] Enable connect.functions.col doctest - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/27 02:31:23 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39762: [SPARK-42207][INFRA] Update `build_and_test.yml` to use `Ubuntu 22.04` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 02:51:59 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39756: [SPARK-42200][INFRA] Enable GA build and test for `connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 03:02:33 UTC, 6 replies.
- [GitHub] [spark] yabola commented on pull request #39687: [SPARK-41470][SQL] Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/01/27 03:35:37 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39762: [SPARK-42207][INFRA] Update `build_and_test.yml` to use `Ubuntu 22.04` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 03:41:24 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39756: [SPARK-42200][INFRA] Enable GA build and test for `connect-client-jvm` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 03:47:22 UTC, 6 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39756: [SPARK-42200][INFRA] Enable GA build and test for `connect-client-jvm` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 03:49:12 UTC, 0 replies.
- [GitHub] [spark] techaddict commented on pull request #39450: [SPARK-41897][CONNECT][TESTS] Enable tests with error mismatch in connect/test_parity_functions.py - posted by "techaddict (via GitHub)" <gi...@apache.org> on 2023/01/27 03:53:03 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39734: [SPARK-41812][SPARK-41823][CONNECT][PYTHON] Fix ambiguous columns issue in Join - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 04:39:45 UTC, 1 replies.
- [GitHub] [spark] JoshRosen opened a new pull request, #39763: [WIP][CORE][SPARK-42204] Remove redundant logging of TaskMetrics internal accumulators in event logs - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/01/27 04:54:20 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on pull request #39763: [WIP][CORE][SPARK-42204] Remove redundant logging of TaskMetrics internal accumulators in event logs - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/01/27 04:56:04 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on a diff in pull request #39763: [WIP][CORE][SPARK-42204] Remove redundant logging of TaskMetrics internal accumulators in event logs - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/01/27 05:04:34 UTC, 3 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 05:11:21 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39764: [SPARK-41849][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_input_file_name_udf ` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 05:13:05 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39739: [SPARK-42126][PYTHON][CONNECT] Accept return type in DDL strings for Python Scalar UDFs in Spark Connect - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 05:14:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39756: [SPARK-42200][INFRA] Enable GA build and test for `connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 05:17:17 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39762: [SPARK-42207][INFRA] Update `build_and_test.yml` to use `Ubuntu 22.04` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 05:19:14 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39756: [SPARK-42200][INFRA] Put `connect-client-jvm` into separate test groups and xxx - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 05:24:39 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39756: [SPARK-42200][INFRA] Put `connect-client-jvm` into separate test groups and xxx - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 05:26:00 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 05:38:56 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/27 05:46:13 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39766: [SPARK-41875][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_to` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 05:48:17 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on a diff in pull request #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 05:49:53 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39751: [SPARK-42197][CONNECT] Reuses JVM initialization, and separate configuration groups to set in remote local mode - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/27 05:54:37 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39756: [SPARK-42200][INFRA] Put `connect-client-jvm` into separate test groups and xxx - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 06:22:41 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39706: [SPARK-42158][SQL] Integrate `_LEGACY_ERROR_TEMP_1003` into `FIELD_NOT_FOUND` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/27 07:41:03 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39753: [SPARK-42125][CONNECT][PYTHON] Pandas UDF in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/27 07:46:14 UTC, 3 replies.
- [GitHub] [spark] JoshRosen opened a new pull request, #39767: [SPARK-42205][CORE] Don't log accumulator values in stage / task start event logs - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/01/27 07:51:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39768: [SPARK-42209][CORE][CONNECT] Introduce test tag `ExtendedConnectTest` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 07:54:36 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39753: [SPARK-42125][CONNECT][PYTHON] Pandas UDF in Spark Connect - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/27 08:34:47 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39769: [SPARK-42190][K8S][FOLLOWUP] Fix to use the user-given number of threads - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 08:39:55 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39769: [SPARK-42190][K8S][FOLLOWUP] Fix to use the user-given number of threads - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 08:40:43 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39769: [SPARK-42190][K8S][FOLLOWUP] Fix to use the user-given number of threads - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 08:43:38 UTC, 0 replies.
- [GitHub] [spark] JoshRosen opened a new pull request, #39770: [WIP][SPARK-42206][CORE] Omit "Task Executor Metrics" field in eventlogs if values are all zero - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/01/27 08:49:28 UTC, 0 replies.
- [GitHub] [spark] JoshRosen commented on a diff in pull request #39770: [WIP][SPARK-42206][CORE] Omit "Task Executor Metrics" field in eventlogs if values are all zero - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2023/01/27 08:51:28 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39769: [SPARK-42190][K8S][FOLLOWUP] Fix to use the user-given number of threads - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 10:00:41 UTC, 0 replies.
- [GitHub] [spark] mridulm closed pull request #39725: [SPARK-33573][CORE][FOLLOW-UP] Enhance ignoredBlockBytes in pushMergeMetrics to cover more scenarios - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/27 10:15:37 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39725: [SPARK-33573][CORE][FOLLOW-UP] Enhance ignoredBlockBytes in pushMergeMetrics to cover more scenarios - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/27 10:16:46 UTC, 0 replies.
- [GitHub] [spark] renzhe-brian commented on pull request #35239: [SPARK-37952][DOCS] Add missing statements to ALTER TABLE document - posted by "renzhe-brian (via GitHub)" <gi...@apache.org> on 2023/01/27 10:50:46 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #39723: [WIP][SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/27 10:59:53 UTC, 1 replies.
- [GitHub] [spark] vicennial commented on a diff in pull request #39753: [SPARK-42125][CONNECT][PYTHON] Pandas UDF in Spark Connect - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2023/01/27 13:18:56 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39771: [SPARK-42213][CONNECT][BUILD][TESTS] Add `repl` as the test dependency of `connect-client-jvm` to solve `repl.Main` not found when maven test - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 13:52:07 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39695: [SPARK-42156] SparkConnectClient supports RetryPolicies now - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/01/27 14:46:16 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/27 15:05:59 UTC, 2 replies.
- [GitHub] [spark] MaxGekk closed pull request #39711: [SPARK-41931][SQL] Better error message for incomplete complex type definition - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/27 15:06:43 UTC, 0 replies.
- [GitHub] [spark] zhenlineo commented on pull request #39760: [SPARK-42202][Connect][Test] Improve the E2E test server stop logic - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/27 16:26:53 UTC, 1 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39772: [SPARK-42216][CORE][TESTS] Use `immutable.Map` by default and fix two check conditions in `util.JsonProtocolSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 16:50:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39772: [SPARK-42216][CORE][TESTS] Use `immutable.Map` by default and fix two check conditions in `util.JsonProtocolSuite` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 17:17:45 UTC, 0 replies.
- [GitHub] [spark] sunchao closed pull request #39752: [SPARK-42168][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/01/27 17:20:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39771: [SPARK-42213][BUILD][CONNECT] Add `repl` test dependency to `connect-client-jvm` module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 17:22:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39752: [SPARK-42168][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 17:24:36 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39717: [SPARK-42168][3.2][SQL][PYTHON] Fix required child distribution of FlatMapCoGroupsInPandas (as in CoGroup) - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 17:26:34 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39772: [SPARK-42216][CORE][TESTS] Use `immutable.Map` by default and fix two check conditions in `util.JsonProtocolSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 17:27:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39772: [SPARK-42216][CORE][TESTS] Use `immutable.Map` by default and fix two check conditions in `util.JsonProtocolSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 17:31:46 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39771: [SPARK-42213][BUILD][CONNECT] Add `repl` test dependency to `connect-client-jvm` module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 17:34:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39760: [SPARK-42202][Connect][Test] Improve the E2E test server stop logic - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 17:47:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39768: [SPARK-42209][CORE][CONNECT] Introduce `ExtendedConnectTest` to make user can disable tests related to `RemoteSparkSession` selectively - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 17:50:24 UTC, 1 replies.
- [GitHub] [spark] LuciferYang closed pull request #39768: [SPARK-42209][CORE][CONNECT] Introduce `ExtendedConnectTest` to make user can disable tests related to `RemoteSparkSession` selectively - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 17:50:35 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39768: [SPARK-42209][CORE][CONNECT] Introduce `ExtendedConnectTest` to make user can disable tests related to `RemoteSparkSession` selectively - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 17:56:52 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39707: [SPARK-42161][BUILD] Upgrade Apache Arrow to 11.0.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/27 18:03:03 UTC, 2 replies.
- [GitHub] [spark] rmcyang commented on pull request #39725: [SPARK-33573][CORE][FOLLOW-UP] Enhance ignoredBlockBytes in pushMergeMetrics to cover more scenarios - posted by "rmcyang (via GitHub)" <gi...@apache.org> on 2023/01/27 18:52:07 UTC, 0 replies.
- [GitHub] [spark] anchovYu opened a new pull request, #39773: [WIP] Support lateral column alias in queries with Window - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/01/27 19:00:32 UTC, 0 replies.
- [GitHub] [spark] roczei commented on pull request #39595: [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions in most cases - posted by "roczei (via GitHub)" <gi...@apache.org> on 2023/01/27 19:20:42 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39763: [WIP][SPARK-42204][CORE] Remove redundant logging of TaskMetrics internal accumulators in event logs - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/27 19:50:39 UTC, 1 replies.
- [GitHub] [spark] bjornjorgensen opened a new pull request, #39774: [SPARK-42218][BUILD] Upgrade `netty` to version 4.1.87.Final - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/01/27 19:51:30 UTC, 0 replies.
- [GitHub] [spark] bjornjorgensen commented on pull request #39774: [SPARK-42218][BUILD] Upgrade `netty` to version 4.1.87.Final - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2023/01/27 19:57:08 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39764: [SPARK-41849][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_input_file_name_udf ` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 20:14:16 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39764: [SPARK-41849][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_input_file_name_udf ` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 20:14:38 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39766: [SPARK-41875][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_to` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 20:18:52 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 20:20:22 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 20:20:51 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39707: [SPARK-42161][BUILD] Upgrade Apache Arrow to 11.0.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 20:22:42 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #39763: [WIP][SPARK-42204][CORE] Remove redundant logging of TaskMetrics internal accumulators in event logs - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/27 20:38:33 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #39767: [SPARK-42205][CORE] Don't log accumulator values in stage / task start event logs - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/27 20:46:09 UTC, 0 replies.
- [GitHub] [spark] attilapiros opened a new pull request, #39775: [SPARK-42219] Introducing a config to close all active SparkContexts after the Main method has finished - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/01/27 21:36:24 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #32283: [SPARK-34674][CORE][K8S] Close SparkContext after the Main method has finished - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/01/27 21:41:38 UTC, 0 replies.
- [GitHub] [spark] attilapiros commented on pull request #39775: [SPARK-42219][CORE] Introducing a config to close all active SparkContexts after the Main method has finished - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/01/27 21:42:44 UTC, 3 replies.
- [GitHub] [spark] anchovYu commented on pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2023/01/27 22:35:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39775: [SPARK-42219][CORE] Introducing a config to close all active SparkContexts after the Main method has finished - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/27 23:03:22 UTC, 7 replies.
- [GitHub] [spark] attilapiros commented on a diff in pull request #39775: [SPARK-42219][CORE] Introducing a config to close all active SparkContexts after the Main method has finished - posted by "attilapiros (via GitHub)" <gi...@apache.org> on 2023/01/27 23:05:52 UTC, 4 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39774: [SPARK-42218][BUILD] Upgrade `netty` to version 4.1.87.Final - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/28 00:10:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/28 00:42:51 UTC, 0 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39776: [SPARK-42220][CONNECT][BUILD] Upgrade buf from 1.12.0 to 1.13.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/01/28 01:39:51 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on pull request #39492: [SPARK-41876][CONNECT][PYTHON] `test_dataframe` should catch both `AnalysisException` and `SparkConnectAnalysisException` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/01/28 02:05:28 UTC, 0 replies.
- [GitHub] [spark] beliefer closed pull request #39492: [SPARK-41876][CONNECT][PYTHON] `test_dataframe` should catch both `AnalysisException` and `SparkConnectAnalysisException` - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/01/28 02:05:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39772: [SPARK-42216][CORE][TESTS] Fix two check conditions and remove redundant `toMap` in `util.JsonProtocolSuite` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/28 02:13:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39774: [SPARK-42218][BUILD] Upgrade `netty` to version 4.1.87.Final - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/28 02:23:41 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39774: [SPARK-42218][BUILD] Upgrade `netty` to version 4.1.87.Final - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/28 02:24:32 UTC, 0 replies.
- [GitHub] [spark] beliefer commented on a diff in pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/01/28 02:31:50 UTC, 2 replies.
- [GitHub] [spark] yabola commented on a diff in pull request #39687: [SPARK-41470][SQL] Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "yabola (via GitHub)" <gi...@apache.org> on 2023/01/28 02:36:25 UTC, 3 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39777: [SPARK-42221][SQL] Introduce a new conf for TimestampNTZ schema inference in JSON/CSV - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/28 04:10:30 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39777: [SPARK-42221][SQL] Introduce a new conf for TimestampNTZ schema inference in JSON/CSV - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/28 04:11:33 UTC, 2 replies.
- [GitHub] [spark] navinvishy closed pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/01/28 04:27:43 UTC, 0 replies.
- [GitHub] [spark] navinvishy opened a new pull request, #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/01/28 04:46:45 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39772: [SPARK-42216][CORE][TESTS] Fix two check conditions and remove redundant `toMap` in `util.JsonProtocolSuite` - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/28 04:52:58 UTC, 0 replies.
- [GitHub] [spark] navinvishy commented on a diff in pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/01/28 04:53:58 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/28 05:18:36 UTC, 1 replies.
- [GitHub] [spark] navinvishy commented on pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "navinvishy (via GitHub)" <gi...@apache.org> on 2023/01/28 05:22:33 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/28 05:37:22 UTC, 5 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/28 05:40:14 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39450: [SPARK-41897][CONNECT][TESTS] Enable tests with error mismatch in connect/test_parity_functions.py - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/28 05:53:23 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39723: [SPARK-41302][SQL] Assign name to _LEGACY_ERROR_TEMP_1185 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/28 05:58:19 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39450: [SPARK-41897][CONNECT][TESTS] Enable tests with error mismatch in connect/test_parity_functions.py - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/28 06:02:10 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39450: [SPARK-41897][CONNECT][TESTS] Enable tests with error mismatch in connect/test_parity_functions.py - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/28 06:02:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39707: [SPARK-42161][BUILD] Upgrade Apache Arrow to 11.0.0 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/28 06:03:52 UTC, 0 replies.
- [GitHub] [spark] Yikun opened a new pull request, #39778: [SPARK-42214][INFRA] Enable infra image build for scheduled job - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/28 06:54:37 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #39778: [SPARK-42214][INFRA] Enable infra image build for scheduled job - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/28 06:55:01 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39778: [SPARK-42214][INFRA] Enable infra image build for scheduled job - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/28 06:57:22 UTC, 1 replies.
- [GitHub] [spark] kecheung opened a new pull request, #39779: Spark3.3 backport spark 41344 - posted by "kecheung (via GitHub)" <gi...@apache.org> on 2023/01/28 07:07:47 UTC, 0 replies.
- [GitHub] [spark] kecheung commented on pull request #39779: [SPARK-42222][SQL] Make error clearer when table not found in SupportsCatalogOptions catalog - posted by "kecheung (via GitHub)" <gi...@apache.org> on 2023/01/28 07:28:38 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39776: [SPARK-42220][CONNECT][BUILD] Upgrade buf from 1.12.0 to 1.13.1 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/28 07:34:47 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/28 07:36:27 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/28 07:51:01 UTC, 0 replies.
- [GitHub] [spark] wankunde opened a new pull request, #39780: [SPARK-42223][SQL] Remove duplicate branches in CASE_WHEN and COALESCE function - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2023/01/28 08:25:46 UTC, 0 replies.
- [GitHub] [spark] Yikun closed pull request #39778: [SPARK-42214][INFRA] Enable infra image build for scheduled job - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/28 10:02:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39778: [SPARK-42214][INFRA] Enable infra image build for scheduled job - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/28 10:11:04 UTC, 0 replies.
- [GitHub] [spark] eejbyfeldt commented on a diff in pull request #38428: [SPARK-40912][CORE]Overhead of Exceptions in KryoDeserializationStream - posted by "eejbyfeldt (via GitHub)" <gi...@apache.org> on 2023/01/28 10:32:53 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39781: [SPARK-42168][3.3][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/28 10:45:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39781: [SPARK-42168][3.3][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/28 11:22:24 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39782: [SPARK-4224][CONNECT] Migrate `TypeError` into error framework for Spark Connect functions - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/28 12:04:23 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39782: [SPARK-4224][CONNECT] Migrate `TypeError` into error framework for Spark Connect functions - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/28 12:06:33 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39783: [SPARK-42225][CONNECT] Add `SparkConnectIllegalArgumentException` to handle Spark Connect error precisely. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/28 13:02:06 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39784: [SPARK-42226][BUILD] Upgrade `versions-maven-plugin` to 2.14.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/28 14:09:52 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39785: [SPARK-42192][PYTHON] Migrate the `TypeError` from `pyspark/sql/dataframe.py` into `PySparkTypeError` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/28 16:32:03 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39786: [SPARK-42194][PS] Allow `columns` parameter when creating DataFrame with Series. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/28 19:26:28 UTC, 0 replies.
- [GitHub] [spark] dtenedor commented on pull request #39726: [SPARK-42123][SQL] Include column default values in DESCRIBE and SHOW CREATE TABLE output - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2023/01/28 20:18:28 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39707: [SPARK-42161][BUILD] Upgrade Apache Arrow to 11.0.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/29 00:49:10 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39776: [SPARK-42220][CONNECT][BUILD] Upgrade buf from 1.12.0 to 1.13.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/29 00:49:50 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39776: [SPARK-42220][CONNECT][BUILD] Upgrade buf from 1.12.0 to 1.13.1 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/29 00:50:18 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39765: [SPARK-41830][CONNECT][PYTHON][TESTS][FOLLOWUP] Enable parity test `test_sample` - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/29 00:51:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39784: [SPARK-42226][BUILD] Upgrade `versions-maven-plugin` to 2.14.2 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/29 00:54:00 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39782: [SPARK-42224][CONNECT] Migrate `TypeError` into error framework for Spark Connect functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 01:02:52 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39782: [SPARK-42224][CONNECT] Migrate `TypeError` into error framework for Spark Connect functions - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 01:03:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39786: [SPARK-42194][PS] Allow `columns` parameter when creating DataFrame with Series. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 01:04:40 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39786: [SPARK-42194][PS] Allow `columns` parameter when creating DataFrame with Series. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 02:34:29 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39787: [SPARK-42224][FOLLOWUP] Raise `PySparkTypeError` instead of `TypeError`. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/29 03:54:36 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39784: [SPARK-42226][BUILD] Upgrade `versions-maven-plugin` to 2.14.2 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/29 04:35:51 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39783: [SPARK-42225][CONNECT] Add `SparkConnectIllegalArgumentException` to handle Spark Connect error precisely. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 05:25:31 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/29 05:44:48 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #39701: [SPARK-41489][SQL] Assign name to _LEGACY_ERROR_TEMP_2415 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/29 05:45:25 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39783: [SPARK-42225][CONNECT] Add `SparkConnectIllegalArgumentException` to handle Spark Connect error precisely. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/29 05:57:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39783: [SPARK-42225][CONNECT] Add `SparkConnectIllegalArgumentException` to handle Spark Connect error precisely. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 07:17:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39783: [SPARK-42225][CONNECT] Add `SparkConnectIllegalArgumentException` to handle Spark Connect error precisely. - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 07:18:21 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39787: [SPARK-42224][FOLLOWUP] Raise `PySparkTypeError` instead of `TypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 07:19:18 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39787: [SPARK-42224][FOLLOWUP] Raise `PySparkTypeError` instead of `TypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 07:19:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39785: [SPARK-42192][PYTHON] Migrate the `TypeError` from `pyspark/sql/dataframe.py` into `PySparkTypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 07:23:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39781: [SPARK-42168][3.3][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 07:25:29 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39712: [SPARK-42172][CONNECT] Scala Client Mima Compatibility Tests - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 07:39:57 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39779: [SPARK-42222][SQL][3.3] Make error clearer when table not found in SupportsCatalogOptions catalog - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 08:07:20 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39788: [WIP] Move ClientE2ETestSuite into a separate module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/29 08:32:49 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39789: [SPARK-42228][CONNECT][BUILD] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/29 08:41:04 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39790: [SPARK-42094][PS] Support `fill_value` for `ps.Series.(add|radd)` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/29 08:49:18 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39790: [SPARK-42094][PS] Support `fill_value` for `ps.Series.(add|radd)` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/29 08:52:09 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39789: [SPARK-42228][CONNECT][BUILD] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/29 09:00:09 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39788: [WIP] Move ClientE2ETestSuite into a separate module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/29 09:10:15 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39788: [WIP] Move ClientE2ETestSuite into a separate module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/29 09:16:32 UTC, 1 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/29 09:32:05 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39790: [SPARK-42094][PS] Support `fill_value` for `ps.Series.(add|radd)` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/29 09:38:53 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39791: [SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/29 10:01:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39720: [SPARK-41500] [SQL] Year/Month Interval operations bug fix - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/29 12:52:43 UTC, 2 replies.
- [GitHub] [spark] techaddict commented on pull request #39614: [SPARK-42002][CONNECT][PYTHON] Implement DataFrameWriterV2 - posted by "techaddict (via GitHub)" <gi...@apache.org> on 2023/01/29 13:27:57 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/29 13:36:55 UTC, 14 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39744: [SPARK-38591][SQL][FOLLOW-UP] Fix ambiguous references for sorted cogroups - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/29 13:45:18 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39734: [WIP][SPARK-41812][SPARK-41823][CONNECT][PYTHON] Fix ambiguous columns issue in Join - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/29 14:52:36 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39791: [SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/29 16:27:52 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on pull request #39770: [WIP][SPARK-42206][CORE] Omit "Task Executor Metrics" field in eventlogs if values are all zero - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/29 18:44:15 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on a diff in pull request #39775: [SPARK-42219][CORE] Introducing a config to close all active SparkContexts after the Main method has finished - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/01/29 21:00:39 UTC, 0 replies.
- [GitHub] [spark] khalidmammadov commented on a diff in pull request #39785: [SPARK-42192][PYTHON] Migrate the `TypeError` from `pyspark/sql/dataframe.py` into `PySparkTypeError` - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/01/29 21:23:36 UTC, 7 replies.
- [GitHub] [spark] ganeshchand commented on pull request #39750: [SPARK-42196][SS] Fix typo - posted by "ganeshchand (via GitHub)" <gi...@apache.org> on 2023/01/29 21:30:19 UTC, 0 replies.
- [GitHub] [spark] sadikovi commented on pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2023/01/29 22:09:49 UTC, 0 replies.
- [GitHub] [spark] db-scnakandala commented on pull request #39722: [SPARK-42162] Introduce MultiAdd expression as a memory optimization for canonicalizing large trees of Add expressions - posted by "db-scnakandala (via GitHub)" <gi...@apache.org> on 2023/01/29 22:48:19 UTC, 2 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39792: [SPARK-42230][INFRA] Improve `lint` job by skipping PySpark and SparkR docs if unchanged - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 00:33:42 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39792: [SPARK-42230][INFRA] Improve `lint` job by skipping PySpark and SparkR docs if unchanged - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 00:40:54 UTC, 1 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39792: [SPARK-42230][INFRA] Improve `lint` job by skipping PySpark and SparkR docs if unchanged - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 00:41:04 UTC, 4 replies.
- [GitHub] [spark] xinrong-meng commented on a diff in pull request #39753: [SPARK-42125][CONNECT][PYTHON] Pandas UDF in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/30 01:13:22 UTC, 2 replies.
- [GitHub] [spark] cloud-fan closed pull request #39592: [SPARK-42081][SQL] Improve the plan change validation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 02:00:52 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39744: [SPARK-38591][SQL][FOLLOW-UP] Fix ambiguous references for sorted cogroups - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 02:06:21 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on a diff in pull request #39792: [SPARK-42230][INFRA] Improve `lint` job by skipping PySpark and SparkR docs if unchanged - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/30 02:07:22 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39785: [SPARK-42192][PYTHON] Migrate the `TypeError` from `pyspark/sql/dataframe.py` into `PySparkTypeError` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 02:11:13 UTC, 14 replies.
- [GitHub] [spark] beliefer commented on pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2023/01/30 02:12:55 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 02:23:32 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39777: [SPARK-42221][SQL] Introduce a new conf for TimestampNTZ schema inference in JSON/CSV - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 02:28:59 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39777: [SPARK-42221][SQL] Introduce a new conf for TimestampNTZ schema inference in JSON/CSV - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 02:30:28 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 02:46:39 UTC, 1 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 02:49:55 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 03:03:41 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39793: [SPARK-42233][SQL] Improve error message for PIVOT_AFTER_GROUP_BY - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 03:25:33 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39793: [SPARK-42233][SQL] Improve error message for PIVOT_AFTER_GROUP_BY - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 03:26:00 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39624: [SPARK-42101][SQL] Introduce Materializable and MaterializableQueryStage for AQE framework - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/30 03:26:40 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #39624: [SPARK-42101][SQL] Introduce Materializable and MaterializableQueryStage for AQE framework - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/30 03:29:23 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/01/30 03:55:26 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39794: [SPARK-41735][SQL] Use MINIMAL instead of STANDARD for SparkListenerSQLExecutionEnd - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/30 03:56:30 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39794: [SPARK-41735][SQL] Use MINIMAL instead of STANDARD for SparkListenerSQLExecutionEnd - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/30 03:56:38 UTC, 1 replies.
- [GitHub] [spark] itholic opened a new pull request, #39795: [SPARK-42234][SQL] Rename error class: `UNSUPPORTED_FEATURE.REPEATED_PIVOT` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 04:39:52 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39777: [SPARK-42221][SQL] Introduce a new conf for TimestampNTZ schema inference in JSON/CSV - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/30 05:02:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 05:23:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39750: [SPARK-42196][SS] Fix typo in StreamingQuery.runId - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 05:41:53 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39750: [SPARK-42196][SS] Fix typo in StreamingQuery.runId - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 05:42:46 UTC, 0 replies.
- [GitHub] [spark] huaxingao commented on pull request #37479: [SPARK-40045][SQL]Optimize the order of filtering predicates - posted by "huaxingao (via GitHub)" <gi...@apache.org> on 2023/01/30 05:52:07 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39792: [SPARK-42230][INFRA] Improve `lint` job by skipping PySpark and SparkR docs if unchanged - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 06:16:00 UTC, 0 replies.
- [GitHub] [spark] jzhuge opened a new pull request, #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/01/30 07:21:01 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39797: [SPARK-42231][SQL] Turn `MISSING_STATIC_PARTITION_COLUMN` into `internalError` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 07:24:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39792: [SPARK-42230][INFRA] Improve `lint` job by skipping PySpark and SparkR docs if unchanged - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 07:37:18 UTC, 0 replies.
- [GitHub] [spark] mridulm commented on a diff in pull request #38428: [SPARK-40912][CORE]Overhead of Exceptions in KryoDeserializationStream - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/30 07:37:46 UTC, 2 replies.
- [GitHub] [spark] mridulm commented on pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2023/01/30 07:44:53 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39794: [SPARK-41735][SQL] Use MINIMAL instead of STANDARD for SparkListenerSQLExecutionEnd - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 08:01:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39794: [SPARK-41735][SQL] Use MINIMAL instead of STANDARD for SparkListenerSQLExecutionEnd - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 08:01:42 UTC, 0 replies.
- [GitHub] [spark] cxzl25 opened a new pull request, #39798: [MINOR] Fix typo `Exlude` to `Exclude` in `HealthTracker` - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/01/30 08:18:29 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39799: [SPARK-42232][SQL] Rename error class: `UNSUPPORTED_FEATURE.JDBC_TRANSACTION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 09:00:59 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on pull request #39720: [SPARK-41500] [SQL] Year/Month Interval operations bug fix - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/30 09:05:58 UTC, 1 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39800: [SPARK-41855][CONNECT][PYTHON][FOLLOWUP] Make `createDataFrame` accept `Decimal('NaN')` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/30 09:09:08 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39800: [SPARK-41855][CONNECT][PYTHON][FOLLOWUP] Make `createDataFrame` accept `Decimal('NaN')` - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/30 09:11:11 UTC, 0 replies.
- [GitHub] [spark] zhmin opened a new pull request, #39801: [MINOR][DOCS][SQL] Fix FoldablePropagation rule document - posted by "zhmin (via GitHub)" <gi...@apache.org> on 2023/01/30 09:12:11 UTC, 0 replies.
- [GitHub] [spark] Yikun commented on pull request #39792: [SPARK-42230][INFRA] Improve `lint` job by skipping PySpark and SparkR docs if unchanged - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2023/01/30 09:18:53 UTC, 0 replies.
- [GitHub] [spark] weiyuyilia opened a new pull request, #39802: [SPARK-42237][SQL] change binary to unsupported dataType in csv format - posted by "weiyuyilia (via GitHub)" <gi...@apache.org> on 2023/01/30 09:20:25 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39802: [SPARK-42237][SQL] Change binary to unsupported dataType in csv format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 09:37:13 UTC, 0 replies.
- [GitHub] [spark] wangyum commented on a diff in pull request #39801: [MINOR][DOCS][SQL] Fix FoldablePropagation rule document - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2023/01/30 09:44:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39752: [SPARK-42168][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 10:08:35 UTC, 0 replies.
- [GitHub] [spark] EnricoMi opened a new pull request, #39803: [SPARK-42168][3.4][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/30 10:50:30 UTC, 0 replies.
- [GitHub] [spark] weiyuyilia commented on a diff in pull request #39802: [SPARK-42237][SQL] Change binary to unsupported dataType in csv format - posted by "weiyuyilia (via GitHub)" <gi...@apache.org> on 2023/01/30 10:50:48 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38760: [SPARK-41219][SQL] IntegralDivide use decimal(1, 0) to represent 0 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 10:51:01 UTC, 5 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 10:55:07 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39625: [SPARK-42066][SQL] The DATATYPE_MISMATCH error class contains inappropriate and duplicating subclasses - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 10:55:26 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39802: [SPARK-42237][SQL] Change binary to unsupported dataType in CSV format - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 10:55:29 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39676: [SPARK-42134][SQL] Fix getPartitionFiltersAndDataFilters() to handle filters without referenced attributes - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 10:59:19 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39722: [SPARK-42162] Introduce MultiAdd expression as a memory optimization for canonicalizing large trees of Add expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/30 11:06:40 UTC, 1 replies.
- [GitHub] [spark] ulysses-you commented on pull request #38760: [SPARK-41219][SQL] IntegralDivide use decimal(1, 0) to represent 0 - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/30 11:19:13 UTC, 4 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39695: [SPARK-42156][CONNECT] SparkConnectClient supports RetryPolicies now - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 12:00:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39785: [SPARK-42192][PYTHON] Migrate the `TypeError` from `pyspark/sql/dataframe.py` into `PySparkTypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 12:09:26 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39800: [SPARK-41855][CONNECT][PYTHON][FOLLOWUP] Make `createDataFrame` accept `Decimal('NaN')` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 12:12:34 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39800: [SPARK-41855][CONNECT][PYTHON][FOLLOWUP] Make `createDataFrame` accept `Decimal('NaN')` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 12:12:44 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39804: [SPARK-42236][SQL] Refine `NULLABLE_ARRAY_OR_MAP_ELEMENT` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 12:27:08 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39804: [SPARK-42236][SQL] Refine `NULLABLE_ARRAY_OR_MAP_ELEMENT` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 12:33:27 UTC, 1 replies.
- [GitHub] [spark] xinrong-meng commented on pull request #39695: [SPARK-42156][CONNECT] SparkConnectClient supports RetryPolicies now - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/30 13:02:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39700: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/30 13:21:55 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #39700: [SPARK-41490][SQL] Assign name to _LEGACY_ERROR_TEMP_2441 - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/30 13:22:35 UTC, 0 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #37525: [SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/30 13:26:52 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39803: [SPARK-42168][3.4][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 13:36:33 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39803: [SPARK-42168][3.4][SQL][PYTHON][FOLLOW-UP] Test FlatMapCoGroupsInPandas with Window function - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/30 13:36:57 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 13:41:23 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 13:41:43 UTC, 0 replies.
- [GitHub] [spark] zhmin commented on a diff in pull request #39801: [MINOR][DOCS][SQL] Fix FoldablePropagation rule document - posted by "zhmin (via GitHub)" <gi...@apache.org> on 2023/01/30 13:42:43 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39793: [SPARK-42233][SQL] Improve error message for `PIVOT_AFTER_GROUP_BY` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/30 13:50:20 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39793: [SPARK-42233][SQL] Improve error message for `PIVOT_AFTER_GROUP_BY` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/30 13:50:59 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39806: [SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 14:06:12 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39806: [SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 14:06:26 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/30 14:23:02 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #39798: [MINOR] Fix typo `Exlude` to `Exclude` in `HealthTracker` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/30 14:25:04 UTC, 2 replies.
- [GitHub] [spark] srowen commented on pull request #38428: [SPARK-40912][CORE]Overhead of Exceptions in KryoDeserializationStream - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/30 14:29:39 UTC, 1 replies.
- [GitHub] [spark] peter-toth commented on pull request #39722: [SPARK-42162] Introduce MultiAdd expression as a memory optimization for canonicalizing large trees of Add expressions - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2023/01/30 14:43:43 UTC, 0 replies.
- [GitHub] [spark] NarekDW closed pull request #39720: [SPARK-41500] [SQL] Year/Month Interval operations bug fix - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/30 15:24:21 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39807: [SPARK-42240][CONNECT][TESTS] Move ClientE2ETestSuite into a separate module to test shaded jvm client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 15:52:54 UTC, 0 replies.
- [GitHub] [spark] LuciferYang closed pull request #39788: [WIP] Move ClientE2ETestSuite into a separate module - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 15:54:05 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39807: [WIP][SPARK-42240][INFRA][CONNECT][TESTS] Move ClientE2ETestSuite into a separate module to test shaded jvm client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 16:01:50 UTC, 5 replies.
- [GitHub] [spark] databricks-david-lewis opened a new pull request, #39808: [SC-][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "databricks-david-lewis (via GitHub)" <gi...@apache.org> on 2023/01/30 16:06:03 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39807: [WIP][SPARK-42240][INFRA][CONNECT][TESTS] Move ClientE2ETestSuite into a separate module to test shaded jvm client - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 16:07:48 UTC, 1 replies.
- [GitHub] [spark] EnricoMi commented on a diff in pull request #39754: [SPARK-42199][SQL] Fix issues around Dataset.groupByKey - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2023/01/30 16:18:01 UTC, 0 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39795: [SPARK-42234][SQL] Rename error class: `UNSUPPORTED_FEATURE.REPEATED_PIVOT` - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/01/30 16:58:27 UTC, 2 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #38867: [SPARK-41234][SQL][PYTHON] Add `array_insert` function - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/01/30 17:06:18 UTC, 4 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39809: [SPARK-42230][INFRA][FOLLOWUP] Add `GITHUB_PREV_SHA` and `APACHE_SPARK_REF` to lint job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 17:48:03 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39809: [SPARK-42230][INFRA][FOLLOWUP] Add `GITHUB_PREV_SHA` and `APACHE_SPARK_REF` to lint job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 17:49:44 UTC, 3 replies.
- [GitHub] [spark] srielau commented on a diff in pull request #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/01/30 17:53:34 UTC, 0 replies.
- [GitHub] [spark] databricks-david-lewis commented on pull request #39808: [SPARK-41970][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "databricks-david-lewis (via GitHub)" <gi...@apache.org> on 2023/01/30 17:53:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang opened a new pull request, #39810: [SPARK-42241][CONNECT][TESTS] Correct the condition of `SparkConnectServerUtils#findSparkConnectJar` to find the correct connect server jar for maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 17:55:46 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on a diff in pull request #39808: [SPARK-41970][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 18:00:04 UTC, 0 replies.
- [GitHub] [spark] databricks-david-lewis commented on a diff in pull request #39808: [SPARK-41970][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "databricks-david-lewis (via GitHub)" <gi...@apache.org> on 2023/01/30 18:01:25 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39808: [SPARK-41970][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 18:01:49 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 18:02:40 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39810: [SPARK-42241][CONNECT][TESTS] Correct the condition of finding connect jar in `SparkConnectServerUtils#findSparkConnectJar` for maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 18:07:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39807: [WIP][SPARK-42240][INFRA][CONNECT][TESTS] Move `ClientE2ETestSuite` into a separate module and add new GA task to test shaded jvm client with maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 18:47:36 UTC, 2 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39807: [WIP][SPARK-42240][INFRA][CONNECT][TESTS] Move `ClientE2ETestSuite` into a separate module and add new GA task to test shaded jvm client with maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 18:58:24 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39760: [SPARK-42202][Connect][Test] Improve the E2E test server stop logic - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/01/30 19:43:47 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39806: [SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/30 19:56:37 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #39806: [SPARK-42239][SQL] Integrate `MUST_AGGREGATE_CORRELATED_SCALAR_SUBQUERY` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/30 19:57:09 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun closed pull request #39809: [SPARK-42230][INFRA][FOLLOWUP] Add `GITHUB_PREV_SHA` and `APACHE_SPARK_REF` to lint job - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 19:59:37 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/30 20:07:41 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/01/30 20:09:43 UTC, 3 replies.
- [GitHub] [spark] LuciferYang commented on a diff in pull request #39760: [SPARK-42202][Connect][Test] Improve the E2E test server stop logic - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/30 20:30:33 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on a diff in pull request #39695: [SPARK-42156][CONNECT] SparkConnectClient supports RetryPolicies now - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/30 21:15:06 UTC, 0 replies.
- [GitHub] [spark] grundprinzip commented on pull request #39695: [SPARK-42156][CONNECT] SparkConnectClient supports RetryPolicies now - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2023/01/30 21:15:33 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39777: [SPARK-42221][SQL] Introduce a new conf for TimestampNTZ schema inference in JSON/CSV - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/30 21:31:48 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun opened a new pull request, #39811: [SPARK-42242][BUILD] Upgrade `snappy-java` to 1.1.9.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 23:29:14 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #39811: [SPARK-42242][BUILD] Upgrade `snappy-java` to 1.1.9.0 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/01/30 23:34:02 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39811: [SPARK-42242][BUILD] Upgrade `snappy-java` to 1.1.9.0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/30 23:34:26 UTC, 6 replies.
- [GitHub] [spark] gengliangwang opened a new pull request, #39812: [SPARK-42243][SQL] Use `spark.sql.inferTimestampNTZInDataSources.enabled` to infer timestamp type on partition columns - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/30 23:35:42 UTC, 0 replies.
- [GitHub] [spark] sigmod commented on pull request #39718: [SPARK-42163] Fix schema pruning for non-foldable array index or map key - posted by "sigmod (via GitHub)" <gi...@apache.org> on 2023/01/30 23:38:43 UTC, 1 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 23:41:09 UTC, 0 replies.
- [GitHub] [spark] itholic commented on a diff in pull request #39795: [SPARK-42234][SQL] Rename error class: `UNSUPPORTED_FEATURE.REPEATED_PIVOT` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/30 23:48:15 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39785: [SPARK-42192][PYTHON] Migrate the `TypeError` from `pyspark/sql/dataframe.py` into `PySparkTypeError` - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 00:06:42 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39808: [SPARK-41970][SQL][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 00:09:59 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39808: [SPARK-41970][SQL][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 00:10:16 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39810: [SPARK-42241][CONNECT][TESTS] Fix the find connect jar condition in `SparkConnectServerUtils#findSparkConnectJar` for maven - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 00:11:35 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39810: [SPARK-42241][CONNECT][TESTS] Fix the find connect jar condition in `SparkConnectServerUtils#findSparkConnectJar` for maven - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 00:11:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39614: [SPARK-42002][CONNECT][PYTHON] Implement DataFrameWriterV2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 00:16:02 UTC, 1 replies.
- [GitHub] [spark] itholic commented on pull request #39797: [SPARK-42231][SQL] Turn `MISSING_STATIC_PARTITION_COLUMN` into `internalError` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/31 00:18:28 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39614: [SPARK-42002][CONNECT][PYTHON] Implement DataFrameWriterV2 - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 00:18:36 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #38947: [SPARK-41233][SQL] Add `array_prepend` function - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/31 00:46:21 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/31 00:53:31 UTC, 3 replies.
- [GitHub] [spark] zhenlineo commented on pull request #39789: [SPARK-42228][BUILD][CONNECT] Add shade and relocation rule of grpc to connect-client-jvm module - posted by "zhenlineo (via GitHub)" <gi...@apache.org> on 2023/01/31 01:03:20 UTC, 2 replies.
- [GitHub] [spark] fe2s opened a new pull request, #39813: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by "fe2s (via GitHub)" <gi...@apache.org> on 2023/01/31 01:03:24 UTC, 0 replies.
- [GitHub] [spark] fe2s commented on pull request #39381: [SPARK-41554] fix changing of Decimal scale when scale decreased by m… - posted by "fe2s (via GitHub)" <gi...@apache.org> on 2023/01/31 01:04:18 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng closed pull request #39753: [SPARK-42125][CONNECT][PYTHON] Pandas UDF in Spark Connect - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/31 01:12:36 UTC, 0 replies.
- [GitHub] [spark] xinrong-meng opened a new pull request, #39814: [WIP][SPARK-42208][CONNECT][PYTHON] Reuse UDF test cases under `pyspark.sql.tests` - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2023/01/31 01:21:05 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39815: [SPARK-42244][PYTHON] Refine error classes and messages - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/31 01:28:18 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39799: [SPARK-42232][SQL] Rename error class: `UNSUPPORTED_FEATURE.JDBC_TRANSACTION` - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/31 01:38:57 UTC, 0 replies.
- [GitHub] [spark] hvanhovell closed pull request #39760: [SPARK-42202][Connect][Test] Improve the E2E test server stop logic - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/01/31 01:49:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #37525: [SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 01:51:35 UTC, 1 replies.
- [GitHub] [spark] panbingkun opened a new pull request, #39816: [SPARK-42245][BUILD] Upgrade scalafmt from 3.6.1 to 3.7.1 - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2023/01/31 01:54:18 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39660: [SPARK-42128][SQL] Support TOP (N) for MS SQL Server dialect as an alternative to Limit pushdown - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 02:07:10 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on a diff in pull request #37525: [SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/31 02:07:38 UTC, 10 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39808: [SPARK-41970][SQL][FOLLOWUP] Revert SparkPath changes to FileIndex and FileRelation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 02:10:20 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39718: [SPARK-42163][SQL] Fix schema pruning for non-foldable array index or map key - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 02:16:15 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39718: [SPARK-42163][SQL] Fix schema pruning for non-foldable array index or map key - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 02:16:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39797: [SPARK-42231][SQL] Turn `MISSING_STATIC_PARTITION_COLUMN` into `internalError` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 02:27:14 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #39797: [SPARK-42231][SQL] Turn `MISSING_STATIC_PARTITION_COLUMN` into `internalError` - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 02:27:29 UTC, 0 replies.
- [GitHub] [spark] viirya commented on pull request #38760: [SPARK-41219][SQL] IntegralDivide use decimal(1, 0) to represent 0 - posted by "viirya (via GitHub)" <gi...@apache.org> on 2023/01/31 03:23:58 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon opened a new pull request, #39817: [SPARK-42250][PYTHON][ML] `predict_batch_udf` with float fails when the batch size consists of single value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 03:45:30 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39810: [SPARK-42241][CONNECT][TESTS] Fix the find connect jar condition in `SparkConnectServerUtils#findSparkConnectJar` for maven - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/31 03:48:45 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39817: [SPARK-42250][PYTHON][ML] `predict_batch_udf` with float fails when the batch size consists of single value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 03:48:58 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #38760: [SPARK-41219][SQL] IntegralDivide use decimal(1, 0) to represent 0 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 03:58:27 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng opened a new pull request, #39818: [SPARK-42023][SPARK-42024][CONNECT][PYTHON] Make `createDataFrame` support `AtomicType -> StringType` coercion - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/31 04:11:48 UTC, 0 replies.
- [GitHub] [spark] srowen commented on a diff in pull request #39747: [SPARK-42191][SQL] Support udf 'luhn_check' - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/31 04:38:28 UTC, 0 replies.
- [GitHub] [spark] wayneguow opened a new pull request, #39819: [SPARK-42252][Core] Deprecate spark.shuffle.unsafe.file.output.buffer and add a new config - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/01/31 04:41:25 UTC, 0 replies.
- [GitHub] [spark] cxzl25 commented on pull request #39798: [MINOR] Fix typo `Exlude` to `Exclude` in `HealthTracker` - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2023/01/31 04:48:22 UTC, 0 replies.
- [GitHub] [spark] LuciferYang commented on pull request #39811: [SPARK-42242][BUILD] Upgrade `snappy-java` to 1.1.9.0 - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2023/01/31 04:57:06 UTC, 2 replies.
- [GitHub] [spark] itholic opened a new pull request, #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/31 04:59:04 UTC, 0 replies.
- [GitHub] [spark] itholic commented on pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/31 05:01:08 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39812: [SPARK-42243][SQL] Use `spark.sql.inferTimestampNTZInDataSources.enabled` to infer timestamp type on partition columns - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 05:30:18 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on a diff in pull request #39812: [SPARK-42243][SQL] Use `spark.sql.inferTimestampNTZInDataSources.enabled` to infer timestamp type on partition columns - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/31 05:38:53 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39818: [SPARK-42023][SPARK-42024][CONNECT][PYTHON] Make `createDataFrame` support `AtomicType -> StringType` coercion - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 06:27:20 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39818: [SPARK-42023][SPARK-42024][CONNECT][PYTHON] Make `createDataFrame` support `AtomicType -> StringType` coercion - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 06:27:38 UTC, 0 replies.
- [GitHub] [spark] gengliangwang commented on pull request #39812: [SPARK-42243][SQL] Use `spark.sql.inferTimestampNTZInDataSources.enabled` to infer timestamp type on partition columns - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/31 06:58:45 UTC, 0 replies.
- [GitHub] [spark] gengliangwang closed pull request #39812: [SPARK-42243][SQL] Use `spark.sql.inferTimestampNTZInDataSources.enabled` to infer timestamp type on partition columns - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2023/01/31 06:59:23 UTC, 0 replies.
- [GitHub] [spark] zhengruifeng commented on pull request #39818: [SPARK-42023][SPARK-42024][CONNECT][PYTHON] Make `createDataFrame` support `AtomicType -> StringType` coercion - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2023/01/31 07:01:14 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on pull request #39695: [SPARK-42156][CONNECT] SparkConnectClient supports RetryPolicies now - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 07:05:07 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39695: [SPARK-42156][CONNECT] SparkConnectClient supports RetryPolicies now - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 07:05:27 UTC, 0 replies.
- [GitHub] [spark] itholic opened a new pull request, #39821: [SPARK-42253][PYTHON] Add test for detecting duplicated error class - posted by "itholic (via GitHub)" <gi...@apache.org> on 2023/01/31 07:30:00 UTC, 0 replies.
- [GitHub] [spark] wayneguow commented on pull request #39819: [SPARK-42252][CORE] Deprecate spark.shuffle.unsafe.file.output.buffer and add a new config - posted by "wayneguow (via GitHub)" <gi...@apache.org> on 2023/01/31 07:32:27 UTC, 0 replies.
- [GitHub] [spark] NarekDW commented on a diff in pull request #39501: [SPARK-41295][SPARK-41296][SQL] Rename the error classes - posted by "NarekDW (via GitHub)" <gi...@apache.org> on 2023/01/31 08:14:12 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39791: [SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/31 08:30:25 UTC, 1 replies.
- [GitHub] [spark] MaxGekk closed pull request #39791: [SPARK-42229][CORE] Migrate `SparkCoreErrors` into error classes - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/31 08:31:31 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39516: [SPARK-41989][PYTHON] Avoid breaking logging config from pyspark.pandas - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/31 08:35:57 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 08:46:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39667: [SPARK-42131][SQL] Extract the function that construct the select statement for JDBC dialect. - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 08:57:36 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 09:00:41 UTC, 1 replies.
- [GitHub] [spark] cloud-fan closed pull request #39705: [SPARK-41488][SQL] Assign name to _LEGACY_ERROR_TEMP_1176 (and 1177) - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 09:01:26 UTC, 0 replies.
- [GitHub] [spark] digitallabs-reviewBot commented on pull request #39821: [SPARK-42253][PYTHON] Add test for detecting duplicated error class - posted by "digitallabs-reviewBot (via GitHub)" <gi...@apache.org> on 2023/01/31 09:09:44 UTC, 0 replies.
- [GitHub] [spark] ulysses-you opened a new pull request, #39822: [SPARK-42251][SQL] Forbid deicmal type if precision less than 1 - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/31 09:18:32 UTC, 0 replies.
- [GitHub] [spark] ulysses-you commented on pull request #39822: [SPARK-42251][SQL] Forbid deicmal type if precision less than 1 - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2023/01/31 09:19:21 UTC, 1 replies.
- [GitHub] [spark] steveloughran commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC0 - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2023/01/31 10:06:09 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 10:28:04 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 10:29:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39822: [SPARK-42251][SQL] Forbid deicmal type if precision less than 1 - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 10:33:52 UTC, 1 replies.
- [GitHub] [spark] HyukjinKwon closed pull request #39817: [SPARK-42250][PYTHON][ML] `predict_batch_udf` with float fails when the batch size consists of single value - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 10:42:02 UTC, 0 replies.
- [GitHub] [spark] HyukjinKwon commented on a diff in pull request #39821: [SPARK-42253][PYTHON] Add test for detecting duplicated error class - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2023/01/31 11:14:29 UTC, 0 replies.
- [GitHub] [spark] pan3793 commented on pull request #25536: [SPARK-28837][SQL] CTAS/RTAS should use nullable schema - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2023/01/31 12:23:50 UTC, 2 replies.
- [GitHub] [spark] khalidmammadov opened a new pull request, #39823: [SPARK-42257][CORE] Remove unused variable external sorter - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/01/31 12:44:56 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #25536: [SPARK-28837][SQL] CTAS/RTAS should use nullable schema - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 13:07:13 UTC, 2 replies.
- [GitHub] [spark] jchen5 commented on pull request #39759: [SPARK-36124][SQL] Support subqueries with correlation through INTERSECT/EXCEPT - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2023/01/31 13:12:55 UTC, 1 replies.
- [GitHub] [spark] srowen closed pull request #39798: [MINOR] Fix typo `Exlude` to `Exclude` in `HealthTracker` - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/31 14:14:02 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39823: [SPARK-42257][CORE] Remove unused variable external sorter - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/31 14:19:42 UTC, 1 replies.
- [GitHub] [spark] srowen commented on pull request #39811: [SPARK-42242][BUILD] Upgrade `snappy-java` to 1.1.9.0 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/31 14:54:33 UTC, 0 replies.
- [GitHub] [spark] cloud-fan closed pull request #37525: [SPARK-40086][SPARK-42049][SQL] Improve AliasAwareOutputPartitioning and AliasAwareQueryOutputOrdering to take all aliases into account - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 15:08:33 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39820: [SPARK-42249][SQL] Refining html link for documentation in error messages. - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/31 15:12:11 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39805: [SPARK-42238][SQL] Introduce new error class: `INCOMPATIBLE_JOIN_TYPES` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/31 15:16:36 UTC, 0 replies.
- [GitHub] [spark] srowen commented on pull request #39816: [SPARK-42245][BUILD] Upgrade scalafmt from 3.6.1 to 3.7.1 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/31 15:32:19 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39816: [SPARK-42245][BUILD] Upgrade scalafmt from 3.6.1 to 3.7.1 - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/31 15:32:22 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #38823: [SPARK-41290][SQL] Support GENERATED ALWAYS AS expressions for columns in create/replace table statements - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 15:59:05 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39722: [SPARK-42162] Introduce MultiCommutativeOp expression as a memory optimization for canonicalizing large trees of commutative expressions - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 16:21:17 UTC, 2 replies.
- [GitHub] [spark] cloud-fan commented on pull request #38038: [SPARK-42136] Refactor BroadcastHashJoinExec output partitioning calculation - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 16:24:31 UTC, 0 replies.
- [GitHub] [spark] MaxGekk commented on pull request #39804: [SPARK-42236][SQL] Refine `NULLABLE_ARRAY_OR_MAP_ELEMENT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/31 16:35:30 UTC, 0 replies.
- [GitHub] [spark] MaxGekk closed pull request #39804: [SPARK-42236][SQL] Refine `NULLABLE_ARRAY_OR_MAP_ELEMENT` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/31 16:36:16 UTC, 0 replies.
- [GitHub] [spark] amogh-jahagirdar commented on pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "amogh-jahagirdar (via GitHub)" <gi...@apache.org> on 2023/01/31 17:48:29 UTC, 0 replies.
- [GitHub] [spark] dongjoon-hyun commented on pull request #39124: [DON'T MERGE] Test build and test with hadoop 3.3.5-RC0 - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2023/01/31 17:50:27 UTC, 0 replies.
- [GitHub] [spark] cloud-fan opened a new pull request, #39824: [SPARK-42259][SQL] ResolveGroupingAnalytics should take care of Python UDAF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 17:51:17 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on pull request #39824: [SPARK-42259][SQL] ResolveGroupingAnalytics should take care of Python UDAF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 17:51:34 UTC, 0 replies.
- [GitHub] [spark] cloud-fan commented on a diff in pull request #39824: [SPARK-42259][SQL] ResolveGroupingAnalytics should take care of Python UDAF - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2023/01/31 17:52:25 UTC, 0 replies.
- [GitHub] [spark] jzhuge commented on a diff in pull request #39796: [SPARK-39800][SQL][WIP] DataSourceV2: View Support - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2023/01/31 17:52:25 UTC, 1 replies.
- [GitHub] [spark] MaxGekk commented on a diff in pull request #39799: [SPARK-42232][SQL] Rename error class: `UNSUPPORTED_FEATURE.JDBC_TRANSACTION` - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2023/01/31 18:17:07 UTC, 0 replies.
- [GitHub] [spark] sunchao commented on a diff in pull request #39687: [SPARK-41470][SQL] Relax constraints on Storage-Partitioned-Join should assume InternalRow implements equals and hashCode - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2023/01/31 18:32:54 UTC, 0 replies.
- [GitHub] [spark] holdenk opened a new pull request, #39825: [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2023/01/31 18:56:36 UTC, 0 replies.
- [GitHub] [spark] hvanhovell commented on pull request #39615: [SPARK-42093][SQL] Move JavaTypeInference to AgnosticEncoders - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2023/01/31 19:30:41 UTC, 0 replies.
- [GitHub] [spark] planga82 opened a new pull request, #39826: [SPARK-42262][SQL] Table schema changes via V2SessionCatalog with HiveExternalCatalog - posted by "planga82 (via GitHub)" <gi...@apache.org> on 2023/01/31 20:15:39 UTC, 0 replies.
- [GitHub] [spark] planga82 commented on pull request #39826: [SPARK-42262][SQL] Table schema changes via V2SessionCatalog with HiveExternalCatalog - posted by "planga82 (via GitHub)" <gi...@apache.org> on 2023/01/31 20:17:39 UTC, 0 replies.
- [GitHub] [spark] xkrogen commented on a diff in pull request #39826: [SPARK-42262][SQL] Table schema changes via V2SessionCatalog with HiveExternalCatalog - posted by "xkrogen (via GitHub)" <gi...@apache.org> on 2023/01/31 21:04:53 UTC, 1 replies.
- [GitHub] [spark] khalidmammadov commented on pull request #39823: [SPARK-42257][CORE] Remove unused variable external sorter - posted by "khalidmammadov (via GitHub)" <gi...@apache.org> on 2023/01/31 21:40:37 UTC, 0 replies.
- [GitHub] [spark] srielau commented on pull request #39773: [SPARK-42217][SQL] Support implicit lateral column alias in queries with Window - posted by "srielau (via GitHub)" <gi...@apache.org> on 2023/01/31 21:45:25 UTC, 0 replies.
- [GitHub] [spark] srowen closed pull request #39823: [SPARK-42257][CORE] Remove unused variable external sorter - posted by "srowen (via GitHub)" <gi...@apache.org> on 2023/01/31 21:55:46 UTC, 0 replies.