You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [PR] [SPARK-46547][SS] Fix deadlock between maintenance thread and streaming aggregation operator [spark] - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2024/01/01 00:11:49 UTC, 14 replies.
- Re: [PR] WIP: refactor Window operator [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/01 00:21:29 UTC, 0 replies.
- Re: [PR] [Don't merge and review] investigate root cause sbt [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/01 00:21:30 UTC, 0 replies.
- Re: [PR] [SPARK-39800][SQL][WIP] DataSourceV2: View Support [spark] - posted by "jzhuge (via GitHub)" <gi...@apache.org> on 2024/01/01 00:50:07 UTC, 2 replies.
- Re: [PR] [SPARK-46408][SQL] Support date_sub on V2ExpressionBuilder [spark] - posted by "caicancai (via GitHub)" <gi...@apache.org> on 2024/01/01 08:46:43 UTC, 23 replies.
- Re: [PR] [SPARK-46550][BUILD][SQL] Upgrade `datasketches-java` to 5.0.0 [spark] - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2024/01/01 14:04:22 UTC, 2 replies.
- Re: [PR] [SPARK-23890][SQL] Support CHANGE COLUMN to add nested fields to structs [spark] - posted by "ottomata (via GitHub)" <gi...@apache.org> on 2024/01/01 17:55:53 UTC, 0 replies.
- Re: [PR] [WIP][SPARK-46549][INFRA] Cache the Python dependencies for SQL tests [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/02 01:00:18 UTC, 2 replies.
- Re: [PR] [SPARK-46504][PS][TESTS][FOLLOWUPS] Moving move slow tests out of `IndexesTests` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/02 01:23:25 UTC, 0 replies.
- [PR] [SPARK-46553][PS] `FutureWarning` for `interpolate` with object dtype [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/02 01:43:29 UTC, 2 replies.
- Re: [PR] [SPARK-46540][PYTHON] Respect column names when Python data source read function outputs named Row objects [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/02 01:47:04 UTC, 4 replies.
- Re: [PR] [SPARK-46551][PYTHON][DOCS] Refine docstring of `flatten/sequence/shuffle` [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/02 01:47:27 UTC, 2 replies.
- Re: [PR] [SPARK-46554][BUILD] Upgrade slf4j to 2.0.10 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/02 01:48:29 UTC, 2 replies.
- Re: [PR] [SPARK-46546][DOCS] Fix the formatting of tables in `running-on-kubernetes` and `running-on-yarn` pages [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/02 02:10:20 UTC, 3 replies.
- Re: [PR] [SPARK-46361][PYTHON][CORE] Spark dataset chunk read api [spark] - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2024/01/02 02:18:31 UTC, 3 replies.
- Re: [PR] [SPARK-46536][SQL] Support GROUP BY calendar_interval_type [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/02 02:56:08 UTC, 36 replies.
- Re: [PR] [Don't merge & review] upgrade sbt 1.9.8 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/02 03:21:13 UTC, 1 replies.
- Re: [PR] [SPARK-46543][PYTHON][CONNECT] Make `json_tuple` throw PySparkValueError for empty fields [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/02 03:44:03 UTC, 1 replies.
- Re: [PR] [SPARK-46541][SQL][CONNECT] Fix the ambiguous column reference in self join [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/02 04:05:48 UTC, 35 replies.
- [PR] [SPARK-46555][PYTHON][DOCS] Refine docstring for DataFrame.createTempView/createOrReplaceTempView [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/02 04:34:12 UTC, 2 replies.
- [PR] [SPARK-46556][PYTHON][DOCS] Refine docstring for DataFrame.createGlobalTempView/createOrReplaceGlobalTempView [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/02 04:54:24 UTC, 2 replies.
- [PR] [SPARK-46557][PYTHON][DOCS] Refine docstring for DataFrame.schema/explain/printSchema [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/02 05:22:50 UTC, 3 replies.
- [PR] [SPARK-46558][CONNECT] Extract a helper function to eliminate the duplicate code that retrieves `MessageParameters` from `ErrorParams` in `GrpcExceptionConverter` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/02 05:58:00 UTC, 4 replies.
- Re: [PR] [Don't Merge] upgrade sbt 1.9.8 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/02 06:03:48 UTC, 4 replies.
- Re: [PR] [SPARK-45352][SQL] Eliminate foldable window partitions [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/01/02 06:13:29 UTC, 9 replies.
- Re: [PR] [SPARK-46524][SQL] Improve error messages for invalid save mode [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/02 06:14:22 UTC, 3 replies.
- [PR] [SPARK-46559][MLLIB] Wrap the `export` in the package name with backticks [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/02 06:46:08 UTC, 4 replies.
- [PR] [SPARK-46561][CORE][WEBUI] Use `exists` instead of `filter + nonEmpty` to check `showResourceColumn` in `MasterPage.scala` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/02 07:00:05 UTC, 1 replies.
- Re: [PR] [SPARK-46561][UI] Use `exists` instead of `filter + nonEmpty` to check `showResourceColumn` in `MasterPage.scala` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/02 07:08:20 UTC, 4 replies.
- [PR] [SPARK-46562][SQL] Remove retrieval of `keytabFile` from `UserGroupInformation` in `HiveAuthFactory` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/02 07:36:02 UTC, 1 replies.
- Re: [PR] [SPARK-46400][CORE][SQL] When there are corrupted files in the local maven repo, skip this cache and try again [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/02 07:42:35 UTC, 5 replies.
- [PR] [SPARK-46563][SQL] View simpleString not follow conf spark.sql.debug.maxToStringFields [spark] - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2024/01/02 08:27:17 UTC, 5 replies.
- Re: [PR] [SPARK-46544][SQL] Support v2 DESCRIBE TABLE EXTENDED with table stats [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/02 08:27:36 UTC, 3 replies.
- [PR] [SPARK-46564][PYTHON][INFRA] Exclude unrelated files via using omit options properly in PySpark coverage report [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/02 08:44:26 UTC, 2 replies.
- [PR] [SPARK-46565][PYTHON] Refine error classes and error messages for Python data sources [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/02 08:47:00 UTC, 2 replies.
- [PR] [SPARK-46560][PYTHON][DOCS] Refine docstring `reverse/map_contains_key` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/02 09:19:42 UTC, 16 replies.
- Re: [PR] [SPIP-IN-PROGRESS][DO-NOT-MERGE][SS] Add base support for new arbitrary state management operator, single valueState type, multiple state variables and underlying support for column families for RocksDBStateStoreProvider with/without changelog checkpointing [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/01/02 09:29:51 UTC, 25 replies.
- Re: [PR] [SPARK-46525][DOCKER][TESTS] Fix docker-integration-tests on Apple Sillicon [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/02 09:42:08 UTC, 2 replies.
- [PR] [SPARK-46504][PS][TESTS][FOLLOWUPS] Moving more slow tests out of `IndexesTests` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/02 10:38:29 UTC, 5 replies.
- [PR] [SPARK-46567][CORE] Remove ThreadLocal for ReadAheadInputStream [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/02 11:03:08 UTC, 11 replies.
- Re: [PR] [SPARK-45527][CORE] Use fraction to do the resource calculation [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/01/02 11:51:12 UTC, 11 replies.
- Re: [PR] [SPARK-46179][SQL] Add CrossDbmsQueryTestSuites, which allows generating golden files with other DBMS, starting off with Postgres [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/02 11:56:53 UTC, 3 replies.
- [PR] [SPARK-46568][PYTHON] Make Python data source options a case-insensitive dictionary [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/02 12:20:57 UTC, 6 replies.
- Re: [PR] [SPARK-46552][SQL] Replace `UnsupportedOperationException` by `SparkUnsupportedOperationException` in `catalyst` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/02 12:37:21 UTC, 2 replies.
- Re: [PR] [SPARK-23890][SQL] Support DDL for adding nested columns to struct types [spark] - posted by "ottomata (via GitHub)" <gi...@apache.org> on 2024/01/02 15:00:48 UTC, 2 replies.
- Re: [PR] [SPARK-46425][INFRA] Pin the bundler version in CI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/02 16:35:14 UTC, 0 replies.
- Re: [PR] [SPARK-46478][SQL][3.5] Revert SPARK-43049 to use oracle varchar(255) for string [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/02 18:50:16 UTC, 1 replies.
- [PR] [SS][SPIP-IN-PROGRESS][DO-NOT-MERGE] Add support for event time timers for new transformWithState operator [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/01/02 18:54:57 UTC, 3 replies.
- Re: [PR] [SPARK-43299][FOLLOWUP][CONNECT][SS] Followup on StreamingQueryExceptions [spark] - posted by "heyihong (via GitHub)" <gi...@apache.org> on 2024/01/02 19:38:12 UTC, 3 replies.
- Re: [PR] [SPARK-42307][SQL][BUILD][DOCS] Adding in a better name for `_LEGACY_ERROR_TEMP_2232` [spark] - posted by "hannahkamundson (via GitHub)" <gi...@apache.org> on 2024/01/02 20:08:12 UTC, 5 replies.
- Re: [PR] [SPARK-44001][PROTOBUF] Add option to allow unwrapping protobuf well known wrapper types [spark] - posted by "justaparth (via GitHub)" <gi...@apache.org> on 2024/01/02 20:26:42 UTC, 2 replies.
- [PR] [SPARK-46570][INFRA] Run Python 3.11 and 3.12 tests independently [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/02 20:40:29 UTC, 3 replies.
- Re: [PR] [SPARK-46383] Reduce Driver Heap Usage by Reducing the Lifespan of `TaskInfo.accumulables()` [spark] - posted by "utkarsh39 (via GitHub)" <gi...@apache.org> on 2024/01/02 22:36:55 UTC, 12 replies.
- [PR] [SPARK-45338][TEST][SQL][FOLLOWUP] Use scala.jdk.javaapi.CollectionConverters in JavaHigherOrderFunctionsSuite [spark] - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2024/01/02 22:56:54 UTC, 4 replies.
- Re: [PR] [SPARK-46466][SQL] Vectorized parquet reader should never do rebase for timestamp ntz [spark] - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2024/01/02 23:28:27 UTC, 0 replies.
- [PR] [SPARK-46571][PS][TEST] Re-enable TODOs that are resolved from recent Pandas [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/03 01:28:23 UTC, 2 replies.
- Re: [PR] [SPARK-39142][SPARK-42235][PYTHON] Add overloads in pandas function stub file [spark] - posted by "dennisobrien (via GitHub)" <gi...@apache.org> on 2024/01/03 01:49:17 UTC, 0 replies.
- Re: [PR] [SPARK-46512][CORE] Optimize shuffle reading when both sort and combine are used. [spark] - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2024/01/03 02:14:21 UTC, 10 replies.
- Re: [PR] [SPARK-46361][PYTHON][CORE] Spark dataset chunk read api (developer API) [spark] - posted by "WeichenXu123 (via GitHub)" <gi...@apache.org> on 2024/01/03 02:44:35 UTC, 5 replies.
- [PR] [SPARK-46573][K8S] Use `appId` instead of `conf.appId` in `LoggingPodStatusWatcherImpl` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/03 06:00:42 UTC, 3 replies.
- [PR] [SPARK-37039][PS] Fix `Series.astype` to work properly with missing value [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/03 06:10:41 UTC, 6 replies.
- [PR] [SPARK-46382][SQL] XML: Capture values interspersed between elements follow-up [spark] - posted by "shujingyang-db (via GitHub)" <gi...@apache.org> on 2024/01/03 08:01:31 UTC, 0 replies.
- Re: [PR] [SPARK-46546][DOCS] Fix the formatting of tables in `running-on-yarn` pages [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/03 09:04:23 UTC, 22 replies.
- Re: [PR] [SPARK-46522][PYTHON] Block Python data source registration with name conflicts [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/03 09:54:58 UTC, 7 replies.
- [PR] [SPARK-46566][SQL] Session level config was not loaded when isolation is enable. [spark] - posted by "zhengchenyu (via GitHub)" <gi...@apache.org> on 2024/01/03 09:57:59 UTC, 1 replies.
- [PR] [SPARK-21492][SQL][FOLLOWUP] Remove invalid attachCleanupResourceChecker in JoinSuite [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/01/03 11:15:37 UTC, 1 replies.
- [PR] [WIP][SQL] Redact JDBC url in errors and logs [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/03 11:17:01 UTC, 0 replies.
- [PR] [SPARK-46575][SQL][HIVE] Make HiveThriftServer2.startWithContext DevelopApi retriable and fix flakiness of ThriftServerWithSparkContextInHttpSuite [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/03 11:51:11 UTC, 20 replies.
- [PR] [SPARK-46576][SQL] Improve error messages for unsupported data source save mode [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/03 11:53:24 UTC, 3 replies.
- [PR] [SPARK-46569][SQL] Remove ThreadLocal due to SecureRandom is thread safe since JDK9 [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/03 12:18:41 UTC, 5 replies.
- [PR] [SPARK-46577][SQL] HiveMetastoreLazyInitializationSuite leaks hive's SessionState [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/03 12:44:55 UTC, 3 replies.
- [PR] [WIP][SPARK-46578][SQL] Remove ThreadLocal for fallbackConf [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/03 13:22:12 UTC, 1 replies.
- Re: [PR] [SPARK-46550][BUILD][SQL] Upgrade `datasketches-java` to 5.0.1 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/03 17:28:23 UTC, 1 replies.
- Re: [PR] [SPARK-46539][SQL] SELECT * EXCEPT(all fields from a struct) results in an assertion failure [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/03 18:52:14 UTC, 6 replies.
- [PR] [WIP][CONNECT] Handle a non-existent error class [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/03 19:19:06 UTC, 0 replies.
- Re: [PR] [SPARK-46579][SQL] Redact JDBC url in errors and logs [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/03 20:02:10 UTC, 0 replies.
- Re: [PR] [SPARK-46382][SQL] XML: Refactor the handling of values interspersed between elements [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/01/03 22:01:17 UTC, 16 replies.
- Re: [PR] [SPARK-44556][SQL] Reuse `OrcTail` when enable vectorizedReader [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/03 22:29:08 UTC, 1 replies.
- [PR] [SPARK-46580][TESTS] Regenerate benchmark results [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/03 22:44:14 UTC, 10 replies.
- [PR] [WIP][SPARK-46581][CORE] Rename isZero to isUpdated in AccumulatorV2 [spark] - posted by "davintjong-db (via GitHub)" <gi...@apache.org> on 2024/01/03 23:36:24 UTC, 3 replies.
- Re: [PR] [SPARK-45452][SQL] Improve `InMemoryFileIndex` to use `FileSystem.listFiles` API [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/04 00:08:50 UTC, 8 replies.
- Re: [PR] [WIP][SPARK-46179][SQL] Generate golden files with Postgres/other DBMS using Python [spark] - posted by "andylam-db (via GitHub)" <gi...@apache.org> on 2024/01/04 00:11:59 UTC, 0 replies.
- Re: [PR] [SPARK-44959][BUILD] Upgrade sbt to 1.9.5 [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/04 00:19:12 UTC, 1 replies.
- [PR] [WIP][R][INFRA] Upgrade R Tools version from 4.0.2 to 4.3.2 in AppVeyor [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/04 00:23:32 UTC, 0 replies.
- [PR] [SPARK-46539][SQL][FOLLOWUP] fix golden files [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/04 01:51:35 UTC, 3 replies.
- Re: [PR] [SPARK-46584][SQL][TESTS] Remove invalid attachCleanupResourceChecker in JoinSuite [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/01/04 01:59:48 UTC, 3 replies.
- Re: [PR] [SPARK-46582][R][INFRA] Upgrade R Tools version from 4.0.2 to 4.3.2 in AppVeyor [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/04 02:09:42 UTC, 8 replies.
- Re: [PR] [SPARK-46534][SQL][HIVE] Skip initializing an unused hive client in HiveThriftServer2.startWithContext DeveloperApi [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/04 02:38:44 UTC, 0 replies.
- [PR] [SPARK-46585][CORE] Directly constructed `metricPeaks` as an `immutable.ArraySeq` instead of use `mutable.ArraySeq.toSeq` in `Executor` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/04 02:42:05 UTC, 0 replies.
- Re: [PR] [SPARK-46179][SQL] Add CrossDbmsQueryTestSuites, which runs other DBMS against golden files with other DBMS, starting with Postgres [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/04 02:51:40 UTC, 6 replies.
- [PR] [SQL][SPARK-46587] XML: Fix XSD big integer conversion [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/01/04 03:23:23 UTC, 0 replies.
- [PR] [SPARK-46504][PS][TESTS][FOLLOWUPS] Break the remaining part of `IndexesTests` into small test files [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/04 03:46:16 UTC, 0 replies.
- [PR] [SPARK-46412] update java and jdk info [spark] - posted by "engrravijain (via GitHub)" <gi...@apache.org> on 2024/01/04 04:04:25 UTC, 1 replies.
- Re: [PR] [SPARK-46412][K8S][DOCS] Update Java and JDK info in K8S testing [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/04 04:53:21 UTC, 11 replies.
- Re: [PR] [SPARK-45292][SQL][HIVE] Remove Guava from shared classes from IsolatedClientLoader [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/04 05:28:32 UTC, 4 replies.
- [PR] [SPARK-46530][PYTHON][SQL][FOLLOW-UP] Uses path separator instead of file separator to correctly check PySpark library existence [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/04 05:42:51 UTC, 3 replies.
- Re: [PR] [SPARK-46504][PS][TESTS][FOLLOWUP] Break the remaining part of `IndexesTests` into small test files [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/04 06:33:59 UTC, 1 replies.
- Re: [PR] [SPARK-41954][CORE] Add isDecommissioned in ExecutorDeadException [spark] - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2024/01/04 07:21:10 UTC, 0 replies.
- Re: [PR] [SPARK-46587][SQL] XML: Fix XSD big integer conversion [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/04 07:42:28 UTC, 1 replies.
- Re: [PR] [SPARK-46589][CONNECT] Handle a non-existent error class [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/04 07:46:37 UTC, 7 replies.
- [PR] [SPARK-46586][SQL] Support `s.c.immutable.ArraySeq` as `customCollectionCls` in `MapObjects` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/04 08:03:22 UTC, 15 replies.
- Re: [PR] [SPARK-46585][CORE] Directly constructed `metricPeaks` as an `immutable.ArraySeq` instead of use `mutable.ArraySeq.toSeq` in `Executor` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/04 08:07:01 UTC, 1 replies.
- Re: [PR] Collations proof of concept [spark] - posted by "milastdbx (via GitHub)" <gi...@apache.org> on 2024/01/04 09:24:03 UTC, 5 replies.
- Re: [PR] [SPARK-40876][SQL] Widening type promotion for decimals with larger scale in Parquet readers [spark] - posted by "johanl-db (via GitHub)" <gi...@apache.org> on 2024/01/04 09:30:00 UTC, 6 replies.
- [PR] [SPARK-46593][PS][TESTS] Refactor `data_type_ops` tests [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/04 10:22:46 UTC, 4 replies.
- [PR] [SPARK-46594][DOCS] The description of `spark.yarn.shuffle.server.recovery.disabled` should be in chapter `Configuring the External Shuffle Service` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/04 11:10:47 UTC, 0 replies.
- [PR] [SPARK-46592][DOCKER][TEST] OracleIntegrationSuite is flaky because of ORA-04021 [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/04 11:22:36 UTC, 3 replies.
- [PR] [SPARK-46595][PYTHON][DOCS] Refine docstring of `map_from_arrays/map_from_entries/map_concat` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/04 12:31:21 UTC, 3 replies.
- [PR] [SPARK-46596][CORE][TESTS] Correct package name of `SslTestUtils` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/04 12:34:49 UTC, 6 replies.
- [PR] [SPARK-46597][TESTS] Optimize the run tests command in the doc of XXIntegrationSuite in `docker-integration-tests` module [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/04 12:49:49 UTC, 0 replies.
- Re: [PR] [SPARK-46442][SQL] DS V2 supports push down PERCENTILE_CONT and PERCENTILE_DISC [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/04 13:23:08 UTC, 10 replies.
- [PR] [SPARK-46598][SQL] OrcColumnarBatchReader should respect the memory mode when creating c… [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/04 14:19:34 UTC, 0 replies.
- Re: [PR] [SPARK-46598][SQL] OrcColumnarBatchReader should respect the memory mode when creating column vectors for the missing column [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/04 14:20:59 UTC, 9 replies.
- Re: [PR] [SPARK-46597][SQL][TESTS] Optimize the run tests command in the doc of XXIntegrationSuite in `docker-integration-tests` module [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/04 18:45:33 UTC, 2 replies.
- [PR] Generate subqueries for correctness testing [spark] - posted by "andylam-db (via GitHub)" <gi...@apache.org> on 2024/01/04 21:28:37 UTC, 0 replies.
- [PR] [DRAFT][DO NOT MERGE] MapState - StateStore.iterator() not working [spark] - posted by "jingz-db (via GitHub)" <gi...@apache.org> on 2024/01/04 22:44:28 UTC, 1 replies.
- [PR] [SPARK-46599][SQL] XML: Use TypeCoercion.findTightestCommonType for compatibility check [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/01/05 01:38:15 UTC, 4 replies.
- Re: [PR] [WIP][SPARK-45720] Upgrade AWS SDK to v2 for Spark Kinesis connector module [spark] - posted by "junyuc25 (via GitHub)" <gi...@apache.org> on 2024/01/05 02:35:34 UTC, 5 replies.
- [PR] [SPARK-46600][SQL] Move shared code between SqlConf and SqlApiConf to SqlApiConfHelper [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/01/05 03:11:16 UTC, 15 replies.
- Re: [PR] [SPARK-46581][CORE] Update comment on isZero in AccumulatorV2 [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/05 03:36:30 UTC, 3 replies.
- [PR] [WIP][SQL] Propagate `allowExisting` in view creation when the view/table does not exists [spark] - posted by "anchovYu (via GitHub)" <gi...@apache.org> on 2024/01/05 04:07:48 UTC, 0 replies.
- Re: [PR] [SPARK-46602][SQL] Propagate `allowExisting` in view creation when the view/table does not exists [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/05 05:48:10 UTC, 3 replies.
- Re: [PR] [SPARK-46598][SQL] OrcColumnarBatchReader should should use ConstantColumnVector for missing columns [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/05 05:52:42 UTC, 0 replies.
- [PR] [SPARK-46603][PYTHON][DOCS] Refine docstring of `parse_url/url_encode/url_decode` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/05 05:59:17 UTC, 4 replies.
- [PR] Revert "[SPARK-46593][PS][TESTS] Refactor `data_type_ops` tests" [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/05 06:05:43 UTC, 2 replies.
- [PR] [SPARK-46601] Fix log error in handleStatusMessage [spark] - posted by "jiaoqingbo (via GitHub)" <gi...@apache.org> on 2024/01/05 06:22:14 UTC, 0 replies.
- [PR] [SPARK-46604][SQL] Make `Literal.apply` support `s.c.immuable.ArraySeq` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/05 06:25:45 UTC, 3 replies.
- [PR] [SPARK-46605][CONNECT] Make `lit/typedLit` function in connect module support `s.c.immutable.ArraySeq` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/05 06:28:08 UTC, 2 replies.
- Re: [PR] [SPARK-46601] [CORE] Fix log error in handleStatusMessage [spark] - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2024/01/05 06:53:19 UTC, 3 replies.
- [PR] [SPARK-46592][DOCKER][TESTS][FOLLOWUP] Add newline to ORACLE Docker preparation sql [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/05 07:14:58 UTC, 3 replies.
- [PR] [SPARK-46606][PYTHON][DOCS] Refine docstring of `convert_timezone/make_dt_interval/make_interval` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/05 09:22:39 UTC, 13 replies.
- [PR] [SPARK-46607][PYTHON][TESTS] Check the testing mode [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/05 09:52:02 UTC, 7 replies.
- [PR] [SPARK-46525][DOCKER][TESTS][FOLLOWUP] Fix docker-integration-tests on Apple Silicon for db2 and oracle with colima docker environment [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/05 12:20:09 UTC, 1 replies.
- [PR] [WIP][CORE] Remove ThreadLocal by replace SimpleDateFormat with DateTimeFormatter [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/05 12:32:42 UTC, 5 replies.
- [PR] [SPARK-46609][SQL] Avoid exponential explosion in PartitioningPreservingUnaryExecNode [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/05 16:36:43 UTC, 4 replies.
- Re: [PR] [SPARK-46515] Add MONTHNAME function [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/05 20:20:23 UTC, 3 replies.
- Re: [PR] [SPARK-43152][SS] Parametrisable output metadata path (_spark_metadata) [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/06 00:19:40 UTC, 1 replies.
- Re: [PR] [SPARK-46382][SQL] XML: Refactor inference and parsing [spark] - posted by "shujingyang-db (via GitHub)" <gi...@apache.org> on 2024/01/06 04:59:07 UTC, 4 replies.
- [PR] [SPARK-46610][SQL] Create table should throw exception when no value for a key in options [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/01/06 08:27:21 UTC, 5 replies.
- Re: [PR] [SPARK-46611][CORE] Remove ThreadLocal by replace SimpleDateFormat with DateTimeFormatter [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/07 07:16:45 UTC, 40 replies.
- Re: [PR] [SPARK-46248][SQL] XML: Support for ignoreCorruptFiles and ignoreMissingFiles options [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/01/07 08:34:14 UTC, 2 replies.
- Re: [PR] [SPARK-42332][SQL][DOCS] Changing the require to a SparkException in ComplexTypeMergingExpression [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/07 10:17:37 UTC, 8 replies.
- [PR] [WIP][CORE] Replace SimpleDateFormat with DateTimeFormatter [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/07 13:07:31 UTC, 1 replies.
- Re: [PR] Do not convert array type string retrieved from jdbc driver [spark] - posted by "phanhuyn (via GitHub)" <gi...@apache.org> on 2024/01/07 23:01:33 UTC, 0 replies.
- [PR] [SPARK-46613][SQL][PYTHON] Log full exception when failed to lookup Python Data Sources [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/08 00:01:53 UTC, 3 replies.
- Re: [PR] [SPARK-45372][CORE] Handle ClassNotFoundException when load extension [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/08 00:19:59 UTC, 1 replies.
- Re: [PR] [SPARK-38488][INFRA] Upgrade ffi to 1.15.5 with --enable-libffi-alloc to fix doc build error on MacOS M1 [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/08 02:55:51 UTC, 0 replies.
- Re: [PR] [SPARK-46608][SQL] Restore backward compatibility of `JdbcDialect.classifyException` [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/08 04:28:38 UTC, 7 replies.
- Re: [PR] [SPARK-46437][DOCS] Remove cruft from the built-in SQL functions documentation [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/08 04:42:35 UTC, 9 replies.
- [PR] [SPARK-46615][CONNECT] Support s.c.immutable.ArraySeq in ArrowDeserializers [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/08 06:19:41 UTC, 6 replies.
- [PR] [SPARK-46611][CORE][FOLLOWUP] Correct new use of DateTimeFormat.withZone in UIUtils [spark] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2024/01/08 08:26:47 UTC, 3 replies.
- [PR] [SPARK-46618][SQL] Improve error messages for DATA_SOURCE_NOT_FOUND error [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/08 08:56:05 UTC, 4 replies.
- Re: [PR] [SPARK-46331][SQL] Removing CodegenFallback from subset of DateTime expressions and version() expression [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/08 09:14:39 UTC, 3 replies.
- [PR] [WIP][SPARK-46620][PS][CONNECT] Implement `Frame.asfreq` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/08 09:26:26 UTC, 0 replies.
- [PR] [SPARK-46617][SQL] Create-table-if-not-exists should not silently overwrite existing tables [spark] - posted by "adrians (via GitHub)" <gi...@apache.org> on 2024/01/08 09:59:56 UTC, 2 replies.
- [PR] [SPARK-46621][PYTHON] Address null from Exception.getMessage in Py4J captured exception [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/08 10:10:02 UTC, 7 replies.
- [PR] [SPARK-46622][CORE] Override `toString` method for `o.a.s.network.shuffledb.StoreVersion` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/08 11:47:17 UTC, 3 replies.
- [PR] [MINOR][CONNECT] Fix typo in error handling opType name [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/01/08 12:12:44 UTC, 2 replies.
- Re: [PR] [Don't merge & review] Upgrade rocksdbjni 20231121 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/08 13:22:02 UTC, 2 replies.
- [PR] [SPARK-46624][BUILD] Upgrade joda-time to 2.12.6 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/08 13:30:28 UTC, 2 replies.
- Re: [PR] [SPARK-46623][CORE][MLLIB][SQL] Replace SimpleDateFormat with DateTimeFormatter [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/08 13:45:26 UTC, 10 replies.
- [PR] Ensure that docs build successfully with SKIP_API=1 [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/08 15:01:23 UTC, 0 replies.
- Re: [PR] [MINOR][INFRA] Ensure that docs build successfully with SKIP_API=1 [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/08 17:57:21 UTC, 7 replies.
- [PR] [SPARK-46626][DOCS] Bump jekyll to 4.3.3 to enable support for Ruby 3.3.0 [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/08 18:56:58 UTC, 5 replies.
- [PR] [SPARK-46382][SQL]XML: Default ignoreSurroundingSpaces to true [spark] - posted by "shujingyang-db (via GitHub)" <gi...@apache.org> on 2024/01/08 21:54:56 UTC, 2 replies.
- [PR] [SPARK-46437][DOCS] Add custom tag for conditional Jekyll includes [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/08 22:05:20 UTC, 0 replies.
- [PR] [SPARK-46628][INFRA] Use SPDX short identifier in `license` name [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/08 23:12:19 UTC, 3 replies.
- Re: [PR] [SPARK-45959][SQL] Improving performance when addition of 1 column at a time causes increase in the LogicalPlan tree depth [spark] - posted by "ahshahid (via GitHub)" <gi...@apache.org> on 2024/01/08 23:29:22 UTC, 0 replies.
- Re: [PR] [ML] Remove usage of RDD APIs for load/save in spark-ml [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/09 00:19:48 UTC, 1 replies.
- Re: [PR] [SPARK-46525][DOCKER][TESTS][FOLLOWUP] Fix docker-integration-tests on Apple Silicon for db2 and oracle with third-party docker environments [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/09 01:53:13 UTC, 8 replies.
- Re: [PR] [SPARK-46474][INFRA] Upgrade upload-artifact action to v4 [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/09 03:25:21 UTC, 24 replies.
- [PR] [SPARK-40876][SQL][TESTS][FOLLOW-UP] Remove invalid decimal test case when ANSI mode is on [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/09 03:45:25 UTC, 3 replies.
- [PR] [SPARK-46627][SS][UI] Fix timeline tooltip content on streaming ui [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/09 03:48:48 UTC, 4 replies.
- [PR] [SPARK-46630][SQL] XML: Validate XML element name on write [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/01/09 04:54:51 UTC, 6 replies.
- [PR] [SPARK-46633][SQL] Fix Avro reader to handle zero-length blocks [spark] - posted by "sadikovi (via GitHub)" <gi...@apache.org> on 2024/01/09 04:56:07 UTC, 3 replies.
- [PR] [SPARK-XXXX][SS] Add maxBytesPerTrigger threshold [spark] - posted by "MaxNevermind (via GitHub)" <gi...@apache.org> on 2024/01/09 06:34:12 UTC, 7 replies.
- [PR] [SPARK-46593][PS][TESTS] Refactor `data_type_ops` tests again [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/09 07:33:18 UTC, 2 replies.
- [PR] [WIP][CORE][SQL][YARN] Replace AtomicBoolean with volatile boolean [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/09 07:44:20 UTC, 0 replies.
- Re: [PR] [SPARK-46052][CORE] Remove function TaskScheduler.killAllTaskAttempts [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/09 07:48:13 UTC, 10 replies.
- Re: [PR] [WIP][SPARK-37039][PS] Fix `Series.astype` to work properly with missing value [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/09 08:28:36 UTC, 0 replies.
- [PR] [SPARK-46635][PYTHON][DOCS] Refine docstring of `from_csv/schema_of_csv/to_csv` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/09 11:49:20 UTC, 7 replies.
- [PR] [SPARK-46634][SQL] literal validation should not drill down to null fields [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/09 11:54:58 UTC, 3 replies.
- [PR] [MINOR][PYTHON][TESTS] Retry `test_map_in_pandas_with_column_vector` and its parity test [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/09 13:14:23 UTC, 0 replies.
- [PR] [SPARK-46637][Doc] Enhancing the Visual Appeal of Spark doc website [spark] - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2024/01/09 20:03:15 UTC, 3 replies.
- Re: [PR] [SPARK-46437][DOCS] Add custom tags for conditional Jekyll includes [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/09 20:19:14 UTC, 2 replies.
- Re: [PR] [SPARK-45593][BUILD] Building a runnable distribution from master code running spark-sql raise error [spark] - posted by "c3-ffomenko (via GitHub)" <gi...@apache.org> on 2024/01/09 21:32:20 UTC, 7 replies.
- [PR] [WIP] retain empty message field in protobuf connector [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/01/09 22:10:40 UTC, 0 replies.
- [PR] [SPARK-46629] Fix for STRUCT type in SHOW CREATE TABLE command [spark] - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2024/01/09 23:00:14 UTC, 6 replies.
- [PR] [SPARK-46640][SQL] Fix RemoveRedundantAlias by excluding subquery attributes [spark] - posted by "nikhilsheoran-db (via GitHub)" <gi...@apache.org> on 2024/01/09 23:07:52 UTC, 7 replies.
- [PR] [SPARK-46639][SQL] Add WindowExec SQLMetrics [spark] - posted by "erenavsarogullari (via GitHub)" <gi...@apache.org> on 2024/01/09 23:19:48 UTC, 2 replies.
- Re: [PR] [SPARK-45134][SHUFFLE] Avoid repeated fallback when failed to fetch remote push-merged block meta [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/10 00:19:20 UTC, 1 replies.
- Re: [PR] [SPARK-46637][DOCS] Enhancing the Visual Appeal of Spark doc website [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/10 00:32:21 UTC, 2 replies.
- Re: [PR] [MINOR][PYTHON][TESTS] Retry `test_map_in_pandas_with_column_vector` [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/10 00:38:39 UTC, 1 replies.
- [PR] [MINOR] Add license header at docs/_plugins [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/10 00:43:22 UTC, 0 replies.
- Re: [PR] [SPARK-46641][SS] Add maxBytesPerTrigger threshold [spark] - posted by "MaxNevermind (via GitHub)" <gi...@apache.org> on 2024/01/10 00:53:21 UTC, 56 replies.
- [PR] [SPARK-46643][SQL][TESTS] Fix ORC tests to be independent from default compression [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 01:39:07 UTC, 3 replies.
- Re: [PR] [MINOR][DOCS] Add license header at docs/_plugins [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 01:41:37 UTC, 3 replies.
- [PR] [SPARK-46644] Fix add and merge in SQLMetric [spark] - posted by "davintjong-db (via GitHub)" <gi...@apache.org> on 2024/01/10 01:55:44 UTC, 0 replies.
- [PR] Spark 46474 followup [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/10 02:03:33 UTC, 1 replies.
- [PR] [SPARK-46646][SQL][TESTS] Improve `TPCDSQueryBenchmark` to support other file formats [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 02:32:56 UTC, 5 replies.
- [PR] [SPARK-46645][INFRA] Exclude unittest-xml-reporting in Python 3.12 image [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/10 02:34:51 UTC, 2 replies.
- [PR] try to fix upload [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/10 02:38:00 UTC, 1 replies.
- [PR] [SPARK-46648][SQL] Use `zstd` as the default value of `spark.sql.orc.compression.codec` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 02:58:57 UTC, 0 replies.
- Re: [PR] [SPARK-46648][SQL] Use `zstd` as the default ORC compression [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 03:06:06 UTC, 4 replies.
- [PR] [SPARK-46649][PYTHON][INFRA] Run PyPy 3 and Python 3.10 tests independently [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/10 03:22:08 UTC, 2 replies.
- Re: [PR] [MINOR][DOCS] Correct the usage example of Dataset in Java. [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/10 03:39:01 UTC, 1 replies.
- Re: [PR] [MINOR][DOCS] Correct the usage example of Dataset in Java [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/10 03:57:28 UTC, 1 replies.
- Re: [PR] [SPARK-46650][CORE][SQL][YARN] Replace AtomicBoolean with volatile boolean [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/10 03:58:55 UTC, 3 replies.
- [PR] [SPARK-46651][PS][TESTS] Split `FrameTakeTests` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/10 04:02:46 UTC, 2 replies.
- [PR] [SPARK-46652][SQL][TESTS] Remove `Snappy` from `TPCDSQueryBenchmark` benchmark case name [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 04:40:57 UTC, 4 replies.
- [PR] [SPARK-46642][PYTHON] Add `getMessageTemplate` to PySpark error framework [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/10 04:42:22 UTC, 1 replies.
- [PR] [WIP][SQL][CONNECT] Resolve inappropriate use of AtomicInteger [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/10 04:43:42 UTC, 1 replies.
- [PR] [SPARK-46653][SQL] Code-gen for full outer sort merge join output line by line [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/01/10 05:09:37 UTC, 3 replies.
- [PR] [SPARK-46950][SQL] Fix coalesce failed with BroadcastJoin and Union [spark] - posted by "jackylee-ch (via GitHub)" <gi...@apache.org> on 2024/01/10 05:22:15 UTC, 0 replies.
- Re: [PR] [SPARK-46644] Change add and merge in SQLMetric to use isZero [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/10 05:54:20 UTC, 7 replies.
- Re: [PR] [SPARK-46590][SQL] Fix coalesce failed with BroadcastJoin and Union [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/10 06:01:48 UTC, 29 replies.
- Re: [PR] [SPARK-46655][SQL] Skip query context catching in `DataFrame` methods [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/10 06:35:27 UTC, 15 replies.
- Re: [PR] [SPARK-46547][SS] Swallow non-fatal exception in maintenance task to avoid deadlock between maintenance thread and streaming aggregation operator [spark] - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2024/01/10 06:41:45 UTC, 6 replies.
- [PR] Spark 46474 fix [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/10 06:49:34 UTC, 0 replies.
- [PR] [SPARK-46437][FOLLOWUP] Update configuration.md to use include_api_gen [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/10 07:17:37 UTC, 2 replies.
- Re: [PR] [SPARK-19335][SPARK-38200][SQL] Add upserts for writing to JDBC [spark] - posted by "jatin5251 (via GitHub)" <gi...@apache.org> on 2024/01/10 09:55:53 UTC, 0 replies.
- [PR] [SPARK-46656][PS][TESTS] Split `GroupbyParitySplitApplyTests` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/10 11:14:07 UTC, 2 replies.
- [PR] [SPARK-46654][SQL] df.show() of pyspark displayed different results between Regular Spark and Spark Connect [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/10 13:17:00 UTC, 0 replies.
- Re: [PR] [SPARK-45022][SQL] Provide context for dataset API errors [spark] - posted by "ryan-johnson-databricks (via GitHub)" <gi...@apache.org> on 2024/01/10 15:26:29 UTC, 3 replies.
- [PR] [SPARK-46657][INFRA] Install `lxml` in Python 3.12 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 17:22:14 UTC, 3 replies.
- Re: [PR] [SPARK-42199][SQL] Fix issues around Dataset.groupByKey [spark] - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2024/01/10 19:00:14 UTC, 15 replies.
- [PR] [SPARK-46658][DOCS] Loosen Ruby dependency specification [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/10 19:14:19 UTC, 3 replies.
- [PR] Disable memory profiler for iterator UDFs [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/10 19:15:41 UTC, 0 replies.
- [PR] while talking with mohan [spark] - posted by "ramraghu474 (via GitHub)" <gi...@apache.org> on 2024/01/10 19:35:40 UTC, 1 replies.
- [PR] [SPARK-46660][CONNECT] ReattachExecute requests updates aliveness of SessionHolder [spark] - posted by "vicennial (via GitHub)" <gi...@apache.org> on 2024/01/10 20:20:10 UTC, 9 replies.
- [PR] [SPARK-46382][SQL] XML: Update doc for `ignoreSurroundingSpaces` [spark] - posted by "shujingyang-db (via GitHub)" <gi...@apache.org> on 2024/01/10 20:55:46 UTC, 2 replies.
- [PR] [SPARK-46662][K8S] Upgrade `kubernetes-client` to 6.10.0 [spark] - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2024/01/10 21:10:27 UTC, 0 replies.
- [PR] [SPARK-46664][CORE] Improve `Master` to recover quickly in case of zero workers and apps [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/10 22:20:53 UTC, 6 replies.
- [PR] [WIP][DO-NOT-MERGE] Upgrade rocksjni to 8.9.1 [spark] - posted by "neilramaswamy (via GitHub)" <gi...@apache.org> on 2024/01/10 23:36:08 UTC, 2 replies.
- [PR] [SPARK-46665][PYTHON] Remove Pandas dependency for `pyspark.testing` [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/10 23:58:26 UTC, 7 replies.
- [PR] [SPARK-46666][PYTHON][TESTS] Make lxml as an optional testing dependency in test_session [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/11 00:04:33 UTC, 3 replies.
- [PR] [WIP] Multiple input stream test [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/01/11 00:27:54 UTC, 1 replies.
- Re: [PR] [SPARK-46662][K8S][BUILD] Upgrade `kubernetes-client` to 6.10.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/11 00:50:02 UTC, 0 replies.
- [PR] [SPARK-46638][Python] Create Python UDTF API to acquire execution memory for 'eval' and 'terminate' methods [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/01/11 01:07:17 UTC, 1 replies.
- Re: [PR] [SPARK-46654][SQL] Make to_csv can correctly display complex types data [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/11 02:11:05 UTC, 7 replies.
- [PR] [SPARK-46614][PYTHON][DOCS] Refine docstring `make_timestamp/make_timestamp_ltz/make_timestamp_ntz/make_ym_interval` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/11 03:16:15 UTC, 2 replies.
- [PR] [SPARK-46668][DOCS] Parallelize Sphinx build of Python API docs [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/11 04:15:23 UTC, 3 replies.
- [PR] [SPARK-46670][PYTHON][SQL] Make DataSourceManager isolated and self clone-able [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/11 04:55:12 UTC, 2 replies.
- Re: [PR] [SPARK-46395][DOCS] Assign Spark configs to groups for use in documentation [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/11 05:21:53 UTC, 8 replies.
- Re: [PR] [SPARK-46670][PYTHON][SQL] Make DataSourceManager self clone-able by separating static and runtime Python Data Sources [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/11 05:24:11 UTC, 6 replies.
- [PR] [SPARK-46672][BUILD] Upgrade log4j2 to 2.22.1 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/11 05:42:11 UTC, 2 replies.
- [PR] Test commons-pool2 2.12.0 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/11 05:50:21 UTC, 0 replies.
- [PR] [SPARK-37039][PS][FOLLOWUP] Add migration guide for behavior change [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/11 05:52:12 UTC, 2 replies.
- [PR] [SPARK-46667][SQL] XML: Throw error on multiple XML data source [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/01/11 06:13:01 UTC, 6 replies.
- [PR] [SPARK-46675][SQL] Remove unused inferTimestampNTZ in ParquetReadSupport [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/01/11 08:09:53 UTC, 2 replies.
- Re: [PR] [SPARK-46638][Python] Create Python UDTF API to acquire execution memory for function evaluation [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/11 08:41:10 UTC, 19 replies.
- [PR] [MINOR][TEST] Set `datanucleus.autoStartMechanismMode=ignored` to clean the wall of noisy logs [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/11 09:08:25 UTC, 0 replies.
- [PR] [SPARK-46676][SS] dropDuplicatesWithinWatermark should not fail on canonicalization of the plan [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/01/11 09:17:10 UTC, 26 replies.
- [PR] [SPARK-46677][SQL][CONNECT] Fix `df.col("*")` resolution [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/11 10:36:48 UTC, 2 replies.
- Re: [PR] [SPARK-46678][TEST] Set `datanucleus.autoStartMechanismMode=ignored` to clean the wall of noisy logs [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/11 10:37:29 UTC, 2 replies.
- Re: [PR] [SPARK-46677][SQL][CONNECT] Fix `dataframe["*"]` resolution [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/11 10:44:37 UTC, 14 replies.
- Re: [PR] [SPARK-46680][BUILD] Upgrade Apache commons-pool2 to 2.12.0 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/11 11:58:25 UTC, 1 replies.
- [PR] [SPARK-46681][CORE] Refactor `ExecutorFailureTracker#maxNumExecutorFailures` to avoid unnecessary computations when `MAX_EXECUTOR_FAILURES` is configured [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/11 12:16:22 UTC, 0 replies.
- [PR] [SPARK-46368][CORE] Support `readyz` API [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/11 13:06:37 UTC, 0 replies.
- [PR] [SPARK-46664][CORE][TESTS][FOLLOWUP] Add a persistent `DriverInfo` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/11 13:37:42 UTC, 2 replies.
- Re: [PR] [SPARK-46485][SQL] V1Write should not add Sort when not needed [spark] - posted by "sweetpythoncode (via GitHub)" <gi...@apache.org> on 2024/01/11 15:18:39 UTC, 0 replies.
- Re: [PR] [SPARK-46368][CORE] Support `readyz` in REST Submission API [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/11 16:48:37 UTC, 9 replies.
- Re: [PR] [SPARK-46663][PYTHON] Disable memory profiler for pandas UDFs with iterators [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/11 19:27:37 UTC, 2 replies.
- [PR] [SPARK-46682][BUILD] Upgrade `curator` to 5.6.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/11 21:11:15 UTC, 4 replies.
- [PR] [SPARK-46684][PYTHON][CONNECT] Fix CoGroup.applyInPandas/Arrow to pass arguments properly [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/01/11 22:25:33 UTC, 2 replies.
- Re: [PR] [SPARK-46683] Write a subquery generator that generates subqueries of different variations to increase testing coverage [spark] - posted by "andylam-db (via GitHub)" <gi...@apache.org> on 2024/01/11 22:37:25 UTC, 0 replies.
- [PR] [SPARK-46684][PYTHON][CONNECT][3.5] Fix CoGroup.applyInPandas/Arrow to pass arguments properly [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/01/11 22:51:33 UTC, 2 replies.
- Re: [PR] [PROTOBUF] retain empty message field in protobuf connector [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/01/11 23:42:54 UTC, 0 replies.
- [PR] [SPARK-46686][PYTHON][CONNECT] Basic support of SparkSession based Python UDF profiler [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/01/12 01:03:56 UTC, 26 replies.
- [PR] [SPARK-46692][BUILD][PYSPARK] Fix potential issues with environment variable transmission `PYTHON_TO_TEST` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/12 02:31:09 UTC, 1 replies.
- Re: [PR] [SPARK-46692][BUILD] Fix potential issues with environment variable transmission `PYTHON_TO_TEST` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/12 03:05:14 UTC, 2 replies.
- Re: [PR] [SPARK-25895][Core]Adding testcase to compare Lz4 and Zstd compression [spark] - posted by "hackeryang (via GitHub)" <gi...@apache.org> on 2024/01/12 04:00:44 UTC, 0 replies.
- Re: [PR] [SPARK-46612][SQL] Do not convert array type string retrieved from jdbc driver [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/12 04:08:37 UTC, 7 replies.
- [PR] [SPARK-46693][SQL] Inject LocalLimitExec when matching OffsetAndLimit or LimitAndOffset [spark] - posted by "n-young-db (via GitHub)" <gi...@apache.org> on 2024/01/12 04:15:56 UTC, 12 replies.
- [PR] [SPARK-46694][SQL][TESTS] Drop the assumptions of 'hive version < 2.0' in Hive version related tests [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/12 04:40:01 UTC, 5 replies.
- [PR] [WIP][CORE] Replace Timer with single thread scheduled executor for ConsoleProgressBar. [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/12 04:45:39 UTC, 0 replies.
- Re: [PR] [SPARK-45592][SQL] Correctness issue in AQE with InMemoryTableScanExec [spark] - posted by "JJACOB0806 (via GitHub)" <gi...@apache.org> on 2024/01/12 05:33:13 UTC, 1 replies.
- [PR] [SPARK-46586][SQL][TESTS][FOLLOWUP] Replace `appended` with `:+` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/12 05:43:30 UTC, 6 replies.
- [PR] [MINOR][K8S] Remove unused local `val outstanding` from `ExecutorPodsAllocator#onNewSnapshots` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/12 06:15:40 UTC, 0 replies.
- [PR] [SPARK-46695][SQL][HIVE] Always setting hive.execution.engine to mr [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/01/12 06:17:19 UTC, 2 replies.
- [PR] [SPARK-46696][CORE] In ResourceProfileManager, function calls should occur after variable declarations [spark] - posted by "lyy-pineapple (via GitHub)" <gi...@apache.org> on 2024/01/12 06:44:21 UTC, 5 replies.
- Re: [PR] [MINOR][K8S] Remove unused local val `outstanding` from `ExecutorPodsAllocator#onNewSnapshots` [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/12 07:29:48 UTC, 2 replies.
- [PR] test [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/12 07:49:27 UTC, 0 replies.
- [PR] [SPARK-46697][SQL][HIVE] Eliminate Hive configuration time variables conversion [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/01/12 08:25:17 UTC, 6 replies.
- Re: [PR] test python test pip list [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/12 09:08:52 UTC, 0 replies.
- Re: [PR] [SPARK-46698][CORE] Replace Timer with single thread scheduled executor for ConsoleProgressBar. [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/12 09:45:05 UTC, 4 replies.
- Re: [PR] [SPARK-46665][PYTHON] Remove `assertPandasOnSparkEqual` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/12 09:58:09 UTC, 1 replies.
- Re: [PR] [SPARK-46094] Support Executor JVM Profiling [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/12 10:13:33 UTC, 11 replies.
- Re: [PR] [SPARK-46692][BUILD] Fix potential environmental variable transfer issue in `pyspark job GA` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/12 10:31:27 UTC, 0 replies.
- Re: [PR] [SPARK-46692][BUILD] Load `inputs.envs` always in `Python` and `Upload`-related steps [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/12 10:52:29 UTC, 12 replies.
- Re: [PR] [SPARK-46681][CORE] Refactor `ExecutorFailureTracker#maxNumExecutorFailures` to avoid calculating `defaultMaxNumExecutorFailures` when `MAX_EXECUTOR_FAILURES` is configured [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/12 10:56:14 UTC, 1 replies.
- Re: [PR] [SPARK-27950][DSTREAMS][Kinesis] dynamoDBEndpointUrl and cloudWatchMetricsLevel for Kinesis [spark] - posted by "anatol-ju (via GitHub)" <gi...@apache.org> on 2024/01/12 11:26:06 UTC, 0 replies.
- [PR] [SPARK-46699][PS][TESTS] Split `ArithmeticTests` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/12 12:34:32 UTC, 2 replies.
- [PR] [SPARK-46700][CORE] Count the last spilling for the shuffle disk spilling bytes metric [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/12 14:37:49 UTC, 9 replies.
- [PR] [SPARK-46703][INFRA] Disable `fail-fast` in Python CI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/12 19:11:24 UTC, 3 replies.
- [PR] [SPARK-46704][CORE][UI] Fix `MasterPage` to sort `Running Drivers` table by `Duration` column correctly [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/12 20:07:59 UTC, 9 replies.
- [PR] Make RocksDB State Store Compaction Less Likely to fall behind [spark] - posted by "siying (via GitHub)" <gi...@apache.org> on 2024/01/12 20:40:05 UTC, 1 replies.
- [PR] Add messageFormat support to spark connect service [spark] - posted by "garlandz-db (via GitHub)" <gi...@apache.org> on 2024/01/12 22:58:28 UTC, 1 replies.
- [PR] Garlandz db/support message format in connect [spark] - posted by "garlandz-db (via GitHub)" <gi...@apache.org> on 2024/01/12 23:01:17 UTC, 0 replies.
- Re: [PR] Support message format in connect [spark] - posted by "garlandz-db (via GitHub)" <gi...@apache.org> on 2024/01/12 23:17:45 UTC, 2 replies.
- Re: [PR] [SPARK-44223][CORE] Remove LevelDB support [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/13 00:19:00 UTC, 1 replies.
- [PR] [SPARK-42960][PYTHON][SS][TESTS] Factor Connect/non-Connect specific logics out [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/13 00:24:45 UTC, 4 replies.
- [PR] [SPARK-46707] Added throwable field [spark] - posted by "kelvinjian-db (via GitHub)" <gi...@apache.org> on 2024/01/13 01:23:16 UTC, 0 replies.
- Re: [PR] [SPARK-46707] Added throwable field to expressions to improve predicate pushdown [spark] - posted by "09306677806 (via GitHub)" <gi...@apache.org> on 2024/01/13 01:31:03 UTC, 0 replies.
- [PR] [SPARK-46709] Expose partition_id column for state data source by default [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/01/13 04:50:26 UTC, 1 replies.
- [PR] [WIP][CORE] Replace Timer with single thread scheduled executor [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/13 11:55:56 UTC, 0 replies.
- Re: [PR] [SPARK-46709][SS] Expose partition_id column for state data source [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/01/13 13:45:46 UTC, 2 replies.
- Re: [PR] [SPARK-44496][SQL][FOLLOW-UP] CalendarIntervalType is also orderable [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/01/13 15:46:51 UTC, 0 replies.
- Re: [PR] Add Support for Scala 2.13 in Spark 3.4.1 [spark-docker] - posted by "Philosh (via GitHub)" <gi...@apache.org> on 2024/01/13 20:46:22 UTC, 0 replies.
- Re: [PR] [SPARK-45217][CORE] Support change log level of specific package or class [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/14 00:21:10 UTC, 1 replies.
- [PR] fix build python for pypy3 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/14 02:23:33 UTC, 2 replies.
- [PR] [WIP][CORE] Simplify the ReloadingX509TrustManager by the exit operation only depend on interrupt thread. [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/14 04:47:19 UTC, 0 replies.
- [PR] [SPARK-46710][SQL] Clean up the broadcast data generated when sql execution ends [spark] - posted by "yabola (via GitHub)" <gi...@apache.org> on 2024/01/14 05:58:32 UTC, 15 replies.
- [PR] [SPARK-46711] Fix RocksDB state provider race condition during rollback [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/01/14 05:58:42 UTC, 0 replies.
- Re: [PR] [SPARK-46711][SS] Fix RocksDB state provider race condition during rollback [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/01/14 06:00:38 UTC, 3 replies.
- [PR] [SPARK-46712][Minor][SS] Fix indent for streaming aggregation operator [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/01/14 12:07:53 UTC, 1 replies.
- [PR] [SPARK-46713][PYTHON][DOCS] Refine docstring of `map_keys/map_values/map_entries` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/14 13:01:37 UTC, 3 replies.
- [PR] SPARK-46714 overwrite a partition with custom location [spark] - posted by "adrian-wang (via GitHub)" <gi...@apache.org> on 2024/01/14 14:47:39 UTC, 0 replies.
- Re: [PR] [SPARK-46714][SQL] Overwrite a partition with custom location [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/14 15:08:39 UTC, 2 replies.
- Re: [PR] [SPARK-46707][SQL] Added throwable field to expressions to improve predicate pushdown [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/14 23:23:19 UTC, 10 replies.
- Re: [PR] [SPARK-45435] Document that lazy checkpoint may not be a consistent snapshot [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/15 00:20:22 UTC, 1 replies.
- Re: [PR] [SPARK-45290][INFRA][Launcher]Fix The return status is incorrect in standalone mode. The status of endevent is correct [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/15 00:20:23 UTC, 1 replies.
- Re: [PR] [MINOR][SS] Fix indent for streaming aggregation operator [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/01/15 00:25:48 UTC, 1 replies.
- [PR] [MINOR][DOCS] Add missing part of RDD mapPartitions* docstrings [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/15 00:45:04 UTC, 3 replies.
- Re: [PR] [DO-NOT-MERGE][PYTHON] Simple Pythonic catalog reference [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/15 00:48:01 UTC, 0 replies.
- [PR] [INFRA] Pin `sphinxcontrib-applehelp==1.0.4` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/15 01:46:46 UTC, 2 replies.
- [PR] Test upgrade sphinx to 5.0.0 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/15 03:00:51 UTC, 3 replies.
- Re: [PR] [SPARK-46715][INFRA] Pin `sphinxcontrib-*` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/15 03:14:01 UTC, 9 replies.
- Re: [PR] [SPARK-28551][SQL] CTAS with LOCATION should not allow to a non-empty directory. [spark] - posted by "PengleiShi (via GitHub)" <gi...@apache.org> on 2024/01/15 04:01:52 UTC, 4 replies.
- [PR] [SPARK-46716][SS][TESTS] Add a test regarding to backward compatibility check for Scala StreamingQueryListener [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/01/15 04:06:32 UTC, 4 replies.
- Re: [PR] [SPARK-46698][CORE][FOLLOWUP] Replace Timer with single thread scheduled executor [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/15 06:20:34 UTC, 10 replies.
- Re: [PR] [SPARK-46717][CORE] Simplify the ReloadingX509TrustManager by the exit operation only depend on interrupt thread. [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/15 06:22:08 UTC, 3 replies.
- [PR] [SPARK-46719][PS][TESTS] Rebalance `pyspark_pandas` and `pyspark_pandas_slow` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/15 06:38:42 UTC, 2 replies.
- [PR] [WIP][CORE] Simplify the ContextCleaner by the exit operation only depend on interrupt thread. [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/15 06:48:53 UTC, 0 replies.
- Re: [PR] [SPARK-45435][DOC] Document that lazy checkpoint may not be a consistent snapshot [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/15 06:55:49 UTC, 6 replies.
- [PR] [SPARK-45435][DOC][FOLLOWUPS] Fix two typos [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/15 07:53:05 UTC, 2 replies.
- [PR] [SPARK-46720][SQL][PYTHON] Refactor Python Data Source to aline with other DSv2 built-in Data Sources [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/15 08:27:14 UTC, 0 replies.
- Re: [PR] [SPARK-46720][SQL][PYTHON] Refactor Python Data Source to align with other DSv2 built-in Data Sources [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/15 08:34:41 UTC, 2 replies.
- [PR] [CORE][TESTS] make gpu fraction tests more robust [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/01/15 09:00:08 UTC, 0 replies.
- Re: [PR] [SPARK-46721][CORE][TESTS] make gpu fraction tests more robust [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/01/15 09:04:10 UTC, 2 replies.
- [PR] [SPARK-46722][CONNECT] Add a test regarding to backward compatibility check for StreamingQueryListener in Spark Connect (Scala/PySpark) [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/01/15 09:23:55 UTC, 4 replies.
- [PR] [SPARK-46724][DOCS] Update tuning.md to use java 17 doc links [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/15 10:44:56 UTC, 3 replies.
- [PR] fix build_python test [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/15 11:08:01 UTC, 0 replies.
- [PR] [WIP][SQL] Port `classifyException()` in JDBC dialects on error classes [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/15 11:12:49 UTC, 0 replies.
- [PR] [SPARK-46723][CONNECT][SCALA] Make addArtifact retryable [spark] - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2024/01/15 11:48:53 UTC, 5 replies.
- [PR] [SPARK-46726][PS][TESTS] Rebalance `pyspark_pandas_connect_part?` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/15 13:23:27 UTC, 2 replies.
- [PR] [SPARK-45827] Disallow partitioning on Variant column [spark] - posted by "cashmand (via GitHub)" <gi...@apache.org> on 2024/01/15 20:20:40 UTC, 5 replies.
- Re: [PR] [SPARK-46717][CORE] Simplify `ReloadingX509TrustManager` by the exit operation only depend on interrupt thread. [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/15 22:31:21 UTC, 3 replies.
- Re: [PR] [SPARK-45369][SQL] Push down limit through generate [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/16 00:19:10 UTC, 1 replies.
- Re: [PR] [SPARK-44571][SQL] Eliminate the Join by combine multiple Aggregates [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/16 00:19:11 UTC, 0 replies.
- Re: [PR] [WIP][SPARK-43221][CORE] the BlockManager with the persisted block is preferred [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/16 00:19:13 UTC, 1 replies.
- [PR] [SPARK-46670][PYTHON][SQL][FOLLOW-UP] Do not pass static Python Data Sources around when cloning DataSourceManager [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/16 00:40:52 UTC, 2 replies.
- [PR] [SPARK-46715][INFRA][3.5] Pin `sphinxcontrib-*` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/16 00:41:45 UTC, 2 replies.
- [PR] [SPARK-46728][PYTHON] Check Pandas installation properly [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/16 01:43:32 UTC, 23 replies.
- [PR] [SPARK-46729][DOCS] Withdraw the recommendation of using Concurrent Mark Sweep (CMS) Garbage Collector [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/16 03:38:28 UTC, 2 replies.
- [PR] [SPARK-46730][PYTHON][DOCS] Refine docstring of `str_to_map/map_filter/map_zip_with` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/16 04:04:34 UTC, 4 replies.
- [PR] [SPARK-46677][CONNECT][FOLLOWUPS] Fix `dataset.col("*")` in Scala Client [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/16 04:47:08 UTC, 1 replies.
- [PR] test config [spark] - posted by "yabola (via GitHub)" <gi...@apache.org> on 2024/01/16 04:59:27 UTC, 1 replies.
- Re: [PR] [WIP][SPARK-24815] [CORE] Trigger Interval based DRA for Structured Streaming [spark] - posted by "mentasm (via GitHub)" <gi...@apache.org> on 2024/01/16 05:12:15 UTC, 10 replies.
- [PR] [SPARK-46673][PYTHON][DOCS] Refine docstring `aes_encrypt/aes_decrypt/try_aes_decrypt` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/16 07:28:29 UTC, 4 replies.
- [PR] [SPARK-46731][SS] Manage state store provider instance by state data source - reader [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/01/16 07:40:44 UTC, 8 replies.
- Re: [PR] [WIP][SPARK-46677][CONNECT][FOLLOWUPS] Fix `dataset.col("*")` in Scala Client [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/16 08:12:44 UTC, 0 replies.
- [PR] [SPARK-46677][CONNECT][FOLLOWUP] Convert `count(df["*"])` to `count(1)` on client side [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/16 08:24:47 UTC, 2 replies.
- [PR] [SPARK-46732][CONNECT]Make Subquery/Broadcast thread work with Connect's artifact management [spark] - posted by "xieshuaihu (via GitHub)" <gi...@apache.org> on 2024/01/16 08:43:48 UTC, 4 replies.
- Re: [PR] [WIP][CORE] Simplify the ContextCleaner|BlockManager by the exit operation only depend on interrupt thread. [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/16 08:46:54 UTC, 0 replies.
- Re: [PR] [SPARK-46727][SQL] Port `classifyException()` in JDBC dialects on error classes [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/16 08:50:19 UTC, 5 replies.
- Re: [PR] [SPARK-46677][CONNECT][FOLLOWUP] Fix `dataset.col("*")` in Scala Client [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/16 12:09:04 UTC, 1 replies.
- [PR] [SPARK-46734][INFRA] Combine pip installations for lint and doc respectively [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/16 12:25:57 UTC, 4 replies.
- Re: [PR] [SPARK-46733][CORE] Simplify the BlockManager by the exit operation only depend on interrupt thread. [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/16 13:36:24 UTC, 9 replies.
- Re: [PR] [SPARK-43919][SQL] Extract JSON functionality out of Row [spark] - posted by "tfinn-ias (via GitHub)" <gi...@apache.org> on 2024/01/16 14:28:09 UTC, 4 replies.
- [PR] [SPARK-46395][CORE] Assign Spark configs to groups for use in documentation [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/16 16:25:32 UTC, 0 replies.
- [PR] [WIP][SQL] Add the error class `UNSUPPORTED_CALL` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/16 16:50:04 UTC, 1 replies.
- [PR] [SPARK-46725] Add DAYNAME function [spark] - posted by "PetarVasiljevic-DB (via GitHub)" <gi...@apache.org> on 2024/01/16 17:51:35 UTC, 0 replies.
- [PR] [SPARK-46735][PYTHON][TESTS] `pyspark.sql.tests.test_group` should skip Pandas tests if not available [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/16 18:50:44 UTC, 1 replies.
- [PR] [SPARK-46663][PYTHON][3.5] Disable memory profiler for pandas UDFs with iterators [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/16 19:35:20 UTC, 2 replies.
- Re: [PR] [SPARK-46735][PYTHON][TESTS] `pyspark.sql.tests.test_group` should skip Pandas/PyArrow tests if not available [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/16 20:47:13 UTC, 6 replies.
- Re: [PR] [SPARK-46629] Fix for STRUCT type DDL not picking up nullability and comment [spark] - posted by "vitaliili-db (via GitHub)" <gi...@apache.org> on 2024/01/16 21:20:17 UTC, 5 replies.
- [PR] [SPARK-46737][SQL][TESTS] Use the default ORC compression in OrcReadBenchmark [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/17 00:34:34 UTC, 8 replies.
- [PR] [SPARK-46715][INFRA][3.4] Pin `sphinxcontrib-*` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/17 01:14:05 UTC, 2 replies.
- [PR] [SPARK-46732][CONNECT][3.5]Make Subquery/Broadcast thread work with Connect's artifact management [spark] - posted by "xieshuaihu (via GitHub)" <gi...@apache.org> on 2024/01/17 01:57:28 UTC, 3 replies.
- Re: [PR] [SPARK-45433][SQL] Fix CSV/JSON schema inference when timestamps do not match specified timestampFormat [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/17 02:03:22 UTC, 9 replies.
- Re: [PR] [SPARK-46705][SS] Make RocksDB State Store Compaction Less Likely to fall behind [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/01/17 03:39:42 UTC, 7 replies.
- [PR] [SPARK-46740][SQL] Only convert to Parquet FileScan for normal Hive Parquet table [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/01/17 03:46:07 UTC, 3 replies.
- [PR] [SPARK-46742][SQL][TESTS] Add ORC compression tests for `hive` module `OrcFileFormat` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/17 04:12:37 UTC, 3 replies.
- Re: [PR] [SPARK-46734][INFRA] Combine pip installations for lint and doc [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/17 05:27:52 UTC, 2 replies.
- Re: [PR] [SPARK-46683] Write a subquery generator that generates subqueries permutations to increase testing coverage [spark] - posted by "andylam-db (via GitHub)" <gi...@apache.org> on 2024/01/17 05:31:09 UTC, 5 replies.
- Re: [PR] [SPARK-46282][PYTHON][DOCS] Create a Standalone Page for DataFrame API in PySpark Documentation [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/17 05:42:16 UTC, 1 replies.
- Re: [PR] [SPARK-46306][PS] Fix `LocIndexer` to work properly when the key is missing [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/17 05:42:50 UTC, 1 replies.
- Re: [PR] [SPARK-46219][SQL] Unwrap cast in join predicates [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/17 07:01:24 UTC, 11 replies.
- [PR] [SPARK-46282][PYTHON][DOCS] Rename "Spark SQL" page to "DataFrame API" [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/17 07:05:36 UTC, 2 replies.
- Re: [PR] [SPARK-46617][SQL] Create-table-if-not-exists should not silently overwrite existing data-files [spark] - posted by "adrians (via GitHub)" <gi...@apache.org> on 2024/01/17 08:41:45 UTC, 1 replies.
- Re: [PR] [SPARK-46739][SQL] Add the error class `UNSUPPORTED_CALL` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/17 08:51:03 UTC, 3 replies.
- [PR] [SPARK-46741][SQL] Cache Table with CET won't work [spark] - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2024/01/17 09:04:47 UTC, 1 replies.
- [PR] [SPARK-46745][INFRA] Purge pip cache in dockerfile [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/17 09:10:59 UTC, 5 replies.
- [PR] [SPARK-46744][CORE][SQL][CONNECT] Prompt for exit command after `spark-shell` `spark-sql` `spark-connect-repl` startup [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/17 09:18:57 UTC, 1 replies.
- Re: [PR] [WIP][SPARK-46725][SQL] Add DAYNAME function [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/17 10:21:00 UTC, 3 replies.
- [PR] [SPARK-46746][SQL][AVRO] Attach codec extension to avro datasource files [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/17 12:51:33 UTC, 3 replies.
- [PR] [MINOR][CONNECT][PYTHON] Some code clean up [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/17 13:17:43 UTC, 0 replies.
- [PR] [WIP][SQL] Replace `UnsupportedOperationException` by `SparkUnsupportedOperationException` in `sql` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/17 14:22:56 UTC, 0 replies.
- [PR] [SPARK-46748][CORE] Remove `*slave*` scripts [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/17 17:54:41 UTC, 0 replies.
- [PR] [SPARK-46749][DOCS] Document `SPARK_LOG_*` and `SPARK_PID_DIR` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/17 18:36:49 UTC, 3 replies.
- Re: [PR] [SPARK-46736][PROTOBUF] retain empty message field in protobuf connector [spark] - posted by "rangadi (via GitHub)" <gi...@apache.org> on 2024/01/17 19:14:20 UTC, 33 replies.
- Re: [PR] [SPARK-46750][CONNECT][PYTHON] DataFrame APIs code clean up [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/18 00:17:43 UTC, 1 replies.
- Re: [PR] [TEST ONLY][SQL] Test resolve column references with PLAN_ID [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/18 00:19:26 UTC, 3 replies.
- Re: [PR] [SPARK-37508][SQL] Add Unicode[encode/decode]() function [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/18 00:19:28 UTC, 1 replies.
- Re: [PR] [SPARK-44919][AVRO] Avro connector: convert a union of a single primitive type to a StructType [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/18 00:19:29 UTC, 1 replies.
- [PR] [WIP] Basic support of SparkSession-based memory profiler [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/18 00:45:42 UTC, 0 replies.
- Re: [PR] [SPARK-46744][SPARK-SHELL][SQL][CONNECT] Display clear `exit command` for the terminal: `spark-shell`, `spark-sql` and `spark-connect-repl` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/18 01:05:09 UTC, 2 replies.
- [PR] [SPARK-46751][PYTHON][TESTS] Skip test_datasource if PyArrow is not installed [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/18 01:31:01 UTC, 1 replies.
- Re: [PR] [SPARK-46748][CORE] Remove `*slave*.sh` scripts [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/18 02:26:19 UTC, 1 replies.
- Re: [PR] [SPARK-46741][SQL] Cache Table with CTE won't work [spark] - posted by "AngersZhuuuu (via GitHub)" <gi...@apache.org> on 2024/01/18 02:34:35 UTC, 10 replies.
- [PR] [SPARK-46752][SQL][TESTS] Use default ORC compression in data source benchmarks [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/18 02:38:28 UTC, 4 replies.
- [PR] [SPARK-46753][PYTHON][DOCS] Fix pypy3 python test [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/18 03:15:02 UTC, 6 replies.
- [PR] [SPARK-46755][K8S][INFRA][DOCS] Upgrade `Volcano` to 1.8.2 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/18 04:20:31 UTC, 4 replies.
- [PR] [SPARK-46754][SQL][AVRO] Fix compression code resolution in avro table definition and write options [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/18 04:24:37 UTC, 7 replies.
- [PR] [SPARK-46757][INFRA] Reduce the number of layers of testing dockerfile [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/18 04:50:01 UTC, 2 replies.
- Re: [PR] [SPARK-46744][SPARK-SHELL][SQL][CONNECT][PYTHON][R] Display clear `exit command` for all spark terminal [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/18 06:22:25 UTC, 0 replies.
- [PR] [SPARK-46758][INFRA] Upgrade github cache action to v4 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/18 06:45:02 UTC, 4 replies.
- [PR] [MINOR][DOCS] Add zstandard as a candidate to fix the desc of spark.sql.avro.compression.codec [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/18 07:09:34 UTC, 5 replies.
- [PR] [SPARK-46756][SQL] Add rule to rewrite null safe equality join keys [spark] - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2024/01/18 07:27:12 UTC, 1 replies.
- Re: [PR] [SPARK-46708] Support message format in connect [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/18 08:05:15 UTC, 7 replies.
- Re: [PR] [SPARK-44635][CORE] Handle shuffle fetch failures in decommissions [spark] - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2024/01/18 08:19:09 UTC, 0 replies.
- [PR] [SPARK-40546][DOCS] Remove unused TODO [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/18 09:21:48 UTC, 2 replies.
- Re: [PR] [SPARK-46753][PYTHON][TESTS] Fix pypy3 python test [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/18 10:41:36 UTC, 85 replies.
- [PR] [SPARK-46759][SQL][AVRO] Codec xz and zstandard support compression level for avro files [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/18 13:01:04 UTC, 9 replies.
- [PR] [SPARK-46760][SQL][DOCS] Make the document of spark.sql.adaptive.coalescePartitions.parallelismFirst clearer [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/18 13:01:45 UTC, 5 replies.
- Re: [PR] [SPARK-43341][SQL] Patch StructType.toDDL not picking up on non-nullability of nested column [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/18 14:28:55 UTC, 0 replies.
- [PR] [MINOR][DOCS] Remove unneeded production tag block method [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/18 15:25:21 UTC, 2 replies.
- Re: [PR] [SPARK-28973][SQL] Add `TimeType` and support `java.time.LocalTime` as its external type. [spark] - posted by "zeddit (via GitHub)" <gi...@apache.org> on 2024/01/18 15:44:50 UTC, 0 replies.
- [PR] [WIP][SQL] Don't use the NTZ parser for inferring TIMESTAMP_LTZ in CSV [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/18 16:33:38 UTC, 0 replies.
- [PR] [SPARK-46576][SQL][FOLLOWUP] Improve error messages for unsupported data source save mode [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/18 21:13:51 UTC, 2 replies.
- Re: [PR] SPARK-45200: All log4j2 configuration file override [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/19 00:19:40 UTC, 1 replies.
- Re: [PR] [SPARK-45463][CORE][SHUFFLE] Support reliable store with specified executorId [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/19 00:19:42 UTC, 1 replies.
- Re: [PR] [SPARK-44033][PYTHON] Added support for binary ops for list like objects [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/19 00:19:45 UTC, 1 replies.
- Re: [PR] [SPARK-41341][CORE] Wait shuffle fetch to finish when decommission executor [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/19 00:19:47 UTC, 1 replies.
- [PR] [SPARK-46764][DOCS] Reorganize script to build API docs [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/19 02:05:58 UTC, 11 replies.
- [PR] [SPARK-46766][SQL][AVRO] ZSTD Buffer Pool Support For AVRO datasource [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/19 02:58:40 UTC, 4 replies.
- [PR] [SPARK-46765][PYTHON][CONNECT] Make `shuffle` specify the datatype of `seed` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/19 03:38:58 UTC, 3 replies.
- Re: [PR] [SPARK-46590][SQL] Fix coalesce failed with unexpected partition indeces [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/19 05:14:40 UTC, 12 replies.
- [PR] [SPARK-46767][PYTHON][DOCS] Refine docstring of `abs/acos/acosh` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/19 05:37:51 UTC, 8 replies.
- [PR] [SPARK-46768][BUILD] Upgrade the Guava version used by the connect module to 33.0-jre [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/19 05:39:57 UTC, 0 replies.
- Re: [PR] [SPARK-46769][SQL] Fix inferring TIMESTAMP_NTZ in JSON/CSV [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/19 06:14:51 UTC, 2 replies.
- [PR] [SPARK-46770][K8S][TESTS] Remove legacy `docker-for-desktop` logic [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/19 07:44:22 UTC, 3 replies.
- [PR] [SPARK-46718][BUILD] Test arrow 15 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/19 08:10:14 UTC, 10 replies.
- Re: [PR] [SPARK-46769][SQL] Fix type inferring for timestamps without time zone in JSON/CSV [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/19 09:18:30 UTC, 3 replies.
- Re: [PR] [SPARK-46768][BUILD] Upgrade Guava used by the connect module to 33.0-jre [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/19 09:20:33 UTC, 2 replies.
- [PR] [SPARK-46773][BUILD][CONNECT] Change to use whitelist to `generate assemblyExcludedJars` for the connect server module [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/19 09:43:26 UTC, 0 replies.
- Re: [PR] [SPARK-39910][SQL] Delegate path qualification to filesystem during DataSource file path globbing [spark] - posted by "tigrulya-exe (via GitHub)" <gi...@apache.org> on 2024/01/19 09:46:05 UTC, 0 replies.
- Re: [PR] [SPARK-46773][BUILD][CONNECT] Change to use include-list to `generate assemblyExcludedJars` for the connect server module [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/19 09:57:19 UTC, 3 replies.
- [PR] [SPARK-46774][SQL][AVRO] Use mapreduce.output.fileoutputformat.compress instead of deprecated mapred.output.compress in Avro write jobs [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/19 10:25:27 UTC, 4 replies.
- [PR] [SPARK-46769][SQL] Refine timestamp related schema inference [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/19 11:20:01 UTC, 14 replies.
- [PR] [SPARK-45593][FOLLOWUP] Correct relocation connect guava dependency. [spark] - posted by "Yikf (via GitHub)" <gi...@apache.org> on 2024/01/19 13:57:04 UTC, 1 replies.
- [PR] [SPARK-46775][SS] Fix formatting of Kinesis docs [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/19 16:50:12 UTC, 1 replies.
- [PR] [SPARK-40876][SQL] Widening type promotion from integers to decimal in Parquet vectorized reader [spark] - posted by "johanl-db (via GitHub)" <gi...@apache.org> on 2024/01/19 16:53:49 UTC, 9 replies.
- [PR] [MINOR][DOCS] Remove mention of Jenkins from "Building Spark" docs [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/19 17:00:42 UTC, 2 replies.
- Re: [PR] [SPARK-46108][SQL] keepInnerXmlAsRaw option for Built-in XML Data Source [spark] - posted by "adriennn (via GitHub)" <gi...@apache.org> on 2024/01/19 17:32:34 UTC, 5 replies.
- [PR] [SPARK-46780][K8S][TESTS] Support skipping R image build step [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/19 21:26:10 UTC, 1 replies.
- [PR] [SPARK-46779][SQL] `InMemoryRelation` instances of the same cached plan should be semantically equivalent [spark] - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2024/01/19 21:27:19 UTC, 2 replies.
- Re: [PR] [SPARK-46780][K8S][TESTS] Support skipping R image build step in SBT [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/19 21:32:49 UTC, 0 replies.
- Re: [PR] [SPARK-46780][K8S][TESTS] Improve SBT K8s IT to skip R image build step if not needed [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/19 22:05:08 UTC, 8 replies.
- [PR] [SPARK-46781][PYTHON][TESTS] Test custom data source and input partition (pyspark.sql.datasource) [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/19 23:43:14 UTC, 4 replies.
- Re: [PR] [WIP][SPARK-46467][PS][TESTS] Improve and test exceptions of TimedeltaIndex [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/19 23:48:22 UTC, 1 replies.
- Re: [PR] [SPARK-46009][SQL][CONNECT] Merge the parse rule of PercentileCont and PercentileDisc into functionCall [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/20 01:49:39 UTC, 5 replies.
- [PR] [SPARK-46783][K8S][TESTS] Use `built-in` storage classes in PVTestsSuite [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/20 01:54:00 UTC, 3 replies.
- [PR] [SPARK-46784][K8S][TESTS] Create and use a K8s test tag for `PersistentVolume` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/20 02:19:54 UTC, 2 replies.
- [PR] [SPARK-46009][SQL][DOCS][FOLLOWUP] Change the error code for invalid inverse distribution function [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/20 13:32:13 UTC, 4 replies.
- Re: [PR] [SPARK-46725][SQL] Add DAYNAME function [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/20 14:07:32 UTC, 7 replies.
- [PR] [WIP] docs: restructure the docs index page [spark] - posted by "MrPowers (via GitHub)" <gi...@apache.org> on 2024/01/20 15:56:33 UTC, 2 replies.
- [PR] [SPARK-44495][INFRA][K8S] Use the latest minikube in K8s IT [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/20 19:18:06 UTC, 4 replies.
- [PR] [SPARK-46785][K8S][TESTS] Split the local storage PVs test into driver and executor PV tests [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/20 22:26:26 UTC, 3 replies.
- [PR] [SPARK-46785][K8S][TESTS][3.4] Split the local storage PVs test into driver and executor PV tests [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/20 22:43:27 UTC, 1 replies.
- [PR] [SPARK-46786][K8S] Fix `MountVolumesFeatureStep` to use `ReadWriteOncePod` instead of `ReadWriteOnce` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/20 22:57:54 UTC, 10 replies.
- [PR] [SPARK-46777] Refactor `StreamingDataSourceV2Relation` catalyst structure to be more on-par with the batch version [spark] - posted by "jackierwzhang (via GitHub)" <gi...@apache.org> on 2024/01/20 23:22:14 UTC, 0 replies.
- Re: [PR] [SPARK-46785][K8S][TESTS][3.5] Split the local storage PVs test into driver and executor PV tests [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/21 00:09:16 UTC, 0 replies.
- [PR] [SPARK-44495][INFRA][K8S][3.5] Use the latest minikube in K8s IT [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/21 01:05:44 UTC, 4 replies.
- Re: [PR] [SPARK-44494][INFRA][3.4] Use `minikube` v1.30.1 for `k8s-integration-tests` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/21 01:59:30 UTC, 1 replies.
- Re: [PR] [SPARK-44495][INFRA][K8S][3.4] Use the latest minikube in K8s IT [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/21 04:07:20 UTC, 3 replies.
- [PR] [SPARK-46787][CONNECT] `bloomFilter` function should throw `AnalysisException` for invalid input [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/21 06:11:20 UTC, 16 replies.
- Re: [PR] SPARK-45522: [WIP] Jetty 9 -12 upgrade [spark] - posted by "HiuKwok (via GitHub)" <gi...@apache.org> on 2024/01/21 11:05:09 UTC, 1 replies.
- Re: [PR] SPARK-45522: Jetty 9 -> 10 upgrade [spark] - posted by "HiuKwok (via GitHub)" <gi...@apache.org> on 2024/01/21 13:10:30 UTC, 9 replies.
- Re: [PR] [SPARK-42332][SQL] Changing the require to a SparkException in ComplexTypeMergingExpression [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/21 20:26:16 UTC, 5 replies.
- [PR] [WIP][SPARK-42332][SQL] Reverting an import change that shouldn't have been made [spark] - posted by "hannahkamundson (via GitHub)" <gi...@apache.org> on 2024/01/21 20:40:14 UTC, 0 replies.
- Re: [PR] [MINOR][DOCS] Adjust formatting of "Building Spark" docs [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/21 20:44:13 UTC, 1 replies.
- Re: [PR] [SPARK-42332][SQL] Reverting an import change that shouldn't have been made [spark] - posted by "hannahkamundson (via GitHub)" <gi...@apache.org> on 2024/01/21 20:45:36 UTC, 3 replies.
- Re: [PR] [SPARK-44867][CONNECT][DOCS] Refactor Spark Connect Docs to incorporate Scala setup [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/21 21:52:34 UTC, 0 replies.
- Re: [PR] [SPARK-45468][UI] More transparent proxy handling for HTTP redirects [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/22 00:20:24 UTC, 1 replies.
- [PR] [MINOR][DOCS] Fix broken ML paper links [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/22 00:40:30 UTC, 2 replies.
- Re: [PR] [SPARK-46775][DOCS] Fix formatting of Kinesis docs [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/22 00:49:56 UTC, 1 replies.
- [PR] [MINOR][DOCS] Miscellaneous link and anchor fixes [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/22 00:58:49 UTC, 2 replies.
- Re: [PR] [SPARK-46777][SS] Refactor `StreamingDataSourceV2Relation` catalyst structure to be more on-par with the batch version [spark] - posted by "jackierwzhang (via GitHub)" <gi...@apache.org> on 2024/01/22 01:12:17 UTC, 4 replies.
- [PR] [MINOR][DOCS] Fix SQL Error links and link anchors [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/22 01:54:46 UTC, 7 replies.
- [PR] [SPARK-46788][DOCS] Add instructions for running Docker integration tests [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/22 05:49:27 UTC, 2 replies.
- [PR] [SPARK-46789][K8S][TESTS] Add `VolumeSuite` to K8s IT [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/22 06:34:34 UTC, 5 replies.
- [PR] [SPARK-46791][SQL] Support Java Set in JavaTypeInference [spark] - posted by "viirya (via GitHub)" <gi...@apache.org> on 2024/01/22 08:15:10 UTC, 4 replies.
- Re: [PR] [SPARK-43049][SQL] Use CLOB instead of VARCHAR(255) for StringType for Oracle JDBC [spark] - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2024/01/22 08:30:30 UTC, 1 replies.
- [PR] [SPARK-46738][CONNECT] `Cast` displayed different results between Regular Spark and Spark Connect [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/22 09:16:27 UTC, 0 replies.
- [PR] [SPARK-46790][SQL] Support coalesce partition through cartesian product [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/01/22 09:18:41 UTC, 3 replies.
- Re: [PR] [SPARK-38230][SQL] InsertIntoHadoopFsRelationCommand unnecessarily fetches details of partitions [spark] - posted by "npsables (via GitHub)" <gi...@apache.org> on 2024/01/22 09:53:01 UTC, 2 replies.
- [PR] [SPARK-46417][SQL][FOLLOWUP] Log error message when calling hive.getTable throw HiveException and throwException is false [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/01/22 12:05:37 UTC, 6 replies.
- [PR] [SPARK-46792] Refactor ChannelBuilder into DefaultChannelBuilder and ChannelBuilder [spark] - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2024/01/22 12:21:25 UTC, 3 replies.
- [PR] [SPARK-46794][SQL] Remove subqueries from LogicalRDD constraints [spark] - posted by "tomvanbussel (via GitHub)" <gi...@apache.org> on 2024/01/22 14:04:01 UTC, 6 replies.
- [PR] [WIP][SPARK-35878][CORE] Revert S3A endpoint fixup logic of SPARK-35878 [spark] - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2024/01/22 16:00:03 UTC, 5 replies.
- [PR] [SPARK-46763] Fix assertion failure in ReplaceDeduplicateWithAggregate for duplicate attributes [spark] - posted by "nikhilsheoran-db (via GitHub)" <gi...@apache.org> on 2024/01/22 17:01:07 UTC, 3 replies.
- Re: [PR] [SPARK-45579][CORE] Catch errors for FallbackStorage.copy [spark] - posted by "ukby1234 (via GitHub)" <gi...@apache.org> on 2024/01/22 18:04:53 UTC, 0 replies.
- [PR] [MINOR][Test][Connect] Discard stdout / stderr of test Spark connect server if not isDebug [spark] - posted by "EnricoMi (via GitHub)" <gi...@apache.org> on 2024/01/22 18:48:26 UTC, 0 replies.
- Re: [PR] [SPARK-46687][PYTHON][CONNECT] Basic support of SparkSession-based memory profiler [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/22 21:34:49 UTC, 7 replies.
- [PR] [SS] [SPARK-46796] Ensure the correct remote files (mentioned in metadata.zip) are used on RocksDB version load [spark] - posted by "sahnib (via GitHub)" <gi...@apache.org> on 2024/01/22 21:53:59 UTC, 1 replies.
- [PR] [SPARK-46797][CORE] Rename `spark.deploy.spreadOut` to `spark.deploy.spreadOutApps` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/22 23:23:05 UTC, 5 replies.
- Re: [PR] [SPARK-46738][CONNECT] Make the display of `cast` in `Regular Spark` and `Spark Connect` consistent [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/23 00:13:50 UTC, 1 replies.
- Re: [PR] [SPARK-43396][CORE] Add config to control max ratio of decommissioning executors [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/23 00:19:44 UTC, 1 replies.
- Re: [PR] [SPARK-42669][CONNECT] Short circuit local relation RPCs [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/23 00:19:45 UTC, 0 replies.
- [PR] [SPARK-46799][CORE][TESTS] Improve `MasterSuite` to use nanoTime-based appIDs and workerIDs [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/23 00:29:03 UTC, 3 replies.
- [PR] [SPARK-46800][CORE] Support `spark.deploy.spreadOutDrivers` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/23 00:51:21 UTC, 3 replies.
- [PR] [SPARK-46801][PYTHON][TESTS] Do not treat exit code 5 as a test failure in Python testing script [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/23 00:52:15 UTC, 2 replies.
- [PR] [SPARK-46802][PYTHON][TESTS] Clean up obsolete code in PySpark coverage script [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/23 01:14:59 UTC, 2 replies.
- [PR] [SPARK-46804][DOCS][TESTS] Recover the generated documents [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/23 01:45:19 UTC, 7 replies.
- [PR] [SPARK-46803][BUILD] Remove scala-2.13 profile [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/23 01:46:41 UTC, 4 replies.
- [PR] [SPARK-46805][BUILD] Upgrade `scalafmt` to 3.7.17 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/23 01:55:51 UTC, 2 replies.
- [PR] [SPARK-46806][PYTHON] Improve error message for spark.table when argument type is wrong [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/23 02:11:07 UTC, 3 replies.
- Re: [PR] [SPARK-46738][CONNECT][PYTHON] Make the display of `cast` in `Regular Spark` and `Spark Connect` consistent [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/23 02:52:26 UTC, 0 replies.
- [PR] [SPARK-46807][DOCS] Add automation notice to generated SQL error class docs [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/23 03:22:30 UTC, 4 replies.
- [PR] [SPARK-46808][PYTHON] Refine error classes in Python with automatic sorting function [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/23 03:33:35 UTC, 10 replies.
- Re: [PR] [SPARK-46718][BUILD] Upgrade Arrow to 15.0.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/23 05:25:27 UTC, 1 replies.
- Re: [PR] [SPARK-46417][SQL] Do not fail when calling hive.getTable and throwException is false [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/23 05:58:11 UTC, 8 replies.
- [PR] [SPARK-46772][SQL] Benchmarking Avro with Compression Codecs [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/23 07:42:28 UTC, 3 replies.
- [PR] WIP: [SPARK-46811][INFRA] Fix Github Action: TypeError: Cannot read properties of undefined (reading 'head_sha') [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/23 09:08:30 UTC, 0 replies.
- Re: [PR] [SPARK-46792][PYTHON][CONNECT] Refactor ChannelBuilder into DefaultChannelBuilder and ChannelBuilder [spark] - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2024/01/23 09:47:17 UTC, 5 replies.
- [PR] [SPARK-45720] Upgrade AWS SDK to v2 for Spark Kinesis connector module [spark] - posted by "junyuc25 (via GitHub)" <gi...@apache.org> on 2024/01/23 09:57:36 UTC, 4 replies.
- Re: [PR] [SPARK-43403][UI] Ensure old SparkUI in HistoryServer has been detached before loading new one [spark] - posted by "zhouyifan279 (via GitHub)" <gi...@apache.org> on 2024/01/23 10:04:03 UTC, 7 replies.
- Re: [PR] [SPARK-46653][SQL] Code-gen for full outer sort merge join output row by row [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/01/23 10:14:25 UTC, 6 replies.
- [PR] Make mapInPandas / mapInArrow support ResourceProfile [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/01/23 10:59:36 UTC, 1 replies.
- [PR] [WIP][SQL] Avoid parquet footer reads twice [spark] - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2024/01/23 11:10:57 UTC, 0 replies.
- Re: [PR] [SPARK-46795][SQL] Replace `UnsupportedOperationException` by `SparkUnsupportedOperationException` in `sql/core` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/23 11:23:56 UTC, 4 replies.
- [PR] [SPARK-46590][SQL][FOLLOWUP] Update CoalesceShufflePartitions comments [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/01/23 11:51:29 UTC, 4 replies.
- [PR] [SPARK-46813][CORE] Don't set the executor id to "driver" when SparkContext is created by the executor side [spark] - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2024/01/23 12:21:40 UTC, 3 replies.
- Re: [PR] [SPARK-46432][BUILD] Upgrade Netty to 4.1.104.Final [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/23 14:00:52 UTC, 2 replies.
- Re: [PR] [SPARK-46069][SQL] Support unwrap timestamp type to date type [spark] - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2024/01/23 15:46:53 UTC, 0 replies.
- Re: [PR] [SPARK-46772][SQL][TESTS] Benchmarking Avro with Compression Codecs [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/23 16:06:26 UTC, 3 replies.
- Re: [PR] [SPARK-46796][SS] Ensure the correct remote files (mentioned in metadata.zip) are used on RocksDB version load [spark] - posted by "sahnib (via GitHub)" <gi...@apache.org> on 2024/01/23 16:20:31 UTC, 2 replies.
- Re: [PR] [SS][SPARK-46816] Add base support for new arbitrary state management operator, single valueState type, multiple state variables and underlying support for column families for RocksDBStateStoreProvider with/without changelog checkpointing [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/01/23 18:53:19 UTC, 4 replies.
- [PR] [SPARK-46817][CORE] Fix `spark-daemon.sh` usage by adding `decommission` command [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/23 21:38:21 UTC, 4 replies.
- [PR] [SPARK-46818][SQL] Improve error messages for Range with non-foldable inputs [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/01/23 21:50:12 UTC, 3 replies.
- [PR] [SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/01/24 01:25:34 UTC, 0 replies.
- Re: [PR] [SPARK-40876][SQL][FOLLOWUP] Widening type promotion from integers to decimal in Parquet vectorized reader [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/24 01:31:29 UTC, 3 replies.
- Re: [PR] [SPARK-46811][INFRA] Fix Github Action: TypeError: Cannot read properties of undefined (reading 'head_sha') [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/24 02:02:46 UTC, 5 replies.
- [PR] [SPARK-46530][PYTHON][SQL][FOLLOW-UP] Avoid checking Py4J and PySpark library when initial lookup of Python Data Sources [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/24 02:29:00 UTC, 2 replies.
- [PR] [SPARK-46820][PYTHON] Improve error message when `createDataFrame` fails nullability check [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/24 03:19:23 UTC, 15 replies.
- Re: [PR] [SPARK-43306][PYTHON] Migrate `ValueError` from Spark SQL types into error class [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/24 03:29:19 UTC, 0 replies.
- Re: [PR] [SPARK-46812][SQL][CONNECT][PYTHON] Make mapInPandas / mapInArrow support ResourceProfile [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/01/24 03:38:08 UTC, 8 replies.
- [PR] [SPARK-46822][SQL] Respect spark.sql.legacy.charVarcharAsString when casting jdbc type to catalyst type in jdbc [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/24 03:49:33 UTC, 3 replies.
- [PR] [SPARK-46823][CONNECT][PYTHON] `LocalDataToArrowConversion` should check the nullability [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/24 04:11:00 UTC, 3 replies.
- Re: [PR] [SPARK-46820][PYTHON] Fix error message regression by restoring `new_msg` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/24 04:14:54 UTC, 7 replies.
- [PR] [SPARK-46811][FOLLOWUP] Increase check_run response result from 30 to 100 [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/24 04:25:11 UTC, 3 replies.
- [PR] [WIP][SPARK-46819] Move error categories and states into JSON [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/24 06:03:36 UTC, 2 replies.
- [PR] [SPARK-46824][PS][BUILD] Enable Pandas-on-Spark test without optional dependency on PyPy [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/24 06:10:12 UTC, 19 replies.
- [PR] [SPARK-46825][DOCS] Build Spark only once when building docs [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/24 06:50:20 UTC, 2 replies.
- [PR] [SPARK-46826][INFRA] Reset `grpcio` installation version of `Python linter dependencies for branch-3.4/branch-3.5` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/24 07:44:13 UTC, 12 replies.
- [PR] [SPARK-46827][CORE] Fix `RocksDBPersistenceEngine` to support a symbolic link [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/24 08:30:51 UTC, 0 replies.
- Re: [PR] [SPARK-46827][CORE] Make `RocksDBPersistenceEngine` to support a symbolic link [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/24 08:45:23 UTC, 3 replies.
- [PR] [SPARK-46828][SQL] Remove the invalid assertion of remote mode for spark sql shell [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/24 09:09:32 UTC, 3 replies.
- [PR] [SPARK-46620][PS][CONNECT] Introduce a basic fallback mechanism for frame methods [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/24 09:16:21 UTC, 12 replies.
- [PR] [SPARK-46829][CORE] Remove createExecutorEnv from SparkEnv [spark] - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2024/01/24 10:17:38 UTC, 2 replies.
- Re: [PR] [SPARK-45593][BUILD][FOLLOWUP] Correct relocation connect guava dependency [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/24 11:03:07 UTC, 1 replies.
- [PR] Assign new name for _LEGACY_ERROR_TEMP_2102 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/24 12:07:10 UTC, 0 replies.
- [PR] [WIP][SQL] Disable CSV column pruning in the multi-line mode [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/24 17:10:02 UTC, 1 replies.
- Re: [PR] [SPARK-46819][CORE] Move error categories and states into JSON [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/24 19:28:21 UTC, 10 replies.
- [PR] [SPARK-46846][CORE] Make `WorkerResourceInfo` extend `Serializable` explicitly [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/24 21:01:59 UTC, 4 replies.
- [PR] [SPARK-46808][PYTHON][FOLLOW-UP] Add a guide for automatic sorting in `test_error_classes_sorted` [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/24 23:59:29 UTC, 2 replies.
- [PR] [SPARK-46848] XML: Enhance XML bad record handling with partial results support [spark] - posted by "shujingyang-db (via GitHub)" <gi...@apache.org> on 2024/01/25 00:11:42 UTC, 4 replies.
- Re: [PR] should use current plan's output in ScanOperation [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/25 00:19:43 UTC, 1 replies.
- [PR] [SPARK-46849][SQL] Run optimizer on CREATE TABLE column defaults [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/01/25 01:09:49 UTC, 9 replies.
- Re: [PR] [SPARK-46432][BUILD] Upgrade Netty to 4.1.106.Final [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/25 01:42:31 UTC, 6 replies.
- Re: [PR] [SPARK-46850][SQL] Convert `_LEGACY_ERROR_TEMP_2102 ` to `UNSUPPORTED_DATATYPE` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/25 02:28:00 UTC, 7 replies.
- [PR] [SPARK-46851][INFRA] Upgrade `buf` to v1.29.0 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/25 02:34:56 UTC, 8 replies.
- Re: [PR] [SPARK-46816][SS] Add base support for new arbitrary state management operator, single valueState type, multiple state variables and underlying support for column families for RocksDBStateStoreProvider with/without changelog checkpointing [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/01/25 04:33:33 UTC, 6 replies.
- Re: [PR] [SPARK-46256][CORE] Parallel Compression Support for ZSTD [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/01/25 06:45:19 UTC, 1 replies.
- [PR] [SPARK-46855][INFRA] Add `sketch` to the dependencies of the `catalyst` module in `module.py` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/25 07:01:37 UTC, 0 replies.
- [PR] [SPARK-46856][PS][TESTS] Apply approximate equality in ewm tests [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/25 07:14:05 UTC, 3 replies.
- Re: [PR] [SPARK-46851][DOCS] Remove `buf` version information from the doc `contributing.rst` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/25 07:47:07 UTC, 4 replies.
- [PR] [SPARK-46857][CORE] Make `Master` log `appId` in case of worker loss [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/25 08:03:03 UTC, 1 replies.
- [PR] [SPARK-46858][PYTHON][PS][BUILD] Upgrade Pandas to 2.2.0 [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/25 08:57:16 UTC, 6 replies.
- [PR] [SPARK-46861] Avoid Deadlock in DAGScheduler [spark] - posted by "fred-db (via GitHub)" <gi...@apache.org> on 2024/01/25 13:37:10 UTC, 0 replies.
- Re: [PR] [SPARK-46862][SQL] Disable CSV column pruning in the multi-line mode [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/25 14:37:06 UTC, 14 replies.
- Re: [PR] [SPARK-46855][INFRA] Add `sketch` to the dependencies of the `catalyst` in `module.py` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/25 16:26:18 UTC, 2 replies.
- Re: [PR] [SPARK-46861][CORE] Avoid Deadlock in DAGScheduler [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/25 16:32:17 UTC, 8 replies.
- [PR] [SPARK-46864][SS] Onboard Arbitrary StateV2 onto New Error Class Framework [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/01/25 18:33:31 UTC, 27 replies.
- [PR] [SPARK-46865][SS] Add Batch Support for TransformWithState Operator [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/01/25 18:46:42 UTC, 29 replies.
- [PR] [SPARK-46863][DOCS] Cleanup custom CSS [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/25 19:16:23 UTC, 3 replies.
- [PR] [SPARK-46867][PYTHON][CONNECT] Remove unnecessary dependency from test_mixed_udf_and_sql.py [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/25 20:07:24 UTC, 0 replies.
- Re: [PR] [SPARK-46867][PYTHON][CONNECT][TESTS] Remove unnecessary dependency from test_mixed_udf_and_sql.py [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/25 20:31:49 UTC, 3 replies.
- [PR] [SS] Initial State without state reader implementation for State API v2. [spark] - posted by "jingz-db (via GitHub)" <gi...@apache.org> on 2024/01/25 21:36:39 UTC, 0 replies.
- Re: [PR] [SPARK-35878][CORE] Revert S3A endpoint fixup logic of SPARK-35878 [spark] - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2024/01/25 22:25:31 UTC, 1 replies.
- [PR] [SPARK-46868][CORE] Support Spark Worker Log UI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/25 22:57:55 UTC, 20 replies.
- Re: [PR] SPARK-45478: Codegen sum(decimal_column / 2) computes div twice [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/26 00:18:28 UTC, 1 replies.
- Re: [PR] [SPARK-45537][CORE]Fix the issue where the last task may get stuck in a multi-profile [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/26 00:18:30 UTC, 1 replies.
- Re: [PR] [SPARK-46869][K8S] Add `logrotate` to Spark docker files [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/26 00:28:03 UTC, 5 replies.
- [PR] [SPARK-46870][CORE] Support Spark Master Log UI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/26 03:00:34 UTC, 6 replies.
- [PR] [MINOR][INFRA][DOCS] Update the location for error class README.md [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/26 03:22:21 UTC, 1 replies.
- Re: [PR] [MINOR][INFRA] Update the location for error class README.md [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/01/26 04:05:31 UTC, 1 replies.
- [PR] [WIP][SQL] Remove unnecessary synchronized [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/26 04:30:34 UTC, 0 replies.
- [PR] [SPARK-46855][INFRA][3.5] Add sketch to the dependencies of the catalyst in module.py [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/26 05:10:20 UTC, 0 replies.
- [PR] [SPARK-46855][INFRA][3.4] Add `sketch` to the dependencies of the `catalyst` in `module.py` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/26 05:16:07 UTC, 4 replies.
- Re: [PR] [SPARK-46855][INFRA][3.5] Add `sketch` to the dependencies of the `catalyst` in `module.py` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/26 05:22:15 UTC, 3 replies.
- [PR] [SPARK-46871][PS][TESTS] Clean up the imports in `pyspark.pandas.tests.computation.*` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/26 05:24:05 UTC, 3 replies.
- [PR] [SPARK-46872][CORE] Fix `log-view.js` to be non-module [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/26 05:48:39 UTC, 0 replies.
- Re: [PR] [SPARK-46872][CORE] Recover `log-view.js` to be non-module [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/26 05:56:42 UTC, 6 replies.
- Re: [PR] [SPARK-46003][UI][TESTS] Create an ui-test module with Jest to test ui javascript code [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/26 05:58:48 UTC, 3 replies.
- [PR] [MINOR][DOCS] Remove Canonicalize in docs [spark] - posted by "jlfsdtc (via GitHub)" <gi...@apache.org> on 2024/01/26 06:07:08 UTC, 6 replies.
- [PR] [SPARK-46873][SS] Do not recreate new StreamingQueryManager for the same Spark Session [spark] - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2024/01/26 06:40:50 UTC, 5 replies.
- [PR] [SPARK-46874][PYTHON] Remove `pyspark.pandas` dependency from `assertDataFrameEqual` [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/26 08:57:05 UTC, 14 replies.
- [PR] [SPARK-46875][SQL] Provide better prompts when option's mode is null [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/26 09:28:07 UTC, 4 replies.
- Re: [PR] [SPARK-46875][SQL] When the `mode` is null, a `NullPointException` should `not` be thrown [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/26 11:10:33 UTC, 5 replies.
- [PR] [SPARK-46831][SQL] Collations - Extending StringType and PhysicalStringType with collationId field [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/01/26 11:21:32 UTC, 9 replies.
- [PR] [SPARK-46810][DOCS] Explain error class terminology in internal README [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/26 15:44:40 UTC, 5 replies.
- [PR] [WIP] Adding deleteIfExists operator to StatefulProcessorHandleImpl [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/01/26 21:08:52 UTC, 3 replies.
- [PR] [SPARK-46879][SQL] Run optimizer on REPLACE TABLE column defaults [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/01/26 21:11:54 UTC, 2 replies.
- [PR] [SPARK-46880][PYTHON][CONNECT][TESTS] Improve and test warning for Arrow-optimized Python UDF [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/01/26 21:30:42 UTC, 3 replies.
- [PR] [SPARK-46881][CORE] Support `spark.deploy.workerSelectionPolicy` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/26 23:07:31 UTC, 6 replies.
- Re: [PR] [WIP][SPARK-45230][SQL] Plan sorter for Aggregate after SMJ [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/27 00:17:24 UTC, 1 replies.
- Re: [PR] [SPARK-43035][Connect] Add error class in Spark Connect server's ErrorInfo [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/27 00:17:25 UTC, 1 replies.
- [PR] [SPARK-46882][SS][TEST] Replace unnecessary AtomicInteger with int [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/27 05:54:44 UTC, 6 replies.
- [PR] [SPARK-46883][CORE] Support `/json/clusterutilization` API [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/27 06:05:54 UTC, 3 replies.
- [PR] [WIP] [SPARK-46884] Spark Connect - ExecutePlanRequest new property - job description [spark] - posted by "GoEddie (via GitHub)" <gi...@apache.org> on 2024/01/27 08:09:15 UTC, 0 replies.
- [PR] [WIP][SPARK-46862][SQL][FOLLOWUP] Fix column pruning without schema enforcing in V1 CSV datasource [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/27 09:39:40 UTC, 0 replies.
- Re: [PR] [SPARK-46862][SQL][FOLLOWUP] Fix column pruning without schema enforcing in V1 CSV datasource [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/27 16:22:24 UTC, 1 replies.
- [PR] [SPARK-46885][SQL] Push down filters through `TypedFilter` [spark] - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2024/01/27 17:15:51 UTC, 2 replies.
- [PR] [SPARK-46886][CORE] Enable `spark.ui.prometheus.enabled` by default [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/27 19:46:46 UTC, 13 replies.
- [PR] [SPARK-46887][DOCS] Document a few missed `spark.ui.*` configs to `Configuration` page [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/27 23:43:01 UTC, 3 replies.
- Re: [PR] [SPARK-45607][SQL] Collapse repartition operators with a project [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/28 00:20:02 UTC, 1 replies.
- Re: [PR] [SPARK-45604] Prevent SEGFAULT on OffHeapColumnVector by providing explicit memory boundary check [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/28 00:20:04 UTC, 1 replies.
- Re: [PR] [MINOR][SQL] Update HintErrorLogger.scala to fix typo in the error message [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/28 00:20:05 UTC, 1 replies.
- [PR] [SPARK-46883][CORE][FOLLOWUP] Fix clusterutilization to handle 0 worker csae [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/28 01:09:20 UTC, 0 replies.
- Re: [PR] [SPARK-46883][CORE][FOLLOWUP] Fix `clusterutilization` API to handle 0 worker case [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/28 01:12:37 UTC, 7 replies.
- [PR] [SPARK-46888][CORE] Fix `Master` to reject worker kill request if decommission is disabled [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/28 02:14:19 UTC, 0 replies.
- Re: [PR] [SPARK-46888][CORE] Fix `Master` to reject `/workers/kill/` requests if decommission is disabled [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/28 02:30:10 UTC, 4 replies.
- [PR] [SPARK-46889][CORE] Validate `spark.master.ui.decommission.allow.mode` setting [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/28 06:19:21 UTC, 3 replies.
- Re: [PR] [SPARK-46137] update janino to version 3.1.11 [spark] - posted by "igreenfield (via GitHub)" <gi...@apache.org> on 2024/01/28 09:57:23 UTC, 0 replies.
- [PR] [Spark-46891] Allow injecting LogicalPlan Statistics visitor. [spark] - posted by "igreenfield (via GitHub)" <gi...@apache.org> on 2024/01/28 10:10:55 UTC, 0 replies.
- [PR] [SPARK-46892][BUILD] Upgrade dropwizard metrics 4.2.25 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/28 13:08:47 UTC, 3 replies.
- Re: [PR] [SPARK-3734] DriverRunner should not read SPARK_HOME from submitter's environment [spark] - posted by "thanhdanh1803 (via GitHub)" <gi...@apache.org> on 2024/01/28 16:13:26 UTC, 0 replies.
- [PR] [MINOR][DOCS] Miscellaneous documentation improvements [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/28 20:06:11 UTC, 2 replies.
- [PR] [SPARK-46894][PYTHON] Move PySpark error conditions into standalone JSON file [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/28 21:43:04 UTC, 18 replies.
- Re: [PR] [SPARK-45612][CORE] Allow cached RDDs to migrate to fallback storage during decommission [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/29 00:18:44 UTC, 1 replies.
- Re: [PR] [SPARK-44635][CORE] Refresh shuffle locations when decommission happens [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/29 00:18:47 UTC, 1 replies.
- Re: [PR] [SPARK-45594][SQL] Auto repartition before writing data into partitioned or bucket table [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/01/29 00:18:49 UTC, 1 replies.
- [PR] [MINOR][DOCS] Remove unneeded comments from global.html [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/01/29 00:56:49 UTC, 2 replies.
- Re: [PR] [SPARK-46895][CORE] Replace Timer with single thread scheduled executor [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/29 01:31:31 UTC, 2 replies.
- [PR] [SPARK-46896][PS][TESTS] Clean up the imports in `pyspark.pandas.tests.{frame, series, groupby}.*` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/29 03:13:30 UTC, 2 replies.
- [PR] [SPARK-46897][PYTHON][DOCS] Refine docstring of `bit_and/bit_or/bit_xor` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/29 03:38:10 UTC, 3 replies.
- [PR] [WIP][PYTHON] Upgrade `pyarrow` to 15.0.0 [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/29 04:04:40 UTC, 0 replies.
- [PR] [SPARK-46898][CONNECT] Simplify the protobuf function transformation in Planner [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/29 04:08:58 UTC, 3 replies.
- [PR] [SPARK-46899][CORE] Remove POST APIs from `MasterWebUI` when `spark.ui.killEnabled` is `false` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/29 04:55:17 UTC, 0 replies.
- [PR] [SS] Add a check for stateful operator change for streaming [spark] - posted by "jingz-db (via GitHub)" <gi...@apache.org> on 2024/01/29 05:16:38 UTC, 0 replies.
- [PR] [SPARK-46900][BUILD] Upgrade slf4j to 2.0.11 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/29 06:14:55 UTC, 3 replies.
- Re: [PR] [SPARK-46899][CORE] Remove `POST` APIs from `MasterWebUI` when `spark.ui.killEnabled` is `false` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/29 06:18:08 UTC, 5 replies.
- Re: [PR] [SPARK-46721][CORE][TESTS] Make gpu fraction tests more robust [spark] - posted by "Ngone51 (via GitHub)" <gi...@apache.org> on 2024/01/29 07:00:03 UTC, 1 replies.
- [PR] [WIP][BUILD] Test `grpcio*` 1.60.0 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/29 07:35:23 UTC, 1 replies.
- Re: [PR] [WIP][SPARK-46858][PYTHON][PS][BUILD] Upgrade Pandas to 2.2.0 [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/01/29 07:35:34 UTC, 4 replies.
- [PR] [WIP][CORE] Fix Spark History Server UI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/29 07:36:43 UTC, 7 replies.
- Re: [PR] [SPARK-46901][PYTHON] Upgrade `pyarrow` to 15.0.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/29 07:41:53 UTC, 2 replies.
- Re: [PR] [SPARK-46877][SQL] Remove unnecessary synchronized [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/29 08:33:24 UTC, 4 replies.
- [PR] [SPARK-46902][UI] Fix Spark History Server UI for using un-exported setAppLimit [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/29 08:35:11 UTC, 5 replies.
- [PR] [SPARK-46903][CORE] Support Spark History Server Log UI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/29 08:50:02 UTC, 5 replies.
- [PR] [SPARK-46893][UI] Remove inline scripts from UI descriptions [spark] - posted by "rshkv (via GitHub)" <gi...@apache.org> on 2024/01/29 09:25:06 UTC, 2 replies.
- [PR] [SPARK-46904][UI] Fix wrong display of History UI summary [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/29 10:24:49 UTC, 0 replies.
- [PR] [SPARK-46849][SQL][FOLLOWUP] Add dedicated class to keep column definition instead of StructField in Create/ReplaceTable command [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/29 15:51:28 UTC, 2 replies.
- Re: [PR] [SPARK-46904][UI] Fix display issue of History UI summary [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/29 16:47:34 UTC, 2 replies.
- [PR] [SPARK-46907][CORE] Show driver log location in Spark History Server [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/29 18:08:41 UTC, 4 replies.
- [PR] [WIP][SQL] Replace `UnsupportedOperationException` by `SparkUnsupportedOperationException` in `sql/api` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/01/29 18:42:57 UTC, 0 replies.
- Re: [PR] [SPARK-46905][SQL] Add dedicated class to keep column definition instead of StructField in Create/ReplaceTable command [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/01/29 19:13:37 UTC, 7 replies.
- [PR] [SPARK-46908] [WIP] Support star clause outside select [spark] - posted by "srielau (via GitHub)" <gi...@apache.org> on 2024/01/29 20:33:00 UTC, 0 replies.
- [PR] [SPARK-46890][SQL] Fix CSV parsing bug with existence default values and column pruning [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/01/29 22:15:12 UTC, 7 replies.
- [PR] [SPARK-46910][PYTHON] Eliminate JDK Requirement in PySpark Installation [spark] - posted by "asl3 (via GitHub)" <gi...@apache.org> on 2024/01/29 22:57:50 UTC, 4 replies.
- Re: [PR] [SPARK-46906][SS] Add a check for stateful operator change for streaming [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/01/30 00:34:25 UTC, 0 replies.
- [PR] [MINOR][SQL]Replace magic value with constants [spark] - posted by "jlfsdtc (via GitHub)" <gi...@apache.org> on 2024/01/30 03:01:39 UTC, 2 replies.
- [PR] [SPARK-46915][SQL] Simplify `UnaryMinus` and align error class [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/30 03:24:24 UTC, 0 replies.
- [PR] [SPARK-46912] Use worker JAVA_HOME and SPARK_HOME instead of from submitter [spark] - posted by "thanhdanh1803 (via GitHub)" <gi...@apache.org> on 2024/01/30 03:30:25 UTC, 1 replies.
- [PR] [SPARK-46914][UI] Shorten app name in the summary table on the History Page [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/30 03:35:08 UTC, 5 replies.
- [PR] [SPARK-46916][PS][TESTS] Clean up `pyspark.pandas.tests.indexes.*` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/01/30 03:56:28 UTC, 2 replies.
- [PR] [WIP][SPARK-46876]csv line containing delimiter can't be treated as empty line [spark] - posted by "doki23 (via GitHub)" <gi...@apache.org> on 2024/01/30 06:25:15 UTC, 2 replies.
- [PR] [SPARK-46747][SQL] Avoid scan in getTableExistsQuery for JDBC Dialects [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/01/30 07:24:10 UTC, 2 replies.
- Re: [PR] [SPARK-46915][SQL] Simplify `UnaryMinus` `Abs` and align error class [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/30 10:58:04 UTC, 1 replies.
- [PR] [SPARK-46918][YARN] Replace self-defined variables with Hadoop ContainerExitStatus [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/01/30 11:08:36 UTC, 3 replies.
- [PR] [SPARK-46920][YARN] Improve executor exit error message on YARN [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/01/30 12:28:30 UTC, 4 replies.
- [PR] [SPARK-46921] Move `ProblemFilters` that do not belong to `defaultExcludes` to `v40excludes`. [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/01/30 13:56:09 UTC, 0 replies.
- Re: [PR] [SPARK-46919][BUILD][CONNECT] Upgrade `grpcio*` to 1.60.0 and `grpc-java` to 1.61.0 [spark] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2024/01/30 14:40:16 UTC, 1 replies.
- [PR] [SPARK-46922][CORE][SQL] Better handling for runtime user errors [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/30 15:03:49 UTC, 9 replies.
- Re: [PR] [SPARK-46473][SQL] Reuse `getPartitionedFile` method [spark] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2024/01/30 16:04:42 UTC, 0 replies.
- Re: [PR] [SPARK-45708][BUILD] Retry mvn deploy [spark] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2024/01/30 16:13:15 UTC, 0 replies.
- Re: [PR] [SPARK-46921][BUILD] Move `ProblemFilters` that do not belong to `defaultExcludes` to `v40excludes` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/30 16:44:39 UTC, 1 replies.
- Re: [PR] [MINOR][SQL] Use `DecimalType.MINIMUM_ADJUSTED_SCALE` instead of magic number `6` in `Divide` class [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/30 18:26:50 UTC, 3 replies.
- [PR] [SPARK-46798] Kafka custom partition location assignment in Spark Structured Streaming (rack awareness) [spark] - posted by "rschwagercharter (via GitHub)" <gi...@apache.org> on 2024/01/30 18:52:51 UTC, 0 replies.
- Re: [PR] [SPARK-46911][SS] Adding deleteIfExists operator to StatefulProcessorHandleImpl [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/01/30 18:55:23 UTC, 8 replies.
- Re: [PR] test to use lower case attribute reference [spark] - posted by "copperybean (via GitHub)" <gi...@apache.org> on 2024/01/31 09:45:57 UTC, 0 replies.
- [PR] [SPARK-46833][SQL] Collations - Introducing CollationFactory which provides comparison and hashing rules for supported collations [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/01/31 10:11:07 UTC, 1 replies.
- Re: [PR] [SPARK-45522][BUILD][CORE][SQL][UI] Migrate from Jetty 9 to Jetty 10 [spark] - posted by "HiuKwok (via GitHub)" <gi...@apache.org> on 2024/01/31 10:11:22 UTC, 2 replies.
- Re: [PR] [SPARK-45110][BUILD] Upgrade rocksdbjni to 8.8.1 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/01/31 10:42:45 UTC, 0 replies.
- [PR] [SPARK-46933] Add query execution time metric to connectors which use JDBCRDD [spark] - posted by "urosstan-db (via GitHub)" <gi...@apache.org> on 2024/01/31 11:57:14 UTC, 4 replies.
- Re: [PR] [SPARK-46929][CORE][CONNECT][SS] Use ThreadUtils.shutdown to close thread pools [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/01/31 12:32:19 UTC, 3 replies.
- [PR] [SPARK-45807][SQL] Return View after calling replaceView(..) [spark] - posted by "nastra (via GitHub)" <gi...@apache.org> on 2024/01/31 13:34:50 UTC, 1 replies.
- Re: [PR] [SPARK-46487][SQL] Push down part of filter through aggregate with nondeterministic field [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/01/31 13:56:33 UTC, 2 replies.
- Re: [PR] [SPARK-46908] Support star clause in WHERE clause [spark] - posted by "gatorsmile (via GitHub)" <gi...@apache.org> on 2024/01/31 16:38:55 UTC, 0 replies.
- Re: [PR] [SPARK-46930][SQL] Add support for a custom prefix for Union type fields in Avro [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/31 16:39:51 UTC, 1 replies.
- Re: [PR] [SPARK-46931][PS] Implement `{Frame, Series}.to_hdf` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/01/31 16:41:26 UTC, 1 replies.
- Re: [PR] [SS][SPARK-46928] Add support for ListState in Arbitrary State API v2. [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/01/31 22:23:06 UTC, 2 replies.