You are viewing a plain text version of this content. The canonical link for it is here.
- Re: [PR] [SPARK-46753][PYTHON][TESTS] Fix pypy3 python test [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/01 00:18:04 UTC, 0 replies.
- Re: [PR] [SPARK-43238][CORE] Support only decommission idle workers in standalone [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/01 00:19:20 UTC, 0 replies.
- Re: [PR] [SPARK-46931][PS] Implement `{Frame, Series}.to_hdf` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/01 00:43:22 UTC, 0 replies.
- Re: [PR] [SPARK-46929][CORE][CONNECT][SS] Use ThreadUtils.shutdown to close thread pools [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/02/01 01:19:41 UTC, 0 replies.
- Re: [PR] [SPARK-46882][SS][TEST] Replace unnecessary AtomicInteger with int [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/02/01 01:44:00 UTC, 3 replies.
- Re: [PR] [SPARK-46865][SS] Add Batch Support for TransformWithState Operator [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/01 01:47:51 UTC, 25 replies.
- [PR] [SPARK-46935] Consolidate error documentation [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/01 02:03:30 UTC, 1 replies.
- Re: [PR] [SPARK-46923][DOCS] Limit width of configuration tables [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/01 02:09:52 UTC, 0 replies.
- Re: [PR] [SPARK-46935][DOCS] Consolidate error documentation [spark] - posted by "srielau (via GitHub)" <gi...@apache.org> on 2024/02/01 02:16:04 UTC, 4 replies.
- Re: [PR] [MINOR][DOCS] Remove Canonicalize in docs [spark] - posted by "jlfsdtc (via GitHub)" <gi...@apache.org> on 2024/02/01 02:24:24 UTC, 0 replies.
- Re: [PR] [SPARK-46487][SQL] Push down part of filter through aggregate with nondeterministic field [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/02/01 02:38:22 UTC, 2 replies.
- Re: [PR] [SPARK-46908] Support star clause in WHERE clause [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 02:52:08 UTC, 21 replies.
- Re: [PR] [SPARK-46933] Add query execution time metric to connectors which use JDBCRDD [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 03:17:35 UTC, 1 replies.
- [PR] [SPARK-46936][PS] Implement `Frame.to_feather` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/01 03:22:49 UTC, 2 replies.
- Re: [PR] [SPARK-46922][CORE][SQL] Better handling for runtime user errors [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 03:25:11 UTC, 0 replies.
- [PR] retryable test [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/01 03:39:16 UTC, 0 replies.
- [PR] [SPARK-46852] Remove use of explicit key encoder and pass it implicitly to the operator for transformWithState operator [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/01 03:40:59 UTC, 0 replies.
- Re: [PR] [SPARK-46760][SQL][DOCS] Make the document of spark.sql.adaptive.coalescePartitions.parallelismFirst clearer [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 03:44:14 UTC, 6 replies.
- [PR] [SPARK-46707][SQL][FOLLOWUP] Push down throwable predicate through aggregates [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/02/01 03:45:31 UTC, 17 replies.
- Re: [PR] [SPARK-46852][SS] Remove use of explicit key encoder and pass it implicitly to the operator for transformWithState operator [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/01 03:48:35 UTC, 2 replies.
- Re: [PR] [SPARK-46473][SQL] Reuse `getPartitionedFile` method [spark] - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2024/02/01 04:05:20 UTC, 2 replies.
- Re: [PR] [SPARK-45527][CORE] Use fraction to do the resource calculation [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/02/01 05:17:38 UTC, 45 replies.
- Re: [PR] [SPARK-46228][SQL] Insert window group limit node for limit outside of window [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 05:33:20 UTC, 12 replies.
- Re: [PR] [SPARK-46400][CORE][SQL] When there are corrupted files in the local maven repo, skip this cache and try again [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/01 05:34:08 UTC, 2 replies.
- Re: [PR] [SPARK-42199][SQL] Fix issues around Dataset.groupByKey [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 05:47:51 UTC, 3 replies.
- [PR] [WIP][SPARK-46937][SQL] Improve concurrency performance for FunctionRegistry [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/02/01 05:52:07 UTC, 0 replies.
- Re: [PR] [SPARK-46617][SQL] Create-table-if-not-exists should not silently overwrite existing data-files [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 06:10:38 UTC, 4 replies.
- [PR] [SPARK-46939][CORE] Simplify IndylambdaScalaClosures#getSerializationProxy [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/01 06:27:53 UTC, 1 replies.
- [PR] [MINOR][SQL] Clean up outdated comments from `hash` function in `Metadata` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/01 07:30:11 UTC, 2 replies.
- [PR] [SPARK-46940][CORE] Remove unused `updateSparkConfigFromProperties` and `isAbsoluteURI` in `o.a.s.u.Utils` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/01 08:01:02 UTC, 7 replies.
- Re: [PR] [SPARK-46939][CORE] Remove `isClosureCandidate` check from `getSerializationProxy` function in IndylambdaScalaClosures [spark] - posted by "rednaxelafx (via GitHub)" <gi...@apache.org> on 2024/02/01 08:14:48 UTC, 2 replies.
- [PR] [SPARK-46941][SQL] Can't insert window group limit node for top-k computation if contains SizeBasedWindowFunction [spark] - posted by "zml1206 (via GitHub)" <gi...@apache.org> on 2024/02/01 08:53:52 UTC, 4 replies.
- [PR] [SPARK-46942][CORE] Ignore --num-executors config if DYN_ALLOCATION_ENABLED is true and allow remove idle executors [spark] - posted by "wankunde (via GitHub)" <gi...@apache.org> on 2024/02/01 08:57:05 UTC, 1 replies.
- Re: [PR] [SPARK-46876]csv line containing delimiter can't be treated as empty line [spark] - posted by "doki23 (via GitHub)" <gi...@apache.org> on 2024/02/01 09:34:40 UTC, 2 replies.
- [PR] [SPARK-46943][SQL] Support for configuring ShuffledHashJoin plan size Threshold [spark] - posted by "cxzl25 (via GitHub)" <gi...@apache.org> on 2024/02/01 09:48:21 UTC, 3 replies.
- Re: [PR] [SPARK-46891][SQL] Allow injecting LogicalPlan Statistics visitor. [spark] - posted by "igreenfield (via GitHub)" <gi...@apache.org> on 2024/02/01 10:44:48 UTC, 0 replies.
- Re: [PR] [SPARK-45110][BUILD] Upgrade rocksdbjni to 8.8.1 [spark] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2024/02/01 12:37:07 UTC, 1 replies.
- [PR] [SPARK-46944] Follow up to SPARK-46792 (ChannelBuilder refactoring): Fix minor typing oversight [spark] - posted by "cdkrot (via GitHub)" <gi...@apache.org> on 2024/02/01 12:56:49 UTC, 5 replies.
- Re: [PR] [SPARK-46937][SQL] Improve concurrency performance for FunctionRegistry [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/02/01 13:03:38 UTC, 2 replies.
- Re: [PR] [SPARK-45807][SQL] Return View after calling replaceView(..) [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 13:33:57 UTC, 6 replies.
- Re: [PR] [SPARK-46911][SS] Adding deleteIfExists operator to StatefulProcessorHandleImpl [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/01 13:47:33 UTC, 10 replies.
- Re: [PR] [SPARK-46833][SQL] Collations - Introducing CollationFactory which provides comparison and hashing rules for supported collations [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/02/01 13:55:44 UTC, 48 replies.
- [PR] [MINOR][DOCS] Fix outgoing links from SELECT SQL reference page [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/01 14:16:23 UTC, 2 replies.
- Re: [PR] [SPARK-46933][SQL] Add query execution time metric to connectors which use JDBCRDD [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 14:48:12 UTC, 1 replies.
- Re: [PR] [SPARK-39910][SQL] Delegate path qualification to filesystem during DataSource file path globbing [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/01 15:15:32 UTC, 10 replies.
- Re: [PR] [SPARK-46747][SQL] Avoid scan in getTableExistsQuery for JDBC Dialects [spark] - posted by "bala-bellam (via GitHub)" <gi...@apache.org> on 2024/02/01 17:19:53 UTC, 1 replies.
- Re: [PR] [SS][SPARK-46928] Add support for ListState in Arbitrary State API v2. [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/01 18:25:20 UTC, 118 replies.
- Re: [PR] [SPARK-46890][SQL] Fix CSV parsing bug with existence default values and column pruning [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/01 18:41:06 UTC, 8 replies.
- Re: [PR] [SPARK-46908][SQL] Support star clause in WHERE clause [spark] - posted by "srielau (via GitHub)" <gi...@apache.org> on 2024/02/01 19:22:05 UTC, 2 replies.
- Re: [PR] [SPARK-46864][SS] Onboard Arbitrary StateV2 onto New Error Class Framework [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/01 20:37:43 UTC, 1 replies.
- [PR] [SPARK-46945][K8S] Add `spark.kubernetes.legacy.useReadWriteOnceAccessMode` for old K8s clusters [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/01 21:07:24 UTC, 8 replies.
- [PR] [SPARK-46945][K8S][3.5] Add `spark.kubernetes.legacy.useReadWriteOnceAccessMode` for old K8s clusters [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/01 21:15:08 UTC, 4 replies.
- [PR] [SQL] Supporting broadcast of multiple filtering keys in DynamicPruning [spark] - posted by "longvu-db (via GitHub)" <gi...@apache.org> on 2024/02/01 21:46:10 UTC, 0 replies.
- [PR] Avro serialization [spark] - posted by "jingz-db (via GitHub)" <gi...@apache.org> on 2024/02/01 22:18:48 UTC, 0 replies.
- Re: [PR] [SPARK-46812][SQL][PYTHON] Make mapInPandas / mapInArrow support ResourceProfile [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/02/01 23:57:20 UTC, 12 replies.
- Re: [PR] [SPARK-42261][SPARK-42260][K8S] Log Allocation Stalls and Trigger Allocation event without blocking on snapshot [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/02 00:17:59 UTC, 2 replies.
- Re: [PR] [SPARK-46946][SQL] Supporting broadcast of multiple filtering keys in DynamicPruning [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/02 02:15:39 UTC, 5 replies.
- [PR] [SPARK-42727][CORE] Fix can't executing spark commands in the root directory when local mode is specified [spark] - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2024/02/02 02:24:13 UTC, 5 replies.
- [PR] [SPARK-46949][SQL] Support CHAR/VARCHAR through ResolveDefaultColumns [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/02 02:32:56 UTC, 8 replies.
- [PR] [WIP][SPARK-46950][CORE][SQL] Align not available codec error-class [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/02 02:40:23 UTC, 2 replies.
- Re: [PR] [SPARK-46945][K8S][3.4] Add `spark.kubernetes.legacy.useReadWriteOnceAccessMode` for old K8s clusters [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/02 02:45:44 UTC, 2 replies.
- Re: [PR] [SPARK-46922][CORE][SQL] Do not wrap runtime user-facing errors [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/02 03:03:23 UTC, 6 replies.
- Re: [PR] [SPARK-46683][SQL][TESTS][FOLLOW-UP] Fix typo, use queries in partition set [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/02 03:23:04 UTC, 1 replies.
- [PR] [SPARK-43742][TEST] Wrap withTable for a test in ResolveDefaultColumnsSuite [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/02 05:39:01 UTC, 0 replies.
- Re: [PR] [SPARK-46950][CORE][SQL] Align `not available codec` error-class [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/02 05:52:42 UTC, 4 replies.
- [PR] [SPARK-46952][SQL] XML: Limit size of corrupt record [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/02/02 06:10:57 UTC, 8 replies.
- [PR] Change the signature of the hllInvalidLgK query execution error to take an integer as 4th argument [spark] - posted by "mkaravel (via GitHub)" <gi...@apache.org> on 2024/02/02 06:13:56 UTC, 1 replies.
- Re: [PR] [SPARK-46953][TEST]] Wrap withTable for a test in ResolveDefaultColumnsSuite [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/02 07:17:44 UTC, 1 replies.
- [PR] [SPARK-46955][PS] Implement `Frame.to_stata` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/02 07:53:54 UTC, 2 replies.
- [PR] [SPARK-46954][SQL] XML: Optimize schema index lookup [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/02/02 08:25:26 UTC, 3 replies.
- [PR] [WIP][CORE] Rewrite `OpenHashSet#hasher` with `pattern matching` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/02 08:38:18 UTC, 1 replies.
- [PR] [SPARK-46956][SQL] Improve the error prompt when `SaveMode` is null in API `DataFrameWriter.mode` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/02 09:04:32 UTC, 3 replies.
- [PR] [SPARK-46958][SQL] Fix bug when canUpcast and result non-foldable expression [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/02 10:46:37 UTC, 1 replies.
- Re: [PR] [SPARK-46958][SQL] Fix bug when the canUpcast branch result non-foldable default value expression [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/02 11:58:40 UTC, 1 replies.
- Re: [PR] [SPARK-44815][CONNECT]Cache df.schema to avoid extra RPC [spark] - posted by "hvanhovell (via GitHub)" <gi...@apache.org> on 2024/02/02 12:47:24 UTC, 1 replies.
- [PR] [WIP][Spark 44646] Reduce usage of log4j core [spark] - posted by "mucharafal (via GitHub)" <gi...@apache.org> on 2024/02/02 15:52:52 UTC, 0 replies.
- [PR] [WIP] Using ProcessorContext to store and retrieve handle [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/02/02 17:31:37 UTC, 0 replies.
- Re: [PR] [SPARK-46915][SQL] Simplify `UnaryMinus` `Abs` and align error class [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/02 17:33:08 UTC, 1 replies.
- [PR] [MINOR][DOCS] Explain that the default fractional numeric literal is a decimal [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/02 17:33:13 UTC, 0 replies.
- [PR] [WIP] Testing Multiple Input Streams with TransformWithState operator [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/02/02 18:06:24 UTC, 0 replies.
- Re: [PR] [SPARK-46526][SQL] Support LIMIT over correlated subqueries where predicates only reference outer table [spark] - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2024/02/02 19:34:55 UTC, 4 replies.
- [PR] [SPARK-46963] Verify AQE is not enabled for Structured Streaming [spark] - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2024/02/02 19:44:45 UTC, 3 replies.
- Re: [PR] [SPARK-46964] Change the signature of the hllInvalidLgK query execution error to take an integer as 4th argument [spark] - posted by "mkaravel (via GitHub)" <gi...@apache.org> on 2024/02/02 19:47:26 UTC, 0 replies.
- Re: [PR] [SPARK-46964][SQL] Change the signature of the hllInvalidLgK query execution error to take an integer as 4th argument [spark] - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2024/02/02 19:54:03 UTC, 1 replies.
- Re: [PR] [SS][WIP] Serialization using case classes/primitives/POJO based on Avro for Arbitrary State API v2. [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/02 22:11:28 UTC, 4 replies.
- [PR] [SPARK-46965][CORE] Check `logType` in `Utils.getLog` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 00:15:47 UTC, 10 replies.
- Re: [PR] ExternalSorter#mergeSort complexity should be linear if data is already sorted [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/03 00:18:23 UTC, 1 replies.
- Re: [PR] [SPARK-43829][CONNECT] Improve SparkConnectPlanner by reuse Dataset and avoid construct new Dataset [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/03 00:18:25 UTC, 5 replies.
- [PR] [SPARK-46966][Python] Add UDTF API for 'analyze' method to indicate subset of input table columns to select [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/02/03 00:43:06 UTC, 11 replies.
- [PR] SPARK-44111: Bump jetty to v11 [spark] - posted by "HiuKwok (via GitHub)" <gi...@apache.org> on 2024/02/03 05:52:33 UTC, 6 replies.
- [PR] [SPARK-46967][CORE][UI] Hide `Thread Dump` and `Heap Histogram` of `Dead` executors in `Executors` UI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 07:35:13 UTC, 5 replies.
- Re: [PR] [SPARK-46968][SQL] Replace `UnsupportedOperationException` by `SparkUnsupportedOperationException` in `sql` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/03 07:39:29 UTC, 3 replies.
- Re: [PR] [SPARK-46899][CORE] Remove `POST` APIs from `MasterWebUI` when `spark.ui.killEnabled` is `false` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 07:43:05 UTC, 1 replies.
- [PR] [SPARK-46899][CORE][FOLLOWUP] Enable `/workers/kill` if `spark.decommission.enabled=true` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 08:01:43 UTC, 4 replies.
- [PR] [SPARK-46969][SQL][TESTS] Recover `to_timestamp('366', 'DD')` test case of `datetime-parsing-invalid.sql` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 10:34:37 UTC, 4 replies.
- [PR] [MINOR][DOCS] Remove Java8/11 at `IgnoreUnrecognizedVMOptions` description [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 10:52:27 UTC, 2 replies.
- Re: [PR] [MINOR][DOCS] Remove Java 8/11 at `IgnoreUnrecognizedVMOptions` description [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 10:55:28 UTC, 5 replies.
- [PR] [SPARK-45276][INFRA][FOLLOWUP] Fix Java version comment from 11 to 17 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 11:01:08 UTC, 6 replies.
- [PR] [MINOR][DOCS] Clean markup in NULL semantics documentation [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/03 21:52:52 UTC, 2 replies.
- Re: [PR] [SPARK-46895][CORE] Replace Timer with single thread scheduled executor [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/03 22:09:21 UTC, 10 replies.
- Re: [PR] [SPARK-45668][CORE] Improve the assert message in RollingEventLogFilesFileReader [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/04 00:20:13 UTC, 1 replies.
- Re: [PR] [SPARK-44405][SQL][TESTS] Reduce code duplication in group-based DELETE and MERGE tests [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/04 00:20:15 UTC, 1 replies.
- Re: [PR] [SPARK-46641][SS] Add maxBytesPerTrigger threshold [spark] - posted by "MaxNevermind (via GitHub)" <gi...@apache.org> on 2024/02/04 00:44:51 UTC, 16 replies.
- Re: [PR] [SPARK-46958][SQL] Add missing timezone to coerce default values [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/04 03:51:22 UTC, 4 replies.
- Re: [PR] [SPARK-46970][CORE] Rewrite `OpenHashSet#hasher` with `pattern matching` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/04 04:51:43 UTC, 5 replies.
- Re: [PR] [SPARK-46919][BUILD][CONNECT] Upgrade `grpcio*` to 1.60.0 and `grpc-java` to 1.61.0 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/04 05:38:12 UTC, 0 replies.
- Re: [PR] [SPARK-46654][SQL] Make to_csv can correctly display complex types data [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/04 06:15:56 UTC, 3 replies.
- [PR] [SPARK-46971][SQL] When the `compression` is null, a `NullPointException` should not be thrown [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/04 07:28:13 UTC, 3 replies.
- [PR] [Do not merge] Replace Timer with single thread scheduled executor [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/02/04 07:55:42 UTC, 1 replies.
- Re: [PR] [SPARK-42789][SQL] Rewrite multiple GetJsonObjects to a JsonTuple if their json expressions are the same [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/04 08:11:50 UTC, 2 replies.
- [PR] [SPARK-46400][CORE][SQL][3.5] When there are corrupted files in the local maven repo, skip this cache and try again [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/04 08:23:50 UTC, 4 replies.
- [PR] [SPARK-46972][SQL] Fix asymmetrical replacement for char/varchar in V2SessionCatalog.createTable [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/04 09:30:53 UTC, 5 replies.
- [PR] [SPARK-42789][SQL] Rewrite multiple GetJsonObject that consumes same JSON to single JsonTuple [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/02/04 12:10:41 UTC, 1 replies.
- [PR] [MINOR][DOCS] Add Missing space in `docs/configuration.md` [spark] - posted by "KKtheGhost (via GitHub)" <gi...@apache.org> on 2024/02/04 13:17:13 UTC, 2 replies.
- [PR] [MINOR][DOCS] Fix various broken links and link anchors [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/04 15:42:58 UTC, 5 replies.
- [PR] [SPARK-46962[SS] Implement python worker to run python streaming data source [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/04 21:24:23 UTC, 1 replies.
- Re: [PR] [SPARK-46512][CORE] Optimize shuffle reading when both sort and combine are used. [spark] - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2024/02/04 22:55:04 UTC, 4 replies.
- Re: [PR] [SPARK-45621] Add feature to evaluate subquery before push down filter Optimizer rule [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/05 00:19:13 UTC, 1 replies.
- [PR] [SPARK-46974][SQL][TEST] Recover a test case for day-of-year 2-letter 'DD' pattern parsing 3-digit values [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/05 04:05:46 UTC, 2 replies.
- Re: [PR] [SPARK-46962][SS][PYTHON] Implement python worker to run python streaming data source [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/05 04:38:26 UTC, 19 replies.
- Re: [PR] [MINOR][DOCS] The default fractional numeric literal in SQL is a decimal [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/05 05:56:27 UTC, 5 replies.
- [PR] [MINOR][TEST] Add output/exception to error message when schema not matched [spark] - posted by "viirya (via GitHub)" <gi...@apache.org> on 2024/02/05 06:09:20 UTC, 0 replies.
- Re: [PR] [SPARK-28346][SQL] clone the query plan between analyzer, optimizer and planner [spark] - posted by "MasterDDT (via GitHub)" <gi...@apache.org> on 2024/02/05 06:40:50 UTC, 1 replies.
- [PR] [SPARK-46975][PS] Move `to_{hdf, feather, stata}` to the fallback list [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/05 07:07:41 UTC, 10 replies.
- [PR] [MINOR][DOCS] Document default fractional numeric literals in SQL [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/05 07:25:19 UTC, 9 replies.
- [PR] [SPARK-46976][PS] Implement `DataFrameGroupBy.corr` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/05 07:36:41 UTC, 5 replies.
- [PR] [SPARK-36832][kubernetes]: implement launcher protocol for K8s client to manage app using SparkAppHandle [spark] - posted by "Vensence (via GitHub)" <gi...@apache.org> on 2024/02/05 07:58:35 UTC, 0 replies.
- Re: [PR] [SPARK-46920][YARN] Improve executor exit error message on YARN [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/02/05 09:33:13 UTC, 5 replies.
- [PR] [SPARK-46977][CORE] A failed request to obtain a token from one NameNode should not block subsequent token requests [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/02/05 10:22:10 UTC, 2 replies.
- [PR] [SPARK-46978][PYTHON][DOCS] Refine docstring of `sum_distinct/array_agg/count_if` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/05 10:22:55 UTC, 3 replies.
- Re: [PR] [MINOR][TEST] Add output/exception to error message when schema not matched in `TPCDSQueryTestSuite` [spark] - posted by "viirya (via GitHub)" <gi...@apache.org> on 2024/02/05 15:11:25 UTC, 2 replies.
- Re: [PR] [SPARK-46977][CORE] A failed request to obtain a token from one NameNode should not skip subsequent token requests [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/02/05 16:10:13 UTC, 2 replies.
- [PR] [SPARK-46849][SQL][FOLLOWUP] Column default value cannot reference session variables [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/05 17:32:40 UTC, 3 replies.
- [PR] [WIP][SQL] Replace `IllegalArgumentException` by `SparkIllegalArgumentException` in `catalyst` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/05 19:30:29 UTC, 0 replies.
- [PR] Avoid using internal APIs in tests [spark] - posted by "markj-db (via GitHub)" <gi...@apache.org> on 2024/02/05 22:06:54 UTC, 0 replies.
- [PR] [WIP] Support profiling in other types of UDF [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/05 23:42:06 UTC, 0 replies.
- Re: [PR] [SPARK-46980][SQL][MINOR] Avoid using internal APIs in dataframe end-to-end tests [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/06 01:10:57 UTC, 1 replies.
- [PR] [SPARK-45599][CORE] Use object equality in OpenHashSet [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/06 02:56:42 UTC, 36 replies.
- Re: [PR] [SPARK-46960][SS] Testing Multiple Input Streams with TransformWithState operator [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/06 03:28:16 UTC, 1 replies.
- [PR] [SPARK-46170][SQL][3.5] Support inject adaptive query post planner strategy rules in SparkSessionExtensions [spark] - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2024/02/06 03:55:29 UTC, 4 replies.
- [PR] [SPARK-46979] Add support for specifying key and value encoder separately and also for each col family in RocksDB state store provider [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/06 04:15:18 UTC, 0 replies.
- Re: [PR] [SPARK-46979][SS] Add support for specifying key and value encoder separately and also for each col family in RocksDB state store provider [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/06 04:18:26 UTC, 21 replies.
- Re: [PR] [WIP][SS] Python streaming source [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/06 04:27:36 UTC, 0 replies.
- [PR] [SPARK-46934][SQL] Read/write roundtrip for struct type with special characters with HMS [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/06 04:45:16 UTC, 4 replies.
- Re: [PR] [MINOR][DOCS] Clarify docs on default fractional numeric literals in SQL [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/06 04:46:56 UTC, 1 replies.
- [PR] [SPARK-46982][SQL] Remove _LEGACY_ERROR_TEMP_2187 in favor of CANNOT_RECOGNIZE_HIVE_TYPE [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/06 05:39:42 UTC, 4 replies.
- [PR] [SQL][SPARK-46954] XML: Wrap InputStreamReader with BufferedReader [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/02/06 06:00:46 UTC, 0 replies.
- Re: [PR] [SPARK-46954][SQL] XML: Wrap InputStreamReader with BufferedReader [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/06 07:11:13 UTC, 1 replies.
- [PR] [SPARK-46984][PYTHON] Remove pyspark.copy_func [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/06 07:34:13 UTC, 3 replies.
- [PR] [SPARK-46985][PYTHON] Move pyspark._NoValue to pyspark.sql._NoValue [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/06 07:55:22 UTC, 1 replies.
- [PR] [SPARK-46986][PYTHON] Move pyspark.loose_version to pyspark.sql.loose_version [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/06 08:09:02 UTC, 1 replies.
- [PR] [SPARK-46987][CONNECT] `ProtoUtils.abbreviate` avoid unnecessary `setField` operation [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/06 09:03:03 UTC, 3 replies.
- [PR] [SPARK-46989][SQL][CONNECT] Improve concurrency performance for SparkSession [spark] - posted by "beliefer (via GitHub)" <gi...@apache.org> on 2024/02/06 11:11:00 UTC, 3 replies.
- Re: [PR] (DRAFT) [SPARK-46798] Kafka custom partition location assignment in Spark Structured Streaming (rack awareness) [spark] - posted by "subham611 (via GitHub)" <gi...@apache.org> on 2024/02/06 14:05:28 UTC, 1 replies.
- [PR] [MINOR][DOCS] Show sort order of NaN relative to infinity [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/06 16:20:19 UTC, 3 replies.
- [PR] assorted copy edits to migration instructions [spark] - posted by "elharo (via GitHub)" <gi...@apache.org> on 2024/02/06 18:23:57 UTC, 0 replies.
- Re: [PR] [SPARK-46688][SPARK-46691][PYTHON][CONNECT] Support v2 profiling in aggregate Pandas UDFs [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/06 19:23:16 UTC, 2 replies.
- [PR] [MINOR][PYTHON] refactor PythonWrite to prepare for supporting python source streaming write [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/06 19:33:14 UTC, 0 replies.
- Re: [PR] [MINOR][PYTHON] refactor PythonWrite to prepare for supporting python data source streaming write [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/06 19:36:57 UTC, 1 replies.
- [PR] [SPARK-46689][SPARK-46690][PYTHON][CONNECT] Support v2 profiling in group/cogroup applyInPandas [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/06 22:07:54 UTC, 1 replies.
- [PR] [SPARK-46913][WIP] Add support for processing/event time based timers with transformWithState operator [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/06 23:20:07 UTC, 0 replies.
- [PR] [WIP][SPARK-46947][CORE] Delay memory manager initialization until Driver plugin is loaded [spark] - posted by "sunchao (via GitHub)" <gi...@apache.org> on 2024/02/06 23:34:18 UTC, 9 replies.
- [PR] [DO-NOT-MERGE] Decouple PySpark core API to pyspark.core package [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/07 04:46:16 UTC, 0 replies.
- [PR] [SPARK-46996][SQL] Allow AQE coalesce final stage in SQL cached plan [spark] - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2024/02/07 04:46:46 UTC, 7 replies.
- [PR] [SPARK-46997][CORE] Enable `spark.worker.cleanup.enabled` by default [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/07 06:16:29 UTC, 5 replies.
- [PR] [SPARK-43117][CONNECT] Make `ProtoUtils.abbreviate` support repeated fields [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/07 07:03:41 UTC, 9 replies.
- Re: [PR] [SPARK-45762][CORE] Support shuffle managers defined in user jars by changing startup order [spark] - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2024/02/07 07:58:02 UTC, 2 replies.
- [PR] [WIP][SQL] Deprecate the SQL config `spark.sql.legacy.allowZeroIndexInFormatString` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/07 08:40:54 UTC, 0 replies.
- Re: [PR] [SPARK-46998][SQL] Deprecate the SQL config `spark.sql.legacy.allowZeroIndexInFormatString` [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/07 11:59:13 UTC, 5 replies.
- [PR] [SPARK-46999][SQL] ExpressionWithUnresolvedIdentifier should include other expressions in the expression tree [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/07 13:25:01 UTC, 3 replies.
- [PR] [SPARK-46993] Fix constant folding for session variables [spark] - posted by "srielau (via GitHub)" <gi...@apache.org> on 2024/02/07 16:25:52 UTC, 3 replies.
- [PR] [SPARK-47000][CORE] Use `getTotalMemorySize` in `WorkerArguments` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/07 18:30:00 UTC, 4 replies.
- Re: [PR] [SPARK-46961][SS] Using ProcessorContext to store and retrieve handle [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/07 19:21:23 UTC, 8 replies.
- [PR] [SPARK-47003][K8S] Detect and fail on invalid volume sizes (< 1KiB) in K8s [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/07 21:26:08 UTC, 17 replies.
- Re: [PR] [SPARK-46689][SPARK-46690][PYTHON][CONNECT] Support v2 profiling in group/cogroup applyInPandas/applyInArrow [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/07 21:58:44 UTC, 5 replies.
- [PR] [SPARK-47002][Python] Return better error message if UDTF 'analyze' method 'orderBy' field accidentally returns a list of strings [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/02/07 22:01:37 UTC, 4 replies.
- [PR] [SPARK-47004] Added more tests to ClientStreamingQuerySuite to increase Scala client test coverage [spark] - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2024/02/07 22:10:45 UTC, 2 replies.
- [PR] [SPARK-46832][SQL] Introducing Collate and Collation expressions [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/02/07 22:15:53 UTC, 30 replies.
- [PR] [REFERENCE][DO-NOT-MERGE] Initial implementation of python streaminng data source [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/07 22:49:19 UTC, 0 replies.
- Re: [PR] [SPARK-42304][FOLLOWUP][SQL] Add test for `GET_TABLES_BY_TYPE_UNSUPPORTED_BY_HIVE_VERSION` [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/08 00:17:59 UTC, 1 replies.
- Re: [PR] [SPARK-46994][PYTHON] Refactor PythonWrite to prepare for supporting python data source streaming write [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/08 03:16:34 UTC, 1 replies.
- [PR] [SPARK-47005][PYTHON][DOCS] Refine docstring of `asc_nulls_first/asc_nulls_last/desc_nulls_first/desc_nulls_last` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/08 03:19:26 UTC, 2 replies.
- [PR] [MINOR][PYTHON][SQL][TESTS] Don't load Python Data Source when Python executable is not available even for testing [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/08 04:12:57 UTC, 4 replies.
- Re: [PR] [SPARK-46993][SQL] Fix constant folding for session variables [spark] - posted by "srielau (via GitHub)" <gi...@apache.org> on 2024/02/08 05:11:07 UTC, 3 replies.
- Re: [PR] [SPARK-46615][CONNECT] Support s.c.immutable.ArraySeq in ArrowDeserializers [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/08 06:57:21 UTC, 1 replies.
- [PR] [WIP][CORE] Refactor `refill()` method to `isExhausted()` in `NioBufferedFileInputStream` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/08 08:03:50 UTC, 1 replies.
- [PR] [SPARK-47007] SortMap function [spark] - posted by "stefankandic (via GitHub)" <gi...@apache.org> on 2024/02/08 13:58:45 UTC, 0 replies.
- [PR] [SPARK-47011][MLLIB] Remove deprecated `BinaryClassificationMetrics.scoreLabelsWeight` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/08 17:39:47 UTC, 6 replies.
- Re: [PR] [SPARK-46831][SQL] Collations - Extending StringType and PhysicalStringType with collationId field [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/08 17:57:14 UTC, 3 replies.
- [PR] [SPARK-46831][INFRA][FOLLOWUP] Fix a wrong JIRA ID in MimaExcludes [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/08 17:59:28 UTC, 5 replies.
- [PR] [WIP] POC to add TTL for ValueState [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/02/08 21:48:46 UTC, 1 replies.
- [PR] Implement methods dumpPerfProfiles and dumpMemoryProfiles of SparkSession [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/09 00:07:42 UTC, 0 replies.
- Re: [PR] [SPARK-45736][EXAMPLE] Use \s+ as separator when testing kafka source and network source [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/09 00:17:57 UTC, 1 replies.
- Re: [PR] [SPARK-35946][PYTHON] Respect Py4J server in InheritableThread API [spark] - posted by "pratyush-prateek (via GitHub)" <gi...@apache.org> on 2024/02/09 05:23:41 UTC, 2 replies.
- Re: [PR] [SPARK-35303][PYTHON] Enable pinned thread mode by default [spark] - posted by "pratyush-prateek (via GitHub)" <gi...@apache.org> on 2024/02/09 06:09:51 UTC, 1 replies.
- [PR] [SPARK-46355][SQL][FOLLOW-UP] XML: Test to check number of open files [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/02/09 08:01:08 UTC, 0 replies.
- [PR] [SPARK-44914][BUILD] Upgrade Apache Ivy to 2.5.2 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/09 09:09:19 UTC, 26 replies.
- [PR] [MINOR][DOCS] Remove outdated `antlr4` comment in pom.xml [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/09 09:27:32 UTC, 0 replies.
- Re: [PR] [MINOR][DOCS] Remove outdated `antlr4` version comment in `pom.xml` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/09 09:50:23 UTC, 3 replies.
- Re: [PR] [SPARK-46355][SQL][TESTS][FOLLOWUP] Test to check number of open files [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/09 10:37:52 UTC, 1 replies.
- Re: [PR] [SPARK-45274][CORE][SQL][UI] Implementation of a new DAG drawing approach for job/stage/plan graphics to avoid fork [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/09 11:17:08 UTC, 1 replies.
- Re: [PR] [SPARK-47006][CORE] Refactor `refill()` method to `isExhausted()` in `NioBufferedFileInputStream` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/09 11:32:41 UTC, 2 replies.
- Re: [PR] [SPARK-47004][CONNECT] Added more tests to ClientStreamingQuerySuite to increase Scala client test coverage [spark] - posted by "bogao007 (via GitHub)" <gi...@apache.org> on 2024/02/09 23:12:30 UTC, 2 replies.
- Re: [PR] [SPARK-45744][CORE] Switch `spark.history.store.serializer` to use `PROTOBUF` by default [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/10 00:17:38 UTC, 1 replies.
- [PR] [SPARK-45789] Support DESCRIBE TABLE for clustering columns [spark] - posted by "imback82 (via GitHub)" <gi...@apache.org> on 2024/02/10 00:36:09 UTC, 1 replies.
- [PR] [SPARK-47020][CORE] Fix `RealBrowserUISeleniumSuite` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/10 00:54:56 UTC, 0 replies.
- Re: [PR] [SPARK-47020][CORE][TESTS] Fix `RealBrowserUISeleniumSuite` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/10 01:31:30 UTC, 3 replies.
- Re: [PR] [SPARK-47007][SQL] SortMap function [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/10 01:45:12 UTC, 1 replies.
- [PR] [SPARK-44445][BUILD][TESTS] Upgrade to `htmlunit` 3.3.0 and `htmlunit3-driver` 4.17.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/10 05:07:47 UTC, 0 replies.
- [PR] [SPARK-47021][BUILD][TESTS] Fix `kvstore` module to have explicit `commons-lang3` test dependency [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/10 05:46:49 UTC, 3 replies.
- Re: [PR] [SPARK-44445][BUILD][TESTS] Upgrade to `htmlunit` 3.10.0 and `htmlunit3-driver` 4.17.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/10 06:07:04 UTC, 3 replies.
- Re: [PR] [WIP][SPARK-4836][UI] Show all stage attempts on UI's job details page [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/11 00:20:15 UTC, 2 replies.
- Re: [PR] [SPARK-46991][SQL] Replace `IllegalArgumentException` by `SparkIllegalArgumentException` in `catalyst` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/11 11:48:24 UTC, 2 replies.
- [PR] [SPARK-47022][CONNECT][TESTS][3.5] Fix `connect/client/jvm` to have explicit `commons-lang3` test dependency [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/11 19:34:26 UTC, 0 replies.
- Re: [PR] [SPARK-47022][CONNECT][TESTS][3.5] Fix `connect/client/jvm` to have explicit `commons-(io|lang3)` test dependency [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/11 22:38:23 UTC, 2 replies.
- Re: [PR] [SPARK-44445][BUILD][TESTS] Use `org.seleniumhq.selenium.htmlunit3-driver` instead of `net.sourceforge.htmlunit` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/11 23:18:29 UTC, 36 replies.
- Re: [PR] [SPARK-45740][SQL] Relax the node prefix of SparkPlanGraphCluster [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/12 00:18:58 UTC, 1 replies.
- Re: [PR] [SPARK-44065][SQL] Optimize BroadcastHashJoin skew in OptimizeSkewedJoin [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/12 00:18:59 UTC, 1 replies.
- [PR] [WIP][SQL][TESTS] Check `SparkUnsupportedOperationException` instead of `UnsupportedOperationException` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/12 10:44:26 UTC, 0 replies.
- [PR] [MINOR][SQL] Show only number of test blocks when there is a mismatch [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/12 14:08:48 UTC, 4 replies.
- [PR] [SPARK-47023][BUILD] Upgrade `aircompressor` to 1.26 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/12 17:06:12 UTC, 11 replies.
- [PR] [SPARK-47025][BUILD][TESTS] Switch `Guava 19.0` dependency scope from `provided` to `test` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/12 18:34:23 UTC, 4 replies.
- [PR] [SPARK-47026] Enable JSON sources in default value nested type tests [spark] - posted by "markj-db (via GitHub)" <gi...@apache.org> on 2024/02/12 19:41:14 UTC, 1 replies.
- [PR] [SPARK-47027][PYTHON][TESTS] Use temporary directories for profiler test outputs [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/02/12 19:44:57 UTC, 3 replies.
- [PR] [SPARK-47025][BUILD][TESTS] Upgrade `Guava` dependency in `docker-integration-tests` test module [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/12 21:22:55 UTC, 6 replies.
- Re: [PR] [SPARK-47014][PYTHON][CONNECT] Implement methods dumpPerfProfiles and dumpMemoryProfiles of SparkSession [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/02/12 21:39:28 UTC, 9 replies.
- [PR] [SPARK-47030][TESTS] Add `WebBrowserTest` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/12 22:46:20 UTC, 2 replies.
- Re: [PR] [SPARK-45782][CORE][PYTHON] Add Dataframe API df.explainString() [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/13 00:18:48 UTC, 1 replies.
- Re: [PR] [SPARK-47026][SQL][TESTS] Enable JSON sources in default value nested type tests [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/13 00:37:58 UTC, 2 replies.
- Re: [PR] [SPARK-47028][SQL][TESTS] Check `SparkUnsupportedOperationException` instead of `UnsupportedOperationException` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/13 08:11:33 UTC, 2 replies.
- Re: [PR] [WIP][SPARK-46858][PYTHON][PS][BUILD] Upgrade Pandas to 2.2.0 [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/13 10:41:38 UTC, 0 replies.
- [PR] [MINOR][CONNECT] Allow Spark Connect Server Script to wait [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/02/13 13:00:45 UTC, 0 replies.
- Re: [PR] [SPARK-46906][SS] Add a check for stateful operator change for streaming [spark] - posted by "jingz-db (via GitHub)" <gi...@apache.org> on 2024/02/13 17:53:18 UTC, 10 replies.
- [PR] [SPARK-47035][SS][CONNECT] Protocol for Client-Side Listener [spark] - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2024/02/13 20:09:16 UTC, 15 replies.
- [PR] [SS} Cleanup RocksDB file tracking for previously uploaded files if files were deleted from local directory [spark] - posted by "sahnib (via GitHub)" <gi...@apache.org> on 2024/02/13 20:13:32 UTC, 0 replies.
- Re: [PR] [SS][SPARK-47036] Cleanup RocksDB file tracking for previously uploaded files if files were deleted from local directory [spark] - posted by "sahnib (via GitHub)" <gi...@apache.org> on 2024/02/13 21:15:56 UTC, 0 replies.
- [PR] [SPARK-47037] ] Fix AliasAwareOutputExpression outputPartitioning [spark] - posted by "liorregev (via GitHub)" <gi...@apache.org> on 2024/02/13 22:15:53 UTC, 0 replies.
- [PR] [SS] Add MapState implementation for State API v2. [spark] - posted by "jingz-db (via GitHub)" <gi...@apache.org> on 2024/02/14 00:08:35 UTC, 1 replies.
- Re: [PR] [Don't merge & review] verify sbt on master [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/14 00:18:11 UTC, 1 replies.
- Re: [PR] [SPARK-46858][PYTHON][PS][BUILD] Upgrade Pandas to 2.2.0 [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/14 01:20:16 UTC, 20 replies.
- Re: [PR] [SPARK-46820][PYTHON] Fix error message regression by restoring `new_msg` [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/14 01:23:33 UTC, 4 replies.
- Re: [PR] [SPARK-45396][PYTHON] Add doc entry for `pyspark.ml.connect` module, and adds `Evaluator` to `__all__` at `ml.connect` [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/14 01:44:47 UTC, 3 replies.
- Re: [PR] [SPARK-47036][SS] Cleanup RocksDB file tracking for previously uploaded files if files were deleted from local directory [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/14 03:20:14 UTC, 4 replies.
- Re: [PR] [SPARK-46962][SS][PYTHON] Add interface for python streaming data source API and implement python worker to run python streaming data source [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/14 05:35:01 UTC, 25 replies.
- [PR] [SPARK-43259][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_2024 [spark] - posted by "mihailom-db (via GitHub)" <gi...@apache.org> on 2024/02/14 09:16:09 UTC, 22 replies.
- [PR] [SPARK-47038][BUILD] Remove shaded `protobuf-java` 2.6.1 dependency from `kinesis-asl-assembly` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 10:11:35 UTC, 11 replies.
- [PR] [SPARK-47039][TESTS] Add a checkstyle rule to ban `commons-lang2` in Java code [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 10:52:44 UTC, 0 replies.
- Re: [PR] [SPARK-47040][CONNECT] Allow Spark Connect Server Script to wait [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 11:03:35 UTC, 6 replies.
- [PR] [WIP][SQL] Replace `IllegalArgumentException` by `SparkIllegalArgumentException` in `sql/api` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/14 11:25:13 UTC, 0 replies.
- [PR] PushDownUtils should support not only FileScanBuilder but any SupportsPushDownCatalystFilters [spark] - posted by "faucct (via GitHub)" <gi...@apache.org> on 2024/02/14 12:18:37 UTC, 0 replies.
- [PR] [WIP][SPARK-47042][BUILD] add missing explicit dependency 'commons-lang3' to the module 'spark-common-utils' [spark] - posted by "William1104 (via GitHub)" <gi...@apache.org> on 2024/02/14 14:57:00 UTC, 1 replies.
- [PR] SPARK-47042 add missing explicit dependency 'commons-lang3' to the module 'spark-common-utils' [spark] - posted by "William1104 (via GitHub)" <gi...@apache.org> on 2024/02/14 15:07:15 UTC, 0 replies.
- [PR] [SPARK-47044] Add executed query for JDBC external datasources. [spark] - posted by "urosstan-db (via GitHub)" <gi...@apache.org> on 2024/02/14 15:15:35 UTC, 0 replies.
- [PR] SPARK-47043 add `jackson-core` and `jackson-annotations` dependencies to module `spark-common-utils` [spark] - posted by "William1104 (via GitHub)" <gi...@apache.org> on 2024/02/14 15:28:04 UTC, 0 replies.
- Re: [PR] [SPARK-47043][BUILD] add `jackson-core` and `jackson-annotations` dependencies to module `spark-common-utils` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 16:36:10 UTC, 8 replies.
- Re: [PR] [SPARK-47042][BUILD] add missing explicit dependency 'commons-lang3' to the module 'spark-common-utils' [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 16:36:27 UTC, 4 replies.
- Re: [PR] [SPARK-47045][SQL] Replace `IllegalArgumentException` by `SparkIllegalArgumentException` in `sql/api` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/14 16:46:20 UTC, 4 replies.
- [PR] [SPARK-47015][Collation] Disable partitioning on collated columns [spark] - posted by "stefankandic (via GitHub)" <gi...@apache.org> on 2024/02/14 17:01:00 UTC, 2 replies.
- Re: [PR] [SPARK-47039][TESTS] Add a checkstyle rule to ban `commons-lang` in Java code [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 17:13:19 UTC, 4 replies.
- [PR] [SPARK-47009][Collation] Enable create table support for collation [spark] - posted by "stefankandic (via GitHub)" <gi...@apache.org> on 2024/02/14 17:34:06 UTC, 2 replies.
- [PR] [SPARK-47049][BUILD] Ban non-shaded Hadoop dependencies [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 18:40:38 UTC, 3 replies.
- [PR] [SPARK-47051][INFRA] Create a new test pipeline for `yarn` and `connect` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/14 23:09:42 UTC, 10 replies.
- [PR] [wip] value state ttl poc [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/02/14 23:22:24 UTC, 1 replies.
- [PR] [SPARK-47052][WIP] Separate state tracking variables from MicroBatchExecution/StreamExecution [spark] - posted by "jerrypeng (via GitHub)" <gi...@apache.org> on 2024/02/14 23:54:58 UTC, 0 replies.
- Re: [PR] [SPARK-45669][CORE] Ensure the continuity of rolling log index [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/15 00:18:15 UTC, 1 replies.
- [PR] [SPARK-46906][INFRA] Bump python libraries (pandas, pyarrow) in Docker image for release script [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/15 04:10:35 UTC, 2 replies.
- [PR] [SPARK-46906][INFRA][3.5] Bump python libraries (pandas, pyarrow) in Docker image for release script [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/15 04:50:40 UTC, 1 replies.
- Re: [PR] [SPARK-47044] Add executed query for JDBC external datasources to explain output [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/02/15 05:29:37 UTC, 5 replies.
- [PR] [SPARK-46687][TESTS][PYTHON] Skip MemoryProfilerParityTests when codecov enabled [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/15 05:44:41 UTC, 1 replies.
- Re: [PR] [SPARK-46687][TESTS][PYTHON][FOLLOW-UP] Skip MemoryProfilerParityTests when codecov enabled [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/15 05:49:31 UTC, 1 replies.
- Re: [PR] [SPARK-47053][INFRA][3.5] Bump python libraries (pandas, pyarrow) in Docker image for release script [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/15 05:59:02 UTC, 1 replies.
- Re: [PR] [SPARK-47009][SQL] Enable create table support for collation [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/15 06:02:00 UTC, 19 replies.
- [PR] [SPARK-47054][PYTHON][TESTS] Remove pinned version of torch for Python 3.12 support [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/15 06:40:10 UTC, 4 replies.
- [PR] [MINOR][CONNECT] Move Connect Plugins to Java for Compatibility [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/02/15 08:43:20 UTC, 0 replies.
- [PR] [SPARK-47055][PYTHON] Upgrade MyPy 1.8.0 [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/15 08:43:55 UTC, 7 replies.
- [PR] [SPARK-47056][TESTS] Add `scalastyle` and `checkstyle` rules to ban `FileBackedOutputStream` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 08:48:50 UTC, 3 replies.
- [PR] [MINOR][CONNECT] Improve usability of the start-conect-server-script.sh [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/02/15 08:54:17 UTC, 0 replies.
- [PR] [WIP][SQL][TESTS] Check `SparkIllegalArgumentException` instead of `IllegalArgumentException` in `catalyst` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/15 08:59:42 UTC, 0 replies.
- [PR] [SPARK-46078][PYTHON][TESTS] Upgrade `pytorch` for Python 3.12 [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/15 09:15:01 UTC, 2 replies.
- [PR] [SPARK-47058][TESTS] Add `scalastyle` and `checkstyle` rules to ban `AtomicDoubleArray|CompoundOrdering` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 09:48:10 UTC, 6 replies.
- Re: [PR] [SPARK-36964][CORE][YARN] Share cached dnsToSwitchMapping for yarn locality container requests [spark] - posted by "vbmacher (via GitHub)" <gi...@apache.org> on 2024/02/15 10:07:20 UTC, 0 replies.
- [PR] [SPARK-47059][SQL] Attach error context for ALTER COLUMN v1 command [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/15 10:28:41 UTC, 3 replies.
- [PR] Update pom.xml [spark] - posted by "ArunDhamotharan (via GitHub)" <gi...@apache.org> on 2024/02/15 10:42:19 UTC, 4 replies.
- Re: [PR] [SPARK-45789][SQL] Support DESCRIBE TABLE for clustering columns [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/15 13:05:30 UTC, 3 replies.
- Re: [PR] [SPARK-47062][CONNECT] Move Connect Plugins to Java for Compatibility [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/02/15 15:14:22 UTC, 2 replies.
- [PR] [SPARK-47050][SQL] Collect and publish partition level metrics [spark] - posted by "snmvaughan (via GitHub)" <gi...@apache.org> on 2024/02/15 15:37:29 UTC, 0 replies.
- Re: [PR] [SPARK-47040][CONNECT][FOLLOWUP] Improve usability of the start-conect-server-script.sh [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 16:37:23 UTC, 4 replies.
- [PR] [SPARK-47064][SQL][TESTS] Use Scala 2.13 Spark distribution in `HiveExternalCatalogVersionsSuite` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 17:29:50 UTC, 5 replies.
- Re: [PR] [SPARK-46400][CORE][SQL][3.4] When there are corrupted files in the local maven repo, skip this cache and try again [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 17:53:29 UTC, 0 replies.
- [PR] Count bug in temp views [spark] - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2024/02/15 17:57:08 UTC, 0 replies.
- [PR] [SPARK-47066][INFRA] Add Apple Silicon Maven build test to GitHub Action CI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 18:58:08 UTC, 1 replies.
- Re: [PR] [SPARK-47053][INFRA] Bump python libraries (pandas, pyarrow) in Docker image for release script [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/15 19:35:22 UTC, 0 replies.
- [PR] [WIP][SQL] Fix supported interval formats in error messages [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/15 21:01:40 UTC, 0 replies.
- Re: [PR] [SPARK-46743] Count bug after constant folding [spark] - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2024/02/15 21:37:33 UTC, 0 replies.
- Re: [PR] [SPARK-47066][INFRA] Add `Apple Silicon` Maven build test to GitHub Action CI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 22:14:08 UTC, 4 replies.
- [PR] [SPARK-47067][INFRA] Add Daily Apple Silicon Github Action Job (Java/Scala) [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/15 22:45:43 UTC, 5 replies.
- [PR] [WIP] Consolidated API for V2 profiling [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/15 23:20:02 UTC, 0 replies.
- [PR] Encoder ttl poc [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/02/15 23:28:55 UTC, 4 replies.
- [PR] [WIP] Add Variant type info to PySpark [spark] - posted by "desmondcheongzx (via GitHub)" <gi...@apache.org> on 2024/02/16 00:36:21 UTC, 0 replies.
- Re: [PR] [WIP] TransformWithState Batch Support [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/02/16 00:56:16 UTC, 0 replies.
- [PR] Recover -1 and 0 case for spark.sql.execution.arrow.maxRecordsPerBatch [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/16 01:24:49 UTC, 0 replies.
- Re: [PR] [SPARK-47069][PYTHON] Introduce `spark.profile.show/dump` for SparkSession-based profiling [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/02/16 01:52:57 UTC, 7 replies.
- [PR] [SPARK-XXX] Fix invalid aggregation after in-subquery rewrite [spark] - posted by "anton5798 (via GitHub)" <gi...@apache.org> on 2024/02/16 02:20:38 UTC, 0 replies.
- Re: [PR] [SPARK-38098][PYTHON] Add support for ArrayType of nested StructType to arrow-based conversion [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/16 02:37:44 UTC, 0 replies.
- Re: [PR] [SPARK-47068][PYTHON][TESTS] Recover -1 and 0 case for spark.sql.execution.arrow.maxRecordsPerBatch [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/16 03:41:17 UTC, 1 replies.
- [PR] [SPARK-47071][SQL] Inline With expression if it contains special expression [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/16 06:36:58 UTC, 5 replies.
- Re: [PR] [SPARK-45720] Upgrade AWS SDK to v2 for Spark Kinesis connector module [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 07:18:04 UTC, 3 replies.
- Re: [PR] [SPARK-47072][SQL] Fix supported interval formats in error messages [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/16 07:56:28 UTC, 3 replies.
- Re: [PR] [SPARK-45292][SQL][HIVE] Remove Guava from shared classes from IsolatedClientLoader [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 07:57:08 UTC, 3 replies.
- [PR] [SPARK-47057][SQL] Reeanble MyPy data test [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/16 08:12:47 UTC, 0 replies.
- [PR] [SPARK-47073][BUILD] Upgrade `versions-maven-plugin` to 2.16.2 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 08:31:05 UTC, 0 replies.
- Re: [PR] [SPARK-47060][SQL][TESTS] Check `SparkIllegalArgumentException` instead of `IllegalArgumentException` in `catalyst` [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/16 08:44:12 UTC, 1 replies.
- Re: [PR] [SPARK-47073][BUILD] Upgrade several Maven plugins to the latest versions [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 08:59:13 UTC, 3 replies.
- Re: [PR] [SPARK-47057][PYTHON] Reeanble MyPy data test [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 10:15:40 UTC, 1 replies.
- [PR] [SPARK-47074][INFRA] Fix outdated comments in GitHub Action scripts [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 10:23:22 UTC, 4 replies.
- [PR] [SPARK-47075][BUILD] Add `derby-provided` profile [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 10:58:40 UTC, 5 replies.
- [PR] [SPARK-47072][SQL][3.5] Fix supported interval formats in error messages [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/16 12:42:06 UTC, 2 replies.
- [PR] [SPARK-47072][SQL][3.4] Fix supported interval formats in error messages [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/16 14:20:35 UTC, 2 replies.
- [PR] [SPARK-45357][CONNECT][TESTS][3.5] Normalize `dataframeId` when comparing `CollectMetrics` in `SparkConnectProtoSuite` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/16 15:09:44 UTC, 6 replies.
- Re: [PR] [SPARK-47070] Fix invalid aggregation after in-subquery rewrite [spark] - posted by "agubichev (via GitHub)" <gi...@apache.org> on 2024/02/16 18:14:54 UTC, 3 replies.
- [PR] [WIP][SPARK-47032][Python] Prototype for adding pass-through columns to Python UDTF API [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/02/16 19:03:05 UTC, 2 replies.
- [PR] [SPARK-47076][CORE][TESTS] Fix HistoryServerSuite.`incomplete apps get refreshed` test to start with empty storeDir [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/16 20:50:07 UTC, 9 replies.
- [PR] [SPARK-47077][BUILD][TEST] Fix sbt build Revert [SPARK-44445][BUILD][TESTS] [spark] - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2024/02/16 23:14:53 UTC, 3 replies.
- [PR] [SPARK-42285][DOC] Update Parquet data source doc on the timestamp_ntz inference option [spark] - posted by "gengliangwang (via GitHub)" <gi...@apache.org> on 2024/02/16 23:27:26 UTC, 4 replies.
- Re: [PR] [SPARK-46710][SQL] Clean up the broadcast data generated when sql execution ends [spark] - posted by "yabola (via GitHub)" <gi...@apache.org> on 2024/02/18 13:39:31 UTC, 4 replies.
- [PR] [WIP][SQL] Raise Spark's exception with an error class in config value check [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/18 16:39:21 UTC, 0 replies.
- [PR] Using BigDecimal to do the resource calculation [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/02/18 23:33:29 UTC, 0 replies.
- Re: [PR] [SPARK-45881][SQL] support Higher Order aggregate functions from SQL [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/19 00:18:59 UTC, 1 replies.
- Re: [PR] [SPARK-45880][SQL] Introduce a new TableCatalog.listTable overload th… [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/19 00:19:00 UTC, 12 replies.
- Re: [PR] [SPARK-45821][CONNECT]make SparkSession._apply_options throw exception [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/19 00:19:01 UTC, 0 replies.
- Re: [PR] SPARK-45872 Update plugin for SBOM generation to 2.7.10 [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/19 00:19:02 UTC, 1 replies.
- Re: [PR] [SPARK-45716][PYTHON][CONNECT] Add StructType.treeString to Python [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/19 00:19:03 UTC, 0 replies.
- Re: [PR] [SPARK-45278] [YARN] Allow configuring Yarn executor bind address in Yarn [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/19 00:19:04 UTC, 1 replies.
- Re: [PR] [MINOR][INFRA][DOCS] Remove undated comment in build_and_test.yml [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/19 02:14:00 UTC, 1 replies.
- Re: [PR] [SPARK-47084][BUILD] Upgrade joda-time to 2.12.7 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/19 02:15:44 UTC, 1 replies.
- Re: [PR] [SPARK-47066][INFRA][FOLLOW-UP] Deduplicate Apple Silicon Maven build definition [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/19 02:53:12 UTC, 3 replies.
- Re: [PR] [SPARK-47083][BUILD] Upgrade `commons-codec` to 1.16.1 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/19 02:56:51 UTC, 2 replies.
- Re: [PR] [SPARK-46938][BUILD][CORE][SQL][UI] Migrate from Jetty 10 to Jetty 11 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/19 02:58:34 UTC, 30 replies.
- [PR] [WIP][SPARK-47089][BUILD][TESTS] Migrate `mockito 4` to `mockito 5` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/19 03:29:44 UTC, 0 replies.
- [PR] [SPARK-47090][INFRA] Skip JDK 17/21 Maven compilation in branch-3.4 job [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/19 03:34:36 UTC, 13 replies.
- Re: [PR] [SPARK-47085][SQL] reduce the complexity of toTRowSet from n^2 to n [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/19 03:41:17 UTC, 22 replies.
- [PR] [MINOR][INFRA] Rename build_maven.yml and build_maven_java21.yml [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/19 03:41:42 UTC, 2 replies.
- Re: [PR] [SPARK-47087][SQL] Raise Spark's exception with an error class in config value check [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/19 04:24:52 UTC, 4 replies.
- Re: [PR] [SPARK-46654][SQL] Make `to_csv` explicitly indicate that it does not support complex types of data [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/19 05:10:34 UTC, 2 replies.
- [PR] [MINOR][SQL] Remove `unsupportedOperationMsg` from `CaseInsensitiveStringMap` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/19 05:16:10 UTC, 3 replies.
- [PR] [SPARK-47067][INFRA] Add Daily Apple Silicon Github Action Job with Maven (Java/Scala) [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/19 05:18:46 UTC, 4 replies.
- [PR] [SPARK-46972][SQL][TESTS][FOLLOWUP] Remove the assertion for table existence [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/19 06:23:13 UTC, 3 replies.
- Re: [PR] [SPARK-47081][CONNECT] Support Query Execution Progress [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/19 07:16:33 UTC, 8 replies.
- Re: [PR] [SPARK-47089][BUILD][TESTS] Migrate `mockito 4` to `mockito 5` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/19 09:05:25 UTC, 4 replies.
- [PR] [SPARK-44826][TEST][CONNECT] Re-enable `test_series_iloc_setitem` [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/19 10:01:32 UTC, 3 replies.
- Re: [PR] [SPARK-47088] Using BigDecimal to do the resource calculation [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/02/19 10:23:12 UTC, 13 replies.
- [PR] [WIP] Use checkInputDataTypes to check the parameter types of the function `to_xml` & remove _LEGACY_ERROR_TEMP_3234 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/19 10:45:45 UTC, 0 replies.
- Re: [PR] [SPARK-24578][Core] Cap sub-region's size of returned nio buffer [spark] - posted by "Anubisxcw (via GitHub)" <gi...@apache.org> on 2024/02/19 12:10:34 UTC, 0 replies.
- Re: [PR] [SPARK-47015][SQL] Disable partitioning on collated columns [spark] - posted by "stefankandic (via GitHub)" <gi...@apache.org> on 2024/02/19 12:16:10 UTC, 21 replies.
- Re: [PR] [SPARK-46862][SQL] Disable CSV column pruning in the multi-line mode [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/19 15:51:06 UTC, 3 replies.
- [PR] [SPARK-47092][CORE][SQL][K8S] Add `getUriBuilder` to `o.a.s.u.Utils` and use it [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/19 18:49:48 UTC, 2 replies.
- [PR] [SPARK-47093][TESTS] Upgrade `mockito` to 5.10.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/19 21:09:27 UTC, 4 replies.
- [PR] [SPARK-37434][TESTS] Disable unsupported `ExtendedLevelDBTest` on `MacOS/aarch64` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/19 21:41:18 UTC, 4 replies.
- [PR] [SPARK-47095][INFRA] Uses proper options for command script in macos-14 build [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/20 00:02:14 UTC, 2 replies.
- [PR] [SPARK-47096][INFRA] Upgrade Python version in Maven build in macos-14 build [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/20 00:17:47 UTC, 4 replies.
- Re: [PR] [SPARK-43258][SQL] Assign names to error _LEGACY_ERROR_TEMP_202[3,4,5] [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/20 00:17:52 UTC, 5 replies.
- Re: [PR] [SPARK-44493][SQL] Support for translating catalyst expressions into partial datasource filters [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/20 00:17:55 UTC, 1 replies.
- Re: [PR] [SPARK-42601][SQL] New physical type Decimal128 for DecimalType [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/20 00:17:57 UTC, 1 replies.
- Re: [PR] [SPARK-47096][INFRA] Upgrade Python version in Maven builds [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/20 00:57:48 UTC, 3 replies.
- [PR] [SPARK-47097][CONNECT][TESTS] Deflake interrupt tag at SparkSessionE2ESuite [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/20 01:57:44 UTC, 2 replies.
- [PR] [SPARK-47016][BUILD] Upgrade scalatest related dependencies to the 3.2.18 series [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/20 02:25:11 UTC, 3 replies.
- [PR] [SPARK-47098][INFRA] Migrate from AppVeyor to GitHub Actions for SparkR tests on Windows [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/20 02:35:47 UTC, 7 replies.
- [PR] [SPARK-46973][SQL] Add new SessionCatalog APIs to support V2 table cache [spark] - posted by "allisonwang-db (via GitHub)" <gi...@apache.org> on 2024/02/20 02:52:18 UTC, 2 replies.
- [PR] [SPARK-47099][SQL] The `start` value of `paramIndex` for the error class `UNEXPECTED_INPUT_TYPE` should be `1` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/20 03:36:17 UTC, 6 replies.
- Re: [PR] [SPARK-47080][CORE][TESTS] Fix `HistoryServerSuite` to use `constant value` and `getNumJobsRestful` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/20 04:27:14 UTC, 4 replies.
- Re: [PR] [SPARK-47052] Separate state tracking variables from MicroBatchExecution/StreamExecution [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/20 05:26:05 UTC, 0 replies.
- Re: [PR] [SPARK-47052][SS] Separate state tracking variables from MicroBatchExecution/StreamExecution [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/20 05:38:18 UTC, 4 replies.
- [PR] [SPARK-47100][BUILD] Upgrade `netty` to 4.1.107.Final and `netty-tcnative` to 2.0.62.Final [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/20 05:43:35 UTC, 3 replies.
- [PR] [SPARK-45615][BUILD] Remove undated "Auto-application to `()` is deprecated" compile suppression rules [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/20 07:02:43 UTC, 2 replies.
- Re: [PR] [SPARK-47044][SQL] Add executed query for JDBC external datasources to explain output [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/20 09:41:59 UTC, 4 replies.
- [PR] [SPARK-47101][SQL] Make HiveExternalCatalog.verifyDataSchema comply with hive column name rules [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/20 10:01:44 UTC, 6 replies.
- Re: [PR] [SPARK-47085][SQL][3.5] reduce the complexity of toTRowSet from n^2 to n [spark] - posted by "igreenfield (via GitHub)" <gi...@apache.org> on 2024/02/20 10:04:56 UTC, 4 replies.
- Re: [PR] [SPARK-47079][PYTHON][SQL][CONNECT] Add Variant type info to PySpark [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/20 11:21:48 UTC, 12 replies.
- [PR] [SPARK-46992]make dataset.cache() return new ds instance [spark] - posted by "doki23 (via GitHub)" <gi...@apache.org> on 2024/02/20 12:45:14 UTC, 21 replies.
- [PR] [WIP] Make the default storage level of intermediate datasets for MLlib configurable [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/02/20 12:45:22 UTC, 2 replies.
- Re: [PR] [WIP][SPARK-47103][ML] Make the default storage level of intermediate datasets for MLlib configurable [spark] - posted by "srowen (via GitHub)" <gi...@apache.org> on 2024/02/20 13:07:44 UTC, 11 replies.
- Re: [PR] [SPARK-35878][CORE] Revert S3A endpoint fixup logic of SPARK-35878 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/20 16:08:12 UTC, 3 replies.
- Re: [PR] [SPARK-46097][SQL] Push down limit 1 through Union and Aggregate [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/20 16:52:38 UTC, 0 replies.
- Re: [PR] [SPARK-47069][PYTHON][CONNECT] Introduce `spark.profile.show/dump` for SparkSession-based profiling [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/20 18:50:30 UTC, 6 replies.
- Re: [PR] [WIP][SPARK-24815] [CORE] Trigger Interval based DRA for Structured Streaming [spark] - posted by "vitgorbunov (via GitHub)" <gi...@apache.org> on 2024/02/20 19:02:03 UTC, 8 replies.
- Re: [PR] [SPARK-47085][SQL][3.4] reduce the complexity of toTRowSet from n^2 to n [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/20 20:42:44 UTC, 2 replies.
- [PR] [SPARK-42328][SQL] Remove _LEGACY_ERROR_TEMP_1175 from error classes [spark] - posted by "nikolamand-db (via GitHub)" <gi...@apache.org> on 2024/02/20 21:39:01 UTC, 8 replies.
- [PR] [MINOR][SQL] Remove `toLowerCase(Locale.ROOT)` check for `CATALOG_IMPLEMENTATION` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/20 22:15:33 UTC, 0 replies.
- [PR] [SPARK-47108][CORE] Set `derby.connection.requireAuthentication` to `false` explicitly in CLIs [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/20 22:33:37 UTC, 2 replies.
- [PR] [SPARK-47095][INFRA][FOLLOW-UP] Fix Maven test syntax errors [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/20 23:58:27 UTC, 1 replies.
- [PR] Bump org.postgresql:postgresql from 42.7.0 to 42.7.2 [spark] - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2024/02/21 00:06:48 UTC, 3 replies.
- [PR] Bump org.apache.commons:commons-compress from 1.25.0 to 1.26.0 [spark] - posted by "dependabot[bot] (via GitHub)" <gi...@apache.org> on 2024/02/21 00:18:04 UTC, 3 replies.
- Re: [PR] [SPARK-44814][CONNECT][PYTHON]Test to protect from faulty protobuf versions [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/21 00:18:12 UTC, 1 replies.
- [PR] [SPARK-47109][BUILD] Upgrade `commons-compress` to 1.26.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 00:19:39 UTC, 5 replies.
- Re: [PR] [SPARK-44662] Perf improvement in BroadcastHashJoin queries with stream side join key on non partition columns [spark] - posted by "ahshahid (via GitHub)" <gi...@apache.org> on 2024/02/21 00:48:23 UTC, 2 replies.
- Re: [PR] [MINOR][SQL] Remove `toLowerCase(Locale.ROOT)` for `CATALOG_IMPLEMENTATION` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 01:00:13 UTC, 2 replies.
- [PR] [SPARK-47111][SQL][TESTS] Upgrade `PostgreSQL` JDBC driver to 42.7.2 and docker image to 16.2 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 03:08:17 UTC, 4 replies.
- Re: [PR] [SPARK-47095][INFRA][FOLLOW-UP] Remove TTY specific workaround in Maven build [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/21 03:08:41 UTC, 9 replies.
- Re: [PR] [SPARK-47101][SQL] Make `HiveExternalCatalog.verifyDataSchema` comply with hive column name rules [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 03:24:56 UTC, 4 replies.
- [PR] [SPARK-47112][INFRA] Write logs into a file in SparkR Windows build [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/21 03:31:31 UTC, 7 replies.
- Re: [PR] [SPARK-46928][SS] Add support for ListState in Arbitrary State API v2. [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/21 05:43:35 UTC, 0 replies.
- [PR] [SPARK-47113][CORE] Revert S3A endpoint fixup logic of SPARK-35878 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 06:29:49 UTC, 8 replies.
- [PR] [SPARK-46938][BUILD][CORE] Remove javax-servlet-api exclusion rule for SBT [spark] - posted by "HiuKwok (via GitHub)" <gi...@apache.org> on 2024/02/21 06:32:43 UTC, 4 replies.
- Re: [PR] [SPARK-36832][Kubernetes][Launcher]: implement launcher protocol for K8s client to manage app using SparkAppHandle [spark] - posted by "Vensence (via GitHub)" <gi...@apache.org> on 2024/02/21 06:42:55 UTC, 0 replies.
- [PR] [SPARK-47115][INFRA] Use larger memory for Maven builds [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/21 06:46:36 UTC, 10 replies.
- Re: [PR] [SPARK-47099][SQL] Use `ordinalNumber` to uniformly set the value of `paramIndex` for the error class `UNEXPECTED_INPUT_TYPE` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/21 07:27:16 UTC, 6 replies.
- [PR] [SPARK-47116][INFRA][R] Install proper Python version in SparkR Windows build to avoid warnings [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/21 07:35:50 UTC, 4 replies.
- Re: [PR] [SPARK-47101][SQL] Allow comma to be used in top-level column names and use `TypeInfoUtils.getTypeInfoFromTypeString` to check nested type definition in `HiveExternalCatalog.verifyDataSchema` [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/21 08:01:41 UTC, 5 replies.
- Re: [PR] [SPARK-46938][BUILD] Remove `javax-servlet-api` exclusion rule for SBT [spark] - posted by "pan3793 (via GitHub)" <gi...@apache.org> on 2024/02/21 08:33:05 UTC, 6 replies.
- Re: [PR] [SPARK-47101][SQL] Allow comma to be used in top-level column names and remove check nested type definition in `HiveExternalCatalog.verifyDataSchema` [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/21 11:58:58 UTC, 15 replies.
- [PR] Correct docstring for pyspark's dataframe.head [spark] - posted by "wunderalbert (via GitHub)" <gi...@apache.org> on 2024/02/21 13:08:56 UTC, 2 replies.
- [PR] [WIP][SPARK-43256][SQL] Assign a name to the error class _LEGACY_ERROR_TEMP_2021 [spark] - posted by "andrej-db (via GitHub)" <gi...@apache.org> on 2024/02/21 13:09:50 UTC, 4 replies.
- Re: [PR] [SPARK-47118][BUILD][CORE][SQL][UI] Migrate from Jetty 10 to Jetty 11 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 15:33:30 UTC, 17 replies.
- Re: [PR] [SPARK-44719][SQL] Fix NoClassDefFoundError when using Hive UDF [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 17:44:33 UTC, 0 replies.
- [PR] [SPARK-47104][SQL] `TakeOrderedAndProjectExec` should initialize the unsafe projection [spark] - posted by "bersprockets (via GitHub)" <gi...@apache.org> on 2024/02/21 18:25:59 UTC, 2 replies.
- [PR] Pass table identifier to row data source scan exec for V2 strategy. [spark] - posted by "urosstan-db (via GitHub)" <gi...@apache.org> on 2024/02/21 18:45:22 UTC, 0 replies.
- [PR] [SPARK-47119][BUILD] Add `hive-jackson-provided` profile [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 18:52:03 UTC, 14 replies.
- Re: [PR] [SPARK-46947][CORE] Delay memory manager initialization until Driver plugin is loaded [spark] - posted by "mridulm (via GitHub)" <gi...@apache.org> on 2024/02/21 18:54:46 UTC, 35 replies.
- [PR] [SPARK-47120] Null comparison push down data filter from subquery produces in NPE in Parquet filter [spark] - posted by "cosmind-db (via GitHub)" <gi...@apache.org> on 2024/02/21 19:16:31 UTC, 0 replies.
- [PR] [SPARK-47121][CORE] Avoid RejectedExecutionExceptions during StandaloneSchedulerBackend shutdown [spark] - posted by "JoshRosen (via GitHub)" <gi...@apache.org> on 2024/02/21 20:06:18 UTC, 2 replies.
- [PR] TEST buf-setup-action [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 21:01:38 UTC, 1 replies.
- [PR] Buf setup action 2 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 21:03:35 UTC, 0 replies.
- Re: [PR] [SPARK-47122][INFRA] Pin `buf-setup-action` to `v1.29.0` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/21 21:12:41 UTC, 3 replies.
- [PR] [Backport][Spark-3.5][SPARK-47036][SS] Cleanup RocksDB file tracking for previously uploaded files if files were deleted from local directory [spark] - posted by "sahnib (via GitHub)" <gi...@apache.org> on 2024/02/21 21:19:28 UTC, 0 replies.
- Re: [PR] [SPARK-46743][SQL] Count bug after constant folding [spark] - posted by "jchen5 (via GitHub)" <gi...@apache.org> on 2024/02/21 21:37:32 UTC, 0 replies.
- [PR] [SPARK-31745][INFRA][R] Eanble Hive related tests at SparkR on Windows [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/21 23:35:57 UTC, 3 replies.
- [PR] [SPARK-47124][R][INFRA] Skip scheduled SparkR on Windows in fork repositories by default [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/21 23:43:07 UTC, 6 replies.
- [PR] [SPARK-47123][CORE] JDBCRDD does not correctly handle errors in getQueryOutputSchema [spark] - posted by "planga82 (via GitHub)" <gi...@apache.org> on 2024/02/21 23:50:36 UTC, 5 replies.
- [PR] [SPARK-47125][SQL] Return null if Univocity never triggers parsing [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/22 00:18:01 UTC, 4 replies.
- Re: [PR] [SPARK-44493][SQL] Translate catalyst expression into partial datasource filter [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/22 00:18:07 UTC, 1 replies.
- Re: [PR] [SPARK-40129][SQL] Fix Decimal multiply can produce the wrong answer [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/22 00:18:10 UTC, 1 replies.
- Re: [PR] [SPARK-43025][SQL] Eliminate Union if filters have the same child plan [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/22 00:18:12 UTC, 1 replies.
- [PR] [SPARK-47115][INFRA][FOLLOW-UP] Use larger runner for Maven build (macos-14) [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/22 00:50:09 UTC, 2 replies.
- Re: [PR] [SPARK-47115][INFRA][FOLLOW-UP] Use larger runner for Maven build (macos-14-large) [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/22 00:53:11 UTC, 7 replies.
- Re: [PR] [SPARK-47036][SS][3.5] Cleanup RocksDB file tracking for previously uploaded files if files were deleted from local directory [spark] - posted by "HeartSaVioR (via GitHub)" <gi...@apache.org> on 2024/02/22 01:58:48 UTC, 1 replies.
- Re: [PR] [SPARK-47120][SQL] Null comparison push down data filter from subquery produces in NPE in Parquet filter [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/22 03:06:54 UTC, 16 replies.
- [PR] [WIP] Update SKIP_SPARK_RELEASE_VERSIONS in Maven CI [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/22 05:51:43 UTC, 3 replies.
- Re: [PR] [SPARK-47127][INFRA] Update `SKIP_SPARK_RELEASE_VERSIONS` in Maven CIs [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/22 06:18:44 UTC, 2 replies.
- [PR] [SPARK-47128][SQL] Improve `spark.sql.hive.metastore.sharedPrefixes` default value [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/22 07:52:33 UTC, 4 replies.
- [PR] [SPARK-47129][CONNECT][SQL] Make `ResolveRelations` handle plan id properly [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/22 08:03:16 UTC, 0 replies.
- [PR] [SPARK-47130][CORE] Use listStatus to bypass block location info when cleaning driver logs [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/22 08:14:12 UTC, 3 replies.
- [PR] Collation support for built-in string functions: contains, startswith, endswith [spark] - posted by "uros-db (via GitHub)" <gi...@apache.org> on 2024/02/22 09:05:49 UTC, 0 replies.
- Re: [PR] [WIP] Collation support for built-in string functions: contains, startswith, endswith [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/02/22 10:07:06 UTC, 3 replies.
- [PR] [SPARK-47127][INFRA][FOLLOWUP] Remove `3.5.1` from `SKIP_SPARK_RELEASE_VERSIONS` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/22 10:27:55 UTC, 3 replies.
- [PR] [SPARK-47102][SQL] Adding COLLATION_ENABLED config [WIP] [spark] - posted by "mihailom-db (via GitHub)" <gi...@apache.org> on 2024/02/22 13:55:35 UTC, 0 replies.
- Re: [PR] [SPARK-47132][DOCS][PYTHON] Correct docstring for pyspark's dataframe.head [spark] - posted by "wunderalbert (via GitHub)" <gi...@apache.org> on 2024/02/22 14:20:07 UTC, 5 replies.
- Re: [PR] [SPARK-47131][SQL] Collations - support for built-in string functions: contains, startswith, endswith [spark] - posted by "mitkedb (via GitHub)" <gi...@apache.org> on 2024/02/22 14:26:28 UTC, 3 replies.
- Re: [PR] [SPARK-47102][SQL] Adding COLLATION_ENABLED config [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/02/22 16:09:02 UTC, 9 replies.
- Re: [PR] [SPARK-47001][SQL] Pushdown verification in optimizer [spark] - posted by "holdenk (via GitHub)" <gi...@apache.org> on 2024/02/22 18:27:53 UTC, 8 replies.
- Re: [PR] [SPARK-46975][PS] Support dedicated fallback methods [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/22 19:10:11 UTC, 2 replies.
- [PR] [WIP] Fix error class not exist issue in create_data_source.py [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/22 20:07:19 UTC, 1 replies.
- [PR] [SPARK-47136][CORE][TESTS] Use `ivyPath` param of `MavenUtils.loadIvySettings` in `MavenUtilsSuite` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/22 21:45:42 UTC, 0 replies.
- Re: [PR] [SPARK-47136][CORE][TESTS] Fix `MavenUtilsSuite` to use `MavenUtils.resolveMavenCoordinates` properly [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/22 22:02:50 UTC, 8 replies.
- Re: [PR] [SPARK-46913][SS] Add support for processing/event time based timers with transformWithState operator [spark] - posted by "anishshri-db (via GitHub)" <gi...@apache.org> on 2024/02/22 22:16:49 UTC, 13 replies.
- [PR] [SPARK-47135][SS] Implement error classes for Kafka data loss exceptions [spark] - posted by "micheal-o (via GitHub)" <gi...@apache.org> on 2024/02/22 22:19:41 UTC, 19 replies.
- [PR] [SPARK-47137][PYTHON][CONNECT] Add getAll to spark.conf for feature parity with Scala [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/02/22 22:20:38 UTC, 3 replies.
- [PR] [SPARK-47138] Implementing StateTTL for Value State [spark] - posted by "ericm-db (via GitHub)" <gi...@apache.org> on 2024/02/22 22:40:48 UTC, 0 replies.
- [PR] Refactor file listing with ScanFileListing interface [spark] - posted by "costas-db (via GitHub)" <gi...@apache.org> on 2024/02/22 23:00:36 UTC, 2 replies.
- [PR] [SPARK-47099][SQL][FOLLOW-UP] Uses ordinalNumber in UNEXPECTED_INPUT_TYPE [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/22 23:24:01 UTC, 3 replies.
- [PR] [SPARK-43259][SQL][FOLLOWUP] Regenerate sql-error-conditions.md to re… [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 01:18:24 UTC, 0 replies.
- Re: [PR] [SPARK-43259][SQL][FOLLOWUP] Regenerate `sql-error-conditions.md` to recover `SparkThrowableSuite` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 01:22:08 UTC, 5 replies.
- [PR] [SPARK-47140][SPARK47139][INFRA][PYTHON] Upgrade Python verion and codecov action in Coverage job [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/23 01:23:07 UTC, 3 replies.
- Re: [PR] [SPARK-46654][SQL][Python] Make `to_csv` explicitly indicate that it does not support complex types of data [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/23 01:37:19 UTC, 3 replies.
- [PR] [SPARK-47141] [Core]: Support shuffle migration to external storage. [spark] - posted by "maheshk114 (via GitHub)" <gi...@apache.org> on 2024/02/23 03:56:06 UTC, 1 replies.
- [PR] [SPARK-47142][K8S][TESTS] Use `spark.jars.ivy` instead `spark.driver.extraJavaOptions` in `DepsTestsSuite` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 04:25:36 UTC, 3 replies.
- [PR] [SPARK-47143][CONNECT][TESTS] Fix `ArtifactSuite` to use unique `MavenCoordinate`s [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 04:52:14 UTC, 0 replies.
- Re: [PR] [SPARK-47143][CONNECT][TESTS] Improve `ArtifactSuite` to use unique `MavenCoordinate`s [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 05:14:20 UTC, 2 replies.
- Re: [PR] [SPARK-47102][SQL][COLLATION] Adding COLLATION_ENABLED config [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/02/23 10:08:56 UTC, 7 replies.
- [PR] [WIP] DeduplicateRelations keeps original expressions if possible [spark] - posted by "peter-toth (via GitHub)" <gi...@apache.org> on 2024/02/23 10:33:56 UTC, 0 replies.
- [PR] [SPARK-46812][CONNECT][PYTHON] Make mapInPandas / mapInArrow support ResourceProfile [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/02/23 12:06:55 UTC, 6 replies.
- [PR] [SPARK-47144][CONNECT][SQL] Fix Spark Connect collation error by adding collateId protobuf field [spark] - posted by "nikolamand-db (via GitHub)" <gi...@apache.org> on 2024/02/23 14:32:22 UTC, 16 replies.
- Re: [PR] [SPARK-47129][CONNECT][SQL] Make `ResolveRelations` cache connect plan properly [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 17:26:19 UTC, 1 replies.
- [PR] [SPARK-47148][SQL] Avoid to materialize AQE ShuffleQueryStage on the cancellation [spark] - posted by "erenavsarogullari (via GitHub)" <gi...@apache.org> on 2024/02/23 18:26:06 UTC, 7 replies.
- Re: [PR] [SPARK-45862][PYTHON][DOCS] Add user guide for basic dataframe operations [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/23 19:34:58 UTC, 0 replies.
- [PR] [SPARK-47099][SQL][FOLLOWUP] Regenerate try_arithmetic.sql.out.java21 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 20:25:09 UTC, 1 replies.
- Re: [PR] [SPARK-47099][SQL][FOLLOWUP] Regenerate `try_arithmetic.sql.out.java21` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/23 21:44:47 UTC, 1 replies.
- [PR] [SPARK-47151][PS] Upgrade to `pandas` 2.2.1 [spark] - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2024/02/23 23:32:12 UTC, 0 replies.
- Re: [PR] [SPARK-45908][Python] Add support for writing empty DataFrames to parquet with partitions [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/24 00:17:08 UTC, 1 replies.
- Re: [PR] [SPARK-45658][SQL] Fix canonicalization of DynamicPruningSubquery to canonicalize build keys relative to build query output [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/24 00:17:09 UTC, 1 replies.
- Re: [PR] [WIP][SPARK-44098][INFRA] Introduce python breaking change detection [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/24 00:17:10 UTC, 1 replies.
- [PR] [SPARK-47152][SQL][BUILD] Provide Apache Hive Jackson dependency via a new optional directory [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/24 00:32:01 UTC, 1 replies.
- Re: [PR] [SPARK-47151][PYTHON][PS][BUILD] Upgrade to `pandas` 2.2.1 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/24 01:14:00 UTC, 2 replies.
- Re: [PR] [SPARK-47152][SQL][BUILD] Provide `CodeHaus Jackson` dependencies via a new optional directory [spark] - posted by "viirya (via GitHub)" <gi...@apache.org> on 2024/02/24 01:20:50 UTC, 46 replies.
- [PR] [SPARK-47153] Guard serialize/deserialize in JavaSerializer with try-with-resource block [spark] - posted by "jwang0306 (via GitHub)" <gi...@apache.org> on 2024/02/24 02:15:17 UTC, 0 replies.
- [PR] [SPARK-47154][SS][TESTS] Fix `kafka-0-10-sql` to use `ResetSystemProperties` if `KafkaTestUtils` is used [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/24 11:08:44 UTC, 4 replies.
- [PR] Enable user to override base overhead memory [spark] - posted by "jpcorreia99 (via GitHub)" <gi...@apache.org> on 2024/02/24 16:14:45 UTC, 0 replies.
- Re: [PR] [SARK-45866][SQL] Fix for Reuse of Exchange in AQE not happening when DPP filters are pushed down to the underlying Scan (like iceberg) [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/25 00:19:54 UTC, 1 replies.
- Re: [PR] [SPARK-44924][SS] Add config for FileStreamSource cached files [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/25 00:19:57 UTC, 1 replies.
- [PR] [FOLLOWUP][SPARK-47152][SQL][BUILD] Provide `CodeHaus Jackson` dependencies via a new optional directory [spark] - posted by "bjornjorgensen (via GitHub)" <gi...@apache.org> on 2024/02/25 19:04:50 UTC, 3 replies.
- [PR] [SPARK-47137][PYTHON][CONNECT][FOLLOW-UP] Uses assertEqual instead of assertEquals for Python 3.12 build [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/25 23:53:41 UTC, 2 replies.
- Re: [PR] [SPARK-47157][SQL] Refactor file listing with ScanFileListing interface [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/26 00:16:05 UTC, 1 replies.
- Re: [PR] [MINOR][DOCS] Clarify collect_list and collect_set -> ArrayType [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/26 00:18:42 UTC, 1 replies.
- Re: [PR] [SPARK-45849][SQL]: Avoid uneccessary copy when encoding Set [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/26 00:18:44 UTC, 1 replies.
- [PR] [MINOR][CONNECT][TESTS] Chain waitFor after destroyForcibly in SparkConnectServerUtils [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/26 00:42:34 UTC, 2 replies.
- [PR] [WIP][SPARK-47158][SQL] Assign proper name and `sqlState` to `_LEGACY_ERROR_TEMP_(2134|2231)` [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/26 01:37:11 UTC, 0 replies.
- [PR] [SPARK-46802][PYTHON][TESTS] Remove obsolete comment in run-tests-with-coverage [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/26 03:47:39 UTC, 0 replies.
- Re: [PR] [SPARK-46802][PYTHON][TESTS][FOLLOWUP] Remove obsolete comment in run-tests-with-coverage [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 03:59:42 UTC, 1 replies.
- Re: [PR] [SPARK-47152][SQL][BUILD][FOLLOWUP] Provide `CodeHaus Jackson` dependencies via a new optional directory [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 04:01:06 UTC, 5 replies.
- [PR] Update package-lock.json [spark] - posted by "Vishnukumark-48 (via GitHub)" <gi...@apache.org> on 2024/02/26 04:54:28 UTC, 2 replies.
- [PR] Update README [spark] - posted by "Vishnukumark-48 (via GitHub)" <gi...@apache.org> on 2024/02/26 04:57:11 UTC, 4 replies.
- [PR] [SPARK-47159][INFRA] Set `OBJC_DISABLE_INITIALIZE_FORK_SAFETY=YES` in `MacOS` GitHub Action Job [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 05:11:41 UTC, 3 replies.
- [PR] [SPARK-47160][K8S] Update K8s `Dockerfile` to include `hive-jackson` directory if exists [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 05:43:58 UTC, 3 replies.
- [PR] [SPARK-47161][INFRA][R] Uses hash key properly for SparkR build on Windows [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/26 05:50:36 UTC, 2 replies.
- [PR] [SPARK-47163][BUILD] Fix `make-distribution.sh` to check `jackson-*-asl-*.jar` existence first [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 06:14:03 UTC, 0 replies.
- Re: [PR] [SPARK-47163][BUILD] Fix `make-distribution.sh` to check `jackson-core-asl-1.9.13.jar` existence first [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 07:54:25 UTC, 2 replies.
- [PR] [SPARK-47164][SQL] Make Default Value From Wider Type Narrow Literal of v2 behave the same as v1 [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/26 08:02:02 UTC, 4 replies.
- Re: [PR] [SPARK-47158][SQL] Assign proper name and `sqlState` to `_LEGACY_ERROR_TEMP_(2134|2231)` [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/26 09:01:33 UTC, 11 replies.
- [PR] [SPARK-47165][SQL][DOCKER][TESTS] Pull docker image only when its' absent [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/26 09:44:30 UTC, 1 replies.
- [PR] Bold/Red/Upper [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/26 10:44:45 UTC, 0 replies.
- [PR] [SPARK-47147][PYTHON] Fix Pyspark collated string conversion error [spark] - posted by "nikolamand-db (via GitHub)" <gi...@apache.org> on 2024/02/26 11:42:33 UTC, 0 replies.
- Re: [PR] [SPARK-47147][PYTHON][SQL] Fix Pyspark collated string conversion error [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/02/26 11:52:18 UTC, 3 replies.
- [PR] [SPARK-47118][BUILD] Optimise dependencies scope for `jakarta.servlet-api` and `javax.servlet-api` [spark] - posted by "HiuKwok (via GitHub)" <gi...@apache.org> on 2024/02/26 11:55:53 UTC, 0 replies.
- [PR] [SPARK-47167][SQL] Add DescriptiveRelation class [spark] - posted by "urosstan-db (via GitHub)" <gi...@apache.org> on 2024/02/26 11:59:48 UTC, 11 replies.
- Re: [PR] [SPARK-47170][BUILD] Optimise dependencies scope for `jakarta.servlet-api` and `javax.servlet-api` [spark] - posted by "HiuKwok (via GitHub)" <gi...@apache.org> on 2024/02/26 12:36:24 UTC, 0 replies.
- [PR] [SPARK-47169][SQL] Disable bucketing on collated columns [spark] - posted by "mihailom-db (via GitHub)" <gi...@apache.org> on 2024/02/26 13:51:07 UTC, 0 replies.
- Re: [PR] [SPARK-47131][SQL][COLLATION] String function support: contains, startswith, endswith [spark] - posted by "uros-db (via GitHub)" <gi...@apache.org> on 2024/02/26 14:48:28 UTC, 35 replies.
- [PR] [SPARK-46077][SQL] Consider the type generated by TimestampNTZConverter in JdbcDialect.compileValue. [spark] - posted by "planga82 (via GitHub)" <gi...@apache.org> on 2024/02/26 15:08:23 UTC, 14 replies.
- [PR] Disable parquet filter pushdown when working with non default collated strings [spark] - posted by "stefankandic (via GitHub)" <gi...@apache.org> on 2024/02/26 16:12:33 UTC, 0 replies.
- Re: [PR] [SPARK-47170][BUILD][CONNECT] Remove `jakarta.servlet-api` and `javax.servlet-api` dependency scope in `connect/server` module [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 16:57:40 UTC, 1 replies.
- Re: [PR] [SPARK-47165][SQL][TESTS] Pull docker image only when its' absent [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 18:22:29 UTC, 3 replies.
- Re: [PR] [SPARK-47166][INFRA] Improves the HINTs of merge_spark_pr.py [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 18:24:10 UTC, 4 replies.
- [PR] [SPARK-47173][SS][UI][MINOR] Fix a typo in streaming UI explanation [spark] - posted by "WweiL (via GitHub)" <gi...@apache.org> on 2024/02/26 18:29:38 UTC, 0 replies.
- Re: [PR] [SPARK-47173][SS][UI] Fix a typo in streaming UI explanation [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/26 18:51:19 UTC, 3 replies.
- Re: [PR] [SPARK-46639][SQL] Add WindowExec SQLMetrics [spark] - posted by "erenavsarogullari (via GitHub)" <gi...@apache.org> on 2024/02/26 19:34:38 UTC, 0 replies.
- [PR] [SPARK-45527][CORE][TESTS][FOLLOWUP] Reduce the number of threads from 1k to 100 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 22:05:50 UTC, 1 replies.
- Re: [PR] [SPARK-45527][CORE][TESTS][FOLLOWUP] Reduce the number of threads from 1k to 100 in `TaskSchedulerImplSuite` [spark] - posted by "tgravescs (via GitHub)" <gi...@apache.org> on 2024/02/26 22:18:42 UTC, 3 replies.
- [PR] [SPARK-47175][SS][TESTS] Remove ZOOKEEPER-1844 comment from `KafkaTestUtils` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/26 23:21:40 UTC, 3 replies.
- [PR] [SPARK-39771] Add a warning msg in `Dependency` when a too large number of shuffle blocks is to be created. [spark] - posted by "y-wei (via GitHub)" <gi...@apache.org> on 2024/02/26 23:23:53 UTC, 2 replies.
- Re: [PR] [SPARK-47032][Python] Add UDTF API for "analyze" method to identify pass-through columns to output table [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/02/27 00:00:18 UTC, 7 replies.
- Re: [PR] [SPARK-46881][CORE] Support `spark.deploy.workerSelectionPolicy` [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/27 00:06:43 UTC, 3 replies.
- Re: [PR] [MINOR][CORE] Rename scheduler ref for readability [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/27 00:18:27 UTC, 1 replies.
- Re: [PR] [SPARK-42399] [SQL] Support big numbers for conv function (get rid of overflow) [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/27 00:18:31 UTC, 1 replies.
- [PR] [SPARK-47094][SQL] SPJ : Dynamically rebalance number of buckets when they are not equal [spark] - posted by "szehon-ho (via GitHub)" <gi...@apache.org> on 2024/02/27 00:41:17 UTC, 9 replies.
- [PR] [SPARK-45527][CORE][TESTS][FOLLOW-UP] Reduce the number of test cases in fraction resource calculation [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/27 00:54:09 UTC, 12 replies.
- [PR] [WIP] Documentation for SparkSession-based Profilers [spark] - posted by "xinrong-meng (via GitHub)" <gi...@apache.org> on 2024/02/27 00:56:14 UTC, 1 replies.
- [PR] [SPARK-47176][SQL] Have a ResolveAllExpressionsUpWithPruning helper function [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/02/27 01:12:50 UTC, 5 replies.
- Re: [PR] [SPARK-39771][CORE] Add a warning msg in `Dependency` when a too large number of shuffle blocks is to be created. [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/02/27 01:38:26 UTC, 21 replies.
- Re: [PR] [SPARK-47166][INFRA] Improves the HINTs of merge_spark_pr.py [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/27 02:25:23 UTC, 2 replies.
- [PR] [SPARK-47178][PYTHON][TESTS]Add a test case for createDataFrame with dataclasses [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/27 02:31:30 UTC, 0 replies.
- Re: [PR] [SPARK-47178][PYTHON][TESTS] Add a test case for createDataFrame with dataclasses [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/27 02:33:51 UTC, 4 replies.
- [PR] [SPARK-47179][SQL] Improve error message from SparkThrowableSuite for better debuggability [spark] - posted by "itholic (via GitHub)" <gi...@apache.org> on 2024/02/27 02:56:17 UTC, 12 replies.
- [PR] [SPARK-47181][CORE][TESTS] Fix `MasterSuite` to validate the number of registered workers [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/27 03:32:17 UTC, 3 replies.
- [PR] [MINOR][SS] Minor string representation improvement in AssertOnQuery [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/27 03:50:31 UTC, 5 replies.
- [PR] [MINOR][SQL] Tweak column error names and text [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/27 03:52:22 UTC, 2 replies.
- [PR] [SPARK-41811][PYTHON][CONNECT] Implement `SQLStringFormatter` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/27 05:23:19 UTC, 9 replies.
- [PR] [SPARK-47182][BUILD] Exclude `commons-(io|lang3)` transitive dependencies from `commons-compress` and `avro-*` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/27 05:39:50 UTC, 0 replies.
- Re: [PR] [SPARK-47182][BUILD] Exclude `commons-(io|lang3)` transitive dependencies from `commons-compress` and `avro*` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/27 06:01:00 UTC, 9 replies.
- Re: [PR] [SPARK-47153][CORE] Guard serialize/deserialize in JavaSerializer with try-with-resource block [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/27 06:12:58 UTC, 2 replies.
- [PR] [SPARK-41873][PYTHON][CONNECT][TESTS] Enable `DataFrameParityTests.test_pandas_api` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/27 07:45:48 UTC, 2 replies.
- [PR] [SPARK-47183][PYTHON] Fix the error class for `sameSemantics` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/27 08:49:16 UTC, 2 replies.
- [PR] [SPARK-47184][PYTHON][CONNECT][TESTS] Make `test_repartitionByRange_dataframe` reusable [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/27 09:11:59 UTC, 4 replies.
- [PR] [SPARK-47177][SQL] Cached SQL plan do not display final AQE plan in explain string [spark] - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2024/02/27 09:44:32 UTC, 4 replies.
- [PR] [SPARK-47185][SS][TESTS] Increase timeout between actions in KafkaContinuousSourceSuite [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/27 10:20:19 UTC, 3 replies.
- [PR] [SPARK-47186][DOCKER][TESTS] Add some timeouts options and logs to improve the debuggability for docker integration test [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/27 11:43:40 UTC, 4 replies.
- [PR] [SPARK-47102][SQL][COLLATION] Add COLLATION_ENABLED config flag [spark] - posted by "mihailom-db (via GitHub)" <gi...@apache.org> on 2024/02/27 11:47:48 UTC, 39 replies.
- [PR] [SPARK-47187][SQL][3.4] Fix hive compress output config does not work [spark] - posted by "ulysses-you (via GitHub)" <gi...@apache.org> on 2024/02/27 11:48:01 UTC, 3 replies.
- [PR] [SPARK-47188][SQL] Add configuration to determine whether to exclude hive statistics properties [spark] - posted by "huangxiaopingRD (via GitHub)" <gi...@apache.org> on 2024/02/27 12:14:58 UTC, 0 replies.
- [PR] [SPARK-43258][SQL] Assign names to error _LEGACY_ERROR_TEMP_202[3,5] [spark] - posted by "dengziming (via GitHub)" <gi...@apache.org> on 2024/02/27 12:21:33 UTC, 1 replies.
- Re: [PR] [SPARK-47145][SQL] Pass table identifier to row data source scan exec for V2 strategy. [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/27 13:32:54 UTC, 4 replies.
- Re: [PR] [SPARK-47189][SQL] Tweak column error names and text [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/27 14:29:23 UTC, 2 replies.
- Re: [PR] [SPARK-47147][PYTHON][SQL] Fix PySpark collated string conversion error [spark] - posted by "nikolamand-db (via GitHub)" <gi...@apache.org> on 2024/02/27 14:51:31 UTC, 3 replies.
- [PR] [SPARK-47191][SQL] Avoid unnecessary relation lookup when uncaching table/view [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/27 15:22:57 UTC, 2 replies.
- Re: [PR] [SPARK-43256][SQL] Remove error class _LEGACY_ERROR_TEMP_2021 [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/27 16:40:15 UTC, 3 replies.
- [PR] [SPARK-46834][SQL][Collations] Support for aggregates [spark] - posted by "dbatomic (via GitHub)" <gi...@apache.org> on 2024/02/27 16:56:20 UTC, 17 replies.
- [PR] [SPARK-47192] Convert some _LEGACY_ERROR_TEMP_0035 errors [spark] - posted by "srielau (via GitHub)" <gi...@apache.org> on 2024/02/27 17:25:02 UTC, 6 replies.
- [PR] [SPARK-47194][BUILD] Upgrade log4j to 2.23.0 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/27 17:40:49 UTC, 13 replies.
- [PR] [WIP][SPARK-47033][SQL] Fix EXECUTE IMMEDIATE USING does not recognize session variable names [spark] - posted by "andrej-db (via GitHub)" <gi...@apache.org> on 2024/02/27 18:08:02 UTC, 19 replies.
- [PR] [SPARK-47063][SQL] CAST long to timestamp has different behavior for codegen vs interpreted [spark] - posted by "planga82 (via GitHub)" <gi...@apache.org> on 2024/02/27 20:26:23 UTC, 4 replies.
- [PR] [SPARK-47196][CORE][BUILD][3.4] Fix `core` module to succeed SBT tests [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/27 21:24:40 UTC, 5 replies.
- Re: [PR] [SPARK-43157][SQL] Clone InMemoryRelation cached plan to prevent cloned plan from referencing same objects [spark] - posted by "liuzqt (via GitHub)" <gi...@apache.org> on 2024/02/27 21:45:46 UTC, 2 replies.
- [PR] [SPARK-45527][SPARK-47185][TESTS][FOLLOW-UP] Increase timeout more, and reduce the resource usage [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/27 23:43:09 UTC, 1 replies.
- [PR] [SPARK-47199][PYTHON][TESTS] Add prefix into TemporaryDirectory to avoid flakiness [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/28 01:34:53 UTC, 3 replies.
- Re: [PR] [SPARK-45527][SPARK-47185][SS][TESTS][FOLLOW-UP] Increase timeout more, and reduce the resource usage [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/28 01:36:47 UTC, 1 replies.
- Re: [PR] [WIP][SPARK-45880][SQL] Introduce a new TableCatalog.listTable overload th… [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/28 01:53:03 UTC, 13 replies.
- [PR] [SPARK-47200][SS] Error class for Foreach batch sink user function error [spark] - posted by "micheal-o (via GitHub)" <gi...@apache.org> on 2024/02/28 01:53:25 UTC, 5 replies.
- [PR] [SPARK-47201][PYTHON][CONNECT] `sameSemantics` checks input types [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/28 02:27:42 UTC, 3 replies.
- Re: [PR] [SPARK-45599][CORE][3.5] Use object equality in OpenHashSet [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/28 02:31:24 UTC, 1 replies.
- [PR] [SPARK-47202][PySpark] Fix typo breaking datetimes with tzinfo [spark] - posted by "arzavj (via GitHub)" <gi...@apache.org> on 2024/02/28 02:38:14 UTC, 4 replies.
- [PR] [SPARK-43255][SQL]Replace the error class _LEGACY_ERROR_TEMP_2020 by an internal error [spark] - posted by "JinHelin404 (via GitHub)" <gi...@apache.org> on 2024/02/28 02:43:07 UTC, 2 replies.
- Re: [PR] [SPARK-47202][PYTHON] Fix typo breaking datetimes with tzinfo [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/28 02:58:08 UTC, 6 replies.
- [PR] [SPARK-46525][FOLLOWUP] Cleanup http client deps for spotify docker client [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/28 03:19:21 UTC, 0 replies.
- [PR] [SPARK-47203][DOCKER][TEST] Use gvenzl/oracle-free:23.3-slim to reduce disk usage for docker it [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/28 03:29:22 UTC, 2 replies.
- [PR] [WIP] implement Python streaming data sink [spark] - posted by "chaoqin-li1123 (via GitHub)" <gi...@apache.org> on 2024/02/28 04:43:41 UTC, 0 replies.
- Re: [PR] [SPARK-47144][CONNECT][SQL][PYTHON] Fix Spark Connect collation error by adding collateId protobuf field [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/28 05:06:36 UTC, 1 replies.
- Re: [PR] [SPARK-46525][BUILD][TESTS][FOLLOWUP] Cleanup http client deps for spotify docker client [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/28 05:31:14 UTC, 1 replies.
- [PR] [SPARK-47155] Fix Error Class Issue [spark] - posted by "sunan135 (via GitHub)" <gi...@apache.org> on 2024/02/28 06:21:06 UTC, 1 replies.
- [PR] [SPARK-47205][DOCKER][TESTS] Upgrade docker-java to 3.3.5 [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/28 06:21:17 UTC, 0 replies.
- [PR] [SPARK-47206] Add official image Dockerfile for Apache Spark 3.5.1 [spark-docker] - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2024/02/28 06:24:19 UTC, 4 replies.
- [PR] [SPARK-47202][PYTHON][TESTS][FOLLOW-UP]Test timestamp with tzinfo in toPandas and createDataFrame [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/28 06:31:30 UTC, 0 replies.
- [PR] [SPARK-47197] Failed to connect HiveMetastore when using iceberg with HiveCatalog on spark-sql or spark-shell [spark] - posted by "eubnara (via GitHub)" <gi...@apache.org> on 2024/02/28 06:35:13 UTC, 7 replies.
- [PR] [SPARK-42929][CONNECT][PYTHON][TEST] test barrier mode for mapInPandas/mapInArrow [spark] - posted by "wbo4958 (via GitHub)" <gi...@apache.org> on 2024/02/28 07:30:40 UTC, 3 replies.
- Re: [PR] [SPARK-47202][PYTHON][TESTS][FOLLOW-UP] Test timestamp with tzinfo in toPandas and createDataFrame with Arrow optimized [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/28 07:56:44 UTC, 1 replies.
- [PR] minor: Remove out-of-dated comment in `CollectLimitExec` [spark] - posted by "viirya (via GitHub)" <gi...@apache.org> on 2024/02/28 08:38:41 UTC, 0 replies.
- Re: [PR] [SPARK-47205][BUILD][TESTS] Upgrade docker-java to 3.3.5 [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/28 08:38:59 UTC, 1 replies.
- Re: [PR] [MINOR][SQL] Remove out-of-dated comment in `CollectLimitExec` [spark] - posted by "viirya (via GitHub)" <gi...@apache.org> on 2024/02/28 08:41:01 UTC, 3 replies.
- [PR] [BUILD] Test sbt 1.9.9 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/28 08:55:40 UTC, 2 replies.
- [PR] [SPARK-47207][CORE] Support `spark.driver.timeout` and `DriverTimeoutPlugin` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/28 09:10:33 UTC, 7 replies.
- Re: [PR] [SPARK-47208] Allow overriding base overhead memory [spark] - posted by "jpcorreia99 (via GitHub)" <gi...@apache.org> on 2024/02/28 10:39:26 UTC, 1 replies.
- [PR] SPARK-42040: SPJ: Introduce a new API for V2 input partition to …t partition size [spark] - posted by "zhuqi-lucas (via GitHub)" <gi...@apache.org> on 2024/02/28 11:31:33 UTC, 1 replies.
- [PR] [SPARK-47209][BUILD] Upgrade slf4j to 2.0.12 [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/28 13:15:45 UTC, 2 replies.
- Re: [PR] [SPARK-46919][BUILD][INFRA][CONNECT] Upgrade `grpcio*` and `grpc-java` to 1.62.x [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/28 13:32:43 UTC, 0 replies.
- Re: [PR] [SPARK-47168][SQL] Disable parquet filter pushdown when working with non default collated strings [spark] - posted by "mkaravel (via GitHub)" <gi...@apache.org> on 2024/02/28 14:36:01 UTC, 1 replies.
- Re: [PR] [SPARK-41392][BUILD] Make maven build Spark master with Hadoop 3.4.0-SNAPSHOT successful [spark] - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2024/02/28 14:46:01 UTC, 0 replies.
- [PR] [SPARK-47211][CONNECT][PYTHON] Fix ignored PySpark Connect string collation [spark] - posted by "nikolamand-db (via GitHub)" <gi...@apache.org> on 2024/02/28 15:52:55 UTC, 10 replies.
- [PR] [SPARK-41392][BUILD] Spark master to build with Hadoop 3.4.0 [spark] - posted by "steveloughran (via GitHub)" <gi...@apache.org> on 2024/02/28 16:06:53 UTC, 5 replies.
- Re: [PR] [SPARK-41392][BUILD][TESTS] Add `bouncy-castle` test dependencies to `sql/core` module for Hadoop 3.4.0 [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/28 16:22:14 UTC, 4 replies.
- [PR] [MINOR][DOCS] Remove extraneous whitespace from SQL tuning page [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/28 18:34:06 UTC, 3 replies.
- Re: [PR] [SPARK-47167][SQL] Add concrete class for JDBC anonymous relation [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/28 18:59:53 UTC, 1 replies.
- Re: [PR] [SPARK-47169][SQL] Disable bucketing on collated columns [WIP] [spark] - posted by "MaxGekk (via GitHub)" <gi...@apache.org> on 2024/02/28 19:09:59 UTC, 0 replies.
- [PR] [SPARK-47214][Python] Create API for 'analyze' method to differentiate constant NULL arguments and other types of arguments [spark] - posted by "dtenedor (via GitHub)" <gi...@apache.org> on 2024/02/28 19:13:20 UTC, 1 replies.
- [PR] [SPARK-47215][CORE][TESTS] Reduce the number of required threads in M… [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/28 19:13:53 UTC, 0 replies.
- Re: [PR] [SPARK-47214][Python] Create UDTF API for 'analyze' method to differentiate constant NULL arguments and other types of arguments [spark] - posted by "ueshin (via GitHub)" <gi...@apache.org> on 2024/02/28 20:02:43 UTC, 8 replies.
- [PR] [SPARK-47176][SQL][FOLLOW-UP] Rename resolveExpressionsWithPruning to resolveExpressionsDownWithPruning [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/02/28 21:42:34 UTC, 2 replies.
- [PR] [SPARK-47216][DOCS] Refine layout of SQL performance tuning page [spark] - posted by "nchammas (via GitHub)" <gi...@apache.org> on 2024/02/28 23:23:09 UTC, 0 replies.
- [PR] [SPARK-44265] Ignore commented row tags in XML tokenizer [spark] - posted by "yhosny (via GitHub)" <gi...@apache.org> on 2024/02/28 23:52:18 UTC, 1 replies.
- [PR] [SPARK-47202][PYTHON][TESTS][FOLLOW-UP] Run the test only with Python 3.9+ [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/28 23:56:00 UTC, 2 replies.
- Re: [PR] [SPARK-46010][BUILD] Upgrade `Node.js` to 18.X [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/29 00:18:05 UTC, 0 replies.
- Re: [PR] [SPARK-45977][BUILD] Make sbt doc execute successfully [spark] - posted by "github-actions[bot] (via GitHub)" <gi...@apache.org> on 2024/02/29 00:18:06 UTC, 0 replies.
- Re: [PR] [SPARK-47215][CORE][TESTS] Reduce the number of required threads in `MasterSuite` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/29 00:21:13 UTC, 1 replies.
- [PR] [SPARK-47218] XML: Ignore commented row tags in XML tokenizer [spark] - posted by "yhosny (via GitHub)" <gi...@apache.org> on 2024/02/29 00:39:49 UTC, 0 replies.
- [PR] [WIP][SPARK-47194][BUILD] Upgrade log4j to 2.23.0 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/29 01:39:40 UTC, 1 replies.
- [PR] [SPARK-47206][FOLLOWUP] Fix wrong path version [spark-docker] - posted by "Yikun (via GitHub)" <gi...@apache.org> on 2024/02/29 01:44:30 UTC, 2 replies.
- Re: [PR] [SPARK-43744][CONNECT] Fix class loading problem caused by stub user classes not found on the server classpath [spark] - posted by "tenstriker (via GitHub)" <gi...@apache.org> on 2024/02/29 01:45:39 UTC, 0 replies.
- Re: [PR] [SPARK-47078][DOCS][PYTHON] Documentation for SparkSession-based Profilers [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/29 02:01:46 UTC, 4 replies.
- Re: [PR] [SPARK-47218][SQL] XML: Ignore commented row tags in XML tokenizer [spark] - posted by "sandip-db (via GitHub)" <gi...@apache.org> on 2024/02/29 02:15:52 UTC, 0 replies.
- [PR] [SPARK-47146][CORE] Possible thread leak when doing sort merge join [spark] - posted by "JacobZheng0927 (via GitHub)" <gi...@apache.org> on 2024/02/29 02:31:08 UTC, 12 replies.
- [PR] [SPARK-47221][SQL] Uses signatures from CsvParser to AbstractParser [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/29 02:47:18 UTC, 2 replies.
- [PR] [SPARK-47222][SQL] fileCompressionFactor also applied to the size of the table [spark] - posted by "wangyum (via GitHub)" <gi...@apache.org> on 2024/02/29 03:28:54 UTC, 1 replies.
- [PR] [SPARK-47186][DOCKER][FOLLOWUP] Reduce test time for docker ITs [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/29 03:53:59 UTC, 2 replies.
- Re: [PR] [SPARK-47176][SQL][FOLLOW-UP] resolveExpressions should have three versions which is the same as resolveOperators [spark] - posted by "amaliujia (via GitHub)" <gi...@apache.org> on 2024/02/29 04:02:46 UTC, 2 replies.
- Re: [PR] [SPARK-47148][SQL] Avoid to materialize AQE QueryStages on the cancellation [spark] - posted by "erenavsarogullari (via GitHub)" <gi...@apache.org> on 2024/02/29 04:16:58 UTC, 0 replies.
- [PR] [SPARK-47223][MINOR][CORE] Update usage of deprecated Thread.getId() to Thread.threadId() [spark] - posted by "neilagupta (via GitHub)" <gi...@apache.org> on 2024/02/29 04:32:06 UTC, 0 replies.
- [PR] [SPARK-47224][PS][TESTS] Split `test_split_apply_basic` [spark] - posted by "zhengruifeng (via GitHub)" <gi...@apache.org> on 2024/02/29 04:42:37 UTC, 0 replies.
- Re: [PR] [SPARK-47148][SQL] Avoid to materialize AQE ExchangeQueryStageExec on the cancellation [spark] - posted by "erenavsarogullari (via GitHub)" <gi...@apache.org> on 2024/02/29 04:44:10 UTC, 2 replies.
- [PR] [MINOR][PYTHON][DOCS] Clarify verifySchema at createDataFrame not working with pandas DataFrame with Arrow optimization [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/29 05:50:41 UTC, 2 replies.
- [PR] [MINOR] Update outdated comments for class `o.a.s.s.functions` [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/29 08:45:47 UTC, 2 replies.
- Re: [PR] [SPARK-37932][SQL]Wait to resolve missing attributes before applying DeduplicateRelations [spark] - posted by "martinf-moodys (via GitHub)" <gi...@apache.org> on 2024/02/29 09:59:54 UTC, 0 replies.
- [PR] [SPARK-47227] Improve documentation for Spark Connect [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/02/29 10:19:22 UTC, 0 replies.
- [PR] [CORE][TEST][MINOR] FakeTask should reference its TaskMetrics to avoid TaskMetrics accumulators being GCed before stage completion [spark] - posted by "cloud-fan (via GitHub)" <gi...@apache.org> on 2024/02/29 10:36:35 UTC, 1 replies.
- [PR] [SPARK-42627][SPARK-26494][SQL] Support Oracle TIMESTAMP WITH LOCAL TIME ZONE [spark] - posted by "yaooqinn (via GitHub)" <gi...@apache.org> on 2024/02/29 11:22:05 UTC, 2 replies.
- Re: [PR] [SPARK-47224][PS][TESTS] Split `test_split_apply_basic` and `test_split_apply_adv` [spark] - posted by "HyukjinKwon (via GitHub)" <gi...@apache.org> on 2024/02/29 12:52:19 UTC, 1 replies.
- [PR] [SPARK-47229][CORE][SQL][YARN] Change the never changed `var` to `val` [spark] - posted by "LuciferYang (via GitHub)" <gi...@apache.org> on 2024/02/29 13:10:49 UTC, 3 replies.
- Re: [PR] [SPARK-47227][DOCS] Improve documentation for Spark Connect [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/29 15:45:16 UTC, 3 replies.
- Re: [PR] [SPARK-47231][CORE][TESTS] FakeTask should reference its TaskMetrics to avoid TaskMetrics accumulators being GCed before stage completion [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/29 15:49:19 UTC, 2 replies.
- [PR] [SPARK-47227][FOLLOW][DOCS] Improve Spark Connect Documentation [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/02/29 16:02:08 UTC, 2 replies.
- [PR] [WIP][SPARK-47227][FOLLOW][DOCS] Building Extensions [spark] - posted by "grundprinzip (via GitHub)" <gi...@apache.org> on 2024/02/29 16:19:36 UTC, 0 replies.
- Re: [PR] [SPARK-47223][SQL][CORE] Update usage of deprecated Thread.getId() to Thread.threadId() [spark] - posted by "neilagupta (via GitHub)" <gi...@apache.org> on 2024/02/29 16:52:24 UTC, 5 replies.
- Re: [PR] [SPARK-47229][CORE][SQL][SS][YARN][CONNECT][TESTS] Change the never changed `var` to `val` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/29 18:41:57 UTC, 0 replies.
- Re: [PR] [SPARK-47229][CORE][SQL][SS][YARN][CONNECT] Change the never changed `var` to `val` [spark] - posted by "dongjoon-hyun (via GitHub)" <gi...@apache.org> on 2024/02/29 18:42:40 UTC, 3 replies.
- [PR] [SS] Add MapState implementation for State API v2. [spark] - posted by "jingz-db (via GitHub)" <gi...@apache.org> on 2024/02/29 23:12:37 UTC, 0 replies.
- [PR] [WIP][SPARK-47234][BUILD] Upgrade Scala to 2.13.13 [spark] - posted by "panbingkun (via GitHub)" <gi...@apache.org> on 2024/02/29 23:41:53 UTC, 0 replies.