You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2022/05/05 08:45:26 UTC

[GitHub] [spark] cloud-fan commented on a diff in pull request #36377: [SPARK-39043][SQL] Spark SQL Hive client should not gather statistic by default.

cloud-fan commented on code in PR #36377:
URL: https://github.com/apache/spark/pull/36377#discussion_r865682677


##########
docs/sql-migration-guide.md:
##########
@@ -22,6 +22,10 @@ license: |
 * Table of contents
 {:toc}
 
+## Upgrading from Spark SQL 3.3 to 3.4
+  
+  - Since Spark 3.4, Spark disable `hive.stats.autogather` when create `HiveClientImpl`. To restore the behavior before Spark 3.4, you can set `spark.hadoop.hive.stats.autogather` to `true`.

Review Comment:
   ```suggestion
     - Since Spark 3.4, Spark disables `hive.stats.autogather` by default, which means Hive tables won't automatically update statistics that can be consumed by Hive (not Spark). To restore the behavior before Spark 3.4, you can set `spark.hadoop.hive.stats.autogather` to `true`.
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org