You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Michael Armbrust (JIRA)" <ji...@apache.org> on 2015/05/18 22:04:00 UTC
[jira] [Resolved] (SPARK-6241) hiveql ANALYZE TABLE doesn't work
for external tables
[ https://issues.apache.org/jira/browse/SPARK-6241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael Armbrust resolved SPARK-6241.
-------------------------------------
Resolution: Won't Fix
Datasource tables have their own mechanism for reporting statistics that does not rely on "ANALYZE". Please reopen if that is not working for you.
> hiveql ANALYZE TABLE doesn't work for external tables
> -----------------------------------------------------
>
> Key: SPARK-6241
> URL: https://issues.apache.org/jira/browse/SPARK-6241
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 1.3.0
> Reporter: Kai Zeng
> Priority: Critical
>
> "ANALYZE TABLE" does not collect statistics for external tables, but works well for tables created by "CREATE AS SELECT".
> Also tried to use "refresh table" to refresh metadata cache, but got NullPointer error:
> java.util.concurrent.ExecutionException: java.lang.NullPointerException
> at com.google.common.util.concurrent.AbstractFuture$Sync.getValue(AbstractFuture.java:299)
> at com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:286)
> at com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:116)
> at com.google.common.util.concurrent.Uninterruptibles.getUninterruptibly(Uninterruptibles.java:135)
> at com.google.common.cache.LocalCache$Segment.getAndRecordStats(LocalCache.java:2344)
> at com.google.common.cache.LocalCache$Segment$1.run(LocalCache.java:2327)
> at com.google.common.util.concurrent.MoreExecutors$SameThreadExecutorService.execute(MoreExecutors.java:297)
> at com.google.common.util.concurrent.ExecutionList.executeListener(ExecutionList.java:156)
> at com.google.common.util.concurrent.ExecutionList.add(ExecutionList.java:101)
> at com.google.common.util.concurrent.AbstractFuture.addListener(AbstractFuture.java:170)
> at com.google.common.cache.LocalCache$Segment.loadAsync(LocalCache.java:2322)
> at com.google.common.cache.LocalCache$Segment.refresh(LocalCache.java:2385)
> at com.google.common.cache.LocalCache.refresh(LocalCache.java:4085)
> at com.google.common.cache.LocalCache$LocalLoadingCache.refresh(LocalCache.java:4825)
> at org.apache.spark.sql.hive.HiveMetastoreCatalog.refreshTable(HiveMetastoreCatalog.scala:108)
> at org.apache.spark.sql.sources.RefreshTable.run(ddl.scala:404)
> at org.apache.spark.sql.execution.ExecutedCommand.sideEffectResult$lzycompute(commands.scala:55)
> at org.apache.spark.sql.execution.ExecutedCommand.sideEffectResult(commands.scala:55)
> at org.apache.spark.sql.execution.ExecutedCommand.execute(commands.scala:65)
> at org.apache.spark.sql.SQLContext$QueryExecution.toRdd$lzycompute(SQLContext.scala:1092)
> at org.apache.spark.sql.SQLContext$QueryExecution.toRdd(SQLContext.scala:1092)
> at org.apache.spark.sql.DataFrame.<init>(DataFrame.scala:134)
> at org.apache.spark.sql.DataFrame.<init>(DataFrame.scala:117)
> at org.apache.spark.sql.DataFrame$.apply(DataFrame.scala:51)
> at org.apache.spark.sql.hive.HiveContext.sql(HiveContext.scala:92)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org