You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2016/12/17 05:36:58 UTC

[jira] [Assigned] (SPARK-18911) Decouple Statistics and CatalogTable

     [ https://issues.apache.org/jira/browse/SPARK-18911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-18911:
------------------------------------

    Assignee: Apache Spark

> Decouple Statistics and CatalogTable
> ------------------------------------
>
>                 Key: SPARK-18911
>                 URL: https://issues.apache.org/jira/browse/SPARK-18911
>             Project: Spark
>          Issue Type: Sub-task
>          Components: SQL
>            Reporter: Zhenhua Wang
>            Assignee: Apache Spark
>
> Statistics in LogicalPlan should use attributes to refer to columns rather than column names, because two columns from two relations can have the same column name. But CatalogTable doesn't have the concepts of attribute or broadcast hint in Statistics. Therefore, putting Statistics in CatalogTable is confusing. We need to define a different statistic structure in CatalogTable, which is only responsible for interacting with metastore, and is converted to statistics in LogicalPlan when it is used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org