You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Amogh Margoor (Jira)" <ji...@apache.org> on 2019/09/26 13:09:00 UTC

[jira] [Commented] (SPARK-29059) [SPIP] Support for Hive Materialized Views in Spark SQL.

    [ https://issues.apache.org/jira/browse/SPARK-29059?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16938588#comment-16938588 ] 

Amogh Margoor commented on SPARK-29059:
---------------------------------------

Pull Request here with all the optimizer changes: [GitHub Pull Request #25773|https://github.com/apache/spark/pull/25773]

> [SPIP] Support for Hive Materialized Views in Spark SQL.
> --------------------------------------------------------
>
>                 Key: SPARK-29059
>                 URL: https://issues.apache.org/jira/browse/SPARK-29059
>             Project: Spark
>          Issue Type: New Feature
>          Components: Spark Core
>    Affects Versions: 3.0.0
>            Reporter: Amogh Margoor
>            Priority: Minor
>
> Materialized view was introduced in Apache Hive 3.0.0. Currently, Spark Catalyst does not optimize queries against Hive tables using Materialized View the way Apache Calcite does it for Hive. This Jira is to add support for the same.
> We have developed it in our internal trunk and would like to open source it. It would consist of 3 major parts:
>  # Reading MV related Hive Metadata
>  # Implication Engine which would figure out if an expression exp1 implies another expression exp2 i.e., if exp1 => exp2 is a tautology. This is similar to RexImplication checker in Apache Calcite.
>  # Catalyst rule to replace tables by it's Materialized view using Implication Engine. For e.g., if MV 'mv' has been created in Hive using query 'select * from foo where x > 10 && x <110'  then query 'select * from foo where x > 70 and x < 100' will be transformed into 'select * from mv where x >70 and x < 100'
> Note that Implication Engine and Catalyst Rule is generic can be used even when Spark decides to have it's own Materialized View.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org