You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Apache Spark (JIRA)" <ji...@apache.org> on 2015/10/13 06:43:05 UTC

[jira] [Assigned] (SPARK-11077) Join elimination in Catalyst

     [ https://issues.apache.org/jira/browse/SPARK-11077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Apache Spark reassigned SPARK-11077:
------------------------------------

    Assignee: Apache Spark  (was: Ankur Dave)

> Join elimination in Catalyst
> ----------------------------
>
>                 Key: SPARK-11077
>                 URL: https://issues.apache.org/jira/browse/SPARK-11077
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>            Reporter: Ankur Dave
>            Assignee: Apache Spark
>
> Join elimination is a query optimization where certain joins can be eliminated when followed by projections that only keep columns from one side of the join, and when certain columns are known to be unique or foreign keys. This can be very useful for queries involving views and machine-generated queries.
> Adding join elimination to Catalyst requires (1) support for unique and foreign key hints in logical plans, (2) methods in the DataFrame API to let users provide these hints, and (3) an optimizer rule that eliminates unique key outer joins and referential integrity joins when followed by an appropriate projection.
> This proposal is described in detail here: https://docs.google.com/document/d/1-YgQSQywHfAo4PhAT-zOOkFZtVcju99h3dYQq-i9GWQ/edit?usp=sharing



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org