You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Kousuke Saruta (JIRA)" <ji...@apache.org> on 2014/10/07 14:40:34 UTC
[jira] [Updated] (SPARK-3831) Filter rule Improvement and bool
expression optimization.
[ https://issues.apache.org/jira/browse/SPARK-3831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kousuke Saruta updated SPARK-3831:
----------------------------------
Description:
If we write the filter which is always FALSE like
{code}
SELECT * from person WHERE FALSE;
{code}
200 tasks will run. I think, 1 task is enough.
And current optimizer cannot optimize the case NOT is duplicated like
{code}
SELECT * from person WHERE NOT ( NOT (age > 30));
{code}
The filter rule above should be simplified
was:
If we write the filter which is always FALSE like
{code}
SELECT * from person WHERE FALSE;
{code}
200 tasks will run. I think, 1 task is enough.
And current optimizer cannot optimize the case NOT is duplicated like
{code}
SELECT * from person WHERE NOT ( NOT (age > 30));
{code}
The filter rule above should be simplify.
> Filter rule Improvement and bool expression optimization.
> ---------------------------------------------------------
>
> Key: SPARK-3831
> URL: https://issues.apache.org/jira/browse/SPARK-3831
> Project: Spark
> Issue Type: Improvement
> Reporter: Kousuke Saruta
>
> If we write the filter which is always FALSE like
> {code}
> SELECT * from person WHERE FALSE;
> {code}
> 200 tasks will run. I think, 1 task is enough.
> And current optimizer cannot optimize the case NOT is duplicated like
> {code}
> SELECT * from person WHERE NOT ( NOT (age > 30));
> {code}
> The filter rule above should be simplified
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org