You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues-all@impala.apache.org by "Quanlong Huang (Jira)" <ji...@apache.org> on 2021/07/12 14:12:00 UTC

[jira] [Updated] (IMPALA-9972) Use defined referential constraints for join cardinality calculations

     [ https://issues.apache.org/jira/browse/IMPALA-9972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Quanlong Huang updated IMPALA-9972:
-----------------------------------
    Fix Version/s:     (was: Impala 4.0)

> Use defined referential constraints for join cardinality calculations
> ---------------------------------------------------------------------
>
>                 Key: IMPALA-9972
>                 URL: https://issues.apache.org/jira/browse/IMPALA-9972
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Frontend
>            Reporter: Shant Hovsepian
>            Assignee: Shant Hovsepian
>            Priority: Major
>
> Currently an estimation technique is used to determine if the join predicates consistent a foreign key -> primary key type of functional dependency. These types of joins are common in "star schemas" and allow for certain query planning optimization.
> The current technique however can produce both false negatives and false positives given the reliance on table stats which can be out of date or incorrect due to the statistical methods used to derive stats. For example higher variability in the error rates of the HyperLogLog algorithm used by stats computation to calculate the number of distinct values for a specific column.
> In case swhere a referential integrity constraint exists and is defined in the table metadata, this information should be used instead of the stats based estimation to determine the type and cardinality of a join.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscribe@impala.apache.org
For additional commands, e-mail: issues-all-help@impala.apache.org