You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Rahul Challapalli (JIRA)" <ji...@apache.org> on 2015/04/23 22:46:39 UTC
[jira] [Closed] (DRILL-1887) Wrong result for hash join on null
valued columns
[ https://issues.apache.org/jira/browse/DRILL-1887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Rahul Challapalli closed DRILL-1887.
------------------------------------
Verified and added the below testcase
Functional/Passing/joins/inner/q1_DRILL-1887.sql
> Wrong result for hash join on null valued columns
> -------------------------------------------------
>
> Key: DRILL-1887
> URL: https://issues.apache.org/jira/browse/DRILL-1887
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Rahul Challapalli
> Assignee: Aman Sinha
> Fix For: 0.8.0
>
> Attachments: 0001-DRILL-1887-Add-code-gen-for-explicitly-comparing-nul.patch
>
>
> git.commit.id.abbrev=9dfa4a1
> Dataset1:
> {code}
> {
> "col1":1,
> "col2":"abc"
> }
> {
> "col1":2,
> "col2":null
> }
> {code}
> Dataset 2:
> {code}
> {
> "col1":1,
> "col2":null
> }
> {
> "col1":2,
> "col2":null
> }
> {
> "col1":2,
> "col2":"abc"
> }
> {code}
> Query :
> {code}
> select * from `a.json` a join `b.json` b on a.col2=b.col2;
> +------------+------------+------------+------------+
> | col1 | col2 | col10 | col20 |
> +------------+------------+------------+------------+
> | 1 | abc | 2 | abc |
> | 2 | null | 1 | null |
> | 2 | null | 2 | null |
> +------------+------------+------------+------------+
> {code}
> Most database engines treat 2 NULL values as not being equal. This makes it hard to compare DRILL's results against any Standard SQL engine. However I am not sure whether drill is intentionally designed to behave this way.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)