You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Neeraja (JIRA)" <ji...@apache.org> on 2014/08/19 03:04:20 UTC

[jira] [Created] (DRILL-1311) Hash join does not support schema changes error

Neeraja created DRILL-1311:
------------------------------

             Summary: Hash join does not support schema changes error
                 Key: DRILL-1311
                 URL: https://issues.apache.org/jira/browse/DRILL-1311
             Project: Apache Drill
          Issue Type: Bug
            Reporter: Neeraja


- Create a directory with a couple of JSON files. One with columns a,b and second with columns a,b,c. 
- a & b attributes have same data types across both the files
- create a view by selecting columns a, b from the directory
- Join the view with any other table

An error shows up indicating that 'Hash join does not support schema changes'.
There is a schema change across the files with a new element being added, however given that specific columns a,b are selected in the view, expect that query works fine.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Re: [jira] [Created] (DRILL-1311) Hash join does not support schema changes error

Posted by Jinfeng Ni <ji...@gmail.com>.
If possible, can you post the physical plan for the join query you tried?

W/o seeing the plan, all I can guess is that this SchemaChange error might
be related to Project Pushdown. Currently, Drill does not support project
pushdown for JSON storage. As a result, all the columns will be retrieved
from the storage, and given to the parent operator, HashJoin, which will
see SchemaChange, since one file contains 2 columns, and the other one
contains 3 columns.

There is a patch on the way to support Project pushdown for JSON. Once that
patch is merged, it probably will solve this issue.




On Mon, Aug 18, 2014 at 6:04 PM, Neeraja (JIRA) <ji...@apache.org> wrote:

> Neeraja created DRILL-1311:
> ------------------------------
>
>              Summary: Hash join does not support schema changes error
>                  Key: DRILL-1311
>                  URL: https://issues.apache.org/jira/browse/DRILL-1311
>              Project: Apache Drill
>           Issue Type: Bug
>             Reporter: Neeraja
>
>
> - Create a directory with a couple of JSON files. One with columns a,b and
> second with columns a,b,c.
> - a & b attributes have same data types across both the files
> - create a view by selecting columns a, b from the directory
> - Join the view with any other table
>
> An error shows up indicating that 'Hash join does not support schema
> changes'.
> There is a schema change across the files with a new element being added,
> however given that specific columns a,b are selected in the view, expect
> that query works fine.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.2#6252)
>