You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Neeraja (JIRA)" <ji...@apache.org> on 2014/10/28 19:08:34 UTC

[jira] [Updated] (DRILL-1593) Hash join does not support schema changes in joins on JSON files

     [ https://issues.apache.org/jira/browse/DRILL-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Neeraja updated DRILL-1593:
---------------------------
    Description: 
Tried the following query and it is failing with the hash join doesnt support schema changes error. The columns queries in this case do not necessarily change schemas.

0: jdbc:drill:zk=local> SELECT  b.name, sum(r.votes.cool)
. . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
. . . . . . . . . . . >  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
. . . . . . . . . . . > ON (r.business_id = b.business_id)
. . . . . . . . . . . > GROUP BY  b.name
. . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
. . . . . . . . . . . > LIMIT 10;
Query failed: Failure while running fragment. Hash join does not support schema changes [0114f4fe-8051-4a4b-999e-3f60828c9cff]

Error: exception while executing query: Failure while trying to get next result batch. (state=,code=0)

Adding the explain plan.

0: jdbc:drill:zk=local> explain plan for SELECT  b.name, sum(r.votes.cool)
. . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
. . . . . . . . . . . >  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
. . . . . . . . . . . > ON (r.business_id = b.business_id)
. . . . . . . . . . . > GROUP BY  b.name
. . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
. . . . . . . . . . . > LIMIT 10
. . . . . . . . . . . > ;
+------------+------------+
|    text    |    json    |
+------------+------------+
| 00-00    Screen
00-01      Project(name=[$0], EXPR$1=[$1])
00-02        SelectionVectorRemover
00-03          Limit(fetch=[10])
00-04            SelectionVectorRemover
00-05     |
+------------+------------+

  was:
Tried the following query and it is failing with the hash join doesnt support schema changes error. The columns queries in this case do not necessarily change schemas.

0: jdbc:drill:zk=local> SELECT  b.name, sum(r.votes.cool)
. . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
. . . . . . . . . . . >  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
. . . . . . . . . . . > ON (r.business_id = b.business_id)
. . . . . . . . . . . > GROUP BY  b.name
. . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
. . . . . . . . . . . > LIMIT 10;
Query failed: Failure while running fragment. Hash join does not support schema changes [0114f4fe-8051-4a4b-999e-3f60828c9cff]

Error: exception while executing query: Failure while trying to get next result batch. (state=,code=0)



> Hash join does not support schema changes in joins on JSON files
> ----------------------------------------------------------------
>
>                 Key: DRILL-1593
>                 URL: https://issues.apache.org/jira/browse/DRILL-1593
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Neeraja
>            Priority: Critical
>
> Tried the following query and it is failing with the hash join doesnt support schema changes error. The columns queries in this case do not necessarily change schemas.
> 0: jdbc:drill:zk=local> SELECT  b.name, sum(r.votes.cool)
> . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
> . . . . . . . . . . . >  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> . . . . . . . . . . . > ON (r.business_id = b.business_id)
> . . . . . . . . . . . > GROUP BY  b.name
> . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
> . . . . . . . . . . . > LIMIT 10;
> Query failed: Failure while running fragment. Hash join does not support schema changes [0114f4fe-8051-4a4b-999e-3f60828c9cff]
> Error: exception while executing query: Failure while trying to get next result batch. (state=,code=0)
> Adding the explain plan.
> 0: jdbc:drill:zk=local> explain plan for SELECT  b.name, sum(r.votes.cool)
> . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
> . . . . . . . . . . . >  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> . . . . . . . . . . . > ON (r.business_id = b.business_id)
> . . . . . . . . . . . > GROUP BY  b.name
> . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
> . . . . . . . . . . . > LIMIT 10
> . . . . . . . . . . . > ;
> +------------+------------+
> |    text    |    json    |
> +------------+------------+
> | 00-00    Screen
> 00-01      Project(name=[$0], EXPR$1=[$1])
> 00-02        SelectionVectorRemover
> 00-03          Limit(fetch=[10])
> 00-04            SelectionVectorRemover
> 00-05     |
> +------------+------------+



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)