You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Rahul Challapalli (JIRA)" <ji...@apache.org> on 2015/04/29 21:31:08 UTC

[jira] [Commented] (DRILL-1593) Hash join does not support schema changes in joins on JSON files

    [ https://issues.apache.org/jira/browse/DRILL-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14520029#comment-14520029 ] 

Rahul Challapalli commented on DRILL-1593:
------------------------------------------

Verified that the query is working. However did not add an automated testcase since the data is on the larger side



> Hash join does not support schema changes in joins on JSON files
> ----------------------------------------------------------------
>
>                 Key: DRILL-1593
>                 URL: https://issues.apache.org/jira/browse/DRILL-1593
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Data Types
>            Reporter: Neeraja
>            Assignee: Hanifi Gunes
>            Priority: Critical
>             Fix For: 0.7.0
>
>
> Tried the following query and it is failing with the hash join doesnt support schema changes error. The columns queries in this case do not necessarily change schemas.
> {code:sql}
> SELECT  b.name, sum(r.votes.cool)
> FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
>  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> ON (r.business_id = b.business_id)
> GROUP BY  b.name
> ORDER BY sum(r.votes.cool) DESC
> LIMIT 10;
> {code}
> 0: jdbc:drill:zk=local> SELECT  b.name, sum(r.votes.cool)
> . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
> . . . . . . . . . . . >  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> . . . . . . . . . . . > ON (r.business_id = b.business_id)
> . . . . . . . . . . . > GROUP BY  b.name
> . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
> . . . . . . . . . . . > LIMIT 10;
> Query failed: Failure while running fragment. Hash join does not support schema changes [0114f4fe-8051-4a4b-999e-3f60828c9cff]
> Error: exception while executing query: Failure while trying to get next result batch. (state=,code=0)
> Adding the explain plan.
> 0: jdbc:drill:zk=local> explain plan for SELECT  b.name, sum(r.votes.cool)
> . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
> . . . . . . . . . . . >  r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> . . . . . . . . . . . > ON (r.business_id = b.business_id)
> . . . . . . . . . . . > GROUP BY  b.name
> . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
> . . . . . . . . . . . > LIMIT 10
> . . . . . . . . . . . > ;
> +------------+------------+
> |    text    |    json    |
> +------------+------------+
> | 00-00    Screen
> 00-01      Project(name=[$0], EXPR$1=[$1])
> 00-02        SelectionVectorRemover
> 00-03          Limit(fetch=[10])
> 00-04            SelectionVectorRemover
> 00-05     |
> +------------+------------+



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)