You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Rahul Challapalli (JIRA)" <ji...@apache.org> on 2015/04/29 21:31:08 UTC
[jira] [Commented] (DRILL-1593) Hash join does not support schema
changes in joins on JSON files
[ https://issues.apache.org/jira/browse/DRILL-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14520029#comment-14520029 ]
Rahul Challapalli commented on DRILL-1593:
------------------------------------------
Verified that the query is working. However did not add an automated testcase since the data is on the larger side
> Hash join does not support schema changes in joins on JSON files
> ----------------------------------------------------------------
>
> Key: DRILL-1593
> URL: https://issues.apache.org/jira/browse/DRILL-1593
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - Data Types
> Reporter: Neeraja
> Assignee: Hanifi Gunes
> Priority: Critical
> Fix For: 0.7.0
>
>
> Tried the following query and it is failing with the hash join doesnt support schema changes error. The columns queries in this case do not necessarily change schemas.
> {code:sql}
> SELECT b.name, sum(r.votes.cool)
> FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
> r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> ON (r.business_id = b.business_id)
> GROUP BY b.name
> ORDER BY sum(r.votes.cool) DESC
> LIMIT 10;
> {code}
> 0: jdbc:drill:zk=local> SELECT b.name, sum(r.votes.cool)
> . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
> . . . . . . . . . . . > r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> . . . . . . . . . . . > ON (r.business_id = b.business_id)
> . . . . . . . . . . . > GROUP BY b.name
> . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
> . . . . . . . . . . . > LIMIT 10;
> Query failed: Failure while running fragment. Hash join does not support schema changes [0114f4fe-8051-4a4b-999e-3f60828c9cff]
> Error: exception while executing query: Failure while trying to get next result batch. (state=,code=0)
> Adding the explain plan.
> 0: jdbc:drill:zk=local> explain plan for SELECT b.name, sum(r.votes.cool)
> . . . . . . . . . . . > FROM dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_review.json`
> . . . . . . . . . . . > r JOIN dfs.`/users/nrentachintala/Downloads/yelp_dataset_challenge_academic_dataset/yelp_academic_dataset_business.json` b
> . . . . . . . . . . . > ON (r.business_id = b.business_id)
> . . . . . . . . . . . > GROUP BY b.name
> . . . . . . . . . . . > ORDER BY sum(r.votes.cool) DESC
> . . . . . . . . . . . > LIMIT 10
> . . . . . . . . . . . > ;
> +------------+------------+
> | text | json |
> +------------+------------+
> | 00-00 Screen
> 00-01 Project(name=[$0], EXPR$1=[$1])
> 00-02 SelectionVectorRemover
> 00-03 Limit(fetch=[10])
> 00-04 SelectionVectorRemover
> 00-05 |
> +------------+------------+
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)