You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Jacques Nadeau (JIRA)" <ji...@apache.org> on 2014/11/11 04:59:33 UTC

[jira] [Commented] (DRILL-1652) flatten causing the sqlline to hang indefinitely when dealing with 0.1 GB data

    [ https://issues.apache.org/jira/browse/DRILL-1652?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14205914#comment-14205914 ] 

Jacques Nadeau commented on DRILL-1652:
---------------------------------------

With other fixes, select * works fine.  However, it looks like the flatten is failing due to another issue: RepeatedMapVector doesn't yet support split and transfer.

{code}
java.lang.UnsupportedOperationException
	at org.apache.drill.exec.vector.complex.RepeatedMapVector$MapTransferPair.splitAndTransfer(RepeatedMapVector.java:381) ~[classes/:na]
	at org.apache.drill.exec.test.generated.ProjectorGen5.projectRecords(ProjectorTemplate.java:70) ~[na:na]
	at org.apache.drill.exec.physical.impl.project.ProjectRecordBatch.doWork(ProjectRecordBatch.java:149) ~[classes/:na]
{code}

> flatten causing the sqlline to hang indefinitely when dealing with 0.1 GB data
> ------------------------------------------------------------------------------
>
>                 Key: DRILL-1652
>                 URL: https://issues.apache.org/jira/browse/DRILL-1652
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Functions - Drill, Storage - JSON
>            Reporter: Rahul Challapalli
>         Attachments: error.log, select_star_error.log
>
>
> git.commit.id.abbrev=60aa446
> I ran the below test against the private branch of Jason which has some patches for bugs related to flatten which are not yet merged into the master.
> I used the below json record and copied it over 100000 times
> {code}
> {
>     "uid" : 1,
>     "events" : [
>         { "evnt_id":"e1", "campaign_id":"c1", "event_name":"e1_name", "event_time":1000000},
>         { "evnt_id":"e2", "campaign_id":"c1", "event_name":"e2_name", "event_time":2000000},
>         { "evnt_id":"e3", "campaign_id":"c1", "event_name":"e3_name", "event_time":3000000},
>         { "evnt_id":"e4", "campaign_id":"c1", "event_name":"e4_name", "event_time":4000000},
>         { "evnt_id":"e5", "campaign_id":"c2", "event_name":"e5_name", "event_time":5000000},
>         { "evnt_id":"e6", "campaign_id":"c1", "event_name":"e6_name", "event_time":6000000},
>         { "evnt_id":"e7", "campaign_id":"c1", "event_name":"e7_name", "event_time":7000000},
>         { "evnt_id":"e8", "campaign_id":"c2", "event_name":"e8_name", "event_time":8000000},
>         { "evnt_id":"e9", "campaign_id":"c2", "event_name":"e9_name", "event_time":9000000}
>     ],
>    "transactions" : [
>         { "trans_id":"t1", "amount":100, "trans_time":7777777},
>         { "trans_id":"t2", "amount":100, "trans_time":8888888}
>     ]
> }
> {code}
> The below query which applies a flatten function hangs after returning a few thousand records
> {code}
>  select uid, flatten(transactions) from `json_kvgenflatten/transactions.json`;
> {code}
> Select * from the file also fails after displaying a bunch of records
> {code}
> select * from `json_kvgenflatten/transactions.json`;
> {code}
> The below query works
> {code}
> select uid from `json_kvgenflatten/transactions.json`;
> {code}
> I attached the error logs for the 2 failing queries



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)