You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Khurram Faraaz (JIRA)" <ji...@apache.org> on 2017/03/31 07:17:41 UTC

[jira] [Commented] (DRILL-3562) Query fails when using flatten on JSON data where some documents have an empty array

    [ https://issues.apache.org/jira/browse/DRILL-3562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15950452#comment-15950452 ] 

Khurram Faraaz commented on DRILL-3562:
---------------------------------------

[~arina] Is this the expected result for the second SQL below ?

{noformat}
0: jdbc:drill:schema=dfs.tmp> select * from `drill_3562.json`;
+-----------------+
|        a        |
+-----------------+
| {"b":{"c":[]}}  |
+-----------------+
1 row selected (0.138 seconds)
0: jdbc:drill:schema=dfs.tmp> select FLATTEN(t.a.b.c) AS c from `drill_3562.json` t;
+----+
| c  |
+----+
+----+
No rows selected (0.181 seconds)
{noformat}

> Query fails when using flatten on JSON data where some documents have an empty array
> ------------------------------------------------------------------------------------
>
>                 Key: DRILL-3562
>                 URL: https://issues.apache.org/jira/browse/DRILL-3562
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - JSON
>    Affects Versions: 1.1.0
>            Reporter: Philip Deegan
>            Assignee: Serhii Harnyk
>             Fix For: 1.10.0
>
>
> Drill query fails when using flatten when some records contain an empty array 
> {noformat}
> SELECT COUNT(*) FROM (SELECT FLATTEN(t.a.b.c) AS c FROM dfs.`flat.json` t) flat WHERE flat.c.d.e = 'f' limit 1;
> {noformat}
> Succeeds on 
> { "a": { "b": { "c": [  { "d": {  "e": "f" } } ] } } }
> Fails on
> { "a": { "b": { "c": [] } } }
> Error
> {noformat}
> Error: SYSTEM ERROR: ClassCastException: Cannot cast org.apache.drill.exec.vector.NullableIntVector to org.apache.drill.exec.vector.complex.RepeatedValueVector
> {noformat}
> Is it possible to ignore the empty arrays, or do they need to be populated with dummy data?



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)