You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Khurram Faraaz (JIRA)" <ji...@apache.org> on 2015/12/10 22:21:10 UTC
[jira] [Commented] (DRILL-4185) UNION ALL involving empty directory
on any side of union all results in Failed query
[ https://issues.apache.org/jira/browse/DRILL-4185?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15051667#comment-15051667 ]
Khurram Faraaz commented on DRILL-4185:
---------------------------------------
On the other hand if we use an empty file (for example, an empty JSON file named empty.json) in the directory named empty. UNION ALL returns results from the non-empty input.
{code}
[root@centos-01 ~]# hadoop fs -ls /tmp/empty
Found 1 items
-rwxr-xr-x 3 root root 0 2015-11-04 23:43 /tmp/empty/empty.json
{code}
{code}
0: jdbc:drill:schema=dfs.tmp> select key1 from empty UNION ALL select EID from Emp;
+-------+
| key1 |
+-------+
| 100 |
| 10 |
| 2 |
| 50 |
| 55 |
| 67 |
| 113 |
| 119 |
| 89 |
| 57 |
| 61 |
+-------+
11 rows selected (0.42 seconds)
{code}
> UNION ALL involving empty directory on any side of union all results in Failed query
> ------------------------------------------------------------------------------------
>
> Key: DRILL-4185
> URL: https://issues.apache.org/jira/browse/DRILL-4185
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - Relational Operators
> Affects Versions: 1.4.0
> Reporter: Khurram Faraaz
>
> UNION ALL query that involves an empty directory on either side of UNION ALL operator results in FAILED query. We should return the results for the non-empty side (input) of UNION ALL.
> Note that empty_DIR is an empty directory, the directory exists, but it has no files in it.
> Drill 1.4 git.commit.id=b9068117
> 4 node cluster on CentOS
> {code}
> 0: jdbc:drill:schema=dfs.tmp> select columns[0] from empty_DIR UNION ALL select cast(columns[0] as int) c1 from `testWindow.csv`;
> Error: VALIDATION ERROR: From line 1, column 24 to line 1, column 32: Table 'empty_DIR' not found
> [Error Id: 5c024786-6703-4107-8a4a-16c96097be08 on centos-01.qa.lab:31010] (state=,code=0)
> 0: jdbc:drill:schema=dfs.tmp> select cast(columns[0] as int) c1 from `testWindow.csv` UNION ALL select columns[0] from empty_DIR;
> Error: VALIDATION ERROR: From line 1, column 90 to line 1, column 98: Table 'empty_DIR' not found
> [Error Id: 58c98bc4-99df-425c-aa07-c8c5faec4748 on centos-01.qa.lab:31010] (state=,code=0)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)