You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Nick Amato (JIRA)" <ji...@apache.org> on 2015/04/10 00:15:12 UTC
[jira] [Updated] (DRILL-2739) Drill stops reading columns > 4095
[ https://issues.apache.org/jira/browse/DRILL-2739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Nick Amato updated DRILL-2739:
------------------------------
Attachment: mcols.csv
This file contains 1 row of 10000 comma-separated integers (columns). I encountered the issue with a file with 20000 rows of the same data, but was able to reproduce it with this 1-line file to save some space...
> Drill stops reading columns > 4095
> ----------------------------------
>
> Key: DRILL-2739
> URL: https://issues.apache.org/jira/browse/DRILL-2739
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - Flow, Storage - Text & CSV
> Affects Versions: 0.7.0
> Environment: CentOS 6.4
> sqlline 1.1.6
> Drill 0.7
> Reporter: Nick Amato
> Assignee: Chris Westin
> Attachments: mcols.csv
>
>
> Querying a CSV file with 10000 columns, it looks like I can't query past column 4095.
> 0: jdbc:drill:localhost:5181> SELECT columns[4094] FROM `dfs`.`tmp`.`manycols.csv` limit 1;
> +------------+
> | EXPR$0 |
> +------------+
> | 833 |
> +------------+
> 1 row selected (0.269 seconds)
> 0: jdbc:drill:localhost:5181> SELECT columns[4095] FROM `dfs`.`tmp`.`manycols.csv` limit 1;
> +--+
> | |
> +--+
> +--+
> No rows selected (0.198 seconds)
> 0: jdbc:drill:localhost:5181>
> [mapr@drilltwitter tmp]$ awk -F, '{ print $4095,$4096,$4097 }' manycols.csv | head -1
> 833 2552 2222
> [mapr@drilltwitter tmp]$
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)