You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Khurram Faraaz (JIRA)" <ji...@apache.org> on 2015/04/10 00:35:13 UTC

[jira] [Commented] (DRILL-2739) Drill stops reading columns > 4095

    [ https://issues.apache.org/jira/browse/DRILL-2739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14488416#comment-14488416 ] 

Khurram Faraaz commented on DRILL-2739:
---------------------------------------

On 0.9.0 I tried with a CSV file that had 5000 columns and querying for the columns[4096] did return correct results. I had integer data stored in each of the columns in the csv file.

{code}
0: jdbc:drill:> select columns[4096] from `manyColumns.csv`;
+------------+
|   EXPR$0   |
+------------+
| 4096       |
+------------+
1 row selected (0.13 seconds)

0: jdbc:drill:> select columns[4097] from `manyColumns.csv`;
+------------+
|   EXPR$0   |
+------------+
| 4097       |
+------------+
1 row selected (0.103 seconds)
{code}

> Drill stops reading columns > 4095
> ----------------------------------
>
>                 Key: DRILL-2739
>                 URL: https://issues.apache.org/jira/browse/DRILL-2739
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Execution - Flow, Storage - Text & CSV
>    Affects Versions: 0.7.0
>         Environment: CentOS 6.4
> sqlline 1.1.6
> Drill 0.7
>            Reporter: Nick Amato
>            Assignee: Chris Westin
>         Attachments: mcols.csv
>
>
> Querying a CSV file with 10000 columns, it looks like I can't query past column 4095. 
> 0: jdbc:drill:localhost:5181> SELECT columns[4094] FROM `dfs`.`tmp`.`manycols.csv` limit 1;
> +------------+
> |   EXPR$0   |
> +------------+
> | 833        |
> +------------+
> 1 row selected (0.269 seconds)
> 0: jdbc:drill:localhost:5181> SELECT columns[4095] FROM `dfs`.`tmp`.`manycols.csv` limit 1;
> +--+
> |  |
> +--+
> +--+
> No rows selected (0.198 seconds)
> 0: jdbc:drill:localhost:5181>
> [mapr@drilltwitter tmp]$ awk -F, '{ print $4095,$4096,$4097 }' manycols.csv | head -1
> 833 2552 2222
> [mapr@drilltwitter tmp]$



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)