You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Steven Phillips (JIRA)" <ji...@apache.org> on 2015/03/14 03:02:38 UTC

[jira] [Updated] (DRILL-2458) Extra hash column added when running CTAS with order by

     [ https://issues.apache.org/jira/browse/DRILL-2458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Steven Phillips updated DRILL-2458:
-----------------------------------
    Description: 
I created the table via the command:

{code:sql}
create table dfs.tmp.lineitem_sort as select * from dfs.`/drill/SF10/lineitem` order by l_extendedprice desc
{code}

This resulted in an extra column when reading the data back:

{code}
0: jdbc:drill:> select * from `lineitem_sort/0_0_0.parquet` limit 1;
+---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
| E_X_P_R_H_A_S_H_F_I_E_L_D | L_COMMENT  | L_COMMITDATE | L_DISCOUNT | L_EXTENDEDPRICE | L_LINENUMBER | L_LINESTATUS | L_ORDERKEY | L_PARTKEY  | L_QUANTITY | L_RECEIPTDATE | L_RETURNFLAG | L_SHIPDATE | L_SHIPINSTRUCT | L_SHIPMODE | L_SUPPKEY  |   L_TAX    | l_extendedprice0 |
+---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
| -1909175176               | [B@187a06b6 | [B@734ea347  | 0.02       | 104949.5        | 2            | [B@2fc1c575  | 16734176   | 199999     | 50.0       | [B@5a8a9cd1   | [B@423d8bc7  | [B@56a3d7ca | [B@1eac3b36    | [B@3d6365f5 | 50002      | 0.05       | 104949.5         |
+---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
{code}

  was:
I created the table via the command:

<code>
create table dfs.tmp.lineitem_sort as select * from dfs.`/drill/SF10/lineitem` order by l_extendedprice desc
</code>

This resulted in an extra column when reading the data back:

<code>
0: jdbc:drill:> select * from `lineitem_sort/0_0_0.parquet` limit 1;
+---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
| E_X_P_R_H_A_S_H_F_I_E_L_D | L_COMMENT  | L_COMMITDATE | L_DISCOUNT | L_EXTENDEDPRICE | L_LINENUMBER | L_LINESTATUS | L_ORDERKEY | L_PARTKEY  | L_QUANTITY | L_RECEIPTDATE | L_RETURNFLAG | L_SHIPDATE | L_SHIPINSTRUCT | L_SHIPMODE | L_SUPPKEY  |   L_TAX    | l_extendedprice0 |
+---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
| -1909175176               | [B@187a06b6 | [B@734ea347  | 0.02       | 104949.5        | 2            | [B@2fc1c575  | 16734176   | 199999     | 50.0       | [B@5a8a9cd1   | [B@423d8bc7  | [B@56a3d7ca | [B@1eac3b36    | [B@3d6365f5 | 50002      | 0.05       | 104949.5         |
+---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
</code>


> Extra hash column added when running CTAS with order by
> -------------------------------------------------------
>
>                 Key: DRILL-2458
>                 URL: https://issues.apache.org/jira/browse/DRILL-2458
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Steven Phillips
>
> I created the table via the command:
> {code:sql}
> create table dfs.tmp.lineitem_sort as select * from dfs.`/drill/SF10/lineitem` order by l_extendedprice desc
> {code}
> This resulted in an extra column when reading the data back:
> {code}
> 0: jdbc:drill:> select * from `lineitem_sort/0_0_0.parquet` limit 1;
> +---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
> | E_X_P_R_H_A_S_H_F_I_E_L_D | L_COMMENT  | L_COMMITDATE | L_DISCOUNT | L_EXTENDEDPRICE | L_LINENUMBER | L_LINESTATUS | L_ORDERKEY | L_PARTKEY  | L_QUANTITY | L_RECEIPTDATE | L_RETURNFLAG | L_SHIPDATE | L_SHIPINSTRUCT | L_SHIPMODE | L_SUPPKEY  |   L_TAX    | l_extendedprice0 |
> +---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
> | -1909175176               | [B@187a06b6 | [B@734ea347  | 0.02       | 104949.5        | 2            | [B@2fc1c575  | 16734176   | 199999     | 50.0       | [B@5a8a9cd1   | [B@423d8bc7  | [B@56a3d7ca | [B@1eac3b36    | [B@3d6365f5 | 50002      | 0.05       | 104949.5         |
> +---------------------------+------------+--------------+------------+-----------------+--------------+--------------+------------+------------+------------+---------------+--------------+------------+----------------+------------+------------+------------+------------------+
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)