You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Khurram Faraaz (JIRA)" <ji...@apache.org> on 2017/07/06 07:10:00 UTC

[jira] [Updated] (DRILL-5498) CSV text reader does not handle duplicate header names

     [ https://issues.apache.org/jira/browse/DRILL-5498?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Khurram Faraaz updated DRILL-5498:
----------------------------------
    Component/s: Storage - Text & CSV

> CSV text reader does not handle duplicate header names
> ------------------------------------------------------
>
>                 Key: DRILL-5498
>                 URL: https://issues.apache.org/jira/browse/DRILL-5498
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Storage - Text & CSV
>    Affects Versions: 1.8.0
>            Reporter: Paul Rogers
>            Assignee: Paul Rogers
>            Priority: Minor
>              Labels: ready-to-commit
>             Fix For: 1.11.0
>
>
> Consider the following CSV file:
> {code}
> h,h,h
> a,b,c
> d,e,f
> {code}
> Parse this with the CSV storage plugins to parse headers. The result:
> {code}
> 2 row(s):
> h
> c
> f
> {code}
> Expected a runtime error for the duplicate column names, or automatic "uniqification" of the names. Certainly did not expect the first two columns to be dropped.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)