You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "Khurram Faraaz (JIRA)" <ji...@apache.org> on 2016/01/07 10:29:40 UTC

[jira] [Commented] (DRILL-951) CSV header row should be parsed

    [ https://issues.apache.org/jira/browse/DRILL-951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15087110#comment-15087110 ] 

Khurram Faraaz commented on DRILL-951:
--------------------------------------

Verified on Drill 1.4 git commit ID : 32b85160

Added skipFirstLine to DFS storage plugin and set it to true.

1) skipFirstLine set to true, header is not displayed as part of results returned by SELECT query.
2) skipFirstLine set to false, header is displayed as part of results returned by SELECT query.

{noformat}
"formats": {
...
"csv": {
      "type": "text",
      "extensions": [
        "csv"
      ],
      "skipFirstLine": true,
      "delimiter": ","
    },
...
{noformat}

> CSV header row should be parsed
> -------------------------------
>
>                 Key: DRILL-951
>                 URL: https://issues.apache.org/jira/browse/DRILL-951
>             Project: Apache Drill
>          Issue Type: New Feature
>          Components: Storage - Text & CSV
>            Reporter: Tomer Shiran
>            Assignee: Abhijit Pol
>             Fix For: 1.3.0
>
>
> CSV reader is currently treating header names like regular rows. There should be a way to treat the header row as the column names (optional?).
> I exported this dataset to a CSV: https://data.sfgov.org/Public-Safety/SFPD-Incidents-Previous-Three-Months/tmnf-yvry



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)