You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@drill.apache.org by "Roger Dielrton (JIRA)" <ji...@apache.org> on 2016/05/09 16:20:12 UTC

[jira] [Updated] (DRILL-4659) Specify, as part of the query, table information: data format (CSV, parquet, JSON. etc.), field delimiter, etc.

     [ https://issues.apache.org/jira/browse/DRILL-4659?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Roger Dielrton updated DRILL-4659:
----------------------------------
    Description: 
I have a file, that I would like to use in a query, and it can have one or more of the following properties:
* Has not extension ==> Drill is unable to handle it.
* I know it contains data in CSV format, but the field separator is a non standard character ==> Drill is unable to parse it (without modify the storage plugin configuration).
* Is located in an Amazon S3 bucket ==> I can't rename it.
* Has a big size ==> It would be expensive to make a copy of it. 

It would be nice if you can specify, as part of the "select" query, as metadata, relevant table information as:
* Data format (CSV, parquet, JSON. etc.)
* Field delimiter.

  was:
I have a file, that I would like to use in a query, and it can have one or more of the
following properties:
* Has not extension ==> Drill is unable to handle it.
* I know it contains data in CSV format, but with an non standard character as field separator ==>
Drill is unable to parse it (without modify the storage plugin configuration).
* Is located in an Amazon S3 bucket ==> I can rename it.
* Has a big size ==> It would be expensive to make a copy of it. 

It would be nice if you can specify, as part of the "select" query, as metadata, relevant table
information as:
* Data format (CSV, parquet, JSON. etc.)
* Field delimiter.


> Specify, as part of the query, table information: data format (CSV, parquet, JSON. etc.), field delimiter, etc.
> ---------------------------------------------------------------------------------------------------------------
>
>                 Key: DRILL-4659
>                 URL: https://issues.apache.org/jira/browse/DRILL-4659
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Query Planning & Optimization, SQL Parser
>            Reporter: Roger Dielrton
>            Priority: Minor
>
> I have a file, that I would like to use in a query, and it can have one or more of the following properties:
> * Has not extension ==> Drill is unable to handle it.
> * I know it contains data in CSV format, but the field separator is a non standard character ==> Drill is unable to parse it (without modify the storage plugin configuration).
> * Is located in an Amazon S3 bucket ==> I can't rename it.
> * Has a big size ==> It would be expensive to make a copy of it. 
> It would be nice if you can specify, as part of the "select" query, as metadata, relevant table information as:
> * Data format (CSV, parquet, JSON. etc.)
> * Field delimiter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)