You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@drill.apache.org by "Roger Dielrton (JIRA)" <ji...@apache.org> on 2016/05/09 16:17:12 UTC

[jira] [Created] (DRILL-4659) Specify, as part of the query, table information: data format (CSV, parquet, JSON. etc.), field delimiter, etc.

Roger Dielrton created DRILL-4659:
-------------------------------------

             Summary: Specify, as part of the query, table information: data format (CSV, parquet, JSON. etc.), field delimiter, etc.
                 Key: DRILL-4659
                 URL: https://issues.apache.org/jira/browse/DRILL-4659
             Project: Apache Drill
          Issue Type: Improvement
          Components: Query Planning & Optimization, SQL Parser
            Reporter: Roger Dielrton
            Priority: Minor


I have a file, that I would like to use in a query, and it can have one or more of the
following properties:
* Has not extension ==> Drill is unable to handle it.
* I know it contains data in CSV format, but with an non standard character as field separator ==>
Drill is unable to parse it (without modify the storage plugin configuration).
* Is located in an Amazon S3 bucket ==> I can rename it.
* Has a big size ==> It would be expensive to make a copy of it. 

It would be nice if you can specify, as part of the "select" query, as metadata, relevant table
information as:
* Data format (CSV, parquet, JSON. etc.)
* Field delimiter.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)