You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2019/10/21 14:32:00 UTC

[jira] [Commented] (DRILL-5674) Drill should support .zip compression

    [ https://issues.apache.org/jira/browse/DRILL-5674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16956157#comment-16956157 ] 

ASF GitHub Bot commented on DRILL-5674:
---------------------------------------

arina-ielchiieva commented on pull request #1879: DRILL-5674: Support ZIP compression
URL: https://github.com/apache/drill/pull/1879
 
 
   1. Added ZipCodec implementation which can read / write single file.
   2. Revisited Drill plugin formats to ensure 'openPossiblyCompressedStream' method is used in those which support compression.
   3. Added unit tests.
   4. General refactoring.
   
   Jira - [DRILL-5674](https://issues.apache.org/jira/browse/DRILL-5674).
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Drill should support .zip compression
> -------------------------------------
>
>                 Key: DRILL-5674
>                 URL: https://issues.apache.org/jira/browse/DRILL-5674
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Text &amp; CSV
>    Affects Versions: 1.10.0
>            Reporter: Paul Rogers
>            Assignee: Arina Ielchiieva
>            Priority: Major
>              Labels: doc-impacting
>             Fix For: 1.17.0
>
>
> Zip is a very common compression format. Create a compressed CSV file with column headers: data.csv.zip.
> Define a storage plugin config for the file, call it "dfs.myws", set delimiter = ",", extract header = true, skip header = false.
> Run a simple query:
> SELECT * FROM dfs.myws.`data.csv.zip`
> The result is garbage as the CSV reader is trying to parse Zipped data as if it were text.
> DRILL-5506 asks how to do this; the responder said to add a library to the path. Better would be to simply support zip out-of-the-box as a default format.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)