You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@drill.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2021/01/04 00:22:00 UTC

[jira] [Commented] (DRILL-7834) Add Utility Functions for Compressed Files

    [ https://issues.apache.org/jira/browse/DRILL-7834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17257881#comment-17257881 ] 

ASF GitHub Bot commented on DRILL-7834:
---------------------------------------

cgivre opened a new pull request #2133:
URL: https://github.com/apache/drill/pull/2133


   # [DRILL-7834](https://issues.apache.org/jira/browse/DRILL-7834): Add Utility Functions for Compressed Files
   
   ## Description
   
   Some format plugins that use third party parsers throw errors when they receive compressed input streams from Drill.  This PR proposes to introduce three utility functions to the DrillFileSystem:
   * `isCompressed(<path>)`:  Returns true/false whether the input file is compressed
   * `getCodec(<path>)`  This method returns the codec of the file if any
   * `openDecompressedInputStream(<path>)`:  Returns an InputStream that should be readable by parsers that read raw bytes.  This method converts the original InputStream to a byte[] first, then returns that via a ByteArrayInputStream.
   
   ## Documentation
   No user facing changes.
   
   ## Testing
   Tested manually.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Add Utility Functions for Compressed Files
> ------------------------------------------
>
>                 Key: DRILL-7834
>                 URL: https://issues.apache.org/jira/browse/DRILL-7834
>             Project: Apache Drill
>          Issue Type: Improvement
>          Components: Storage - Text &amp; CSV
>    Affects Versions: 1.17.0
>            Reporter: Charles Givre
>            Assignee: Charles Givre
>            Priority: Minor
>             Fix For: 1.19.0
>
>
> Some format plugins that use third party parsers throw errors when they receive compressed input streams from Drill.  This PR proposes to introduce three utility functions to the DrillFileSystem:
> # isCompressed(<path>):  Returns true/false whether the input file is compressed
> # getCodec(<path>):  This method returns the codec of the file if any
> # openDecompressedInputStream(<path>):  Returns an InputStream that should be readable by parsers that read raw bytes.  This method converts the original InputStream to a byte[] first, then returns that via a ByteArrayInputStream.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)