You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@crunch.apache.org by "Jim McStanton (JIRA)" <ji...@apache.org> on 2017/01/09 20:23:58 UTC

[jira] [Created] (CRUNCH-632) Add compression support for CSVFileSource

Jim McStanton created CRUNCH-632:
------------------------------------

             Summary: Add compression support for CSVFileSource
                 Key: CRUNCH-632
                 URL: https://issues.apache.org/jira/browse/CRUNCH-632
             Project: Crunch
          Issue Type: Improvement
            Reporter: Jim McStanton
            Priority: Minor


Currently CSVFileSource does not support decompressing files before reading them, and simply opens the file and starts reading the contents: https://github.com/apache/crunch/blob/6280983179e9c690af69c2bf0e296b054122d724/crunch-core/src/main/java/org/apache/crunch/io/text/csv/CSVRecordReader.java#L127. 

This source would more closely match TextFileSource if this support was added. The {{LineRecordReader}} supports this behavior [here|http://grepcode.com/file/repo1.maven.org/maven2/org.apache.hadoop/hadoop-mapreduce-client-core/2.7.1/org/apache/hadoop/mapreduce/lib/input/LineRecordReader.java?av=f#87]. 




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)