You are viewing a plain text version of this content. The canonical link for it is here.
Posted to mapreduce-dev@hadoop.apache.org by "Greg Roelofs (JIRA)" <ji...@apache.org> on 2010/06/10 23:59:14 UTC

[jira] Resolved: (MAPREDUCE-1795) add error option if file-based record-readers fail to consume all input (e.g., concatenated gzip, bzip2)

     [ https://issues.apache.org/jira/browse/MAPREDUCE-1795?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Greg Roelofs resolved MAPREDUCE-1795.
-------------------------------------

    Resolution: Won't Fix

Per previous comment, we're going to fix the underlying issue instead (i.e., make decompressors support concatenated streams).  See MAPREDUCE-469.

> add error option if file-based record-readers fail to consume all input (e.g., concatenated gzip, bzip2)
> --------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1795
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1795
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>            Reporter: Greg Roelofs
>            Assignee: Greg Roelofs
>
> When running MapReduce with concatenated gzip files as input, only the first part ("member" in gzip spec parlance, http://www.ietf.org/rfc/rfc1952.txt) is read; the remainder is silently ignored.  As a first step toward fixing that, this issue will add a configurable option to throw an error in such cases.
> MAPREDUCE-469 is the tracker for the more complete fix/feature, whenever that occurs.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.