You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Milind Bhandarkar (JIRA)" <ji...@apache.org> on 2007/03/09 01:25:24 UTC

[jira] Created: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Rename InputArchive and OutputArchive and make them public
----------------------------------------------------------

                 Key: HADOOP-1096
                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
             Project: Hadoop
          Issue Type: Improvement
          Components: record
    Affects Versions: 0.12.0
         Environment: All
            Reporter: Milind Bhandarkar
         Assigned To: Milind Bhandarkar
             Fix For: 0.13.0


Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 

1. Eliminate current record.RecordReader and record.RecordWriter. 

2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 

3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Milind Bhandarkar updated HADOOP-1096:
--------------------------------------

    Attachment:     (was: jute-patch.txt)

> Rename InputArchive and OutputArchive and make them public
> ----------------------------------------------------------
>
>                 Key: HADOOP-1096
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: record
>    Affects Versions: 0.12.0
>         Environment: All
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.12.1
>
>
> Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 
> 1. Eliminate current record.RecordReader and record.RecordWriter. 
> 2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 
> 3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Milind Bhandarkar updated HADOOP-1096:
--------------------------------------

    Fix Version/s:     (was: 0.13.0)
                   0.12.1

This patch also contains a bug fix for XML serialization of strings (HADOOP-1097). Thats why I hope it gets into 0.12.1.


> Rename InputArchive and OutputArchive and make them public
> ----------------------------------------------------------
>
>                 Key: HADOOP-1096
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: record
>    Affects Versions: 0.12.0
>         Environment: All
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.12.1
>
>         Attachments: jute-patch.txt
>
>
> Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 
> 1. Eliminate current record.RecordReader and record.RecordWriter. 
> 2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 
> 3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Posted by "Tom White (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tom White updated HADOOP-1096:
------------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

I've just committed this. Thanks Milind!

> Rename InputArchive and OutputArchive and make them public
> ----------------------------------------------------------
>
>                 Key: HADOOP-1096
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: record
>    Affects Versions: 0.12.0
>         Environment: All
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.12.1
>
>         Attachments: jute-patch.txt
>
>
> Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 
> 1. Eliminate current record.RecordReader and record.RecordWriter. 
> 2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 
> 3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Milind Bhandarkar updated HADOOP-1096:
--------------------------------------

    Attachment: jute-patch.txt

> Rename InputArchive and OutputArchive and make them public
> ----------------------------------------------------------
>
>                 Key: HADOOP-1096
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: record
>    Affects Versions: 0.12.0
>         Environment: All
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.13.0
>
>         Attachments: jute-patch.txt
>
>
> Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 
> 1. Eliminate current record.RecordReader and record.RecordWriter. 
> 2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 
> 3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Milind Bhandarkar updated HADOOP-1096:
--------------------------------------

    Attachment: jute-patch.txt

This patch applies cleanly with -E -p0.

> Rename InputArchive and OutputArchive and make them public
> ----------------------------------------------------------
>
>                 Key: HADOOP-1096
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: record
>    Affects Versions: 0.12.0
>         Environment: All
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.12.1
>
>         Attachments: jute-patch.txt
>
>
> Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 
> 1. Eliminate current record.RecordReader and record.RecordWriter. 
> 2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 
> 3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Milind Bhandarkar updated HADOOP-1096:
--------------------------------------

    Status: Patch Available  (was: In Progress)

> Rename InputArchive and OutputArchive and make them public
> ----------------------------------------------------------
>
>                 Key: HADOOP-1096
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: record
>    Affects Versions: 0.12.0
>         Environment: All
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.13.0
>
>         Attachments: jute-patch.txt
>
>
> Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 
> 1. Eliminate current record.RecordReader and record.RecordWriter. 
> 2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 
> 3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Work started: (HADOOP-1096) Rename InputArchive and OutputArchive and make them public

Posted by "Milind Bhandarkar (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HADOOP-1096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on HADOOP-1096 started by Milind Bhandarkar.

> Rename InputArchive and OutputArchive and make them public
> ----------------------------------------------------------
>
>                 Key: HADOOP-1096
>                 URL: https://issues.apache.org/jira/browse/HADOOP-1096
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: record
>    Affects Versions: 0.12.0
>         Environment: All
>            Reporter: Milind Bhandarkar
>         Assigned To: Milind Bhandarkar
>             Fix For: 0.13.0
>
>
> Currently hadoop.record.RecordReader and RecordWriter act as factories for various InputArchive and OutputArchive recently. In the original design, this was done in order to have tight control over various serialization formats. This has proven to be counterproductive. For wider usage of record I/O one should be able to use their own serialization formats. The proposed changes make it possible. They are as follows: 
> 1. Eliminate current record.RecordReader and record.RecordWriter. 
> 2. rename InputArchive as RecordInput, and OutputArchive as RecordOutput. 
> 3. rename various archives accordingly. e..g. BinaryInputArchive -> BinaryRecordInput etc.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.