You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-dev@hadoop.apache.org by "brian (JIRA)" <ji...@apache.org> on 2009/04/16 19:09:15 UTC

[jira] Issue Comment Edited: (HADOOP-398) refactor the mapred package into small pieces

    [ https://issues.apache.org/jira/browse/HADOOP-398?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12699765#action_12699765 ] 

brian edited comment on HADOOP-398 at 4/16/09 10:09 AM:
--------------------------------------------------------

In addition to the splits by Owen above, I would also suggest the following package restructuring

package org.apache.hadoop.mapred.sequence;

SequenceFileAsBinaryInputFormat
SequenceFileAsBinaryOutputFormat
SequenceFileAsTextInputFormat
SequenceFileAsTextRecordReader
SequenceFileInputFilter
SequenceFileInputFormat
SequenceFileOutputFormat
SequenceFileRecordReader


package org.apache.hadoop.mapred.file;

FileAlreadyExistsException
FileInputFormat
FileOutputCommitter
FileOutputFormat
FileSplit
IFile
IFileInputStream
IFileOutputStream


      was (Author: brianmackay):
    i would also suggest the following package restructuring

package org.apache.hadoop.mapred.sequence;

SequenceFileAsBinaryInputFormat
SequenceFileAsBinaryOutputFormat
SequenceFileAsTextInputFormat
SequenceFileAsTextRecordReader
SequenceFileInputFilter
SequenceFileInputFormat
SequenceFileOutputFormat
SequenceFileRecordReader


package org.apache.hadoop.mapred.file;

FileAlreadyExistsException
FileInputFormat
FileOutputCommitter
FileOutputFormat
FileSplit
IFile
IFileInputStream
IFileOutputStream

  
> refactor the mapred package into small pieces
> ---------------------------------------------
>
>                 Key: HADOOP-398
>                 URL: https://issues.apache.org/jira/browse/HADOOP-398
>             Project: Hadoop Core
>          Issue Type: Improvement
>          Components: mapred
>    Affects Versions: 0.4.0
>            Reporter: Owen O'Malley
>            Assignee: Owen O'Malley
>
> The mapred package has gotten too big, so I propose changing it to split it into parts.
> I propose the following splits:
> org.apache.hadoop.mapred = client API
> org.apache.hadoop.mapred.task = code for task tracker
> org.apache.hadoop.mapred.job = code for job tracker
> org.apache.hadoop.mapred.utils = non public code that is shared between the servers
> Does anyone have any other divisions that would help?
> I would make the classes sent through RPC public classes in the server's package.
> Thoughts?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.