You are viewing a plain text version of this content. The canonical link for it is here.
Posted to common-dev@hadoop.apache.org by "Owen O'Malley (JIRA)" <ji...@apache.org> on 2008/03/07 00:56:58 UTC

[jira] Commented: (HADOOP-2885) Restructure the hadoop.dfs package

    [ https://issues.apache.org/jira/browse/HADOOP-2885?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12575965#action_12575965 ] 

Owen O'Malley commented on HADOOP-2885:
---------------------------------------

I think that org.apache.hadoop.hdfs.* is better than org.apache.hadoop.fs.hdfs.*. However, I'm not adamant about it.

I do feel strongly about how this interacts with src directory splitting.

core:
   org.apache.hadoop.{io,conf,ipc,util,fs}

hdfs:
   org.apache.hadoop.hdfs (or fs.hdfs)

mapreduce:
   org.apache.hadoop.mapred

You can't put DistributedFileSystem and DFSClient in separate src directories without making a cyclic dependence and that is very bad. Therefore, I think they both need to be in the hdfs src tree. I think it is less confusing to have the src trees *not* overlap packages and therefore it would be better to have it in org.apache.hadoop.hdfs. I would even propose merging DFSClient and DistributeFileSystem into a single class...

The kfs and s3 could stay in core because they are very thin wrappers over their respective native file systems. 

> Restructure the hadoop.dfs package
> ----------------------------------
>
>                 Key: HADOOP-2885
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2885
>             Project: Hadoop Core
>          Issue Type: Sub-task
>          Components: dfs
>            Reporter: Sanjay Radia
>            Assignee: Sanjay Radia
>            Priority: Minor
>             Fix For: 0.17.0
>
>         Attachments: Prototype dfs package.png
>
>
> This Jira proposes restructurign the package hadoop.dfs.
> 1. Move all server side and internal protocols (NN-DD etc) to hadoop.dfs.server.*
> 2. Further breakdown of dfs.server.
> - dfs.server.namenode.*
> - dfs.server.datanode.*
> - dfs.server.balancer.*
> - dfs.server.common.* - stuff shared between the various servers
> - dfs.protocol.*  - internal protocol between DN, NN and Balancer etc.
> 3. Client interface:
> - hadoop.dfs.DistributedFileSystem.java
> - hadoop.dfs.ChecksumDistributedFileSystem.java
> - hadoop.dfs.HftpFilesystem.java
> - hadoop.dfs.protocol.* - the client side protocol

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.