You are viewing a plain text version of this content. The canonical link for it is here.

Posted to common-user@hadoop.apache.org by Yonik Seeley <ys...@gmail.com> on 2006/03/14 17:42:19 UTC

DFS vs GFS

So, I've just worked my way through the GFS whitepaper (mapreduce is
next), and was wondering what the major design/architecture
differences are compared with Hadoop's DFS.

The JavaDoc points out one: single-writer, stream only (no record
append, no writing to specific spot in file, etc).  Is that a
different design decision, or simply something that hasn't been
implemented yet?

Have you guys checked out the BigTable talk?
http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=437

-Yonik

Re: DFS vs GFS

Posted by Doug Cutting <cu...@apache.org>.

Yonik Seeley wrote:
> The JavaDoc points out one: single-writer, stream only (no record
> append, no writing to specific spot in file, etc).  Is that a
> different design decision, or simply something that hasn't been
> implemented yet?

It's a simplification.  We may add appends & multiple writers later.

> Have you guys checked out the BigTable talk?
> http://norfolk.cs.washington.edu/htbin-post/unrestricted/colloq/details.cgi?id=437

Yeah, that's good stuff.  The compression in particular.

Doug