You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Erik Hatcher <er...@ehatchersolutions.com> on 2005/08/30 22:20:37 UTC

Re: [Nutch-dev] Re: Another NDFS question

What you've just described, Doug, is WebDAV!   There is an  
implementation of it built into Tomcat, but a more full-featured  
version is Slide - http://jakarta.apache.org/slide/ .

There is also JSR (#170) for a content repository, being implemented  
open-source as Jackrabbit: http://incubator.apache.org/projects/ 
jackrabbit.html

Apache's mod_dav is also well worth mentioning, as it is extensible  
and surely quite fast.

I'm not sure how well any of these that I've mentioned jive with the  
goals of NDFS.  I have done a fair bit of homework on WebDAV in the  
past, once even implementing a prototype server before Slide was viable.

     Erik



On Aug 30, 2005, at 12:08 PM, Doug Cutting wrote:

> Ian C. Blenke wrote:
>
>> When NDFS is exposed to userspace for scripts to use, admins types  
>> will embrace it for managing the cluster.
>>
>
> Our intent is to add some servlets which run on each datanode  
> providing access to the filesystem for non-Java programs.
>
> Most operations would be quite simple, e.g.:
>
> - to write a file, post its content to a url like:
>   http://datanode:XXXX/write?name=my.file
>
> - to read a file, get file content from urls like:
>   http://datanode:XXXX/read?name=my.file
>   http://datanode:XXXX/read?name=my.file&start=2048&length=1024
>
> - to remove a file:
>   http://datanode:XXX/remove?name=my.file
>
> Similarly for rename, copy, etc.
>
> The only somewhat complicated thing would be directory listings.   
> These would be handled with a simple REST interface, where some  
> simple XML is returned.  Ideally a stylesheet could be specified so  
> that one can use the directory listing url to view the filesystem  
> from a brower.
>
> These servlets could easily be implemented in terms of the  
> NutchFileSystem API, and deployed with Jetty.  To my knowledge, no  
> one is currently working on this.  A volunteer would be welcome.
>
> Doug
>
>
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle  
> Practices
> Agile & Plan-Driven Development * Managing Projects & Teams *  
> Testing & QA
> Security * Process Improvement & Measurement * http://www.sqe.com/ 
> bsce5sf
> _______________________________________________
> Nutch-developers mailing list
> Nutch-developers@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nutch-developers
>