You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Clemens Marschner <cm...@lanlab.de> on 2002/07/28 13:16:43 UTC

Book on crawlers (and almost all other features of current search engines)

 Soumen Chakrabarti will have a chapter on crawlers in his new book "Mining
 the Web: Discovering Knowledge from Hypertext Data" (Morgan Kauffman),
which
 will be the first one I know off about this topic.
 http://www.cse.iitb.ac.in/soumen/main/book-toc.ps

http://www.amazon.com/exec/obidos/ASIN/1558607544/qid%3D1023770182/sr%3D1-3/
 ref%3Dsr%5F1%5F3/002-7943686-4436045


 --Clemens




 --------------------------------------
 http://www.cmarschner.net



--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>


Re: Book on crawlers (and almost all other features of current search engines)

Posted by Peter Carlson <ca...@bookandhammer.com>.
There is another book out there called Programing Spiders, Bots, and
Aggregators in Java by Jeff Heaton.

It's published by Sybex.

It includes a CD with a complete crawling code that handles many issues,
although not all the ones the LARM project covers.

--Peter

On 7/28/02 4:16 AM, "Clemens Marschner" <cm...@lanlab.de> wrote:

> Soumen Chakrabarti will have a chapter on crawlers in his new book "Mining
> the Web: Discovering Knowledge from Hypertext Data" (Morgan Kauffman),
> which
> will be the first one I know off about this topic.
> http://www.cse.iitb.ac.in/soumen/main/book-toc.ps
> 
> http://www.amazon.com/exec/obidos/ASIN/1558607544/qid%3D1023770182/sr%3D1-3/
> ref%3Dsr%5F1%5F3/002-7943686-4436045
> 
> 
> --Clemens
> 
> 
> 
> 
> --------------------------------------
> http://www.cmarschner.net
> 
> 
> 
> --
> To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
> For additional commands, e-mail: <ma...@jakarta.apache.org>
> 
> 


--
To unsubscribe, e-mail:   <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>