You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by Clemens Marschner <cm...@lanlab.de> on 2002/07/28 13:16:43 UTC
Book on crawlers (and almost all other features of current search engines)
Soumen Chakrabarti will have a chapter on crawlers in his new book "Mining
the Web: Discovering Knowledge from Hypertext Data" (Morgan Kauffman),
which
will be the first one I know off about this topic.
http://www.cse.iitb.ac.in/soumen/main/book-toc.ps
http://www.amazon.com/exec/obidos/ASIN/1558607544/qid%3D1023770182/sr%3D1-3/
ref%3Dsr%5F1%5F3/002-7943686-4436045
--Clemens
--------------------------------------
http://www.cmarschner.net
--
To unsubscribe, e-mail: <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>
Re: Book on crawlers (and almost all other features of current
search engines)
Posted by Peter Carlson <ca...@bookandhammer.com>.
There is another book out there called Programing Spiders, Bots, and
Aggregators in Java by Jeff Heaton.
It's published by Sybex.
It includes a CD with a complete crawling code that handles many issues,
although not all the ones the LARM project covers.
--Peter
On 7/28/02 4:16 AM, "Clemens Marschner" <cm...@lanlab.de> wrote:
> Soumen Chakrabarti will have a chapter on crawlers in his new book "Mining
> the Web: Discovering Knowledge from Hypertext Data" (Morgan Kauffman),
> which
> will be the first one I know off about this topic.
> http://www.cse.iitb.ac.in/soumen/main/book-toc.ps
>
> http://www.amazon.com/exec/obidos/ASIN/1558607544/qid%3D1023770182/sr%3D1-3/
> ref%3Dsr%5F1%5F3/002-7943686-4436045
>
>
> --Clemens
>
>
>
>
> --------------------------------------
> http://www.cmarschner.net
>
>
>
> --
> To unsubscribe, e-mail: <ma...@jakarta.apache.org>
> For additional commands, e-mail: <ma...@jakarta.apache.org>
>
>
--
To unsubscribe, e-mail: <ma...@jakarta.apache.org>
For additional commands, e-mail: <ma...@jakarta.apache.org>