You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by James liu <li...@gmail.com> on 2007/05/01 14:36:35 UTC
Re: i wanna find one crawl that can crawl with defined urls and defined data
2007/4/30, Graeme Merrall <da...@gmail.com>:
>
> > i wanna crawl http://www.amazone.com/ and just wanna product title ,
> > product information, writer, publisher.
> >
> > and other data i wanna ignore.
>
> How about
> http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html
i read it before this mail.
for example,
>
> i wanna crawl http://www.amazone.com/ and just wanna product title ,
> product information, writer, publisher.
>
> and other data i wanna ignore.
>
>
or if you're prepared to wait or help out there's
> http://svn.apache.org/repos/asf/labs/droids/README.TXT
>
--
regards
jl