You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by James liu <li...@gmail.com> on 2007/05/01 14:36:35 UTC

Re: i wanna find one crawl that can crawl with defined urls and defined data

2007/4/30, Graeme Merrall <da...@gmail.com>:
>
> > i wanna crawl http://www.amazone.com/  and just wanna product title ,
> > product information, writer, publisher.
> >
> > and other data i wanna ignore.
>
> How about
> http://blog.foofactory.fi/2007/02/online-indexing-integrating-nutch-with.html



i read it before this mail.


for example,
>
> i wanna crawl http://www.amazone.com/  and just wanna product title ,
> product information, writer, publisher.
>
> and other data i wanna ignore.
>
>
or if you're prepared to wait or help out there's
> http://svn.apache.org/repos/asf/labs/droids/README.TXT
>



-- 
regards
jl