You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Deepa Jayaveer <de...@tcs.com> on 2014/03/25 09:13:40 UTC

setting up depth and topN dynamically

Hi,
I need to crawl around 30000 URLs per day. In which i need to set dynamic 
depth and
topN .    Is there any configuration where i can set up depth and topN 
dynamically  for different
URLs?


Thanks and Regards
Deepa Devi Jayaveer
____________________________________________
=====-----=====-----=====
Notice: The information contained in this e-mail
message and/or attachments to it may contain 
confidential or privileged information. If you are 
not the intended recipient, any dissemination, use, 
review, distribution, printing or copying of the 
information contained in this e-mail message 
and/or attachments to it are strictly prohibited. If 
you have received this communication in error, 
please notify us by reply e-mail or telephone and 
immediately and permanently delete the message 
and any attachments. Thank you



Re: setting up depth and topN dynamically

Posted by Talat Uyarer <ta...@uyarer.com>.
Hi Deepa,

unfortunately nutch doesnt have setting similiar your request. I had same
problem as yours. i developed a special algorithm. If you want you can use,
I wrote how it works.you can reach my issue on NUTCH-1630

Talat
25 Mar 2014 10:14 tarihinde "Deepa Jayaveer" <de...@tcs.com> yazdı:

> Hi,
> I need to crawl around 30000 URLs per day. In which i need to set dynamic
> depth and
> topN .    Is there any configuration where i can set up depth and topN
> dynamically  for different
> URLs?
>
>
> Thanks and Regards
> Deepa Devi Jayaveer
> ____________________________________________
> =====-----=====-----=====
> Notice: The information contained in this e-mail
> message and/or attachments to it may contain
> confidential or privileged information. If you are
> not the intended recipient, any dissemination, use,
> review, distribution, printing or copying of the
> information contained in this e-mail message
> and/or attachments to it are strictly prohibited. If
> you have received this communication in error,
> please notify us by reply e-mail or telephone and
> immediately and permanently delete the message
> and any attachments. Thank you
>
>
>