You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2014/07/15 11:41:09 UTC
[jira] [Updated] (NUTCH-1309) fetch queue management
[ https://issues.apache.org/jira/browse/NUTCH-1309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Julien Nioche updated NUTCH-1309:
---------------------------------
Fix Version/s: (was: 1.9)
> fetch queue management
> ----------------------
>
> Key: NUTCH-1309
> URL: https://issues.apache.org/jira/browse/NUTCH-1309
> Project: Nutch
> Issue Type: Improvement
> Components: fetcher
> Affects Versions: 1.4
> Reporter: behnam nikbakht
> Labels: fetch
>
> when run fetch in hadoop with multiple concurrent mapper, there are multiple independent fetchQueues that make hard to manage them. i suggest that construct fetchQueues before begin of run with this line:
> feeder = new QueueFeeder(input, fetchQueues, threadCount * 50);
--
This message was sent by Atlassian JIRA
(v6.2#6252)