You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Johannes Dorn <jo...@johannet.de> on 2012/12/05 11:46:45 UTC

Fetcher hangs for a long time

Hello,

i have implemented a Protocol that allows me to parse "xyz://something:something" urls.
The parsing works fine, however, when fetching a large amount of URLs, it takes a very long time for the fetcher to finish.

All my URLs are stored in a single text file containing 63000 entries. I am working in local mode. 
When i set -topN to 10000, it takes about 10 minutes for the fetcher to finish. 
Now i have set -topN to 65000 but i am stuck at this point now for 20 hours.

...
fetching xyz://fzu11:vfjd
fetching xyz://iiur:nepw
-finishing thread FetcherThread, activeThreads=9
-finishing thread FetcherThread, activeThreads=8
-activeThreads=8, spinWaiting=0, fetchQueues.totalSize=0
-finishing thread FetcherThread, activeThreads=7
-finishing thread FetcherThread, activeThreads=6
-finishing thread FetcherThread, activeThreads=5
-finishing thread FetcherThread, activeThreads=4
-finishing thread FetcherThread, activeThreads=3
-finishing thread FetcherThread, activeThreads=2
-finishing thread FetcherThread, activeThreads=1
-finishing thread FetcherThread, activeThreads=0
-activeThreads=0, spinWaiting=0, fetchQueues.totalSize=0
-activeThreads=0

What is happening here? No log output is available even though i set the fetcher logging to verbose.

Is there any way to speed things up?

Johannes