You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by YourSoft <yo...@freemail.hu> on 2005/05/14 08:42:35 UTC
Number of searchabe pages
Dear List,
I counted the pages in the segments:
bin/nutch segread -fix -list -dir segments
the sum of results is: 11 million pages - 'dedup' removes 2 million = 9
million pages.
When I search in the frontend with "http" the result is 6 million, how to
find the missing 3 million pages?
How to count the total number of searchable pages in the search
server?
Best Regards,
Ferenc