You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by YourSoft <yo...@freemail.hu> on 2005/05/14 08:42:35 UTC

Number of searchabe pages

Dear List,

I counted the pages in the segments:
  bin/nutch segread -fix -list -dir segments
the sum of results is: 11 million pages - 'dedup' removes 2 million = 9 
million pages.

When I search in the frontend with "http" the result is 6 million, how to 
find the missing 3 million pages?

How to count the total number of searchable pages in the search 
server?

Best Regards,
    Ferenc