You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "Doan, Tan" <td...@carsdirect.com> on 2007/08/15 19:38:33 UTC

How do I find similar pages?

How would I go about finding pages in my index that are similar to a
given url from command line?

 

Step 1, I'd have to index the url if it isn't in there yet.

Step 2... ?

 

Is there a way to use NutchSimilarity from command line?

Thanks