You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Bhavya Sanghavi (JIRA)" <ji...@apache.org> on 2016/04/13 00:16:25 UTC
[jira] [Created] (NUTCH-2249) WordNet Integration for Cosine
Similarity
Bhavya Sanghavi created NUTCH-2249:
--------------------------------------
Summary: WordNet Integration for Cosine Similarity
Key: NUTCH-2249
URL: https://issues.apache.org/jira/browse/NUTCH-2249
Project: Nutch
Issue Type: New Feature
Components: plugin, scoring
Reporter: Bhavya Sanghavi
Priority: Minor
Integrated WordNet database to enhance the cosine similarity plugin.
This helps in reducing the size of the vectors for calculating the cosine similarity by mapping the synonymous words to the same entry in the vector. Consequently, it would increase the accuracy of the scores given to the webpages to be crawled.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)