You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/18 15:37:57 UTC

[jira] [Updated] (NUTCH-1051) Export WebGraph node scores for solr.ExternalFileField

     [ https://issues.apache.org/jira/browse/NUTCH-1051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-1051:
---------------------------------

    Patch Info: [Patch Available]
      Assignee: Markus Jelsma

> Export WebGraph node scores for solr.ExternalFileField
> ------------------------------------------------------
>
>                 Key: NUTCH-1051
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1051
>             Project: Nutch
>          Issue Type: Improvement
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.4
>
>
> The current webgraph.NodeDumper dumps a flat <url>\t<float>\n file, which is almost exactly what is needed for using ExternalFileField in Solr. This issue tracks the option to add to dump it in the proper format. Using EFF we can update scores without reindexing millions of documents. There's one caveat, Solr won't accept an equals-sign in the key but there's a small patch for this in SOLR-2545.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira