You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Furkan KAMACI (JIRA)" <ji...@apache.org> on 2013/08/21 19:02:51 UTC

[jira] [Updated] (NUTCH-1631) Display Document Count Added To Solr Server

     [ https://issues.apache.org/jira/browse/NUTCH-1631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Furkan KAMACI updated NUTCH-1631:
---------------------------------

    Attachment: NUTCH-1631.patch

This patch counts documents added to Solr Server and writes it to context as a Hadoop counter. So one can see how many documents are added simultaneously at Hadoop Map/Reduce Administration page. On the other hand SolrWriter logs how many documents are added at each batch (maximum of commit size) but does not log total count at the end of indexing process. This patch also logs total document count as well as writing to Hadoop context as a counter.
                
> Display Document Count Added To Solr Server
> -------------------------------------------
>
>                 Key: NUTCH-1631
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1631
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>    Affects Versions: 2.1, 2.2, 2.2.1
>            Reporter: Furkan KAMACI
>            Priority: Minor
>             Fix For: 2.3
>
>         Attachments: NUTCH-1631.patch
>
>
> Currently you can not see how many documents are added to Solr Server. One could see how many documents are added to Solr server simultaneously (as a hadoop counter) and after all documents are added total document count should be logged too.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira