You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2013/01/17 10:00:28 UTC

[jira] [Created] (NUTCH-1520) SegmentMerger looses records

Markus Jelsma created NUTCH-1520:
------------------------------------

             Summary: SegmentMerger looses records
                 Key: NUTCH-1520
                 URL: https://issues.apache.org/jira/browse/NUTCH-1520
             Project: Nutch
          Issue Type: Bug
    Affects Versions: 1.6
            Reporter: Markus Jelsma
            Priority: Critical
             Fix For: 1.7


It seems the SegmentMerger tool looses documents. You're likely to see less documents in an index if you index one or more already merged segments than if you index all unmerged segments.

This is really nasty!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira