You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2012/08/30 22:53:07 UTC
[jira] [Closed] (NUTCH-1395) Show batchId when skipping within
ParserJob
[ https://issues.apache.org/jira/browse/NUTCH-1395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney closed NUTCH-1395.
---------------------------------------
> Show batchId when skipping within ParserJob
> -------------------------------------------
>
> Key: NUTCH-1395
> URL: https://issues.apache.org/jira/browse/NUTCH-1395
> Project: Nutch
> Issue Type: Bug
> Components: crawldb, parser
> Affects Versions: nutchgora
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Minor
> Fix For: 2.1
>
> Attachments: NUTCH-1395.patch
>
>
> Although the ParserJob CLI has been smartened up, logging still lets us down where we are only teased with the 'different batch id' for an url which is skipped.
> {code}
> Parsing http://www.trancearoundtheworld.com/tatw/399
> Parsing http://www.trancearoundtheworld.com/index.php
> Skipping http://www.aboveandbeyond.nu/music; different batch id
> Parsing http://www.trancearoundtheworld.com/tatw/425
> Parsing http://www.trancearoundtheworld.com/tatw/398
> Parsing https://twitter.com/tatw
> Parsing http://www.trancearoundtheworld.com/tatw/401
> {code}
> I would like to see
> {code}
> Skipping http://www.aboveandbeyond.nu/music; different batch id ($batchId)
> {code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira