You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/06/14 09:20:42 UTC
[jira] [Updated] (NUTCH-1392) -force and -resume arguments being
ignored in ParserJob
[ https://issues.apache.org/jira/browse/NUTCH-1392?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ferdy Galema updated NUTCH-1392:
--------------------------------
Attachment: NUTCH-1392.patch
> -force and -resume arguments being ignored in ParserJob
> -------------------------------------------------------
>
> Key: NUTCH-1392
> URL: https://issues.apache.org/jira/browse/NUTCH-1392
> Project: Nutch
> Issue Type: Bug
> Components: parser
> Affects Versions: nutchgora
> Reporter: Lewis John McGibbney
> Fix For: 2.1
>
> Attachments: NUTCH-1392.patch
>
>
> From the log below there is obviously something not right here as both -resume and -force are passed to the CLI but blatantly ignored within the log output.
> lewis@lewis:~/ASF/nutchgora/runtime/local$ ./bin/nutch parse
> Usage: ParserJob (<batchId> | -all) [-crawlId <id>] [-resume] [-force]
> <batchId> - symbolic batch ID created by Generator
> -crawlId <id> - the id to prefix the schemas to operate on,
> (default: storage.crawl.id)
> -all - consider pages from all crawl jobs
> -resume - resume a previous incomplete job
> -force - force re-parsing even if a page is already parsed
> lewis@lewis:~/ASF/nutchgora/runtime/local$ ./bin/nutch parse -all -resume -force
> ParserJob: starting
> ParserJob: resuming: false
> ParserJob: forced reparse: false
> ParserJob: parsing all
> Parsing http://www.trancearoundtheworld.com/
> ParserJob: success
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira