You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Vu Hoang (JIRA)" <ji...@apache.org> on 2010/07/14 20:59:50 UTC
[jira] Closed: (NUTCH-780) Nutch crawler did not read configuration
files
[ https://issues.apache.org/jira/browse/NUTCH-780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Vu Hoang closed NUTCH-780.
--------------------------
this issue was resolved
> Nutch crawler did not read configuration files
> ----------------------------------------------
>
> Key: NUTCH-780
> URL: https://issues.apache.org/jira/browse/NUTCH-780
> Project: Nutch
> Issue Type: Bug
> Components: fetcher
> Affects Versions: 1.0.0
> Reporter: Vu Hoang
> Fix For: 1.2, 2.0
>
> Attachments: NUTCH-780.patch
>
>
> Nutch searcher can read properties at the constructor ...
> {code:java|title=NutchSearcher.java|borderStyle=solid}
> NutchBean bean = new NutchBean(getFilesystem().getConf(), fs);
> ... // put search engine code here
> {code}
> ... but Nutch crawler is not, it only reads data from arguments.
> {code:java|title=NutchCrawler.java|borderStyle=solid}
> StringBuilder builder = new StringBuilder();
> builder.append(domainlist + SPACE);
> builder.append(ARGUMENT_CRAWL_DIR);
> builder.append(domainlist + SUBFIX_CRAWLED + SPACE);
> builder.append(ARGUMENT_CRAWL_THREADS);
> builder.append(threads + SPACE);
> builder.append(ARGUMENT_CRAWL_DEPTH);
> builder.append(depth + SPACE);
> builder.append(ARGUMENT_CRAWL_TOPN);
> builder.append(topN + SPACE);
> Crawl.main(builder.toString().split(SPACE));
> {code}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.