You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sebastian Nagel (JIRA)" <ji...@apache.org> on 2014/09/30 22:12:34 UTC

[jira] [Updated] (NUTCH-1864) Bug in indexchecker CLI parsing and invocation of index-solr plugin by default

     [ https://issues.apache.org/jira/browse/NUTCH-1864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sebastian Nagel updated NUTCH-1864:
-----------------------------------
    Attachment: NUTCH-1864-trunk-v1.patch

> Bug in indexchecker CLI parsing and invocation of index-solr plugin by default
> ------------------------------------------------------------------------------
>
>                 Key: NUTCH-1864
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1864
>             Project: Nutch
>          Issue Type: Bug
>          Components: indexer
>    Affects Versions: 1.10
>            Reporter: Lewis John McGibbney
>             Fix For: 1.10
>
>         Attachments: NUTCH-1864-trunk-v1.patch
>
>
> I noticed ok that we have a bug in indexchecker tool where 
>  * the command line parsing is buggy, it expects the args.length -1 argument to be the URL IIRC.
>  * Even if indexer-solr is NOT activated, I get the following message
> lmcgibbn@LMC-032857 /usr/local/trunk/runtime/local $ ./bin/nutch indexchecker -dumpText http://nasa.gov
> fetching: http://nasa.gov
> Exception in thread "main" java.lang.RuntimeException: Missing SOLR URL. Should be set via -D solr.server.url
> SOLRIndexWriter
> 	solr.server.url : URL of the SOLR instance (mandatory)
> 	solr.commit.size : buffer size when sending to SOLR (default 1000)
> 	solr.mapping.file : name of the mapping file for fields (default solrindex-mapping.xml)
> 	solr.auth : use authentication (default false)
> 	solr.auth.username : username for authentication
> 	solr.auth.password : password for authentication
> 	at org.apache.nutch.indexwriter.solr.SolrIndexWriter.setConf(SolrIndexWriter.java:192)
> 	at org.apache.nutch.plugin.Extension.getExtensionInstance(Extension.java:159)
> 	at org.apache.nutch.indexer.IndexWriters.<init>(IndexWriters.java:57)
> 	at org.apache.nutch.indexer.IndexingFiltersChecker.run(IndexingFiltersChecker.java:98)
> 	at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> 	at org.apache.nutch.indexer.IndexingFiltersChecker.main(IndexingFiltersChecker.java:178)
> These issues should be rectified as this is an extremely useful tool which is broken right now.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)