You are viewing a plain text version of this content. The canonical link for it is here.
Posted to solr-user@lucene.apache.org by Ridwan Naibi <ri...@gmail.com> on 2014/04/14 03:32:01 UTC

Error Arising from when I start to crawl

Hi there,

I get the following error after I run the following command. Can you 
please let me know what the problem is? I have exhausted online 
tutorials trying to solve this issue. Thanks

engr@engr-MacBookPro:~/NUTCH_HOME/apache-nutch-2.2.1/runtime/local$ 
bin/crawl urls/seed.txt TestCrawl http://localhost:8983/solr/ 2
InjectorJob: starting at 2014-04-14 02:28:56
InjectorJob: Injecting urlDir: urls/seed.txt
InjectorJob: org.apache.gora.util.GoraException: 
java.lang.RuntimeException: java.lang.IllegalArgumentException: Not a 
host:port pair: �3530@engr-MacBookProlocalhost,43200,1397436949832
at 
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
at 
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
at 
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
Caused by: java.lang.RuntimeException: 
java.lang.IllegalArgumentException: Not a host:port pair: 
�3530@engr-MacBookProlocalhost,43200,1397436949832
at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:127)
at 
org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
at 
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
... 7 more
Caused by: java.lang.IllegalArgumentException: Not a host:port pair: 
�3530@engr-MacBookProlocalhost,43200,1397436949832
at org.apache.hadoop.hbase.HServerAddress.<init>(HServerAddress.java:60)
at 
org.apache.hadoop.hbase.MasterAddressTracker.getMasterAddress(MasterAddressTracker.java:63)
at 
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getMaster(HConnectionManager.java:354)
at org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:94)
at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:109)
... 9 more


Re: Error Arising from when I start to crawl

Posted by Cihad Guzel <cg...@gmail.com>.
Hi Ridwan,

This error is not related to Solr. Solr is used in "IndexerJob" for
Nutch.  This error is thrown from "InjectorJob." It is related Nutch and
Gora. You check your hbase and nutch configuration. You ensure the HBase
run correctly and to use the correct version. For more accurate
information, you should ask questions to the "nutch user list" with more
information.


2014-04-14 5:11 GMT+03:00 Alexandre Rafalovitch <ar...@gmail.com>:

> This is most definitely not a Solr issue, so you may want to check with
> Gora's list.
>
> However as a quick general hint, you problem seems to be in thus
> part: 3530@engr-MacBookProlocalhost . I assume it should be a server name
> there, but it seems to be two name joined together. So I would check where
> that (possibly hbase listen address) is defined and ensure it is correct.
>
> Regards,
>      Alex
> On 14/04/2014 8:46 am, "Ridwan Naibi" <ri...@gmail.com> wrote:
>
> > Hi there,
> >
> > I get the following error after I run the following command. Can you
> > please let me know what the problem is? I have exhausted online tutorials
> > trying to solve this issue. Thanks
> >
> > engr@engr-MacBookPro:~/NUTCH_HOME/apache-nutch-2.2.1/runtime/local$
> > bin/crawl urls/seed.txt TestCrawl http://localhost:8983/solr/ 2
> > InjectorJob: starting at 2014-04-14 02:28:56
> > InjectorJob: Injecting urlDir: urls/seed.txt
> > InjectorJob: org.apache.gora.util.GoraException:
> > java.lang.RuntimeException: java.lang.IllegalArgumentException: Not a
> > host:port pair: � 3530@engr-MacBookProlocalhost,43200,1397436949832
> > at org.apache.gora.store.DataStoreFactory.createDataStore(
> > DataStoreFactory.java:167)
> > at org.apache.gora.store.DataStoreFactory.createDataStore(
> > DataStoreFactory.java:135)
> > at org.apache.nutch.storage.StorageUtils.createWebStore(
> > StorageUtils.java:75)
> > at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)
> > at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
> > at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
> > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> > at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
> > Caused by: java.lang.RuntimeException:
> java.lang.IllegalArgumentException:
> > Not a host:port pair: � 3530@engr-MacBookProlocalhost
> ,43200,1397436949832
> > at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:127)
> > at org.apache.gora.store.DataStoreFactory.initializeDataStore(
> > DataStoreFactory.java:102)
> > at org.apache.gora.store.DataStoreFactory.createDataStore(
> > DataStoreFactory.java:161)
> > ... 7 more
> > Caused by: java.lang.IllegalArgumentException: Not a host:port pair: �
> > 3530@engr-MacBookProlocalhost,43200,1397436949832
> > at org.apache.hadoop.hbase.HServerAddress.<init>(HServerAddress.java:60)
> > at org.apache.hadoop.hbase.MasterAddressTracker.getMasterAddress(
> > MasterAddressTracker.java:63)
> > at org.apache.hadoop.hbase.client.HConnectionManager$
> > HConnectionImplementation.getMaster(HConnectionManager.java:354)
> > at org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:94)
> > at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:109)
> > ... 9 more
> >
> >
>

Re: Error Arising from when I start to crawl

Posted by Alexandre Rafalovitch <ar...@gmail.com>.
This is most definitely not a Solr issue, so you may want to check with
Gora's list.

However as a quick general hint, you problem seems to be in thus
part: 3530@engr-MacBookProlocalhost . I assume it should be a server name
there, but it seems to be two name joined together. So I would check where
that (possibly hbase listen address) is defined and ensure it is correct.

Regards,
     Alex
On 14/04/2014 8:46 am, "Ridwan Naibi" <ri...@gmail.com> wrote:

> Hi there,
>
> I get the following error after I run the following command. Can you
> please let me know what the problem is? I have exhausted online tutorials
> trying to solve this issue. Thanks
>
> engr@engr-MacBookPro:~/NUTCH_HOME/apache-nutch-2.2.1/runtime/local$
> bin/crawl urls/seed.txt TestCrawl http://localhost:8983/solr/ 2
> InjectorJob: starting at 2014-04-14 02:28:56
> InjectorJob: Injecting urlDir: urls/seed.txt
> InjectorJob: org.apache.gora.util.GoraException:
> java.lang.RuntimeException: java.lang.IllegalArgumentException: Not a
> host:port pair: � 3530@engr-MacBookProlocalhost,43200,1397436949832
> at org.apache.gora.store.DataStoreFactory.createDataStore(
> DataStoreFactory.java:167)
> at org.apache.gora.store.DataStoreFactory.createDataStore(
> DataStoreFactory.java:135)
> at org.apache.nutch.storage.StorageUtils.createWebStore(
> StorageUtils.java:75)
> at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)
> at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
> at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
> Caused by: java.lang.RuntimeException: java.lang.IllegalArgumentException:
> Not a host:port pair: � 3530@engr-MacBookProlocalhost,43200,1397436949832
> at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:127)
> at org.apache.gora.store.DataStoreFactory.initializeDataStore(
> DataStoreFactory.java:102)
> at org.apache.gora.store.DataStoreFactory.createDataStore(
> DataStoreFactory.java:161)
> ... 7 more
> Caused by: java.lang.IllegalArgumentException: Not a host:port pair: �
> 3530@engr-MacBookProlocalhost,43200,1397436949832
> at org.apache.hadoop.hbase.HServerAddress.<init>(HServerAddress.java:60)
> at org.apache.hadoop.hbase.MasterAddressTracker.getMasterAddress(
> MasterAddressTracker.java:63)
> at org.apache.hadoop.hbase.client.HConnectionManager$
> HConnectionImplementation.getMaster(HConnectionManager.java:354)
> at org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:94)
> at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:109)
> ... 9 more
>
>