You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "a.ciccia04" <a....@gmail.com> on 2014/03/19 15:23:34 UTC
Probleme with nutch inject blocked
Hi, I try to set up nutch but I have a problem with the injection of urls.
Im working with apache-nutch-2.2.1, hbase-0.90.4 & solr-4.7.0
Quand j'essa
When I run the command "bin/nutch inject urls" nutch stay blocked.
InjectorJob: starting at 2014-03-19 15:19:48
InjectorJob: Injecting urlDir: urls
I tried many things, but I cant find a solution.
Do you have any ideas?
Thank you!
--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Probleme with nutch inject blocked
Posted by Talat Uyarer <ta...@uyarer.com>.
I think you have hbase installation. This is for lastest version of hbase.
However lastest or hbase 0.90.x have same installation.
https://hbase.apache.org/book/quickstart.html
19 Mar 2014 18:10 tarihinde "a.ciccia04" <a....@gmail.com> yazdı:
> With a little patience, I get an error :
>
> InjectorJob: starting at 2014-03-19 17:04:45
> InjectorJob: Injecting urlDir: urls/seed.txt
> InjectorJob: org.apache.gora.util.GoraException:
> java.lang.RuntimeException:
> org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
> at
>
> org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
> at
>
> org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
> at
> org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)
> at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)
> at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
> at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
> at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
> Caused by: java.lang.RuntimeException:
> org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
> at
> org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:127)
> at
>
> org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
> at
>
> org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
> ... 7 more
> Caused by: org.apache.hadoop.hbase.MasterNotRunningException:
> localhost:49444
> at
>
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getMaster(HConnectionManager.java:396)
> at
> org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:94)
> at
> org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:109)
> ... 9 more
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125501.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
Re: Probleme with nutch inject blocked
Posted by "a.ciccia04" <a....@gmail.com>.
With a little patience, I get an error :
InjectorJob: starting at 2014-03-19 17:04:45
InjectorJob: Injecting urlDir: urls/seed.txt
InjectorJob: org.apache.gora.util.GoraException: java.lang.RuntimeException:
org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
at
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)
at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
Caused by: java.lang.RuntimeException:
org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
at
org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:127)
at
org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
... 7 more
Caused by: org.apache.hadoop.hbase.MasterNotRunningException:
localhost:49444
at
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getMaster(HConnectionManager.java:396)
at
org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:94)
at
org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:109)
... 9 more
--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125501.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Probleme with nutch inject blocked
Posted by "a.ciccia04" <a....@gmail.com>.
I went too fast, I got errors when I turn off my HBase server.
When my HBase server is on, I have no errors
--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125484.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Probleme with nutch inject blocked
Posted by "a.ciccia04" <a....@gmail.com>.
Good idea, thx ! now i have an error.
2014-03-19 16:41:12,910 INFO crawl.InjectorJob - InjectorJob: starting at
2014-03-19 16:41:12
2014-03-19 16:41:12,910 INFO crawl.InjectorJob - InjectorJob: Injecting
urlDir: urls
2014-03-19 16:41:13,213 WARN zookeeper.ClientCnxn - Session 0x0 for server
null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:597)
at
org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1119)
2014-03-19 16:41:13,593 WARN zookeeper.ClientCnxn - Session 0x0 for server
null, unexpected error, closing socket connection and attempting reconnect
--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125481.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Probleme with nutch inject blocked
Posted by Talat Uyarer <ta...@uyarer.com>.
Hi Antonia,
Can you change your log level in log4j properties.
2014-03-19 16:41 GMT+02:00 a.ciccia04 <a....@gmail.com>:
> I have only these 2 lines in my hadoop.log :(
>
> 2014-03-19 15:35:03,049 INFO crawl.InjectorJob - InjectorJob: starting at
> 2014-03-19 15:35:03
> 2014-03-19 15:35:03,050 INFO crawl.InjectorJob - InjectorJob: Injecting
> urlDir: urls
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125459.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
--
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304
Re: Probleme with nutch inject blocked
Posted by "a.ciccia04" <a....@gmail.com>.
I have only these 2 lines in my hadoop.log :(
2014-03-19 15:35:03,049 INFO crawl.InjectorJob - InjectorJob: starting at
2014-03-19 15:35:03
2014-03-19 15:35:03,050 INFO crawl.InjectorJob - InjectorJob: Injecting
urlDir: urls
--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125459.html
Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Probleme with nutch inject blocked
Posted by Talat Uyarer <ta...@uyarer.com>.
Hey,
Can you share your hadoop.log
2014-03-19 16:23 GMT+02:00 a.ciccia04 <a....@gmail.com>:
> Hi, I try to set up nutch but I have a problem with the injection of urls.
>
> Im working with apache-nutch-2.2.1, hbase-0.90.4 & solr-4.7.0
>
> Quand j'essa
>
> When I run the command "bin/nutch inject urls" nutch stay blocked.
>
> InjectorJob: starting at 2014-03-19 15:19:48
> InjectorJob: Injecting urlDir: urls
>
> I tried many things, but I cant find a solution.
> Do you have any ideas?
> Thank you!
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
--
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304