You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "a.ciccia04" <a....@gmail.com> on 2014/03/19 15:23:34 UTC

Probleme with nutch inject blocked

Hi, I try to set up nutch but I have a problem with the injection of urls.

Im working with apache-nutch-2.2.1, hbase-0.90.4  & solr-4.7.0

Quand j'essa

When I run the command "bin/nutch inject urls" nutch stay blocked.

InjectorJob: starting at 2014-03-19 15:19:48
InjectorJob: Injecting urlDir: urls

I tried many things, but I cant find a solution. 
Do you have any ideas? 
Thank you!



--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Probleme with nutch inject blocked

Posted by Talat Uyarer <ta...@uyarer.com>.
I think you have hbase installation. This is for lastest version of hbase.
However lastest or hbase 0.90.x have same installation.

https://hbase.apache.org/book/quickstart.html
19 Mar 2014 18:10 tarihinde "a.ciccia04" <a....@gmail.com> yazdı:

> With a little patience, I get an error :
>
> InjectorJob: starting at 2014-03-19 17:04:45
> InjectorJob: Injecting urlDir: urls/seed.txt
> InjectorJob: org.apache.gora.util.GoraException:
> java.lang.RuntimeException:
> org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
>         at
>
> org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
>         at
>
> org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
>         at
> org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)
>         at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)
>         at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
>         at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>         at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
> Caused by: java.lang.RuntimeException:
> org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
>         at
> org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:127)
>         at
>
> org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
>         at
>
> org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
>         ... 7 more
> Caused by: org.apache.hadoop.hbase.MasterNotRunningException:
> localhost:49444
>         at
>
> org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getMaster(HConnectionManager.java:396)
>         at
> org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:94)
>         at
> org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:109)
>         ... 9 more
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125501.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>

Re: Probleme with nutch inject blocked

Posted by "a.ciccia04" <a....@gmail.com>.
With a little patience, I get an error :

InjectorJob: starting at 2014-03-19 17:04:45
InjectorJob: Injecting urlDir: urls/seed.txt
InjectorJob: org.apache.gora.util.GoraException: java.lang.RuntimeException:
org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
        at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
        at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
        at
org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)
        at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:221)
        at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)
        at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)
Caused by: java.lang.RuntimeException:
org.apache.hadoop.hbase.MasterNotRunningException: localhost:49444
        at
org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:127)
        at
org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
        at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
        ... 7 more
Caused by: org.apache.hadoop.hbase.MasterNotRunningException:
localhost:49444
        at
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getMaster(HConnectionManager.java:396)
        at
org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:94)
        at
org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:109)
        ... 9 more




--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125501.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Probleme with nutch inject blocked

Posted by "a.ciccia04" <a....@gmail.com>.
I went too fast, I got errors when I turn off my HBase server.
When my HBase server is on, I have no errors



--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125484.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Probleme with nutch inject blocked

Posted by "a.ciccia04" <a....@gmail.com>.
Good idea, thx ! now i have an error.

2014-03-19 16:41:12,910 INFO  crawl.InjectorJob - InjectorJob: starting at
2014-03-19 16:41:12
2014-03-19 16:41:12,910 INFO  crawl.InjectorJob - InjectorJob: Injecting
urlDir: urls
2014-03-19 16:41:13,213 WARN  zookeeper.ClientCnxn - Session 0x0 for server
null, unexpected error, closing socket connection and attempting reconnect
java.net.ConnectException: Connection refused
        at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
        at
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:597)
        at
org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1119)
2014-03-19 16:41:13,593 WARN  zookeeper.ClientCnxn - Session 0x0 for server
null, unexpected error, closing socket connection and attempting reconnect




--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125481.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Probleme with nutch inject blocked

Posted by Talat Uyarer <ta...@uyarer.com>.
Hi Antonia,

Can you change your log level in log4j properties.



2014-03-19 16:41 GMT+02:00 a.ciccia04 <a....@gmail.com>:

> I have only these 2 lines in my hadoop.log :(
>
> 2014-03-19 15:35:03,049 INFO  crawl.InjectorJob - InjectorJob: starting at
> 2014-03-19 15:35:03
> 2014-03-19 15:35:03,050 INFO  crawl.InjectorJob - InjectorJob: Injecting
> urlDir: urls
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125459.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>



-- 
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304

Re: Probleme with nutch inject blocked

Posted by "a.ciccia04" <a....@gmail.com>.
I have only these 2 lines in my hadoop.log :(

2014-03-19 15:35:03,049 INFO  crawl.InjectorJob - InjectorJob: starting at
2014-03-19 15:35:03
2014-03-19 15:35:03,050 INFO  crawl.InjectorJob - InjectorJob: Injecting
urlDir: urls



--
View this message in context: http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454p4125459.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Probleme with nutch inject blocked

Posted by Talat Uyarer <ta...@uyarer.com>.
Hey,

Can you share your hadoop.log


2014-03-19 16:23 GMT+02:00 a.ciccia04 <a....@gmail.com>:

> Hi, I try to set up nutch but I have a problem with the injection of urls.
>
> Im working with apache-nutch-2.2.1, hbase-0.90.4  & solr-4.7.0
>
> Quand j'essa
>
> When I run the command "bin/nutch inject urls" nutch stay blocked.
>
> InjectorJob: starting at 2014-03-19 15:19:48
> InjectorJob: Injecting urlDir: urls
>
> I tried many things, but I cant find a solution.
> Do you have any ideas?
> Thank you!
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Probleme-with-nutch-inject-blocked-tp4125454.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>



-- 
Talat UYARER
Websitesi: http://talat.uyarer.com
Twitter: http://twitter.com/talatuyarer
Linkedin: http://tr.linkedin.com/pub/talat-uyarer/10/142/304