You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Jorge Luis Betancourt González <jl...@uci.cu> on 2016/05/03 16:52:54 UTC

Re: [MASSMAIL]crawl with nutch 1.11

Actually, executing bin/crawl shows this:

    -i|--index	Indexes crawl results into a configured indexer

So you could use the bin/crawl command to index each iteration in your configured indexer (Solr/ES), for instance you could use this command:

$ bin/crawl -i -D solr.server.url=http://localhost:8983/solr/ urls/ mycrawl/  2

More info could be found in [1]

Regards,

[1] https://wiki.apache.org/nutch/NutchTutorial

----- Mensaje original -----
De: "Shani Chaushu" <sh...@intel.com>
Para: user@nutch.apache.org
Enviados: Lunes, 2 de Mayo 2016 8:46:58
Asunto: [MASSMAIL]crawl with nutch 1.11

Hi,
I want to upgrade from nutch 1.9 to nutch 1.11
I saw that in bin/crawl script there is no step of solrindex
Do I need to run command for solr index separately after all the crawl is complete?
There is another way to run the whole process in one command?

Thanks,
Shani



---------------------------------------------------------------------
Intel Electronics Ltd.

This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.
La UCI presente este 1ro. de Mayo en la Plaza de la Revoluci�n
junto a todo el pueblo.�Por Cuba: Unidad y Compromiso!

RE: [MASSMAIL]crawl with nutch 1.11

Posted by "Chaushu, Shani" <sh...@intel.com>.
Great thanks!

-----Original Message-----
From: Jorge Luis Betancourt González [mailto:jlbetancourt@uci.cu] 
Sent: Tuesday, May 03, 2016 17:53
To: user@nutch.apache.org
Subject: Re: [MASSMAIL]crawl with nutch 1.11

Actually, executing bin/crawl shows this:

    -i|--index	Indexes crawl results into a configured indexer

So you could use the bin/crawl command to index each iteration in your configured indexer (Solr/ES), for instance you could use this command:

$ bin/crawl -i -D solr.server.url=http://localhost:8983/solr/ urls/ mycrawl/  2

More info could be found in [1]

Regards,

[1] https://wiki.apache.org/nutch/NutchTutorial

----- Mensaje original -----
De: "Shani Chaushu" <sh...@intel.com>
Para: user@nutch.apache.org
Enviados: Lunes, 2 de Mayo 2016 8:46:58
Asunto: [MASSMAIL]crawl with nutch 1.11

Hi,
I want to upgrade from nutch 1.9 to nutch 1.11 I saw that in bin/crawl script there is no step of solrindex Do I need to run command for solr index separately after all the crawl is complete?
There is another way to run the whole process in one command?

Thanks,
Shani



---------------------------------------------------------------------
Intel Electronics Ltd.

This e-mail and any attachments may contain confidential material for the sole use of the intended recipient(s). Any review or distribution by others is strictly prohibited. If you are not the intended recipient, please contact the sender and delete all copies.
La UCI presente este 1ro. de Mayo en la Plaza de la Revolución junto a todo el pueblo.¡Por Cuba: Unidad y Compromiso!
---------------------------------------------------------------------
Intel Electronics Ltd.

This e-mail and any attachments may contain confidential material for
the sole use of the intended recipient(s). Any review or distribution
by others is strictly prohibited. If you are not the intended
recipient, please contact the sender and delete all copies.