You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Cam Bazz <ca...@gmail.com> on 2011/07/18 23:26:20 UTC

reparsing and already parsed segment.

Hello,

Any ideas on how I can reparse a segment? I am running experiments -
and it is taking way long time for each inject/generate/fetch/parse
cycle.

Best Regards,
-C.B.

Re: reparsing and already parsed segment.

Posted by Markus Jelsma <ma...@openindex.io>.
It's per segment.

> Hello,
> 
> And how about solrindex? does nutch mark what is indexed and what is not,
> or is it a read-only from segment to solr operation?
> 
> best.
> 
> On Tue, Jul 19, 2011 at 5:05 PM, Julien Nioche <
> 
> lists.digitalpebble@gmail.com> wrote:
> > You can. Simply delete parse_text, parse_data and crawl_parse from the
> > segment before calling the parse command on it
> > 
> > On 19 July 2011 14:57, Markus Jelsma <ma...@openindex.io> wrote:
> >> You cannot reparse a segment IIRC.
> >> 
> >> On Monday 18 July 2011 23:26:20 Cam Bazz wrote:
> >> > Hello,
> >> > 
> >> > Any ideas on how I can reparse a segment? I am running experiments -
> >> > and it is taking way long time for each inject/generate/fetch/parse
> >> > cycle.
> >> > 
> >> > Best Regards,
> >> > -C.B.
> >> 
> >> --
> >> Markus Jelsma - CTO - Openindex
> >> http://www.linkedin.com/in/markus17
> >> 050-8536620 / 06-50258350
> > 
> > --
> > *
> > *Open Source Solutions for Text Engineering
> > 
> > http://digitalpebble.blogspot.com/
> > http://www.digitalpebble.com

Re: reparsing and already parsed segment.

Posted by Cam Bazz <ca...@gmail.com>.
Hello,

And how about solrindex? does nutch mark what is indexed and what is not, or
is it a read-only from segment to solr operation?

best.

On Tue, Jul 19, 2011 at 5:05 PM, Julien Nioche <
lists.digitalpebble@gmail.com> wrote:

> You can. Simply delete parse_text, parse_data and crawl_parse from the
> segment before calling the parse command on it
>
>
> On 19 July 2011 14:57, Markus Jelsma <ma...@openindex.io> wrote:
>
>> You cannot reparse a segment IIRC.
>>
>> On Monday 18 July 2011 23:26:20 Cam Bazz wrote:
>> > Hello,
>> >
>> > Any ideas on how I can reparse a segment? I am running experiments -
>> > and it is taking way long time for each inject/generate/fetch/parse
>> > cycle.
>> >
>> > Best Regards,
>> > -C.B.
>>
>> --
>> Markus Jelsma - CTO - Openindex
>> http://www.linkedin.com/in/markus17
>> 050-8536620 / 06-50258350
>>
>
>
>
> --
> *
> *Open Source Solutions for Text Engineering
>
> http://digitalpebble.blogspot.com/
> http://www.digitalpebble.com
>

Re: reparsing and already parsed segment.

Posted by Julien Nioche <li...@gmail.com>.
You can. Simply delete parse_text, parse_data and crawl_parse from the
segment before calling the parse command on it

On 19 July 2011 14:57, Markus Jelsma <ma...@openindex.io> wrote:

> You cannot reparse a segment IIRC.
>
> On Monday 18 July 2011 23:26:20 Cam Bazz wrote:
> > Hello,
> >
> > Any ideas on how I can reparse a segment? I am running experiments -
> > and it is taking way long time for each inject/generate/fetch/parse
> > cycle.
> >
> > Best Regards,
> > -C.B.
>
> --
> Markus Jelsma - CTO - Openindex
> http://www.linkedin.com/in/markus17
> 050-8536620 / 06-50258350
>



-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com

Re: reparsing and already parsed segment.

Posted by Markus Jelsma <ma...@openindex.io>.
You cannot reparse a segment IIRC. 

On Monday 18 July 2011 23:26:20 Cam Bazz wrote:
> Hello,
> 
> Any ideas on how I can reparse a segment? I am running experiments -
> and it is taking way long time for each inject/generate/fetch/parse
> cycle.
> 
> Best Regards,
> -C.B.

-- 
Markus Jelsma - CTO - Openindex
http://www.linkedin.com/in/markus17
050-8536620 / 06-50258350