You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Doug Cutting <cu...@nutch.org> on 2005/11/23 18:33:05 UTC
Re: MapRed Generator
Are you crawling only a single host? If so, I can see how this would
happen. Using two hosts to crawl a single host is probably not a good
idea anyway, no?
Doug
Anton Potehin wrote:
> Class Generator
>
>
>
> We have 2 Reduce Tasks
>
>
>
> Limit = TopN / 2;
>
>
>
> Generator.Selector.Reduce for first task receive all K,V pairs from
> maps, but select only half of them (work limit)
>
> Generator.Selector.Reduce for second task doesn't receive pairs at all!
>
>
>
> In result on output we have half of mapped pairs!
>
>
>
> This is bug? Or I'm don't understand something?
>
>