You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by jyoti aditya <jy...@gmail.com> on 2016/12/06 11:13:11 UTC

page size

Hi team,

I am getting only around half page of data from a web page when i crawl.
Is there any size limit property to be configured?

With Regards
Jyoti Aditya

Re: page size

Posted by Vincent <vi...@openindex.io>.
Hi Jyoti,

There is a property http.content.limit in the conf. Beyond that size the 
content will be truncated.

Cheers,
Vincent

On 06-12-16 12:13, jyoti aditya wrote:
> Hi team,
>
> I am getting only around half page of data from a web page when i crawl.
> Is there any size limit property to be configured?
>
> With Regards
> Jyoti Aditya
>