You are viewing a plain text version of this content. The canonical link for it is here.

Posted to user@nutch.apache.org by jyoti aditya <jy...@gmail.com> on 2016/12/01 11:42:14 UTC

bindata

Hi team,

When i crawl a specific site using nutch, it gives me content in bindata, i
when decode this page, it contains mainly javascript of that page and
metadata informarion.

I wanted to know, can we craw and extract all  information on any webpage?
And is it possible to get the content in string format directly?

PFA.



-- 
With Regards
Jyoti Aditya