You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by jyoti aditya <jy...@gmail.com> on 2016/12/06 13:01:20 UTC

log file

Hi team,

I have configured Nutch 2.3.1 with mongodb.
I am not able to see any log file after crawling, though my data is going
to mongodb.

In my db i can see bindata and text.
When i decode bibdata it gives me html,and javascript info of that webpage.

I just wanted to know is there any other way by which I can get the content
of webpage in proper format. Like as the html tag vs its value.


-- 
With Regards
Jyoti Aditya