You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by jyoti aditya <jy...@gmail.com> on 2016/12/06 13:01:20 UTC
log file
Hi team,
I have configured Nutch 2.3.1 with mongodb.
I am not able to see any log file after crawling, though my data is going
to mongodb.
In my db i can see bindata and text.
When i decode bibdata it gives me html,and javascript info of that webpage.
I just wanted to know is there any other way by which I can get the content
of webpage in proper format. Like as the html tag vs its value.
--
With Regards
Jyoti Aditya