You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Maciej Szwajcowski <ma...@softwaremind.pl> on 2006/03/08 13:42:27 UTC

Content of page

Hello, 

I'm using nutch, version 0.8 dev.
I'm using NUTCH API like this:

Hits hits = nutchBean.search(query, numHits); 
HitDetails hitDetails = nutchBean.getDetails(hit.getHit(0));
byte[] content = nutchBean.getContent(hitDetails);

Now I want to retrieve the content of the given URL rather then specify a query.
Simply need a method:

getPageContentByUrl(url String)

I would be grateful if somebody help me how to do this?

Thanks