You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by "abobrobo@gmail.com" <ab...@gmail.com> on 2006/09/13 19:10:33 UTC

How to crawl this website

Hey,I am confused with the crawling with nutch.
as you know,there are some website which can not be accessed becaused
they are the "post"method,that means,even if you know the web site's
url,when you input the url into the address bar on the IE or Mozilla,the
website 's some important content has lost.
what should I do,should I do a plugin to extend the crawling ?
eg:
http://www.51job.com/hot/show_job_detail.php?id=100655204&jobiduni=(102344234)