You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by Adamantios Corais <ad...@gmail.com> on 2015/01/24 23:03:59 UTC

How to collect links with Apache Nutch 2.3

Hi,

How to tell Apache Nutch 2.3 to go through all http://URL/?pg={X} pages,
with {X} going from 1 to 348, collect all http://URL/view/{Y}/ links, with
{Y} an arbitrary long number, and finally dump all these links into a
single file?

//Adam