You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Zuber (JIRA)" <ji...@apache.org> on 2016/10/01 12:11:20 UTC

[jira] [Created] (NUTCH-2319) Link with "rel=alternate" doesn't return in crawl

Zuber created NUTCH-2319:
----------------------------

             Summary: Link with "rel=alternate" doesn't return in crawl 
                 Key: NUTCH-2319
                 URL: https://issues.apache.org/jira/browse/NUTCH-2319
             Project: Nutch
          Issue Type: Bug
            Reporter: Zuber


I am using nutch-1.4. I am getting the issue that the nutch doesn't return the URLs from the link rel="alternate".
 For example, I am trying to crawl the URL  http://rssfeeds.azcentral.com/phoenix/asu which contains the  below link which I am not getting as result.
<link rel="alternate" type="application/atom+xml" href="http://rssfeeds.azcentral.com/phoenix/asu&amp;x=1" title="Phoenix - ASU">

Could you please help



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)