You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sertac TURKEL (JIRA)" <ji...@apache.org> on 2014/02/12 17:50:19 UTC
[jira] [Updated] (NUTCH-1727) Length of the Tlds
[ https://issues.apache.org/jira/browse/NUTCH-1727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sertac TURKEL updated NUTCH-1727:
---------------------------------
Attachment: NUTCH-1727.patch
I had a look domain-suffix.xml and I saw the longest domain suffix can include 8 characters(.internal). By default value, I picked 8 for this reason and I prepared a patch. Could you review my patch?
> Length of the Tlds
> ------------------
>
> Key: NUTCH-1727
> URL: https://issues.apache.org/jira/browse/NUTCH-1727
> Project: Nutch
> Issue Type: Bug
> Reporter: Sertac TURKEL
> Priority: Minor
> Fix For: 2.1
>
> Attachments: NUTCH-1727.patch
>
>
> Length of the tld should be selectable, there is some available tld's like .travel and url-validator plugin filters this type of urls.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)