You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sertac TURKEL (JIRA)" <ji...@apache.org> on 2014/02/12 17:50:19 UTC

[jira] [Updated] (NUTCH-1727) Length of the Tlds

     [ https://issues.apache.org/jira/browse/NUTCH-1727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sertac TURKEL updated NUTCH-1727:
---------------------------------

    Attachment: NUTCH-1727.patch

I had a look domain-suffix.xml  and I saw the longest domain suffix can include 8 characters(.internal). By default value, I picked 8 for this reason and I prepared a patch.  Could you review my patch?

> Length of the Tlds
> ------------------
>
>                 Key: NUTCH-1727
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1727
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Sertac TURKEL
>            Priority: Minor
>             Fix For: 2.1
>
>         Attachments: NUTCH-1727.patch
>
>
> Length of the tld  should be selectable, there is some available tld's like .travel and url-validator plugin filters this type of urls.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)