You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (Created) (JIRA)" <ji...@apache.org> on 2012/03/08 14:35:57 UTC

[jira] [Created] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Domain(blacklist)URLFilter to trim entries
------------------------------------------

                 Key: NUTCH-1305
                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
             Project: Nutch
          Issue Type: Bug
    Affects Versions: 1.4
            Reporter: Markus Jelsma
            Assignee: Markus Jelsma
            Priority: Minor
             Fix For: 1.5
         Attachments: NUTCH-1305-1.5-1.patch

Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Markus Jelsma (Updated) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-1305:
---------------------------------

    Attachment: NUTCH-1305-1.5-1.patch

Patch for 1.5. Fixes the issue.
                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Markus Jelsma (Resolved) (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma resolved NUTCH-1305.
----------------------------------

    Resolution: Fixed

Committed for 1.5 in rev. 1298394.
                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13230031#comment-13230031 ] 

Hudson commented on NUTCH-1305:
-------------------------------

Integrated in nutch-trunk-maven #196 (See [https://builds.apache.org/job/nutch-trunk-maven/196/])
    NUTCH-1305 missing in CHANGES (Revision 1300871)

     Result = SUCCESS
markus : 
Files : 
* /nutch/trunk/CHANGES.txt

                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225232#comment-13225232 ] 

Hudson commented on NUTCH-1305:
-------------------------------

Integrated in nutch-trunk-maven #187 (See [https://builds.apache.org/job/nutch-trunk-maven/187/])
    NUTCH-1305 Domain(blacklist)URLFilter to trim entries (Revision 1298394)

     Result = SUCCESS
markus : 
Files : 
* /nutch/trunk/src/plugin/urlfilter-domainblacklist/src/java/org/apache/nutch/urlfilter/domainblacklist/DomainBlacklistURLFilter.java

                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225994#comment-13225994 ] 

Hudson commented on NUTCH-1305:
-------------------------------

Integrated in Nutch-trunk #1781 (See [https://builds.apache.org/job/Nutch-trunk/1781/])
    NUTCH-1305 Domain(blacklist)URLFilter to trim entries (Revision 1298394)

     Result = SUCCESS
markus : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1298394
Files : 
* /nutch/trunk/src/plugin/urlfilter-domainblacklist/src/java/org/apache/nutch/urlfilter/domainblacklist/DomainBlacklistURLFilter.java

                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Lewis John McGibbney (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225206#comment-13225206 ] 

Lewis John McGibbney commented on NUTCH-1305:
---------------------------------------------

+1
                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13239174#comment-13239174 ] 

Hudson commented on NUTCH-1305:
-------------------------------

Integrated in Nutch-trunk #1799 (See [https://builds.apache.org/job/Nutch-trunk/1799/])
    NUTCH-1305 DomainFilter missing (Revision 1305381)

     Result = SUCCESS
markus : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1305381
Files : 
* /nutch/trunk/src/plugin/urlfilter-domain/src/java/org/apache/nutch/urlfilter/domain/DomainURLFilter.java

                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13230903#comment-13230903 ] 

Hudson commented on NUTCH-1305:
-------------------------------

Integrated in Nutch-trunk #1788 (See [https://builds.apache.org/job/Nutch-trunk/1788/])
    NUTCH-1305 missing in CHANGES (Revision 1300871)

     Result = SUCCESS
markus : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1300871
Files : 
* /nutch/trunk/CHANGES.txt

                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Hudson (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13238510#comment-13238510 ] 

Hudson commented on NUTCH-1305:
-------------------------------

Integrated in nutch-trunk-maven #213 (See [https://builds.apache.org/job/nutch-trunk-maven/213/])
    NUTCH-1305 DomainFilter missing (Revision 1305381)

     Result = FAILURE
markus : 
Files : 
* /nutch/trunk/src/plugin/urlfilter-domain/src/java/org/apache/nutch/urlfilter/domain/DomainURLFilter.java

                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1305) Domain(blacklist)URLFilter to trim entries

Posted by "Markus Jelsma (Commented) (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13225209#comment-13225209 ] 

Markus Jelsma commented on NUTCH-1305:
--------------------------------------

Thanks Lewis.
                
> Domain(blacklist)URLFilter to trim entries
> ------------------------------------------
>
>                 Key: NUTCH-1305
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1305
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.5
>
>         Attachments: NUTCH-1305-1.5-1.patch
>
>
> Both filters should handle entries with trailing whitespace.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira