You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@nutch.apache.org by sn...@apache.org on 2017/11/05 21:16:48 UTC

[nutch] branch master updated: NUTCH-2452 Allow nutch to retrieve Ftp URLs that contain UrlEncoded characters, closes #237

This is an automated email from the ASF dual-hosted git repository.

snagel pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/nutch.git


The following commit(s) were added to refs/heads/master by this push:
     new 517dbdf  NUTCH-2452 Allow nutch to retrieve Ftp URLs that contain UrlEncoded characters, closes #237
517dbdf is described below

commit 517dbdf3261d42e90883d07320b7991ff8e2bcf8
Author: Hiran Chaudhuri <hi...@mail.de>
AuthorDate: Sat Oct 28 01:15:22 2017 +0200

    NUTCH-2452 Allow nutch to retrieve Ftp URLs that contain UrlEncoded characters, closes #237
---
 .../src/java/org/apache/nutch/protocol/ftp/FtpResponse.java             | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/FtpResponse.java b/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/FtpResponse.java
index f7c7c6d..51bc6bf 100644
--- a/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/FtpResponse.java
+++ b/src/plugin/protocol-ftp/src/java/org/apache/nutch/protocol/ftp/FtpResponse.java
@@ -252,6 +252,8 @@ public class FtpResponse {
       }
 
       this.content = null;
+      
+      path = java.net.URLDecoder.decode(path, "UTF-8");
 
       if (path.endsWith("/")) {
         getDirAsHttpResponse(path, datum.getModifiedTime());

-- 
To stop receiving notification emails like this one, please contact
['"commits@nutch.apache.org" <co...@nutch.apache.org>'].