You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Tejas Patil (JIRA)" <ji...@apache.org> on 2013/05/18 06:05:15 UTC

[jira] [Comment Edited] (NUTCH-1566) bin/nutch to allow whitespace in paths

    [ https://issues.apache.org/jira/browse/NUTCH-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13661264#comment-13661264 ] 

Tejas Patil edited comment on NUTCH-1566 at 5/18/13 4:05 AM:
-------------------------------------------------------------

Hi Seb,
I tried the patch over a windows machine with cygwin and it worked (I have not ran all possible scenarios exhaustively...just tried few).

One minor suggestion:
With the current patch, I see this error message (on cygwin console) while running nutch in local mode: 
{noformat}cygpath: can't convert empty path{noformat}

I figured out the responsible place (line 115) in the nutch script:
{noformat}NUTCH_JOB=`cygpath -p -w "$NUTCH_JOB"`{noformat}

As the NUTCH_JOB value is empty while running in local mode, it gave that error message. The if case for adjusting NUTCH_JOB at lines 113-116 in [nutch script|http://svn.apache.org/viewvc/nutch/trunk/src/bin/nutch?view=markup] could be moved in the block just above it to address that. What say ?
                
      was (Author: tejasp):
    Hi Seb,
I tried the patch over a windows machine with cygwin and it worked (I have not ran all possible scenarios exhaustively...just tried few).

One minor suggestion:
With the current patch, I see this 
{noformat}cygpath: can't convert empty path{noformat}

I figured out the responsible place (line 115) in the nutch script:
{noformat}NUTCH_JOB=`cygpath -p -w "$NUTCH_JOB"`{noformat}

As the NUTCH_JOB value is empty while running in local mode, it gave that error message. The if case for adjusting NUTCH_JOB at lines 113-116 in [nutch script|http://svn.apache.org/viewvc/nutch/trunk/src/bin/nutch?view=markup] could be moved in the block just above it to address that. What say ?
                  
> bin/nutch to allow whitespace in paths
> --------------------------------------
>
>                 Key: NUTCH-1566
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1566
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.6, 2.1
>            Reporter: Sebastian Nagel
>            Priority: Minor
>             Fix For: 1.7, 2.3
>
>         Attachments: NUTCH-1566-trunk.patch
>
>
> bin/nutch and bin/crawl choke if a path contains white space, eg, if JAVA_HOME is "{{C:\Program Files\jdk}}". If you don't have the permission to change the path it is impossible to run Nutch. This has been reported frequently ([1|http://stackoverflow.com/questions/9345629/nutch-cygwin-how-to-set-java-home], [2|http://lucene.472066.n3.nabble.com/Problem-running-Nutch-on-Win-7-Cygwin-td3487163.html], and [3|http://nutchinstall.blogspot.de/2007/07/setting-up-cygwin-and-nutch.html]), see also NUTCH-19.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira