You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Andrzej Bialecki (JIRA)" <ji...@apache.org> on 2007/12/27 14:08:44 UTC

[jira] Created: (NUTCH-595) "Target file:/.... already exists"

"Target file:/.... already exists"
----------------------------------

                 Key: NUTCH-595
                 URL: https://issues.apache.org/jira/browse/NUTCH-595
             Project: Nutch
          Issue Type: Bug
    Affects Versions: 1.0.0
         Environment: Cygwin, Win XP, LocalJobTracker and LocalFileSystem.
            Reporter: Andrzej Bialecki 


This is related to the upgrade to Hadoop 0.15.0. I'm unable to run any Hadoop jobs in local mode under Cygwin:

{noformat}
2007-12-27 13:54:24,468 WARN  mapred.LocalJobRunner - job_local_1
java.io.IOException: Target file:/c:/tmp/hadoop-abial/mapred/temp/inject-temp-19350068/_reduce_kmsua5/part-00000 already exists
        at org.apache.hadoop.fs.FileUtil.checkDest(FileUtil.java:246)
        at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:125)
        at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:116)
        at org.apache.hadoop.fs.RawLocalFileSystem.rename(RawLocalFileSystem.java:180)
        at org.apache.hadoop.fs.ChecksumFileSystem.rename(ChecksumFileSystem.java:394)
        at org.apache.hadoop.mapred.Task.moveTaskOutputs(Task.java:452)
        at org.apache.hadoop.mapred.Task.moveTaskOutputs(Task.java:469)
        at org.apache.hadoop.mapred.Task.saveTaskOutput(Task.java:426)
        at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:165)
2007-12-27 13:54:24,843 FATAL crawl.Injector - Injector: java.io.IOException: Job failed!
        at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:831)
        at org.apache.nutch.crawl.Injector.inject(Injector.java:162)
        at org.apache.nutch.crawl.Injector.run(Injector.java:192)
        at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
        at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:54)
        at org.apache.nutch.crawl.Injector.main(Injector.java:182)
{noformat}

AFAIK this should be fixed in HADOOP-2228, which is a part of 0.15.2.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-595) "Target file:/.... already exists"

Posted by "Emmanuel Joke (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12554571 ] 

Emmanuel Joke commented on NUTCH-595:
-------------------------------------

I had a similar issue and i follow the instruction done by Dennis and it solved my pb.
http://www.nabble.com/File-Paths-2C-Hadoop--3E-3D-0.15-and-Local-Jobs-to13184356.html

Its just a workaround but at least you can run your crawler.

> "Target file:/.... already exists"
> ----------------------------------
>
>                 Key: NUTCH-595
>                 URL: https://issues.apache.org/jira/browse/NUTCH-595
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.0.0
>         Environment: Cygwin, Win XP, LocalJobTracker and LocalFileSystem.
>            Reporter: Andrzej Bialecki 
>
> This is related to the upgrade to Hadoop 0.15.0. I'm unable to run any Hadoop jobs in local mode under Cygwin:
> {noformat}
> 2007-12-27 13:54:24,468 WARN  mapred.LocalJobRunner - job_local_1
> java.io.IOException: Target file:/c:/tmp/hadoop-abial/mapred/temp/inject-temp-19350068/_reduce_kmsua5/part-00000 already exists
>         at org.apache.hadoop.fs.FileUtil.checkDest(FileUtil.java:246)
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:125)
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:116)
>         at org.apache.hadoop.fs.RawLocalFileSystem.rename(RawLocalFileSystem.java:180)
>         at org.apache.hadoop.fs.ChecksumFileSystem.rename(ChecksumFileSystem.java:394)
>         at org.apache.hadoop.mapred.Task.moveTaskOutputs(Task.java:452)
>         at org.apache.hadoop.mapred.Task.moveTaskOutputs(Task.java:469)
>         at org.apache.hadoop.mapred.Task.saveTaskOutput(Task.java:426)
>         at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:165)
> 2007-12-27 13:54:24,843 FATAL crawl.Injector - Injector: java.io.IOException: Job failed!
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:831)
>         at org.apache.nutch.crawl.Injector.inject(Injector.java:162)
>         at org.apache.nutch.crawl.Injector.run(Injector.java:192)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>         at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:54)
>         at org.apache.nutch.crawl.Injector.main(Injector.java:182)
> {noformat}
> AFAIK this should be fixed in HADOOP-2228, which is a part of 0.15.2.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-595) "Target file:/.... already exists"

Posted by "armand rayman (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12560775#action_12560775 ] 

armand rayman commented on NUTCH-595:
-------------------------------------

You say this is only happening on Cygwin/Win XP but this is also a major problem in Linux too (Ubuntu) at least using the nightly build (around 15th Jan 2008 build)

> "Target file:/.... already exists"
> ----------------------------------
>
>                 Key: NUTCH-595
>                 URL: https://issues.apache.org/jira/browse/NUTCH-595
>             Project: Nutch
>          Issue Type: Bug
>    Affects Versions: 1.0.0
>         Environment: Cygwin, Win XP, LocalJobTracker and LocalFileSystem.
>            Reporter: Andrzej Bialecki 
>
> This is related to the upgrade to Hadoop 0.15.0. I'm unable to run any Hadoop jobs in local mode under Cygwin:
> {noformat}
> 2007-12-27 13:54:24,468 WARN  mapred.LocalJobRunner - job_local_1
> java.io.IOException: Target file:/c:/tmp/hadoop-abial/mapred/temp/inject-temp-19350068/_reduce_kmsua5/part-00000 already exists
>         at org.apache.hadoop.fs.FileUtil.checkDest(FileUtil.java:246)
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:125)
>         at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:116)
>         at org.apache.hadoop.fs.RawLocalFileSystem.rename(RawLocalFileSystem.java:180)
>         at org.apache.hadoop.fs.ChecksumFileSystem.rename(ChecksumFileSystem.java:394)
>         at org.apache.hadoop.mapred.Task.moveTaskOutputs(Task.java:452)
>         at org.apache.hadoop.mapred.Task.moveTaskOutputs(Task.java:469)
>         at org.apache.hadoop.mapred.Task.saveTaskOutput(Task.java:426)
>         at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:165)
> 2007-12-27 13:54:24,843 FATAL crawl.Injector - Injector: java.io.IOException: Job failed!
>         at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:831)
>         at org.apache.nutch.crawl.Injector.inject(Injector.java:162)
>         at org.apache.nutch.crawl.Injector.run(Injector.java:192)
>         at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
>         at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:54)
>         at org.apache.nutch.crawl.Injector.main(Injector.java:182)
> {noformat}
> AFAIK this should be fixed in HADOOP-2228, which is a part of 0.15.2.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.