You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (JIRA)" <ji...@apache.org> on 2011/07/25 22:58:09 UTC

[jira] [Created] (NUTCH-1069) readlinkdb throws exception

readlinkdb throws exception
---------------------------

                 Key: NUTCH-1069
                 URL: https://issues.apache.org/jira/browse/NUTCH-1069
             Project: Nutch
          Issue Type: Bug
    Affects Versions: 1.4
            Reporter: Markus Jelsma
            Assignee: Markus Jelsma
             Fix For: 1.4, 2.0


reading the linkdb doesn't work on Hadoop 0.20+. It believes data is to be read from the _SUCCESS file that is written by newer Hadoop version.

Quick fix is to remove the _SUCCESS file

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1069) Readlinkdb broken on Hadoop > 0.20

Posted by "Markus Jelsma (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-1069:
---------------------------------

      Component/s: linkdb
         Priority: Minor  (was: Major)
       Patch Info: [Patch Available]
    Fix Version/s:     (was: 2.0)
          Summary: Readlinkdb broken on Hadoop > 0.20  (was: readlinkdb throws exception)

> Readlinkdb broken on Hadoop > 0.20
> ----------------------------------
>
>                 Key: NUTCH-1069
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1069
>             Project: Nutch
>          Issue Type: Bug
>          Components: linkdb
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.4
>
>         Attachments: NUTCH-1069-1.4-1.patch
>
>
> reading the linkdb doesn't work on Hadoop 0.20+. It believes data is to be read from the _SUCCESS file that is written by newer Hadoop version.
> Quick fix is to remove the _SUCCESS file

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Closed] (NUTCH-1069) Readlinkdb broken on Hadoop > 0.20

Posted by "Markus Jelsma (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma closed NUTCH-1069.
--------------------------------


> Readlinkdb broken on Hadoop > 0.20
> ----------------------------------
>
>                 Key: NUTCH-1069
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1069
>             Project: Nutch
>          Issue Type: Bug
>          Components: linkdb
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.4
>
>         Attachments: NUTCH-1069-1.4-1.patch
>
>
> reading the linkdb doesn't work on Hadoop 0.20+. It believes data is to be read from the _SUCCESS file that is written by newer Hadoop version.
> Quick fix is to remove the _SUCCESS file

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Resolved] (NUTCH-1069) Readlinkdb broken on Hadoop > 0.20

Posted by "Markus Jelsma (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma resolved NUTCH-1069.
----------------------------------

    Resolution: Fixed

Tested, confirmed and committed for 1.4 in rev. 1156665.

> Readlinkdb broken on Hadoop > 0.20
> ----------------------------------
>
>                 Key: NUTCH-1069
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1069
>             Project: Nutch
>          Issue Type: Bug
>          Components: linkdb
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>             Fix For: 1.4
>
>         Attachments: NUTCH-1069-1.4-1.patch
>
>
> reading the linkdb doesn't work on Hadoop 0.20+. It believes data is to be read from the _SUCCESS file that is written by newer Hadoop version.
> Quick fix is to remove the _SUCCESS file

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1069) readlinkdb throws exception

Posted by "Markus Jelsma (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1069?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Markus Jelsma updated NUTCH-1069:
---------------------------------

    Attachment: NUTCH-1069-1.4-1.patch

Fix to prevent creation of the _SUCCESS file in the LinkDB merge phase.

> readlinkdb throws exception
> ---------------------------
>
>                 Key: NUTCH-1069
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1069
>             Project: Nutch
>          Issue Type: Bug
>          Components: linkdb
>    Affects Versions: 1.4
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>             Fix For: 1.4
>
>         Attachments: NUTCH-1069-1.4-1.patch
>
>
> reading the linkdb doesn't work on Hadoop 0.20+. It believes data is to be read from the _SUCCESS file that is written by newer Hadoop version.
> Quick fix is to remove the _SUCCESS file

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira