You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Ferdy Galema (JIRA)" <ji...@apache.org> on 2012/05/10 09:54:54 UTC

[jira] [Created] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Ferdy Galema created NUTCH-1365:
-----------------------------------

             Summary: Fix crawlId functionalilty by making using of new gora configuration
                 Key: NUTCH-1365
                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
             Project: Nutch
          Issue Type: Bug
            Reporter: Ferdy Galema
             Fix For: 2.1


With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Closed] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema closed NUTCH-1365.
-------------------------------

    Resolution: Fixed

committed
                
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365.patch, NUTCH-1365-v2.patch, NUTCH-1365-v3.patch, NUTCH-1365-v4.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema updated NUTCH-1365:
--------------------------------

    Attachment: NUTCH-1365-v4.patch

new patch fixes crawlId functionality for HostInjectorJob too.
                
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365-v2.patch, NUTCH-1365-v3.patch, NUTCH-1365-v4.patch, NUTCH-1365.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema updated NUTCH-1365:
--------------------------------

    Attachment: NUTCH-1365-v3.patch

Small improvement of the patch by showing the crawlId name in the jobName.
                
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365-v2.patch, NUTCH-1365-v3.patch, NUTCH-1365.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema updated NUTCH-1365:
--------------------------------

    Attachment: NUTCH-1365-v2.patch

Updated patch for new version of GORA-150.
                
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365-v2.patch, NUTCH-1365.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema updated NUTCH-1365:
--------------------------------

    Attachment: NUTCH-1365.patch

The updated patch. (Because of the splitting up of the corresponding Gora issue)
                
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema updated NUTCH-1365:
--------------------------------

    Attachment:     (was: NUTCH-1365.patch)
    
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ferdy Galema updated NUTCH-1365:
--------------------------------

    Attachment: NUTCH-1365.patch
    
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13417120#comment-13417120 ] 

Ferdy Galema commented on NUTCH-1365:
-------------------------------------

When we update Gora to 0.3, we can commit this.
                
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (NUTCH-1365) Fix crawlId functionalilty by making using of new gora configuration

Posted by "Ferdy Galema (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13427226#comment-13427226 ] 

Ferdy Galema commented on NUTCH-1365:
-------------------------------------

Nutch should be updated to Gora 2.1.
                
> Fix crawlId functionalilty by making using of new gora configuration
> --------------------------------------------------------------------
>
>                 Key: NUTCH-1365
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1365
>             Project: Nutch
>          Issue Type: Bug
>            Reporter: Ferdy Galema
>             Fix For: 2.1
>
>         Attachments: NUTCH-1365-v2.patch, NUTCH-1365-v3.patch, NUTCH-1365-v4.patch, NUTCH-1365.patch
>
>
> With GORA-126 it is finally possible to make correctly use of crawlId throughout nutch. This patch changes StorageUtils so that the preferred schema name (crawlId + "_" + schema) is correctly set on gora.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira