You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Lewis John McGibbney (JIRA)" <ji...@apache.org> on 2013/03/30 23:49:15 UTC

[jira] [Updated] (NUTCH-1551) Improve WebTableReader field order and display batchId

     [ https://issues.apache.org/jira/browse/NUTCH-1551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Lewis John McGibbney updated NUTCH-1551:
----------------------------------------

    Attachment: NUTCH-1551.patch

patch for 2.x HEAD
                
> Improve WebTableReader field order and display batchId
> ------------------------------------------------------
>
>                 Key: NUTCH-1551
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1551
>             Project: Nutch
>          Issue Type: Bug
>          Components: crawldb
>    Affects Versions: 2.1
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>             Fix For: 2.2
>
>         Attachments: NUTCH-1551.patch
>
>
> I've made slight modifications to WebTableReader to dump a more appropriately structured fields when dumping the webdb. The structure now more closely reflects the set out of the webpage.avsc file.
> Additionally, I've added the batchId however for backwards compatability with existing webdb's this is only appended to the string buffer if it is not null value.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira