You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Julien Nioche (JIRA)" <ji...@apache.org> on 2010/02/01 12:43:50 UTC

[jira] Created: (NUTCH-782) Ability to order htmlparsefilters

Ability to order htmlparsefilters
---------------------------------

                 Key: NUTCH-782
                 URL: https://issues.apache.org/jira/browse/NUTCH-782
             Project: Nutch
          Issue Type: New Feature
            Reporter: Julien Nioche
            Assignee: Julien Nioche
             Fix For: 1.1
         Attachments: NUTCH-782.patch

Patch which adds a new parameter 'htmlparsefilter.order' which specifies the order in which HTMLParse filters are applied. HTMLParse filter ordering MAY have an impact on end result, as some filters could rely on the metadata generated by a previous filter.



-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-782) Ability to order htmlparsefilters

Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche updated NUTCH-782:
--------------------------------

    Attachment: NUTCH-782.patch

> Ability to order htmlparsefilters
> ---------------------------------
>
>                 Key: NUTCH-782
>                 URL: https://issues.apache.org/jira/browse/NUTCH-782
>             Project: Nutch
>          Issue Type: New Feature
>            Reporter: Julien Nioche
>            Assignee: Julien Nioche
>             Fix For: 1.1
>
>         Attachments: NUTCH-782.patch
>
>
> Patch which adds a new parameter 'htmlparsefilter.order' which specifies the order in which HTMLParse filters are applied. HTMLParse filter ordering MAY have an impact on end result, as some filters could rely on the metadata generated by a previous filter.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Closed: (NUTCH-782) Ability to order htmlparsefilters

Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche closed NUTCH-782.
-------------------------------

    Resolution: Fixed

Committed revision 917557

> Ability to order htmlparsefilters
> ---------------------------------
>
>                 Key: NUTCH-782
>                 URL: https://issues.apache.org/jira/browse/NUTCH-782
>             Project: Nutch
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Julien Nioche
>            Assignee: Julien Nioche
>             Fix For: 1.1
>
>         Attachments: NUTCH-782.patch
>
>
> Patch which adds a new parameter 'htmlparsefilter.order' which specifies the order in which HTMLParse filters are applied. HTMLParse filter ordering MAY have an impact on end result, as some filters could rely on the metadata generated by a previous filter.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (NUTCH-782) Ability to order htmlparsefilters

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/NUTCH-782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12840002#action_12840002 ] 

Hudson commented on NUTCH-782:
------------------------------

Integrated in Nutch-trunk #1083 (See [http://hudson.zones.apache.org/hudson/job/Nutch-trunk/1083/])
    : Ability to order htmlparsefilters


> Ability to order htmlparsefilters
> ---------------------------------
>
>                 Key: NUTCH-782
>                 URL: https://issues.apache.org/jira/browse/NUTCH-782
>             Project: Nutch
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Julien Nioche
>            Assignee: Julien Nioche
>             Fix For: 1.1
>
>         Attachments: NUTCH-782.patch
>
>
> Patch which adds a new parameter 'htmlparsefilter.order' which specifies the order in which HTMLParse filters are applied. HTMLParse filter ordering MAY have an impact on end result, as some filters could rely on the metadata generated by a previous filter.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (NUTCH-782) Ability to order htmlparsefilters

Posted by "Julien Nioche (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/NUTCH-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Julien Nioche updated NUTCH-782:
--------------------------------

    Component/s: parser

> Ability to order htmlparsefilters
> ---------------------------------
>
>                 Key: NUTCH-782
>                 URL: https://issues.apache.org/jira/browse/NUTCH-782
>             Project: Nutch
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Julien Nioche
>            Assignee: Julien Nioche
>             Fix For: 1.1
>
>         Attachments: NUTCH-782.patch
>
>
> Patch which adds a new parameter 'htmlparsefilter.order' which specifies the order in which HTMLParse filters are applied. HTMLParse filter ordering MAY have an impact on end result, as some filters could rely on the metadata generated by a previous filter.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.