You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Markus Jelsma (Jira)" <ji...@apache.org> on 2024/02/06 11:14:00 UTC

[jira] [Commented] (NUTCH-3028) WARCExported to support filtering by JEXL

    [ https://issues.apache.org/jira/browse/NUTCH-3028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17814731#comment-17814731 ] 

Markus Jelsma commented on NUTCH-3028:
--------------------------------------

Any objections to this one before i get it in?

> WARCExported to support filtering by JEXL
> -----------------------------------------
>
>                 Key: NUTCH-3028
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3028
>             Project: Nutch
>          Issue Type: Improvement
>            Reporter: Markus Jelsma
>            Assignee: Markus Jelsma
>            Priority: Minor
>         Attachments: NUTCH-3028.patch
>
>
> Filtering segment data to WARC is now possible using JEXL expressions. In the next example, all records with SOME_KEY=SOME_VALUE in their parseData metadata are exported to WARC.
> {color:#000000}-expr 'parseData.getParseMeta().get("SOME_KEY").equals("SOME_VALUE")'{color}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)