You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@datafu.apache.org by "Russell Jurney (JIRA)" <ji...@apache.org> on 2014/08/05 12:15:14 UTC

[jira] [Updated] (DATAFU-65) Aho-Corasick Pig UDF

     [ https://issues.apache.org/jira/browse/DATAFU-65?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Russell Jurney updated DATAFU-65:
---------------------------------

    Attachment: DATAFU-65.diff

Working implementation of AhoCorasickMatch with options and working unit tests that test them.

> Aho-Corasick Pig UDF
> --------------------
>
>                 Key: DATAFU-65
>                 URL: https://issues.apache.org/jira/browse/DATAFU-65
>             Project: DataFu
>          Issue Type: Bug
>    Affects Versions: 1.3.0
>         Environment: Drought
>            Reporter: Russell Jurney
>             Fix For: 1.3.0
>
>         Attachments: DATAFU-65.diff
>
>   Original Estimate: 8h
>  Remaining Estimate: 8h
>
> I need to use the Aho-Corasick algorithm for efficient sub-string matching. A java implementation is available at https://github.com/robert-bor/aho-corasick and is available on maven central: http://maven-repository.com/artifact/org.arabidopsis.ahocorasick/ahocorasick/2.x A Pig UDF will be very helpful to me.
> How do I add a maven dependency with gradle?



--
This message was sent by Atlassian JIRA
(v6.2#6252)