You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Emin Aksehirli (JIRA)" <ji...@apache.org> on 2014/04/15 11:17:24 UTC

[jira] [Commented] (MAHOUT-1355) Frequent Pattern Mining algorithms for Mahout

    [ https://issues.apache.org/jira/browse/MAHOUT-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969361#comment-13969361 ] 

Emin Aksehirli commented on MAHOUT-1355:
----------------------------------------

Hello, yes, it doesn't make sense to add more MR Code. Maybe we can port this code to the new platform. What will be the base platform for Mahout? Hadoop 2 + YARN?

> Frequent Pattern Mining algorithms for Mahout
> ---------------------------------------------
>
>                 Key: MAHOUT-1355
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1355
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Frequent Itemset/Association Rule Mining
>    Affects Versions: 0.9
>            Reporter: Sandy Moens
>            Priority: Minor
>         Attachments: MAHOUT-1355.patch, MAHOUT-1355_V2.patch
>
>   Original Estimate: 0h
>  Remaining Estimate: 0h
>
> We implemented frequent pattern mining algorithms for Hadoop and adapted them to Mahout. We used "PFP" (now deprecated) as a benchmark and these implementations perform better in terms of speed and memory footprint. The details of the implementations can be found in the paper Frequent Pattern Mining for BigData ( http://adrem.ua.ac.be/bigfim )
> We have been maintaining the project for a while in GitLab ( https://gitlab.com/adrem/bigfim ). Documentation for adaptation ( Readme-Mahout.md ) and usage in mahout ( Mahout-wiki.md ) can be found there.
> We are open to any modification and/or improvement requests to make it more worthwhile for the Mahout project. We, as the research group, volunteer to maintain FPM algorithms as well.



--
This message was sent by Atlassian JIRA
(v6.2#6252)