You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Emin Aksehirli (JIRA)" <ji...@apache.org> on 2014/04/15 11:17:24 UTC
[jira] [Commented] (MAHOUT-1355) Frequent Pattern Mining algorithms
for Mahout
[ https://issues.apache.org/jira/browse/MAHOUT-1355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969361#comment-13969361 ]
Emin Aksehirli commented on MAHOUT-1355:
----------------------------------------
Hello, yes, it doesn't make sense to add more MR Code. Maybe we can port this code to the new platform. What will be the base platform for Mahout? Hadoop 2 + YARN?
> Frequent Pattern Mining algorithms for Mahout
> ---------------------------------------------
>
> Key: MAHOUT-1355
> URL: https://issues.apache.org/jira/browse/MAHOUT-1355
> Project: Mahout
> Issue Type: New Feature
> Components: Frequent Itemset/Association Rule Mining
> Affects Versions: 0.9
> Reporter: Sandy Moens
> Priority: Minor
> Attachments: MAHOUT-1355.patch, MAHOUT-1355_V2.patch
>
> Original Estimate: 0h
> Remaining Estimate: 0h
>
> We implemented frequent pattern mining algorithms for Hadoop and adapted them to Mahout. We used "PFP" (now deprecated) as a benchmark and these implementations perform better in terms of speed and memory footprint. The details of the implementations can be found in the paper Frequent Pattern Mining for BigData ( http://adrem.ua.ac.be/bigfim )
> We have been maintaining the project for a while in GitLab ( https://gitlab.com/adrem/bigfim ). Documentation for adaptation ( Readme-Mahout.md ) and usage in mahout ( Mahout-wiki.md ) can be found there.
> We are open to any modification and/or improvement requests to make it more worthwhile for the Mahout project. We, as the research group, volunteer to maintain FPM algorithms as well.
--
This message was sent by Atlassian JIRA
(v6.2#6252)