You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Sandy Moens (JIRA)" <ji...@apache.org> on 2013/11/13 17:59:25 UTC

[jira] [Created] (MAHOUT-1355) Frequent Pattern Mining algorithms for Mahout

Sandy Moens created MAHOUT-1355:
-----------------------------------

             Summary: Frequent Pattern Mining algorithms for Mahout
                 Key: MAHOUT-1355
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1355
             Project: Mahout
          Issue Type: New Feature
          Components: Frequent Itemset/Association Rule Mining
    Affects Versions: 0.9
            Reporter: Sandy Moens
            Priority: Minor
             Fix For: 1.0


We implemented frequent pattern mining algorithms for Hadoop and adapted them to Mahout. We used "PFP" (now deprecated) as a benchmark and these implementations perform better in terms of speed and memory footprint. The details of the implementations can be found in the paper Frequent Pattern Mining for BigData ( http://adrem.ua.ac.be/bigfim )

We have been maintaining the project for a while in GitLab ( https://gitlab.com/adrem/bigfim ). Documentation for adaptation ( Readme-Mahout.md ) and usage in mahout ( Mahout-wiki.md ) can be found there.

We are open to any modification and/or improvement requests to make it more worthwhile for the Mahout project. We, as the research group, volunteer to maintain FPM algorithms as well.



--
This message was sent by Atlassian JIRA
(v6.1#6144)