You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@madlib.apache.org by "Frank McQuillan (JIRA)" <ji...@apache.org> on 2016/10/28 00:20:58 UTC

[jira] [Created] (MADLIB-1031) Improve performance of Apriori

Frank McQuillan created MADLIB-1031:
---------------------------------------

             Summary: Improve performance of Apriori
                 Key: MADLIB-1031
                 URL: https://issues.apache.org/jira/browse/MADLIB-1031
             Project: Apache MADlib
          Issue Type: Improvement
          Components: Module: Association Rules
            Reporter: Frank McQuillan


Two potential improvements:

1) Limit itemset size:  see maxlen parameter on p. 10 of 
https://cran.r-project.org/web/packages/arules/arules.pdf
as an example.

2)  Something like a WHERE clause for LHS and RHS in order to reduce execution time, but still need the existence of the filtered transactions for support and confidence computation.   (That is you can't filter then out ahead of time because would skew support and confidence.)




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)