You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by matd <ma...@gmail.com> on 2016/01/06 19:28:55 UTC

fp growth - clean up repetitions in input

Hi folks,

I'm interested in using FP growth to identify sequence patterns.

Unfortunately, my input sequences have cycles :
...1,2,4,1,2,5...

And this is not supported by fp-growth
(I get a SparkException: Items in a transaction must be unique but got
WrappedArray)

Do you know a way to identify and clean up cycles before giving them to
fp-growth ?

thanks for your input.
Mat



--
View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/fp-growth-clean-up-repetitions-in-input-tp25897.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscribe@spark.apache.org
For additional commands, e-mail: user-help@spark.apache.org