You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@spark.apache.org by Monish R <mo...@gmail.com> on 2020/02/22 00:48:46 UTC
PowerIterationClustering
Hi guys,
I am new to mlib and trying out PowerIterationClustering as per the example
mentioned below,
https://github.com/apache/spark/blob/master/examples/src/main/java/org/apache/spark/examples/mllib/JavaPowerIterationClusteringExample.java
I am having trouble in understanding how the output is created.
For instance if i change the input as shown below, i would like to
understand how the algorithm arrived at grouping 0 and 2 , while keeping
the rest in another cluster.
k = 2 .
Input :
new Tuple3<>(0L, 1L, 0.9),
new Tuple3<>(1L, 2L, 0.7),
new Tuple3<>(2L, 3L, 0.3),
new Tuple3<>(3L, 4L, 0.5),
new Tuple3<>(4L, 5L, 0.2)));
Output :
4 -> 0
0 -> 1
1 -> 0
3 -> 0
5 -> 0
2 -> 1
Kindly guide if you have any info on using the algorithm / point to some
materials that are suitable for beginners on this context.
Regards.