You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@spark.apache.org by Kybe67 <be...@gmail.com> on 2015/10/15 16:58:54 UTC

MLlib Contribution

Hi, i made a clustering algorithm in Scala/Spark during my internship, i
would like to contribute to MLlib, but i don't know how, i do my best to
follow this instructions :

https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark#ContributingtoSpark-MLlib-specificContributionGuidelines
https://cwiki.apache.org/confluence/display/SPARK/Spark+Code+Style+Guide

The algorithm is the Mean Shift. It works fine on multivariate
muldimensional datasets, especially on image. I think some works should be
done but i don't know what i should do.

Thank you for your support and for the amazing Spark project.



--
View this message in context: http://apache-spark-developers-list.1001551.n3.nabble.com/MLlib-Contribution-tp14626.html
Sent from the Apache Spark Developers List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@spark.apache.org
For additional commands, e-mail: dev-help@spark.apache.org

Re: MLlib Contribution

Posted by Joseph Bradley <jo...@databricks.com>.

Hi, it'd be great to share your implementation with the community.  I'd
recommend:

(1) Share it immediately by creating a Spark package:
http://spark-packages.org/

You can use this helper package to create your own:
http://spark-packages.org/package/databricks/sbt-spark-package
After you create and test it, I'd recommend emailing the user list to
announce it and see if others have feedback.

(2) If you'd like to get it into MLlib itself, I'd recommend creating a
JIRA first to discuss the algorithm, its design, and a timeline/priority
for getting it into Spark.  (I'd say it looks useful, but it's hard to
guess the use cases & priority within the whole community.  We'd want to
collect user feedback on JIRA, the mailing list, and/or your Spark Package
in order to gauge use cases and priority.)

Thanks!
Joseph

On Thu, Oct 15, 2015 at 7:58 AM, Kybe67 <be...@gmail.com> wrote:

> Hi, i made a clustering algorithm in Scala/Spark during my internship, i
> would like to contribute to MLlib, but i don't know how, i do my best to
> follow this instructions :
>
>
> https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark#ContributingtoSpark-MLlib-specificContributionGuidelines
> https://cwiki.apache.org/confluence/display/SPARK/Spark+Code+Style+Guide
>
> The algorithm is the Mean Shift. It works fine on multivariate
> muldimensional datasets, especially on image. I think some works should be
> done but i don't know what i should do.
>
> Thank you for your support and for the amazing Spark project.
>
>
>
> --
> View this message in context:
> http://apache-spark-developers-list.1001551.n3.nabble.com/MLlib-Contribution-tp14626.html
> Sent from the Apache Spark Developers List mailing list archive at
> Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscribe@spark.apache.org
> For additional commands, e-mail: dev-help@spark.apache.org
>
>