You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ignite.apache.org by Alexey Zinoviev <za...@gmail.com> on 2019/02/12 11:14:40 UTC
[Discussion] [ML] Future of Pipelines
Hi, Igniters!
Currently, we have in Ignite ML next Pipeline API in alpha version via
PipelineMDL and Pipeline classes.
I'm going to finish Pipeline API for the next release 2.8 and need you help
in brainstorming
You could find an example in master here
<https://github.com/apache/ignite/blob/master/examples/src/main/java/org/apache/ignite/examples/ml/tutorial/Step_5_Scaling_with_Pipeline.java>
The snippet is added here
PipelineMdl<Integer, Object[]> mdl = new Pipeline<Integer, Object[],
Object[]>()
.addFeatureExtractor(featureExtractor)
.addLabelExtractor(lbExtractor)
.addPreprocessor(new EncoderTrainer<Integer, Object[]>()
.withEncoderType(EncoderType.STRING_ENCODER)
.withEncodedFeature(1)
.withEncodedFeature(6))
.addPreprocessor(new ImputerTrainer<Integer,
Object[]>())
.addPreprocessor(new MinMaxScalerTrainer<Integer,
Object[]>())
.addPreprocessor(new NormalizationTrainer<Integer,
Object[]>()
.withP(1))
.addTrainer(new DecisionTreeClassificationTrainer(5, 0))
.fit(ignite, dataCache);
If you have any experience with Pipeline APi in another ML framework as
scikit-learn, Spark, .NET ML and etc, please have a look at this API and
write any suggestion about possible features and use-cases
Feel free to add any suggestions or comments.
--
Sent from: http://apache-ignite-developers.2346864.n4.nabble.com/