You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Weihua Jiang (JIRA)" <ji...@apache.org> on 2018/12/07 09:43:00 UTC

[jira] [Created] (FLINK-11096) Create a new table based flink ML package

Weihua Jiang created FLINK-11096:
------------------------------------

             Summary: Create a new table based flink ML package
                 Key: FLINK-11096
                 URL: https://issues.apache.org/jira/browse/FLINK-11096
             Project: Flink
          Issue Type: Sub-task
          Components: Machine Learning Library, Table API &amp; SQL
            Reporter: Weihua Jiang


Currently,  the DataSet based ML library is under org.apache._flink.ml_ scala package and under _flink-libraries/flink-ml directory._

 

There are two questions related to packaging:
 # Shall we create a new scala/java package, e.g. org.apache.flink.table.ml? Or still stay in org.apache.flink.ml?
 # Shall we still put new code in flink-libraries/flink-ml directory or create a new one, e.g. flink-libraries/flink-table-ml and corresponding maven package?

 

I implemented a prototype for the design and found that the new design is very hard to fit into existing flink.ml codebase. The existing flink.ml code is tightly coupled with DataSet API. Thus, I have to rewrite almost all parts of flink.ml to get some sample case to work. The only reusable code from flink.ml are the base math classes under _org.apache.flink.ml.math_ and _org.apache.flink.ml.metrics.distance_ packages. 

Considering this fact, I will prefer to create a new package org.apache.flink.table.ml and a new maven package flink-table-ml.

 

Please feel free to give your feedbacks. 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)