You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@systemml.apache.org by ja...@apache.org on 2020/06/14 03:52:14 UTC
[systemml] branch master updated: [DOC][1/2] imputeByFD and
discoverFD builtin func.
This is an automated email from the ASF dual-hosted git repository.
janardhan pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/systemml.git
The following commit(s) were added to refs/heads/master by this push:
new 0c2a2b3 [DOC][1/2] imputeByFD and discoverFD builtin func.
0c2a2b3 is described below
commit 0c2a2b393ae76da4068288c2be02fb753289b45b
Author: Tarun Kumar <ku...@gmail.com>
AuthorDate: Sun Jun 14 09:12:29 2020 +0530
[DOC][1/2] imputeByFD and discoverFD builtin func.
Closes #969.
---
dev/docs/builtins-reference.md | 48 ++++++++++++++++++++++++++++++++++++++++++
1 file changed, 48 insertions(+)
diff --git a/dev/docs/builtins-reference.md b/dev/docs/builtins-reference.md
index 8b29ee9..a9b832b 100644
--- a/dev/docs/builtins-reference.md
+++ b/dev/docs/builtins-reference.md
@@ -25,11 +25,13 @@ limitations under the License.
* [DML-Bodied Built-In functions](#dml-bodied-built-in-functions)
* [`confusionMatrix`-Function](#confusionmatrix-function)
* [`cvlm`-Function](#cvlm-function)
+ * [`discoverFD`-Function](#discoverFD-function)
* [`glm`-Function](#glm-function)
* [`gridSearch`-Function](#gridSearch-function)
* [`img_brightness`-Function](#img_brightness-function)
* [`img_crop`-Function](#img_crop-function)
* [`img_mirror`-Function](#img_mirror-function)
+ * [`imputeByFD`-Function](#imputeByFD-function)
* [`KMeans`-Function](#KMeans-function)
* [`lm`-Function](#lm-function)
* [`lmDS`-Function](#lmds-function)
@@ -196,6 +198,28 @@ y = X %*% rand(rows = ncol(X), cols = 1)
[predict, beta] = cvlm(X = X, y = y, k = 4)
```
+## `discoverFD`-Function
+
+The `discoverFD`-function finds the functional dependencies.
+
+### Usage
+```r
+discoverFD(X, Mask, threshold)
+```
+
+### Arguments
+| Name | Type | Default | Description |
+| :-------- | :----- | ------- | :---------- |
+| X | Double | -- | Input Matrix X, encoded Matrix if data is categorical |
+| Mask | Double | -- | A row vector for interested features i.e. Mask =[1, 0, 1] will exclude the second column from processing |
+| threshold | Double | -- | threshold value in interval [0, 1] for robust FDs |
+
+### Returns
+| Type | Description |
+| :----- | :---------- |
+| Double | matrix of functional dependencies |
+
+
## `glm`-Function
The `glm`-function is a flexible generalization of ordinary linear regression that allows for response variables that have
@@ -356,6 +380,30 @@ A = rand(rows = 3, cols = 3, min = 0, max = 255)
B = img_mirror(img_in = A, horizontal_axis = TRUE)
```
+## `imputeByFD`-Function
+
+The `imputeByFD`-function imputes missing values from observed values (if exist)
+using robust functional dependencies.
+
+### Usage
+```r
+imputeByFD(F, sourceAttribute, targetAttribute, threshold)
+```
+
+### Arguments
+| Name | Type | Default | Description |
+| :-------- | :------ | -------- | :---------- |
+| F | String | -- | A data frame |
+| source | Integer | -- | Source attribute to use for imputation and error correction |
+| target | Integer | -- | Attribute to be fixed |
+| threshold | Double | -- | threshold value in interval [0, 1] for robust FDs |
+
+### Returns
+| Type | Description |
+| :----- | :---------- |
+| String | Frame with possible imputations |
+
+
## `KMeans`-Function
The kmeans() implements the KMeans Clustering algorithm.