You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Miao Wang (JIRA)" <ji...@apache.org> on 2017/02/16 01:04:42 UTC

[jira] [Commented] (SPARK-19460) Update dataset used in R documentation, examples to reduce warning noise and confusions

    [ https://issues.apache.org/jira/browse/SPARK-19460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15868911#comment-15868911 ] 

Miao Wang commented on SPARK-19460:
-----------------------------------

Seems a lots of work. :) I can give a try.

> Update dataset used in R documentation, examples to reduce warning noise and confusions
> ---------------------------------------------------------------------------------------
>
>                 Key: SPARK-19460
>                 URL: https://issues.apache.org/jira/browse/SPARK-19460
>             Project: Spark
>          Issue Type: Bug
>          Components: SparkR
>    Affects Versions: 2.1.0
>            Reporter: Felix Cheung
>
> Running build we have a bunch of warnings from using the `iris` dataset, for example.
> Warning in FUN(X[[1L]], ...) :
> Use Sepal_Length instead of Sepal.Length as column name
> Warning in FUN(X[[2L]], ...) :
> Use Sepal_Width instead of Sepal.Width as column name
> Warning in FUN(X[[3L]], ...) :
> Use Petal_Length instead of Petal.Length as column name
> Warning in FUN(X[[4L]], ...) :
> Use Petal_Width instead of Petal.Width as column name
> Warning in FUN(X[[1L]], ...) :
> Use Sepal_Length instead of Sepal.Length as column name
> Warning in FUN(X[[2L]], ...) :
> Use Sepal_Width instead of Sepal.Width as column name
> Warning in FUN(X[[3L]], ...) :
> Use Petal_Length instead of Petal.Length as column name
> Warning in FUN(X[[4L]], ...) :
> Use Petal_Width instead of Petal.Width as column name
> Warning in FUN(X[[1L]], ...) :
> Use Sepal_Length instead of Sepal.Length as column name
> Warning in FUN(X[[2L]], ...) :
> Use Sepal_Width instead of Sepal.Width as column name
> Warning in FUN(X[[3L]], ...) :
> Use Petal_Length instead of Petal.Length as column name
> These are the results of having `.` in the column name. For reference, see SPARK-12191, SPARK-11976. Since it involves changing SQL, if we couldn't support that there then we should strongly consider using other dataset without `.`, eg. `cars`
> And we should update this in API doc (roxygen2 doc string), vignettes, programming guide, R code example.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org