You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Sen Fang (JIRA)" <ji...@apache.org> on 2015/12/26 06:13:49 UTC

[jira] [Created] (SPARK-12526) `ifelse`, `when`, `otherwise` unable to take Column as value

Sen Fang created SPARK-12526:
--------------------------------

             Summary: `ifelse`, `when`, `otherwise` unable to take Column as value
                 Key: SPARK-12526
                 URL: https://issues.apache.org/jira/browse/SPARK-12526
             Project: Spark
          Issue Type: Bug
          Components: SparkR
    Affects Versions: 1.5.2, 1.6.0
            Reporter: Sen Fang


When passing a Column to {{ifelse}}, {{when}}, {{otherwise}}, it will error out with

{code}
attempt to replicate an object of type 'environment'
{code}

The problems lies in the use of base R {{ifelse}} function, which is vectorized version of {{if ... else ...}} idiom, but it is unable to replicate a Column's job id as it is an environment.

Considering {{callJMethod}} was never designed to be vectorized, the safe option is to replace {{ifelse}} with {{if ... else ...}} instead. However technically this is inconsistent to base R's ifelse, which is meant to be vectorized.

I can send a PR for review first and discuss further if there is scenario at all when `ifelse`, `when`, `otherwise` would be used vectorizedly.

A dummy example is:
{code}
ifelse(lit(1) == lit(1), lit(2), lit(3))
{code}
A concrete example might be:
{code}
ifelse(df$mpg > 0, df$mpg, 0)
{code}




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org