You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (Jira)" <ji...@apache.org> on 2023/01/02 08:23:00 UTC

[jira] [Created] (SPARK-41814) Column.eqNullSafe fails on NaN comparison

Hyukjin Kwon created SPARK-41814:
------------------------------------

             Summary: Column.eqNullSafe fails on NaN comparison
                 Key: SPARK-41814
                 URL: https://issues.apache.org/jira/browse/SPARK-41814
             Project: Spark
          Issue Type: Sub-task
          Components: Connect
    Affects Versions: 3.4.0
            Reporter: Hyukjin Kwon


{code}
File "/.../spark/python/pyspark/sql/connect/column.py", line 115, in pyspark.sql.connect.column.Column.eqNullSafe
Failed example:
    df2.select(
        df2['value'].eqNullSafe(None),
        df2['value'].eqNullSafe(float('NaN')),
        df2['value'].eqNullSafe(42.0)
    ).show()
Expected:
    +----------------+---------------+----------------+
    |(value <=> NULL)|(value <=> NaN)|(value <=> 42.0)|
    +----------------+---------------+----------------+
    |           false|           true|           false|
    |           false|          false|            true|
    |            true|          false|           false|
    +----------------+---------------+----------------+
Got:
    +----------------+---------------+----------------+
    |(value <=> NULL)|(value <=> NaN)|(value <=> 42.0)|
    +----------------+---------------+----------------+
    |            true|          false|           false|
    |           false|          false|            true|
    |            true|          false|           false|
    +----------------+---------------+----------------+
{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org