You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Kazuaki Ishizaki (JIRA)" <ji...@apache.org> on 2018/01/01 18:17:17 UTC

[jira] [Created] (SPARK-22935) Dataset with Java Beans for java.sql.Date produces incorrect result

Kazuaki Ishizaki created SPARK-22935:
----------------------------------------

             Summary: Dataset with Java Beans for java.sql.Date produces incorrect result
                 Key: SPARK-22935
                 URL: https://issues.apache.org/jira/browse/SPARK-22935
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 2.2.1, 2.3.0
            Reporter: Kazuaki Ishizaki


The following code prints {{c=0}}. The value of {{c}} must be 2. {ds} must not be {{empty}}.

{code}
  public void SPARK22935() {
    Dataset<CDR> cdr = spark
            .read()
            .format("csv")
            .option("header", "true")
            .option("inferSchema", "true")
            .option("delimiter", ";")
            .csv("CDR_SAMPLE.csv")
            .as(Encoders.bean(CDR.class));
    Dataset<CDR> ds = cdr.filter((FilterFunction<CDR>) x -> (x.timestamp != null));
    long c = ds.count();
    cdr.show(2);
    ds.show(2);
    System.out.println("cnt=" + c);
  }

// CDR.java
public class CDR implements java.io.Serializable {
  public java.sql.Date timestamp;
}

// CDR_SAMPLE.csv
timestamp
2017-10-29T02:37:07.815Z
2017-10-29T02:38:07.815Z
{code}

result
{code}
+--------------------+
|           timestamp|
+--------------------+
|2017-10-29 11:37:...|
|2017-10-29 11:38:...|
+--------------------+

+---------+
|timestamp|
+---------+
+---------+

c=0
{code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org