You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Arvind Krishnan Iyer (JIRA)" <ji...@apache.org> on 2019/02/07 22:34:00 UTC

[jira] [Updated] (SPARK-26846) Empty Strings in dataframe are written as "" in CSV

     [ https://issues.apache.org/jira/browse/SPARK-26846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Arvind Krishnan Iyer updated SPARK-26846:
-----------------------------------------
    Description: 
 
{code:java}
import spark.implicits._
val sc = spark.sparkContext
val df = Seq((8,"100","sfd"),(0,"","sfd"),(8, null, "asfasd")).toDF("num","str_num","word").toDF().coalesce(1)
df.write.mode(SaveMode.Overwrite).csv("/Users/arvind.iyer/abcd.csv")
{code}
We are writing the contents of this CSV into a DB, and the contents of that column are going in as "". 

+Output+ 

8,100,sfd
 0,"",sfd
 8,"",asfasd

  was:
 
{code:java}
import spark.implicits._
val sc = spark.sparkContext
val df = Seq((8,"100","sfd"),(0,"","sfd"),(8, null, "asfasd")).toDF("num","str_num","word").toDF().coalesce(1)
df.write.mode(SaveMode.Overwrite).csv("/Users/arvind.iyer/abcd.csv")
{code}
+Output+ 

8,100,sfd
0,"",sfd
8,"",asfasd


> Empty Strings in dataframe are written as "" in CSV
> ---------------------------------------------------
>
>                 Key: SPARK-26846
>                 URL: https://issues.apache.org/jira/browse/SPARK-26846
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.4.0
>            Reporter: Arvind Krishnan Iyer
>            Priority: Major
>
>  
> {code:java}
> import spark.implicits._
> val sc = spark.sparkContext
> val df = Seq((8,"100","sfd"),(0,"","sfd"),(8, null, "asfasd")).toDF("num","str_num","word").toDF().coalesce(1)
> df.write.mode(SaveMode.Overwrite).csv("/Users/arvind.iyer/abcd.csv")
> {code}
> We are writing the contents of this CSV into a DB, and the contents of that column are going in as "". 
> +Output+ 
> 8,100,sfd
>  0,"",sfd
>  8,"",asfasd



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org