You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "ShivaKumar SS (Jira)" <ji...@apache.org> on 2019/11/25 09:21:00 UTC

[jira] [Created] (SPARK-30023) Spark partitionby saves as columnName={value} | Can it be only columnvalue

ShivaKumar SS created SPARK-30023:
-------------------------------------

             Summary: Spark partitionby saves as columnName={value} | Can it be only columnvalue
                 Key: SPARK-30023
                 URL: https://issues.apache.org/jira/browse/SPARK-30023
             Project: Spark
          Issue Type: Question
          Components: Spark Core, SQL
    Affects Versions: 2.4.3
            Reporter: ShivaKumar SS


I am using scala and spark.

This is using Dataframe and in dataframe i have a columns by name "year" "month" and "date" and many other columns which are not relevant here.

 

Code snippet.
  
 {{df.write.partitionBy("year", "month", "day").format("csv").option("header", "true").save(outPath)
 }}
  

and my expectation is to save in a hierarchy folder structure.
 
{{2016/11/15/file.csv}}

 

but the files are getting saved as 

 
 
{{year=2016/month=11/day=15/file.csv}}

{{}}

{{Is there any way i can remove the column name from the directory structure and save only the column value here. ? }}

{{}}

{{}}

{{}}

 

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org