You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2016/03/09 06:55:40 UTC
[jira] [Created] (SPARK-13766) Inconsistent file extensions and
omitting file extensions written by CSV, TEXT and JSON data sources
Hyukjin Kwon created SPARK-13766:
------------------------------------
Summary: Inconsistent file extensions and omitting file extensions written by CSV, TEXT and JSON data sources
Key: SPARK-13766
URL: https://issues.apache.org/jira/browse/SPARK-13766
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 2.0.0
Reporter: Hyukjin Kwon
Priority: Minor
Currently, the output (part-files) from CSV, TEXT and JSON data sources does not have file extensions such as .csv, .txt and .json (except for compression extensions such as .gz, .deflate and .bz4).
In addition, it looks Parquet has the extensions (in part-files) such as .gz.parquet or .snappy.parquet according to compression codecs whereas ORC does not have such extensions but it is just .orc.
It would be great if we have a consistent naming for them
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org