You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Bomi Kim (JIRA)" <ji...@apache.org> on 2016/04/19 10:33:25 UTC
[jira] [Created] (SPARK-14726) Support for sampling when inferring
schema in CSV data source
Bomi Kim created SPARK-14726:
--------------------------------
Summary: Support for sampling when inferring schema in CSV data source
Key: SPARK-14726
URL: https://issues.apache.org/jira/browse/SPARK-14726
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 2.0.0
Reporter: Bomi Kim
Currently, I am using CSV data source and trying to get used to Spark 2.0 because it has built-in CSV data source.
I realized that CSV data source infers schema with all the data. JSON data source supports sampling ratio option.
It would be great if CSV data source has this option too (or is this supported already?).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org