You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@spark.apache.org by "Bomi Kim (JIRA)" <ji...@apache.org> on 2016/04/19 10:33:25 UTC

[jira] [Created] (SPARK-14726) Support for sampling when inferring schema in CSV data source

Bomi Kim created SPARK-14726:
--------------------------------

             Summary: Support for sampling when inferring schema in CSV data source
                 Key: SPARK-14726
                 URL: https://issues.apache.org/jira/browse/SPARK-14726
             Project: Spark
          Issue Type: Improvement
          Components: SQL
    Affects Versions: 2.0.0
            Reporter: Bomi Kim


Currently, I am using CSV data source and trying to get used to Spark 2.0 because it has built-in CSV data source.

I realized that CSV data source infers schema with all the data. JSON data source supports sampling ratio option.

It would be great if CSV data source has this option too (or is this supported already?).




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org