You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2018/09/06 01:34:00 UTC
[jira] [Commented] (SPARK-25346) Document Spark builtin data
sources
[ https://issues.apache.org/jira/browse/SPARK-25346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16605140#comment-16605140 ]
Hyukjin Kwon commented on SPARK-25346:
--------------------------------------
[~mengxr], actually there are documentation for several datasources. For example,
Parquet - https://spark.apache.org/docs/latest/sql-programming-guide.html#parquet-files
ORC - https://spark.apache.org/docs/latest/sql-programming-guide.html#orc-files
JSON - https://spark.apache.org/docs/latest/sql-programming-guide.html#json-datasets
CSV - https://spark.apache.org/docs/latest/sql-programming-guide.html#manually-specifying-options (there were few tries for CSV documentation but they were failed for the sake of duplicated API documentation in DataFrameReader)
JDBC - https://spark.apache.org/docs/latest/sql-programming-guide.html#jdbc-to-other-databases
> Document Spark builtin data sources
> -----------------------------------
>
> Key: SPARK-25346
> URL: https://issues.apache.org/jira/browse/SPARK-25346
> Project: Spark
> Issue Type: Story
> Components: Documentation
> Affects Versions: 2.4.0
> Reporter: Xiangrui Meng
> Priority: Major
>
> It would be nice to list built-in data sources in the doc site. So users know what are available by default. However, I didn't find any from 2.3.1 docs.
>
> cc: [~hyukjin.kwon]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org