You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@spark.apache.org by "Hyukjin Kwon (JIRA)" <ji...@apache.org> on 2018/09/06 01:34:00 UTC

[jira] [Commented] (SPARK-25346) Document Spark builtin data sources

    [ https://issues.apache.org/jira/browse/SPARK-25346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16605140#comment-16605140 ] 

Hyukjin Kwon commented on SPARK-25346:
--------------------------------------

[~mengxr], actually there are documentation for several datasources. For example,

Parquet - https://spark.apache.org/docs/latest/sql-programming-guide.html#parquet-files
ORC - https://spark.apache.org/docs/latest/sql-programming-guide.html#orc-files
JSON - https://spark.apache.org/docs/latest/sql-programming-guide.html#json-datasets
CSV - https://spark.apache.org/docs/latest/sql-programming-guide.html#manually-specifying-options (there were few tries for CSV documentation but they were failed for the sake of duplicated API documentation in DataFrameReader)
JDBC - https://spark.apache.org/docs/latest/sql-programming-guide.html#jdbc-to-other-databases

> Document Spark builtin data sources
> -----------------------------------
>
>                 Key: SPARK-25346
>                 URL: https://issues.apache.org/jira/browse/SPARK-25346
>             Project: Spark
>          Issue Type: Story
>          Components: Documentation
>    Affects Versions: 2.4.0
>            Reporter: Xiangrui Meng
>            Priority: Major
>
> It would be nice to list built-in data sources in the doc site. So users know what are available by default. However, I didn't find any from 2.3.1 docs.
>  
> cc: [~hyukjin.kwon]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@spark.apache.org
For additional commands, e-mail: issues-help@spark.apache.org