You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@spark.apache.org by gu...@apache.org on 2022/06/08 05:34:37 UTC

[spark] branch branch-3.3 updated: [SPARK-39394][DOCS][SS][3.3] Improve PySpark Structured Streaming page more readable

This is an automated email from the ASF dual-hosted git repository.

gurwls223 pushed a commit to branch branch-3.3
in repository https://gitbox.apache.org/repos/asf/spark.git


The following commit(s) were added to refs/heads/branch-3.3 by this push:
     new 86f1b6bfe39 [SPARK-39394][DOCS][SS][3.3] Improve PySpark Structured Streaming page more readable
86f1b6bfe39 is described below

commit 86f1b6bfe397646697c299ab5af2157cb1dd2e1e
Author: itholic <ha...@databricks.com>
AuthorDate: Wed Jun 8 14:34:25 2022 +0900

    [SPARK-39394][DOCS][SS][3.3] Improve PySpark Structured Streaming page more readable
    
    ### What changes were proposed in this pull request?
    
    Hotfix https://github.com/apache/spark/pull/36782 for branch-3.3.
    
    ### Why are the changes needed?
    
    The improvement of document readability will also improve the usability for PySpark Structured Streaming.
    
    ### Does this PR introduce _any_ user-facing change?
    
    Yes, now the documentation is categorized by its class or their own purpose more clearly as below:
    
    ![Screen Shot 2022-06-07 at 12 30 01 PM](https://user-images.githubusercontent.com/44108233/172289737-bd6ebf0e-601c-4a80-a16a-cf885302e7b6.png)
    
    ### How was this patch tested?
    
    The existing doc build in CI should cover.
    
    Closes #36797 from itholic/SPARK-39394-3.3.
    
    Authored-by: itholic <ha...@databricks.com>
    Signed-off-by: Hyukjin Kwon <gu...@apache.org>
---
 python/docs/source/reference/index.rst             |  2 +-
 .../{index.rst => pyspark.ss/core_classes.rst}     | 26 +++++-------
 .../source/reference/{ => pyspark.ss}/index.rst    | 23 ++++------
 .../{pyspark.ss.rst => pyspark.ss/io.rst}          | 47 ++-------------------
 .../query_management.rst}                          | 49 +---------------------
 5 files changed, 24 insertions(+), 123 deletions(-)

diff --git a/python/docs/source/reference/index.rst b/python/docs/source/reference/index.rst
index 1d2db3f4a15..127889afb07 100644
--- a/python/docs/source/reference/index.rst
+++ b/python/docs/source/reference/index.rst
@@ -29,7 +29,7 @@ Pandas API on Spark follows the API specifications of pandas 1.3.
 
    pyspark.sql/index
    pyspark.pandas/index
-   pyspark.ss
+   pyspark.ss/index
    pyspark.ml
    pyspark.streaming
    pyspark.mllib
diff --git a/python/docs/source/reference/index.rst b/python/docs/source/reference/pyspark.ss/core_classes.rst
similarity index 68%
copy from python/docs/source/reference/index.rst
copy to python/docs/source/reference/pyspark.ss/core_classes.rst
index 1d2db3f4a15..4160008881c 100644
--- a/python/docs/source/reference/index.rst
+++ b/python/docs/source/reference/pyspark.ss/core_classes.rst
@@ -16,22 +16,16 @@
     under the License.
 
 
-=============
-API Reference
-=============
+============
+Core Classes
+============
 
-This page lists an overview of all public PySpark modules, classes, functions and methods.
+.. currentmodule:: pyspark.sql.streaming
 
-Pandas API on Spark follows the API specifications of pandas 1.3.
+.. autosummary::
+    :toctree: api/
 
-.. toctree::
-   :maxdepth: 2
-
-   pyspark.sql/index
-   pyspark.pandas/index
-   pyspark.ss
-   pyspark.ml
-   pyspark.streaming
-   pyspark.mllib
-   pyspark
-   pyspark.resource
+    DataStreamReader
+    DataStreamWriter
+    StreamingQuery
+    StreamingQueryManager
diff --git a/python/docs/source/reference/index.rst b/python/docs/source/reference/pyspark.ss/index.rst
similarity index 70%
copy from python/docs/source/reference/index.rst
copy to python/docs/source/reference/pyspark.ss/index.rst
index 1d2db3f4a15..2cb0b1216ef 100644
--- a/python/docs/source/reference/index.rst
+++ b/python/docs/source/reference/pyspark.ss/index.rst
@@ -16,22 +16,15 @@
     under the License.
 
 
-=============
-API Reference
-=============
+====================
+Structured Streaming
+====================
 
-This page lists an overview of all public PySpark modules, classes, functions and methods.
-
-Pandas API on Spark follows the API specifications of pandas 1.3.
+This page gives an overview of all public Structed Streaming API.
 
 .. toctree::
-   :maxdepth: 2
+    :maxdepth: 2
 
-   pyspark.sql/index
-   pyspark.pandas/index
-   pyspark.ss
-   pyspark.ml
-   pyspark.streaming
-   pyspark.mllib
-   pyspark
-   pyspark.resource
+    core_classes
+    io
+    query_management
diff --git a/python/docs/source/reference/pyspark.ss.rst b/python/docs/source/reference/pyspark.ss/io.rst
similarity index 61%
copy from python/docs/source/reference/pyspark.ss.rst
copy to python/docs/source/reference/pyspark.ss/io.rst
index cace2d5a6bb..da476fb6fac 100644
--- a/python/docs/source/reference/pyspark.ss.rst
+++ b/python/docs/source/reference/pyspark.ss/io.rst
@@ -16,25 +16,9 @@
     under the License.
 
 
-====================
-Structured Streaming
-====================
-
-Core Classes
-------------
-
-.. currentmodule:: pyspark.sql.streaming
-
-.. autosummary::
-    :toctree: api/
-
-    DataStreamReader
-    DataStreamWriter
-    StreamingQuery
-    StreamingQueryManager
-
-Input and Output
-----------------
+============
+Input/Output
+============
 
 .. currentmodule:: pyspark.sql.streaming
 
@@ -61,28 +45,3 @@ Input and Output
     DataStreamWriter.queryName
     DataStreamWriter.start
     DataStreamWriter.trigger
-
-Query Management
-----------------
-
-.. currentmodule:: pyspark.sql.streaming
-
-.. autosummary::
-    :toctree: api/
-
-    StreamingQuery.awaitTermination
-    StreamingQuery.exception
-    StreamingQuery.explain
-    StreamingQuery.id
-    StreamingQuery.isActive
-    StreamingQuery.lastProgress
-    StreamingQuery.name
-    StreamingQuery.processAllAvailable
-    StreamingQuery.recentProgress
-    StreamingQuery.runId
-    StreamingQuery.status
-    StreamingQuery.stop
-    StreamingQueryManager.active
-    StreamingQueryManager.awaitAnyTermination
-    StreamingQueryManager.get
-    StreamingQueryManager.resetTerminated
diff --git a/python/docs/source/reference/pyspark.ss.rst b/python/docs/source/reference/pyspark.ss/query_management.rst
similarity index 59%
rename from python/docs/source/reference/pyspark.ss.rst
rename to python/docs/source/reference/pyspark.ss/query_management.rst
index cace2d5a6bb..b580015baa3 100644
--- a/python/docs/source/reference/pyspark.ss.rst
+++ b/python/docs/source/reference/pyspark.ss/query_management.rst
@@ -16,54 +16,9 @@
     under the License.
 
 
-====================
-Structured Streaming
-====================
-
-Core Classes
-------------
-
-.. currentmodule:: pyspark.sql.streaming
-
-.. autosummary::
-    :toctree: api/
-
-    DataStreamReader
-    DataStreamWriter
-    StreamingQuery
-    StreamingQueryManager
-
-Input and Output
-----------------
-
-.. currentmodule:: pyspark.sql.streaming
-
-.. autosummary::
-    :toctree: api/
-
-    DataStreamReader.csv
-    DataStreamReader.format
-    DataStreamReader.json
-    DataStreamReader.load
-    DataStreamReader.option
-    DataStreamReader.options
-    DataStreamReader.orc
-    DataStreamReader.parquet
-    DataStreamReader.schema
-    DataStreamReader.text
-    DataStreamWriter.foreach
-    DataStreamWriter.foreachBatch
-    DataStreamWriter.format
-    DataStreamWriter.option
-    DataStreamWriter.options
-    DataStreamWriter.outputMode
-    DataStreamWriter.partitionBy
-    DataStreamWriter.queryName
-    DataStreamWriter.start
-    DataStreamWriter.trigger
-
+================
 Query Management
-----------------
+================
 
 .. currentmodule:: pyspark.sql.streaming
 


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscribe@spark.apache.org
For additional commands, e-mail: commits-help@spark.apache.org