You are viewing a plain text version of this content. The canonical link for it is here.
Posted to reviews@spark.apache.org by GitBox <gi...@apache.org> on 2020/07/31 13:04:59 UTC

[GitHub] [spark] HyukjinKwon opened a new pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

HyukjinKwon opened a new pull request #29320:
URL: https://github.com/apache/spark/pull/29320


   ### What changes were proposed in this pull request?
   
   This PR proposes to write the main page of PySpark documentation.
   
   ### Why are the changes needed?
   
   For better usability and readability in PySpark documentation.
   
   ### Does this PR introduce _any_ user-facing change?
   
   Yes, it creates a new main page as below:
   
   ![Screen Shot 2020-07-31 at 10 02 44 PM](https://user-images.githubusercontent.com/6477701/89037618-d2d68880-d379-11ea-9a44-562f2aa0e3fd.png)
   
   ### How was this patch tested?
   
   Manually built the PySpark documentation.
   
   ```bash
   cd python
   make clean html
   ```


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667248957






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668484972






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] BryanCutler commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
BryanCutler commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-669310451


   Looks great, I think it will be very helpful for PySpark to have it's own main page. Thanks @HyukjinKwon !


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668468168






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668484431


   **[Test build #127046 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127046/testReport)** for PR 29320 at commit [`dae09ec`](https://github.com/apache/spark/commit/dae09ec82ec11af4f8109576a2adf88bc71aca41).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667772011


   > What's docs/img/pyspark-components.pptx for?
   
   It is for the image I used in the main page in case some people want to edit. There are other pptx files in `docs/img` as well for that purpose.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667809709


   **[Test build #126951 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126951/testReport)** for PR 29320 at commit [`6d5f6ef`](https://github.com/apache/spark/commit/6d5f6ef069cb8e0fbb65616ca98f919cdd367fda).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] viirya commented on a change in pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
viirya commented on a change in pull request #29320:
URL: https://github.com/apache/spark/pull/29320#discussion_r464194094



##########
File path: python/docs/source/index.rst
##########
@@ -21,8 +21,42 @@
 PySpark Documentation
 =====================
 
+PySpark is an interface for Apache Spark in Python. It not only allows you to write
+Spark applications using Python APIs, but also provides the PySpark shell for
+interactively analyzing your data in a distributed environment. PySpark supports most
+of Spark's features such as Spark SQL, DataFrmae, Streaming, MLlib

Review comment:
       DataFrmae -> DataFrame




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668471350


   **[Test build #127046 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127046/testReport)** for PR 29320 at commit [`dae09ec`](https://github.com/apache/spark/commit/dae09ec82ec11af4f8109576a2adf88bc71aca41).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668468168






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
HyukjinKwon commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668934219


   Thank you @viirya for approaching this. I am merging this to master.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667111686


   **[Test build #126889 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126889/testReport)** for PR 29320 at commit [`86be1f5`](https://github.com/apache/spark/commit/86be1f59a139e9c9be50d945ed28ee00cc0cbf46).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667112295






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667248957






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667802688






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668484972






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667809951






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667802688






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667111686


   **[Test build #126889 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126889/testReport)** for PR 29320 at commit [`86be1f5`](https://github.com/apache/spark/commit/86be1f59a139e9c9be50d945ed28ee00cc0cbf46).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-668471350


   **[Test build #127046 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/127046/testReport)** for PR 29320 at commit [`dae09ec`](https://github.com/apache/spark/commit/dae09ec82ec11af4f8109576a2adf88bc71aca41).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] srowen commented on a change in pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
srowen commented on a change in pull request #29320:
URL: https://github.com/apache/spark/pull/29320#discussion_r464008257



##########
File path: python/docs/source/index.rst
##########
@@ -21,8 +21,42 @@
 PySpark Documentation
 =====================
 
+PySpark is an interface for Apache Spark in Python language. It not only offers for you

Review comment:
       in the Python language, or just "in Python"
   

##########
File path: python/docs/source/index.rst
##########
@@ -21,8 +21,42 @@
 PySpark Documentation
 =====================
 
+PySpark is an interface for Apache Spark in Python language. It not only offers for you
+to write an application in the Python APIs but also provides PySpark shell so you can
+interactively analyze your data in a distributed environment. PySpark supports most
+of Spark features such as Spark SQL, DataFrmae, Streaming, MLlib
+(Machine Learning) and Spark Core.
+
+.. image:: ../../../docs/img/pyspark-components.png
+  :alt: PySpark Compoenents
+
+**Spark SQL and DataFrame**
+
+Spark SQL is a Spark module for structured data processing. It provides
+a programming abstraction called DataFrame and can also act as distributed
+SQL query engine.
+
+**Streaming**
+
+Running on top of Spark, the streaming feature in Apache Spark enables powerful
+interactive and analytical applications across both streaming and historical data,
+while inheriting Spark’s ease of use and fault tolerance characteristics.
+
+**MLlib**
+
+Built on top of Spark, MLlib is a scalable machine learning library that provides
+a uniform set of high-level APIs that help users create and tune practical machine
+learning pipelines.
+
+**Spark Core**
+
+Spark Core is the underlying general execution engine for the Spark platform that all
+other functionality is built on top of. It provides an RDD (Resilient Disributed Data)

Review comment:
       Data -> Dataset

##########
File path: python/docs/source/index.rst
##########
@@ -21,8 +21,42 @@
 PySpark Documentation
 =====================
 
+PySpark is an interface for Apache Spark in Python language. It not only offers for you
+to write an application in the Python APIs but also provides PySpark shell so you can
+interactively analyze your data in a distributed environment. PySpark supports most
+of Spark features such as Spark SQL, DataFrmae, Streaming, MLlib

Review comment:
       most of Spark's features

##########
File path: python/docs/source/index.rst
##########
@@ -21,8 +21,42 @@
 PySpark Documentation
 =====================
 
+PySpark is an interface for Apache Spark in Python language. It not only offers for you
+to write an application in the Python APIs but also provides PySpark shell so you can

Review comment:
       Maybe "It not only allows you to write Spark applications using Python APIs, but also provides the PySpark shell for interactively analyzing ..."




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667802282


   **[Test build #126951 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126951/testReport)** for PR 29320 at commit [`6d5f6ef`](https://github.com/apache/spark/commit/6d5f6ef069cb8e0fbb65616ca98f919cdd367fda).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA commented on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA commented on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667247831


   **[Test build #126889 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126889/testReport)** for PR 29320 at commit [`86be1f5`](https://github.com/apache/spark/commit/86be1f59a139e9c9be50d945ed28ee00cc0cbf46).
    * This patch passes all tests.
    * This patch merges cleanly.
    * This patch adds no public classes.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] viirya commented on a change in pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
viirya commented on a change in pull request #29320:
URL: https://github.com/apache/spark/pull/29320#discussion_r464195184



##########
File path: python/docs/source/index.rst
##########
@@ -21,8 +21,42 @@
 PySpark Documentation
 =====================
 
+PySpark is an interface for Apache Spark in Python. It not only allows you to write
+Spark applications using Python APIs, but also provides the PySpark shell for
+interactively analyzing your data in a distributed environment. PySpark supports most
+of Spark's features such as Spark SQL, DataFrmae, Streaming, MLlib
+(Machine Learning) and Spark Core.
+
+.. image:: ../../../docs/img/pyspark-components.png
+  :alt: PySpark Compoenents
+
+**Spark SQL and DataFrame**
+
+Spark SQL is a Spark module for structured data processing. It provides
+a programming abstraction called DataFrame and can also act as distributed
+SQL query engine.
+
+**Streaming**
+
+Running on top of Spark, the streaming feature in Apache Spark enables powerful
+interactive and analytical applications across both streaming and historical data,
+while inheriting Spark’s ease of use and fault tolerance characteristics.
+
+**MLlib**
+
+Built on top of Spark, MLlib is a scalable machine learning library that provides
+a uniform set of high-level APIs that help users create and tune practical machine
+learning pipelines.
+
+**Spark Core**
+
+Spark Core is the underlying general execution engine for the Spark platform that all
+other functionality is built on top of. It provides an RDD (Resilient Disributed Dataset)

Review comment:
       Disributed -> Distributed




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667809951






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] HyukjinKwon closed pull request #29320: [SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
HyukjinKwon closed pull request #29320:
URL: https://github.com/apache/spark/pull/29320


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] SparkQA removed a comment on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main page for PySpark documentation

Posted by GitBox <gi...@apache.org>.
SparkQA removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667802282


   **[Test build #126951 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/126951/testReport)** for PR 29320 at commit [`6d5f6ef`](https://github.com/apache/spark/commit/6d5f6ef069cb8e0fbb65616ca98f919cdd367fda).


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org


[GitHub] [spark] AmplabJenkins removed a comment on pull request #29320: [WIP][SPARK-32507][DOCS][PYTHON] Add main package for PySpark documentation

Posted by GitBox <gi...@apache.org>.
AmplabJenkins removed a comment on pull request #29320:
URL: https://github.com/apache/spark/pull/29320#issuecomment-667112295






----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org