You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/08/03 15:56:00 UTC

[jira] [Work logged] (BEAM-7632) Update Python quickstart guide for Flink and Spark

     [ https://issues.apache.org/jira/browse/BEAM-7632?focusedWorklogId=465771&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-465771 ]

ASF GitHub Bot logged work on BEAM-7632:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 03/Aug/20 15:55
            Start Date: 03/Aug/20 15:55
    Worklog Time Spent: 10m 
      Work Description: aromanenko-dev commented on a change in pull request #12397:
URL: https://github.com/apache/beam/pull/12397#discussion_r464504070



##########
File path: website/www/site/content/en/get-started/wordcount-example.md
##########
@@ -387,18 +387,20 @@ python -m apache_beam.examples.wordcount --input YOUR_INPUT_FILE --output counts
 {{< /highlight >}}
 
 {{< highlight class="runner-flink-local" >}}
-Currently, running wordcount.py on Flink requires a full download of the Beam source code.
-See https://beam.apache.org/roadmap/portability/#python-on-flink for more information.
+python -m apache_beam.examples.wordcount --input /path/to/inputfile \
+                                         --output /path/to/write/counts \
+                                         --runner FlinkRunner
 {{< /highlight >}}
 
 {{< highlight class="runner-flink-cluster" >}}
-Currently, running wordcount.py on Flink requires a full download of the Beam source code.
-See https://beam.apache.org/documentation/runners/flink/ for more information.
+# Running Beam Python on a distributed Flink cluster requires additional configuration.

Review comment:
       Would it be more simple and clear for user to keep only one tab for every engine (e.g. just "Flink") and explain all additional instructions, like `local` and `cluster` mode, inside this tab? 




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 465771)
    Time Spent: 1h 40m  (was: 1.5h)

> Update Python quickstart guide for Flink and Spark
> --------------------------------------------------
>
>                 Key: BEAM-7632
>                 URL: https://issues.apache.org/jira/browse/BEAM-7632
>             Project: Beam
>          Issue Type: Improvement
>          Components: website
>            Reporter: Kyle Weaver
>            Priority: P3
>          Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> Currently, the documentation says "This runner is not yet available for the Python SDK.", which is out of date. [https://beam.apache.org/get-started/quickstart-py/]
> Edit: when Beam 2.15 is released, we should update the quickstart to use the new Python FlinkRunner [1].
>  
> [1] [https://github.com/apache/beam/blob/master/sdks/python/apache_beam/runners/portability/flink_runner.py]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)