You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Tibor Kiss (JIRA)" <ji...@apache.org> on 2017/03/12 18:35:04 UTC

[jira] [Created] (BEAM-1695) Improve Python-SDK's programming guide

Tibor Kiss created BEAM-1695:
--------------------------------

             Summary: Improve Python-SDK's programming guide
                 Key: BEAM-1695
                 URL: https://issues.apache.org/jira/browse/BEAM-1695
             Project: Beam
          Issue Type: Bug
          Components: website
            Reporter: Tibor Kiss
            Priority: Minor


Beam's programming guide provides a tutorial-like structure to introduce the user to the main concepts.

Due to flaws of the snippets the copied code needs altering to work.
Some of the problems per section
1) Section "Creating the pipeline"
    - {{import apache_beam as beam}} statement is missing from the beginning
    - The command line arguments are not parsed
2) Section "Creating a PCollection from in-memory data"
    - {{pipeline_options}} variable is undefined
    - {{my_options}} variable is undefined
3) Section "ParDo": 
    - It is not explained how to define {{words}} variable
4) Section "Advanced combinations using CombineFn" and "Combining a PCollection into a single value" has the same code snippet
5) Section "Combining values in a key-grouped collection":
   - It is not explained how to define {{player_accuracies}}
6) Section "Using Flatten and Partition"
   - The code snippet contains unnecessary markers ({{[START model_multiple_pcollections_tuple]}})
7) Section "partition":
   - {{students}} variable is undefined

This list might not be complete.

The website's repo is located at: https://github.com/apache/beam-site
The snippets are taken from: https://github.com/apache/beam/blob/master/sdks/python/apache_beam/examples/snippets/snippets.py





--
This message was sent by Atlassian JIRA
(v6.3.15#6346)