You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Stephen Sisk (JIRA)" <ji...@apache.org> on 2016/11/21 21:45:58 UTC

[jira] [Created] (BEAM-1025) User guide - "How to create Beam IO Transforms"

Stephen Sisk created BEAM-1025:
----------------------------------

             Summary: User guide - "How to create Beam IO Transforms"
                 Key: BEAM-1025
                 URL: https://issues.apache.org/jira/browse/BEAM-1025
             Project: Beam
          Issue Type: Task
            Reporter: Stephen Sisk


Beam has javadocs for how to create a read or write transform, but no friendly user guide on how to get started using BoundedSource/BoundedReader.

This should cover:
* background on beam's source/sink API design 
* design patterns
* evaluating different data sources (eg, what are the properties of a pub sub system that affect how you should write your UnboundedSource? What is the best design for reading from a NoSql style source?)
* testing - how to write unit, integration (and once we have them, performance tests)
* public API recommendations

This is related, but not strictly overlapping with: 
https://issues.apache.org/jira/browse/BEAM-193
- the Dataflow SDK documentation for "Custom Sources and Sinks"  contains some info about writing Sources/Sinks, but it is somewhat out of date, and doesn't reflect the things we've learned recently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)