You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Stephen Sisk (JIRA)" <ji...@apache.org> on 2016/11/21 21:45:58 UTC
[jira] [Created] (BEAM-1025) User guide - "How to create Beam IO
Transforms"
Stephen Sisk created BEAM-1025:
----------------------------------
Summary: User guide - "How to create Beam IO Transforms"
Key: BEAM-1025
URL: https://issues.apache.org/jira/browse/BEAM-1025
Project: Beam
Issue Type: Task
Reporter: Stephen Sisk
Beam has javadocs for how to create a read or write transform, but no friendly user guide on how to get started using BoundedSource/BoundedReader.
This should cover:
* background on beam's source/sink API design
* design patterns
* evaluating different data sources (eg, what are the properties of a pub sub system that affect how you should write your UnboundedSource? What is the best design for reading from a NoSql style source?)
* testing - how to write unit, integration (and once we have them, performance tests)
* public API recommendations
This is related, but not strictly overlapping with:
https://issues.apache.org/jira/browse/BEAM-193
- the Dataflow SDK documentation for "Custom Sources and Sinks" contains some info about writing Sources/Sinks, but it is somewhat out of date, and doesn't reflect the things we've learned recently.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)