You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Jean-Baptiste Onofré (JIRA)" <ji...@apache.org> on 2016/11/23 06:21:59 UTC

[jira] [Commented] (BEAM-1025) User guide - "How to create Beam IO Transforms"

    [ https://issues.apache.org/jira/browse/BEAM-1025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15689087#comment-15689087 ] 

Jean-Baptiste Onofré commented on BEAM-1025:
--------------------------------------------

I will be happy to help there (as I did some IOs ;)).

> User guide - "How to create Beam IO Transforms"
> -----------------------------------------------
>
>                 Key: BEAM-1025
>                 URL: https://issues.apache.org/jira/browse/BEAM-1025
>             Project: Beam
>          Issue Type: Task
>            Reporter: Stephen Sisk
>            Assignee: Stephen Sisk
>
> Beam has javadocs for how to create a read or write transform, but no friendly user guide on how to get started using BoundedSource/BoundedReader.
> This should cover:
> * background on beam's source/sink API design 
> * design patterns
> * evaluating different data sources (eg, what are the properties of a pub sub system that affect how you should write your UnboundedSource? What is the best design for reading from a NoSql style source?)
> * testing - how to write unit, integration (and once we have them, performance tests)
> * public API recommendations
> This is related, but not strictly overlapping with: 
> https://issues.apache.org/jira/browse/BEAM-193
> - the Dataflow SDK documentation for "Custom Sources and Sinks"  contains some info about writing Sources/Sinks, but it is somewhat out of date, and doesn't reflect the things we've learned recently.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)