You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Jean-Baptiste Onofré (JIRA)" <ji...@apache.org> on 2016/08/22 09:13:21 UTC

[jira] [Created] (BEAM-573) TextIO.read() should support sharding

Jean-Baptiste Onofré created BEAM-573:
-----------------------------------------

             Summary: TextIO.read() should support sharding
                 Key: BEAM-573
                 URL: https://issues.apache.org/jira/browse/BEAM-573
             Project: Beam
          Issue Type: Improvement
          Components: sdk-java-extensions
    Affects Versions: 0.1.0-incubating, 0.2.0-incubating
            Reporter: Jean-Baptiste Onofré
            Assignee: Jean-Baptiste Onofré


Currently, TextIO supports sharding (or {{withoutSharding()}} option) only in {{Write}}.

I think it makes sense to also supports sharding on the {{Read}} behaving like:
read the input in chunks of n MB (to avoid heap OutOfMemory with large files).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)