You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Jean-Baptiste Onofré (JIRA)" <ji...@apache.org> on 2016/09/23 15:55:20 UTC

[jira] [Created] (BEAM-671) Update Read PTransform to implicitly use maxNumRecords and maxReadTime

Jean-Baptiste Onofré created BEAM-671:
-----------------------------------------

             Summary: Update Read PTransform to implicitly use maxNumRecords and maxReadTime
                 Key: BEAM-671
                 URL: https://issues.apache.org/jira/browse/BEAM-671
             Project: Beam
          Issue Type: Improvement
          Components: sdk-java-core
            Reporter: Jean-Baptiste Onofré
            Assignee: Jean-Baptiste Onofré


Today, almost all IOs {{Read}} {{PTransform}}, when using an {{UnboundedSource}}, do the following in the {{apply()}}:

{code}
      org.apache.beam.sdk.io.Read.Unbounded<byte[]> unbounded =
          org.apache.beam.sdk.io.Read.from(getSource());

      PTransform<PBegin, PCollection<byte[]>> transform = unbounded;

      if (maxNumRecords != Long.MAX_VALUE) {
        transform = unbounded.withMaxNumRecords(maxNumRecords);
      } else if (maxReadTime != null) {
        transform = unbounded.withMaxReadTime(maxReadTime);
      }

      return input.getPipeline().apply(transform);
{code}

To avoid to duplicate this code bunch of time, it would make sense to do it by default in {{org.apache.beam.sdk.io.Read}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)