You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@beam.apache.org by "Stas Levin (JIRA)" <ji...@apache.org> on 2017/04/25 08:57:04 UTC
[jira] [Created] (BEAM-2072) MicrobatchSource.reader stops reading
after reaching maxNumRecords for the first time
Stas Levin created BEAM-2072:
--------------------------------
Summary: MicrobatchSource.reader stops reading after reaching maxNumRecords for the first time
Key: BEAM-2072
URL: https://issues.apache.org/jira/browse/BEAM-2072
Project: Beam
Issue Type: Bug
Components: runner-spark
Affects Versions: Not applicable
Reporter: Stas Levin
Assignee: Stas Levin
{{MicrobatchSource.Reader}} stops reading further data after the first batch in which it has reached {{maxNumRecords}}. Subsequent read request will check the {{recordsRead}} and conclude that no data can be read.
The root cause being that {{recordsRead}} is maintained throughout the reader's lifecycle, and can span multiple (spark) batches.
We should reset {{recordsRead}} every time the reader starts reading a fresh batch.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)