You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@storm.apache.org by "Aaron Dossett (JIRA)" <ji...@apache.org> on 2015/07/23 14:50:05 UTC

[jira] [Created] (STORM-960) Hive-Bolt can lose tuples when flushing data

Aaron Dossett created STORM-960:
-----------------------------------

             Summary: Hive-Bolt can lose tuples when flushing data
                 Key: STORM-960
                 URL: https://issues.apache.org/jira/browse/STORM-960
             Project: Apache Storm
          Issue Type: Improvement
          Components: external
            Reporter: Aaron Dossett
            Priority: Minor


In HiveBolt's execute method tuples are ack'd as they are received.  When a batchsize of tuples has been received, the writers are flushed.  However, if the flush fails only the most recent tuple will be marked as failed.  All prior tuples will already have been ack'd.  This creates a window for data loss.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)