You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@storm.apache.org by "Rick Kellogg (JIRA)" <ji...@apache.org> on 2015/09/29 04:39:04 UTC

[jira] [Updated] (STORM-960) Hive-Bolt can lose tuples when flushing data

     [ https://issues.apache.org/jira/browse/STORM-960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Rick Kellogg updated STORM-960:
-------------------------------
    Component/s:     (was: external)
                 storm-hive

> Hive-Bolt can lose tuples when flushing data
> --------------------------------------------
>
>                 Key: STORM-960
>                 URL: https://issues.apache.org/jira/browse/STORM-960
>             Project: Apache Storm
>          Issue Type: Improvement
>          Components: storm-hive
>            Reporter: Aaron Dossett
>            Assignee: Aaron Dossett
>            Priority: Minor
>             Fix For: 0.11.0
>
>
> In HiveBolt's execute method tuples are ack'd as they are received.  When a batchsize of tuples has been received, the writers are flushed.  However, if the flush fails only the most recent tuple will be marked as failed.  All prior tuples will already have been ack'd.  This creates a window for data loss.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)