You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@qpid.apache.org by "Ganesh Murthy (JIRA)" <ji...@apache.org> on 2019/04/12 15:16:00 UTC

[jira] [Closed] (DISPATCH-1136) Receiver crash due to data corruption on multicast presettled messages

     [ https://issues.apache.org/jira/browse/DISPATCH-1136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ganesh Murthy closed DISPATCH-1136.
-----------------------------------

> Receiver crash due to data corruption on multicast presettled messages
> ----------------------------------------------------------------------
>
>                 Key: DISPATCH-1136
>                 URL: https://issues.apache.org/jira/browse/DISPATCH-1136
>             Project: Qpid Dispatch
>          Issue Type: Bug
>          Components: Routing Engine
>    Affects Versions: 1.3.0
>         Environment: Fedora 27
> Three routers connected serially as described in DISPATCH-1124
>  
>            Reporter: Chuck Rolke
>            Assignee: Chuck Rolke
>            Priority: Major
>             Fix For: 1.4.0
>
>
> After applying the fixes from DISPATCH-1124 and DISPATCH-1129 receivers in long-running multicast presettled tests still fail with corrupted data sequences. There is no single symptom but several:
>  * Receivers use all system memory and cache and getting hit by the OOM killer
>  * underrun
>  * illegal value for field
> Research shows that function qdr_forward_drop_presettled_CT_LH is routinely dropping presettled deliveries that have already made forward progress in transmitting bytes to the wire. After that happens there is a race condition as to whether the message is successfully transmitted or the message is torn down in the middle of transmission.
> For reproducing this error the sender must supply messages significantly faster than the receiving router can forward them to the next router. This triggers the presettled drops. My test setup does this by having the sender and the receiving router on the same laptop and having the next router connected over a relatively slow WiFi.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@qpid.apache.org
For additional commands, e-mail: dev-help@qpid.apache.org