You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by "Weijie Guo (Jira)" <ji...@apache.org> on 2022/08/22 06:05:00 UTC

[jira] [Created] (FLINK-29053) Hybrid shuffle has concurrent modification of buffer when compression is enabled

Weijie Guo created FLINK-29053:
----------------------------------

             Summary: Hybrid shuffle has concurrent modification of buffer when compression is enabled
                 Key: FLINK-29053
                 URL: https://issues.apache.org/jira/browse/FLINK-29053
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Network
    Affects Versions: 1.16.0
            Reporter: Weijie Guo
             Fix For: 1.16.0


When the downstream thread obtains the buffer and consuming it, if the data is compressed in the spilling thread and copied to the original buffer in the same time, since the two threads share the same memory data, the consuming thread will consume incorrect data, causing problems such as deserialize the data disorder.
Considering that the downstream consumption is prohibited during compression, or block spilling thread when the downstream consumption is not completed will have a great impact on performance. I think we should move the compression operation to the write thread and store the compressed buffer directly in memory.

 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)