You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@beam.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/04/21 20:07:00 UTC

[jira] [Work logged] (BEAM-9014) Update CachingShuffleBatchReader to record weights by size in bytes

     [ https://issues.apache.org/jira/browse/BEAM-9014?focusedWorklogId=425865&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-425865 ]

ASF GitHub Bot logged work on BEAM-9014:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 21/Apr/20 20:06
            Start Date: 21/Apr/20 20:06
    Worklog Time Spent: 10m 
      Work Description: lukecwik opened a new pull request #11483:
URL: https://github.com/apache/beam/pull/11483


   This change negatively impacts shuffle performance for large iterables.
   
   R: @tudorm @tysonjh 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 425865)
    Time Spent: 50m  (was: 40m)

> Update CachingShuffleBatchReader to record weights by size in bytes
> -------------------------------------------------------------------
>
>                 Key: BEAM-9014
>                 URL: https://issues.apache.org/jira/browse/BEAM-9014
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-dataflow
>            Reporter: Luke Cwik
>            Assignee: Tyson Hamilton
>            Priority: Minor
>             Fix For: 2.21.0
>
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> Currently the CachingShuffleBatchReader caches based upon the number of batches and not the size of those batches. This task is about updating CachingShuffleBatchReader to cache based on the size of those batches.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)