You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2021/07/11 18:26:27 UTC

[GitHub] [arrow-datafusion] andygrove opened a new issue #711: Ballista: Executor must return statistics in CompletedTask / CompletedJob

andygrove opened a new issue #711:
URL: https://github.com/apache/arrow-datafusion/issues/711


   **Is your feature request related to a problem or challenge? Please describe what you are trying to do.**
   We cannot fix the shuffle mechanism until we have partition stats, or ShuffleReaderExec will attempt to read empty partitions, causing an error.
   
   **Describe the solution you'd like**
   Scheduler should receive partition stats and only try and read from non-empty shuffle partitions.
   
   **Describe alternatives you've considered**
   As a workaround we could write empty shuffle files for empty partitions.
   
   **Additional context**
   None
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org