You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by ppadma <gi...@git.apache.org> on 2018/04/20 16:55:00 UTC

[GitHub] drill issue #1227: Drill 6236: batch sizing for hash join

Github user ppadma commented on the issue:

    https://github.com/apache/drill/pull/1227
  
    @Ben-Zvi Thanks a lot for the review. updated PR with review comments taken care of. Please take a look.
    
    Regarding spill files, here are my thoughts. 
    For build side, I am using aggregate statistics i.e. average of all batches.  On probe side, I am using stats for each batch coming in and adjusting the output row count. So,  we can skip applying sizing for batches spilled from build side and continue to do what I am doing on the probe side. Once your code is merged in, I will refactor the code as needed.



---