You are viewing a plain text version of this content. The canonical link for it is here.

Posted to reviews@spark.apache.org by rxin <gi...@git.apache.org> on 2018/09/18 22:30:28 UTC

[GitHub] spark issue #16677: [SPARK-19355][SQL] Use map output statistics to improve ...

Github user rxin commented on the issue:

    https://github.com/apache/spark/pull/16677
  
    two questions about this (i just saw this from a different place):
    
    1. is numOutput about number of records?
    
    2. how much memory usage will be increased by, for the driver, at scale?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscribe@spark.apache.org
For additional commands, e-mail: reviews-help@spark.apache.org