You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Jorge (Jira)" <ji...@apache.org> on 2020/08/18 06:13:00 UTC

[jira] [Created] (ARROW-9779) [Rust] [DataFusion] Increase stability of average accumulator

Jorge created ARROW-9779:
----------------------------

             Summary: [Rust] [DataFusion] Increase stability of average accumulator
                 Key: ARROW-9779
                 URL: https://issues.apache.org/jira/browse/ARROW-9779
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Rust, Rust - DataFusion
            Reporter: Jorge
            Assignee: Jorge


Currently, our method to compute the average is based on:

1. compute sum of all terms
2. compute count of all terms
3. compute sum / count

however, the sum may overflow.

There is a typical solution to this based on an online formula described e.g. [here|http://www.heikohoffmann.de/htmlthesis/node134.html] to keep the numbers small.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)