You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2021/02/01 00:09:00 UTC

[jira] [Created] (HUDI-1570) Add Avg record size in commit metadata

sivabalan narayanan created HUDI-1570:
-----------------------------------------

             Summary: Add Avg record size in commit metadata
                 Key: HUDI-1570
                 URL: https://issues.apache.org/jira/browse/HUDI-1570
             Project: Apache Hudi
          Issue Type: Improvement
          Components: Utilities
            Reporter: sivabalan narayanan


Many users want to understand what would be their avg record size. As of now, there is no easy way to fetch record size for the end user. Even w/ hudi-cli, we could decipher from commit metadata, but we need to make some rough calculation. So, it would be better if we store the avg record size w/ WriteStats (total bytes written/ total records written) , as well as in commit metadata. So, in hudi_cli, we could expose this info along w/ "commit showpartitions" or expose another command "commit showmetadata" or something. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)