You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by "sivabalan narayanan (Jira)" <ji...@apache.org> on 2021/02/01 00:09:00 UTC
[jira] [Created] (HUDI-1570) Add Avg record size in commit metadata
sivabalan narayanan created HUDI-1570:
-----------------------------------------
Summary: Add Avg record size in commit metadata
Key: HUDI-1570
URL: https://issues.apache.org/jira/browse/HUDI-1570
Project: Apache Hudi
Issue Type: Improvement
Components: Utilities
Reporter: sivabalan narayanan
Many users want to understand what would be their avg record size. As of now, there is no easy way to fetch record size for the end user. Even w/ hudi-cli, we could decipher from commit metadata, but we need to make some rough calculation. So, it would be better if we store the avg record size w/ WriteStats (total bytes written/ total records written) , as well as in commit metadata. So, in hudi_cli, we could expose this info along w/ "commit showpartitions" or expose another command "commit showmetadata" or something.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)