You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ádám Szita (Jira)" <ji...@apache.org> on 2019/10/02 19:23:00 UTC
[jira] [Created] (HIVE-22284) Improve LLAP CacheContentsTracker to
collect and display correct statistics
Ádám Szita created HIVE-22284:
---------------------------------
Summary: Improve LLAP CacheContentsTracker to collect and display correct statistics
Key: HIVE-22284
URL: https://issues.apache.org/jira/browse/HIVE-22284
Project: Hive
Issue Type: Improvement
Components: llap
Reporter: Ádám Szita
Assignee: Ádám Szita
When keeping track of which buffers correspond to what Hive objects, CacheContentsTracker relies on cache tags.
Currently a tag is a simple String that ideally holds DB and table name, and a partition spec concatenated by . and / . The information here is derived from the Path of the file that is getting cached. Needless to say sometimes this produces a wrong tag especially for external tables.
Also there's a bug when calculating aggregated stats for a 'parent' tag (corresponding to the table of the partition) because the overall maxCount and maxSize do not add up to the sum of those in the partitions. This happens when buffers get removed from the cache.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)