You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Peter Varga (Jira)" <ji...@apache.org> on 2020/12/14 20:42:00 UTC
[jira] [Created] (HIVE-24535) Cleanup AcidUtils.Directory and
remove unnecessary filesystem listings
Peter Varga created HIVE-24535:
----------------------------------
Summary: Cleanup AcidUtils.Directory and remove unnecessary filesystem listings
Key: HIVE-24535
URL: https://issues.apache.org/jira/browse/HIVE-24535
Project: Hive
Issue Type: Improvement
Reporter: Peter Varga
Assignee: Peter Varga
* AcidUtils.getAcidState is doing a recursive listing on S3 FileSystem, it already knows the content of each delta and base directory, this could be returned to OrcInputFormat, to avoid listing each delta directory again there.
* AcidUtils.getAcidstate submethods are collecting more and more infos about the state of the data directory. This could be done directly to the final Directory object to avoid 10+ parameters in methods.
* AcidUtils.Directory, OrcInputFormat.AcidDirInfo and AcidUtils.TxnBase can be merged to one class, to clean up duplications.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)