You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Peter Varga (Jira)" <ji...@apache.org> on 2020/12/14 20:42:00 UTC

[jira] [Created] (HIVE-24535) Cleanup AcidUtils.Directory and remove unnecessary filesystem listings

Peter Varga created HIVE-24535:
----------------------------------

             Summary: Cleanup AcidUtils.Directory and remove unnecessary filesystem listings
                 Key: HIVE-24535
                 URL: https://issues.apache.org/jira/browse/HIVE-24535
             Project: Hive
          Issue Type: Improvement
            Reporter: Peter Varga
            Assignee: Peter Varga


* AcidUtils.getAcidState is doing a recursive listing on S3 FileSystem, it already knows the content of each delta and base directory, this could be returned to OrcInputFormat, to avoid listing each delta directory again there.
* AcidUtils.getAcidstate submethods are collecting more and more infos about the state of the data directory. This could be done directly to the final Directory object to avoid 10+ parameters in methods.
* AcidUtils.Directory, OrcInputFormat.AcidDirInfo and AcidUtils.TxnBase can be merged to one class, to clean up duplications.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)