You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Marta Kuczora (Jira)" <ji...@apache.org> on 2020/12/08 15:31:00 UTC

[jira] [Created] (HIVE-24505) Investigate if the arrays in the FileSinkOperator could be replaced by Lists

Marta Kuczora created HIVE-24505:
------------------------------------

             Summary: Investigate if the arrays in the FileSinkOperator could be replaced by Lists
                 Key: HIVE-24505
                 URL: https://issues.apache.org/jira/browse/HIVE-24505
             Project: Hive
          Issue Type: Task
    Affects Versions: 4.0.0
            Reporter: Marta Kuczora
            Assignee: Marta Kuczora


The FileSinkOperator uses some array variables, like
    Path[] outPaths;
    Path[] outPathsCommitted;
    Path[] finalPaths;
    RecordWriter[] outWriters;
    RecordUpdater[] updaters;
Working with these is not always convenient, like when in the createDynamicBucket method, they are extended with elements. Or in case of an UPDATE operation with direct insert on. Then the delete deltas have to be collected separately, because the outPaths array will contain only the inserted deltas. These operations would be much easier with lists.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)