You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@arrow.apache.org by "Ji Liu (JIRA)" <ji...@apache.org> on 2019/04/24 02:37:00 UTC

[jira] [Created] (ARROW-5207) [Java] add APIs to support vector

Ji Liu created ARROW-5207:
-----------------------------

             Summary: [Java] add APIs to support vector 
                 Key: ARROW-5207
                 URL: https://issues.apache.org/jira/browse/ARROW-5207
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Java
            Reporter: Ji Liu


In some scenarios we hope that ValueVector could be reused to reduce creation overhead. This is very common in shuffle stage, it's no need to create ValueVector or realloc buffers every time, suppose that the recordCount of ValueVector and capacity of its buffers is written in stream, when we deserialize it, we can simply judge whether realloc is needed through dataLength.

My proposal is that add APIs in ValueVector to process this logic, otherwise users have to implement by themselves if they want to reuse which is not user-friendly. 

If you agree with this, I would like to take this ticket. Thanks



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)