You are viewing a plain text version of this content. The canonical link for it is here.

Posted to jira@arrow.apache.org by "Micah Kornfield (Jira)" <ji...@apache.org> on 2021/09/12 21:25:00 UTC

[jira] [Resolved] (ARROW-13733) [Java] Allow JDBC adapters to reuse vector schema roots

     [ https://issues.apache.org/jira/browse/ARROW-13733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Micah Kornfield resolved ARROW-13733.
-------------------------------------
    Fix Version/s: 6.0.0
       Resolution: Fixed

Issue resolved by pull request 10983
[https://github.com/apache/arrow/pull/10983]

> [Java] Allow JDBC adapters to reuse vector schema roots
> -------------------------------------------------------
>
>                 Key: ARROW-13733
>                 URL: https://issues.apache.org/jira/browse/ARROW-13733
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: Java
>            Reporter: Liya Fan
>            Assignee: Liya Fan
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 6.0.0
>
>          Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> According to the current design of the JDBC adapter, it is not possible to reuse the vector schema roots. That is, a new vector schema root is created and released for each batch.
> This can cause performance problems, because in many scenarios, the client code only reads data in vector schema root. So the vector schema roots can be reused in the following cycle: populate data -> client use data -> populate data -> ...
> The current design has another problem. For most times, it has two alternating vector schema roots in memory, causing a large waste of memory, especially for large batches.
> We solve both problems by providing a flag in the config, which allows the user to reuse the vector shema roots. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)