You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Ben Kietzman (Jira)" <ji...@apache.org> on 2021/05/04 16:56:00 UTC

[jira] [Comment Edited] (ARROW-8928) [C++] Measure microperformance associated with ExecBatchIterator

    [ https://issues.apache.org/jira/browse/ARROW-8928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17339145#comment-17339145 ] 

Ben Kietzman edited comment on ARROW-8928 at 5/4/21, 4:55 PM:
--------------------------------------------------------------

ARROW-12010 adds KeyColumnData which is a simplified non-owning wrapper for non-nested ArrayData. Reusing it for ExecBatch's values would be a worthwhile optimization


was (Author: bkietz):
ARROW-12010 adds KeyColumnData which is a simplified non-owning wrapper for non-nested ArrayData. Reusing it for ExecBatch would be a worthwhile optimization

> [C++] Measure microperformance associated with ExecBatchIterator
> ----------------------------------------------------------------
>
>                 Key: ARROW-8928
>                 URL: https://issues.apache.org/jira/browse/ARROW-8928
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++
>            Reporter: Wes McKinney
>            Assignee: Wes McKinney
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 5.0.0
>
>          Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> {{arrow::compute::ExecBatch}} uses a vector of {{arrow::Datum}} to contain a collection of ArrayData and Scalar objects for kernel execution. It would be helpful to know how many nanoseconds of overhead is associated with basic interactions with this data structure to know the cost of using our vendored variant, and other such issues. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)