You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Rok Mihevc (Jira)" <ji...@apache.org> on 2020/10/22 17:13:00 UTC

[jira] [Comment Edited] (ARROW-1614) [C++] Add a Tensor logical value type with constant dimensions, implemented using ExtensionType

    [ https://issues.apache.org/jira/browse/ARROW-1614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17219199#comment-17219199 ] 

Rok Mihevc edited comment on ARROW-1614 at 10/22/20, 5:12 PM:
--------------------------------------------------------------

As proposed by [~jorisvandenbossche] I've made a draft PR ( [https://github.com/apache/arrow/pull/8510.]) with python logic prototype. It was heavily inspired by [~bryanc]'s text-extensions-for-pandas. Once we agree on the design we can rewrite it to c++.

As this is for the case where all tensors in the array are of the same shape I propose we store the data in a single Tensor. Is there a good reason not to do that?

I assume we should support non-contiguous tensors. I'll add that.

Any comments at this point?

[~chrish42] - feel free to jump in any time.


was (Author: rokm):
As proposed by [~jorisvandenbossche] I've made a draft PR ( [https://github.com/apache/arrow/pull/8510.]) with python logic prototype. It was heavily inspired by [~bryanc]'s text-extensions-for-pandas. Once we agree on the design we can rewrite it to c++.

As this is for the case where all tensors in the array are of the same shape I propose we store the data in a single Tensor. Is there a good reason not to do that?

I assume we should support non-contiguous tensors. I'll add that.

Any comments at this point?

> [C++] Add a Tensor logical value type with constant dimensions, implemented using ExtensionType
> -----------------------------------------------------------------------------------------------
>
>                 Key: ARROW-1614
>                 URL: https://issues.apache.org/jira/browse/ARROW-1614
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++, Format
>            Reporter: Wes McKinney
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 20m
>  Remaining Estimate: 0h
>
> In an Arrow table, we would like to add support for a column that has values cells each containing a tensor value, with all tensors having the same dimensions. These would be stored as a binary value, plus some metadata to store type and shape/strides.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)