You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "GregoryKimball (via GitHub)" <gi...@apache.org> on 2023/02/17 00:26:14 UTC

[GitHub] [arrow] GregoryKimball commented on pull request #33925: GH-33923: [Docs] Tensor canonical extension type specification

GregoryKimball commented on PR #33925:
URL: https://github.com/apache/arrow/pull/33925#issuecomment-1433923846

   Thank you everyone for this discussion. The proposal looks good to me, and I would like your help checking my understanding.
   
   In [RAPIDS libcudf](https://github.com/rapidsai/cudf), we would use an nested List type to represent the proposed Tensor type. In the case where `permutation` is not provided, I believe we could use the elements zero-copy and then create offsets based on the `shape` parameter. 
   
   So this Tensor example with 2 rows of (2,3) tensors:
   ```
   value_type: int32
   shape: (2,3)
   elements: [1,2,3,4,5,6,7,8,9,10,11,12]
   ```
   would correspond to these List column children:
   ```
   data_type: int32
   offsets: [0,2,4]
     offsets: [0,3,6,9,12]
     elements: [1,2,3,4,5,6,7,8,9,10,11,12]
   ```
   
   Would you please let me know if my conversion looks correct?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org