You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "westonpace (via GitHub)" <gi...@apache.org> on 2023/04/26 17:18:06 UTC

[GitHub] [arrow] westonpace commented on issue #34455: [Python] ArrowNotImplementedError: concatenation of extension>

westonpace commented on issue #34455:
URL: https://github.com/apache/arrow/issues/34455#issuecomment-1523785261

   The call to `combine_chunks` was introduced in https://github.com/huggingface/datasets/pull/5542 which explains why reverting to an older version of `datasets` fixes the issue.
   
   `combine_chunks` relies on array concatenation.  Support for concatenating extension type arrays was added in https://github.com/apache/arrow/pull/14463 which will be part of 12.0.0
   
   So your options are:
   
    * Keep `datasets` pinned
    * Upgrade to Arrow 12.0.0 once it releases
    * File a bug with `datasets` and ask them to stop calling `combine_chunks`
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org