You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "jorisvandenbossche (via GitHub)" <gi...@apache.org> on 2023/03/23 08:24:47 UTC

[GitHub] [arrow] jorisvandenbossche opened a new issue, #34701: [Python] Expose Run-End Encoded arrays in PyArrow

jorisvandenbossche opened a new issue, #34701:
URL: https://github.com/apache/arrow/issues/34701

   The initial bindings for Run-End Encoded arrays (the array and type classes) landed in https://github.com/apache/arrow/pull/34570. Some follow-ups:
   
   - [ ] https://github.com/apache/arrow/issues/34686
   - [ ] Support construction in `pa.array(..)`
   - [ ] Conversion pyarrow<->numpy/pandas


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] felipecrv commented on issue #34701: [Python] Expose Run-End Encoded arrays in PyArrow

Posted by "felipecrv (via GitHub)" <gi...@apache.org>.
felipecrv commented on issue #34701:
URL: https://github.com/apache/arrow/issues/34701#issuecomment-1579030034

   `RunEndEncoded` is one of the types listed here as "NotImplemented". No specific reason, just a matter of taking the time to write and test the implementation.
   
   https://github.com/apache/arrow/blob/main/python/pyarrow/src/arrow/python/arrow_to_pandas.cc#L1348
   
   The list of things to implement for a new format is huge and this is one of the things that I didn't have time to complete yet.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Python] Expose Run-End Encoded arrays in PyArrow [arrow]

Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF closed issue #34701: [Python] Expose Run-End Encoded arrays in PyArrow
URL: https://github.com/apache/arrow/issues/34701


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


Re: [I] [Python] Expose Run-End Encoded arrays in PyArrow [arrow]

Posted by "AlenkaF (via GitHub)" <gi...@apache.org>.
AlenkaF commented on issue #34701:
URL: https://github.com/apache/arrow/issues/34701#issuecomment-2019568037

   Closing this issue as all subtasks are completed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] jorisvandenbossche commented on issue #34701: [Python] Expose Run-End Encoded arrays in PyArrow

Posted by "jorisvandenbossche (via GitHub)" <gi...@apache.org>.
jorisvandenbossche commented on issue #34701:
URL: https://github.com/apache/arrow/issues/34701#issuecomment-1578938916

   Example of the current behaviour for conversion to numpy/pandas:
   
   ```
   In [36]: arr = pc.run_end_encode([1, 2, 2, 3, 3, 3])
   
   In [37]: arr.to_numpy()
   ---------------------------------------------------------------------------
   ArrowNotImplementedError                  Traceback (most recent call last)
   Cell In [37], line 1
   ----> 1 arr.to_numpy()
   
   File pyarrow/array.pxi:1530, in pyarrow.lib.Array.to_numpy()
   
   File pyarrow/error.pxi:121, in pyarrow.lib.check_status()
   
   ArrowNotImplementedError: No known equivalent Pandas block for Arrow data of type run_end_encoded<run_ends: int32, values: int64> is known.
   
   In [38]: arr.to_pandas()
   ---------------------------------------------------------------------------
   ArrowNotImplementedError                  Traceback (most recent call last)
   Cell In [38], line 1
   ----> 1 arr.to_pandas()
   
   File pyarrow/array.pxi:855, in pyarrow.lib._PandasConvertible.to_pandas()
   
   File pyarrow/array.pxi:1478, in pyarrow.lib.Array._to_pandas()
   
   File pyarrow/array.pxi:1682, in pyarrow.lib._array_like_to_pandas()
   
   File pyarrow/error.pxi:121, in pyarrow.lib.check_status()
   
   ArrowNotImplementedError: No known equivalent Pandas block for Arrow data of type run_end_encoded<run_ends: int32, values: int64> is known.
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org