You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "alippai (via GitHub)" <gi...@apache.org> on 2023/03/07 20:54:24 UTC

[GitHub] [arrow] alippai opened a new issue, #34491: [Python] Add pyarrow.TableGroupBy() subtables

alippai opened a new issue, #34491:
URL: https://github.com/apache/arrow/issues/34491

   ### Describe the enhancement requested
   
   ```python
   import pyarrow as pa
   t = pa.table([
         pa.array(["a", "a", "b", "b", "c"]),
         pa.array([1, 2, 3, 4, 5]),
   ], names=["keys", "values"])
   
   for subtable in pa.TableGroupBy(t,"keys"): # loop over the 3 tables
     subtable.to_pandas() # with the keys column
     
   # alternatively
   for key_name, subtable in pa.TableGroupBy(t,"keys"): # loop over the 3 tables
     subtable.to_pandas() # without the keys column
   ```
   
   
   ### Component(s)
   
   Python


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow] jorisvandenbossche commented on issue #34491: [Python] Add pyarrow.TableGroupBy() subtables

Posted by "jorisvandenbossche (via GitHub)" <gi...@apache.org>.
jorisvandenbossche commented on issue #34491:
URL: https://github.com/apache/arrow/issues/34491#issuecomment-1473513463

   Some related discussion at https://ursalabs.zulipchat.com/#narrow/stream/180245-dev/topic/Acero.20-.20GroupingSegmenter/near/342103440


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org