You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2021/07/05 10:05:00 UTC

[jira] [Assigned] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

     [ https://issues.apache.org/jira/browse/ARROW-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Joris Van den Bossche reassigned ARROW-13258:
---------------------------------------------

    Assignee: Joris Van den Bossche

> [Python] Improve the repr of ParquetFileFragment
> ------------------------------------------------
>
>                 Key: ARROW-13258
>                 URL: https://issues.apache.org/jira/browse/ARROW-13258
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Joris Van den Bossche
>            Assignee: Joris Van den Bossche
>            Priority: Major
>             Fix For: 5.0.0
>
>
> Compare with the legacy version:
> {code}
> In [5]: d1 = pq.ParquetDataset("test_partitioned")
> In [6]: d2 = pq.ParquetDataset("test_partitioned", use_legacy_dataset=False)
> In [7]: d1.pieces[0]
> <ipython-input-30-45c15fc36b93>:1: DeprecationWarning: ParquetDataset.pieces attribute is deprecated
>   d1.pieces[0]
> Out[7]: ParquetDatasetPiece('test_partitioned/B=a/1f73a8652e6242b58de7582b1a514907.parquet', row_group=None, partition_keys=[('B', 0)])
> In [8]: d2.fragments[0]
> Out[8]: <pyarrow._dataset.ParquetFileFragment at 0x7fec0f88c710>
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)