You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2021/07/05 10:04:00 UTC

[jira] [Created] (ARROW-13258) [Python] Improve the repr of ParquetFileFragment

Joris Van den Bossche created ARROW-13258:
---------------------------------------------

             Summary: [Python] Improve the repr of ParquetFileFragment
                 Key: ARROW-13258
                 URL: https://issues.apache.org/jira/browse/ARROW-13258
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Joris Van den Bossche


Compare with the legacy version:

{code}
In [5]: d1 = pq.ParquetDataset("test_partitioned")

In [6]: d2 = pq.ParquetDataset("test_partitioned", use_legacy_dataset=False)

In [7]: d1.pieces[0]
<ipython-input-30-45c15fc36b93>:1: DeprecationWarning: ParquetDataset.pieces attribute is deprecated
  d1.pieces[0]
Out[7]: ParquetDatasetPiece('test_partitioned/B=a/1f73a8652e6242b58de7582b1a514907.parquet', row_group=None, partition_keys=[('B', 0)])

In [8]: d2.fragments[0]
Out[8]: <pyarrow._dataset.ParquetFileFragment at 0x7fec0f88c710>
}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)