You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "Joris Van den Bossche (Jira)" <ji...@apache.org> on 2021/07/05 10:05:00 UTC
[jira] [Updated] (ARROW-13258) [Python] Improve the repr of
ParquetFileFragment
[ https://issues.apache.org/jira/browse/ARROW-13258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Joris Van den Bossche updated ARROW-13258:
------------------------------------------
Description:
Compare with the legacy version:
{code}
In [5]: d1 = pq.ParquetDataset("test_partitioned")
In [6]: d2 = pq.ParquetDataset("test_partitioned", use_legacy_dataset=False)
In [7]: d1.pieces[0]
<ipython-input-30-45c15fc36b93>:1: DeprecationWarning: ParquetDataset.pieces attribute is deprecated
d1.pieces[0]
Out[7]: ParquetDatasetPiece('test_partitioned/B=a/1f73a8652e6242b58de7582b1a514907.parquet', row_group=None, partition_keys=[('B', 0)])
In [8]: d2.fragments[0]
Out[8]: <pyarrow._dataset.ParquetFileFragment at 0x7fec0f88c710>
{code}
was:
Compare with the legacy version:
{code}
In [5]: d1 = pq.ParquetDataset("test_partitioned")
In [6]: d2 = pq.ParquetDataset("test_partitioned", use_legacy_dataset=False)
In [7]: d1.pieces[0]
<ipython-input-30-45c15fc36b93>:1: DeprecationWarning: ParquetDataset.pieces attribute is deprecated
d1.pieces[0]
Out[7]: ParquetDatasetPiece('test_partitioned/B=a/1f73a8652e6242b58de7582b1a514907.parquet', row_group=None, partition_keys=[('B', 0)])
In [8]: d2.fragments[0]
Out[8]: <pyarrow._dataset.ParquetFileFragment at 0x7fec0f88c710>
}
> [Python] Improve the repr of ParquetFileFragment
> ------------------------------------------------
>
> Key: ARROW-13258
> URL: https://issues.apache.org/jira/browse/ARROW-13258
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Joris Van den Bossche
> Priority: Major
>
> Compare with the legacy version:
> {code}
> In [5]: d1 = pq.ParquetDataset("test_partitioned")
> In [6]: d2 = pq.ParquetDataset("test_partitioned", use_legacy_dataset=False)
> In [7]: d1.pieces[0]
> <ipython-input-30-45c15fc36b93>:1: DeprecationWarning: ParquetDataset.pieces attribute is deprecated
> d1.pieces[0]
> Out[7]: ParquetDatasetPiece('test_partitioned/B=a/1f73a8652e6242b58de7582b1a514907.parquet', row_group=None, partition_keys=[('B', 0)])
> In [8]: d2.fragments[0]
> Out[8]: <pyarrow._dataset.ParquetFileFragment at 0x7fec0f88c710>
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)