You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@arrow.apache.org by jo...@apache.org on 2022/05/12 11:25:34 UTC
[arrow] branch master updated: ARROW-16526: [Python] test_partitioned_dataset fails when building with PARQUET but without DATASET
This is an automated email from the ASF dual-hosted git repository.
jorisvandenbossche pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/arrow.git
The following commit(s) were added to refs/heads/master by this push:
new 7a955f07b3 ARROW-16526: [Python] test_partitioned_dataset fails when building with PARQUET but without DATASET
7a955f07b3 is described below
commit 7a955f07b3472a36d9174eb71883f8f9c33083ae
Author: Weston Pace <we...@gmail.com>
AuthorDate: Thu May 12 13:25:26 2022 +0200
ARROW-16526: [Python] test_partitioned_dataset fails when building with PARQUET but without DATASET
One of the legacy parquet dataset tests was not properly passing use_legacy_dataset and this caused the test to attempt to use the new datasets module even if it wasn't enabled
Closes #13116 from westonpace/bugfix/MINOR--missing-dataset-mark
Authored-by: Weston Pace <we...@gmail.com>
Signed-off-by: Joris Van den Bossche <jo...@gmail.com>
---
python/pyarrow/tests/parquet/test_dataset.py | 3 ++-
1 file changed, 2 insertions(+), 1 deletion(-)
diff --git a/python/pyarrow/tests/parquet/test_dataset.py b/python/pyarrow/tests/parquet/test_dataset.py
index 7b6845bbc2..2c660a3f6e 100644
--- a/python/pyarrow/tests/parquet/test_dataset.py
+++ b/python/pyarrow/tests/parquet/test_dataset.py
@@ -1542,7 +1542,8 @@ def test_partitioned_dataset(tempdir, use_legacy_dataset):
})
table = pa.Table.from_pandas(df)
pq.write_to_dataset(table, root_path=str(path),
- partition_cols=['one', 'two'])
+ partition_cols=['one', 'two'],
+ use_legacy_dataset=use_legacy_dataset)
table = pq.ParquetDataset(
path, use_legacy_dataset=use_legacy_dataset).read()
pq.write_table(table, path / "output.parquet")