You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "Vladimir (Jira)" <ji...@apache.org> on 2022/09/15 14:15:00 UTC
[jira] [Created] (ARROW-17739) [Python] write_to_parquet with s3fs will fails on no permissions for CreateBucket even when bucket exists (regression in 8.0.0)
Vladimir created ARROW-17739:
--------------------------------
Summary: [Python] write_to_parquet with s3fs will fails on no permissions for CreateBucket even when bucket exists (regression in 8.0.0)
Key: ARROW-17739
URL: https://issues.apache.org/jira/browse/ARROW-17739
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 8.0.1, 9.0.0, 8.0.0
Reporter: Vladimir
Hello
in version 8.0.0 and later the following bug was introduced: `write_to_dataset` will fail with the error "An error occurred (AccessDenied) when calling the CreateBucket operation: Access Denied" on writing to existing S3 bucket.
{code:java}
import s3fs
import pyarrow as pa
import pyarrow.parquet as pq
print(pa.__version__)
dff = pd.DataFrame({'Par1': ['a','b','c']*8, 'Par2': ['x','y']*12, 'Par3': range(24)})
path = 's3://bucket/path'
s3 = s3fs.S3FileSystem()
table = pa.Table.from_pandas(dff)
# This will pass
pq.write_table(table, path + '/table.parquet', filesystem=s3)
# This will fail: An error occurred (AccessDenied) when calling the CreateBucket operation: Access Denied
pq.write_to_dataset(table, path + '/table_partitioned.parquet', filesystem=s3, partition_cols=['Par1', 'Par2']) {code}
In 7.0.0 the code runs without errors.
Best regards
Vladimir
--
This message was sent by Atlassian Jira
(v8.20.10#820010)