You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "David Li (Jira)" <ji...@apache.org> on 2022/01/07 20:05:00 UTC

[jira] [Created] (ARROW-15285) [C++] write_dataset with delete_matching occasionally fails with "Path does not exist"

David Li created ARROW-15285:
--------------------------------

             Summary: [C++] write_dataset with delete_matching occasionally fails with "Path does not exist"
                 Key: ARROW-15285
                 URL: https://issues.apache.org/jira/browse/ARROW-15285
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
            Reporter: David Li


The reproducer in ARROW-15265, once the bug there is fixed, now occasionally fails with this:
{noformat}
Traceback (most recent call last):
  File "/home/lidavidm/Code/upstream/arrow-15265/python/test.py", line 37, in <module>
    ds.write_dataset(
  File "/home/lidavidm/Code/upstream/arrow-15265/python/pyarrow/dataset.py", line 931, in write_dataset
    _filesystemdataset_write(
  File "pyarrow/_dataset.pyx", line 2658, in pyarrow._dataset._filesystemdataset_write
    check_status(CFileSystemDataset.Write(c_options, c_scanner))
  File "pyarrow/error.pxi", line 114, in pyarrow.lib.check_status
    raise IOError(message)
OSError: Path does not exist 'my-bucket/test8.parquet/col1=c' {noformat}
The path is different each time it fails (i.e. it's not deterministic). This is relatively rare (2 out of 100 runs when I checked just now)



--
This message was sent by Atlassian Jira
(v8.20.1#820001)