You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "N Gautam Animesh (Jira)" <ji...@apache.org> on 2022/11/11 09:21:00 UTC

[jira] [Created] (ARROW-18313) Issues with open_dataset()

N Gautam Animesh created ARROW-18313:
----------------------------------------

             Summary: Issues with open_dataset()
                 Key: ARROW-18313
                 URL: https://issues.apache.org/jira/browse/ARROW-18313
             Project: Apache Arrow
          Issue Type: Bug
            Reporter: N Gautam Animesh
         Attachments: image-2022-11-11-09-19-16-065.png

On using open_dataset, it creates a connection due to which the files in the directory get blocked and we cannot perform other operations on the file like replace!

Actual issue:
 # We are running an atomic operation on a bunch of files, which replaces the temp file names to the target file names.
 # But while this is happening, if we try to run open_dataset() on that particular directory, the atomic operation is failing and there are both target files and temp files in the directory.
 # It is blocking the files that have been read through open_dataset().
 # Please, provide me with more about how we can handle such problems.
 # Snapshot: !image-2022-11-11-09-19-16-065.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)