You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by "Fokko (via GitHub)" <gi...@apache.org> on 2023/02/02 14:52:33 UTC

[GitHub] [arrow] Fokko commented on pull request #33974: GH-33973: [Python][Docs] Update documentation for Parquet filter

Fokko commented on PR #33974:
URL: https://github.com/apache/arrow/pull/33974#issuecomment-1413867923

   @westonpace 
   
   > The docs you probably want are [pyarrow.dataset.dataset](https://arrow.apache.org/docs/python/generated/pyarrow.dataset.dataset.html#pyarrow.dataset.dataset) and [pyarrow.dataset.Dataset](https://arrow.apache.org/docs/python/generated/pyarrow.dataset.Dataset.html#pyarrow.dataset.Dataset)
   
   I would agree with you, but I went for a lower API because I want to re-use the connection. More background is provided [here](https://github.com/apache/arrow/issues/33972), and the dataset only accepts paths. If you think it is worthwhile to accept `NativeFile` there as well, let me know, and happy to raise a PR.
   
   > which unfortunately redirects its documentation to [pyarrow.dataset.Scanner.from_dataset](https://arrow.apache.org/docs/python/generated/pyarrow.dataset.Scanner.html#pyarrow.dataset.Scanner.from_dataset) 😰 )
   
   Do you want me to create a PR to copy those docs? Because of the redirect, the arguments are also not showing up in PyCharm. Let me know and I'll create a PR.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org