You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@arrow.apache.org by "okartal (via GitHub)" <gi...@apache.org> on 2023/05/04 08:18:47 UTC

[GitHub] [arrow-julia] okartal opened a new issue, #434: Writing and Reading Random Access Files

okartal opened a new issue, #434:
URL: https://github.com/apache/arrow-julia/issues/434

   Maybe related to #353
   
   It is already possible to use Tables.partitioner to write record batches to a single Arrow file. However, when I read that file with Arrow.Table I do not know how to access a specific record batch like here: https://arrow.apache.org/docs/java/ipc.html#writing-and-reading-random-access-files
   
   According to the docs, this should be possible but I am not sure if that is not implemented yet or simply not documented.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@arrow.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-julia] Moelf commented on issue #434: Writing and Reading Random Access Files

Posted by "Moelf (via GitHub)" <gi...@apache.org>.
Moelf commented on issue #434:
URL: https://github.com/apache/arrow-julia/issues/434#issuecomment-1567543626

   we don't have to do any Python implementation says, that's specifically for Python.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-julia] quinnj commented on issue #434: Writing and Reading Random Access Files

Posted by "quinnj (via GitHub)" <gi...@apache.org>.
quinnj commented on issue #434:
URL: https://github.com/apache/arrow-julia/issues/434#issuecomment-1558216906

   You're right that we don't expose this very well (i.e at all) via `Arrow.Table` right now; but using `Arrow.Stream` gives you back an _iterator_ of `Arrow.Table` for each record batch. But we could probably also expose a way via `Arrow.Table` to let you get the individual tables. Something to think about, or at least improve in the docs mentioning `Arrow.Stream`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


[GitHub] [arrow-julia] okartal commented on issue #434: Writing and Reading Random Access Files

Posted by "okartal (via GitHub)" <gi...@apache.org>.
okartal commented on issue #434:
URL: https://github.com/apache/arrow-julia/issues/434#issuecomment-1567540423

   According to https://arrow.apache.org/docs/python/ipc.html#writing-and-reading-random-access-files we need to use a seek method to implement random access to a batch 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org