You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@beam.apache.org by GitBox <gi...@apache.org> on 2022/04/21 19:41:00 UTC

[GitHub] [beam] lostluck commented on pull request #17347: [BEAM-14304] implement parquetio to read/write parquet files

lostluck commented on PR #17347:
URL: https://github.com/apache/beam/pull/17347#issuecomment-1105681959

   Apologies for the delay, I do have some comments to make, but I'm rather swamped. I should have time to finish my review tomorrow.
   
   > Yes. I use a quite naive approach that requires reading the whole parquet file into memory. Unlike Java, the `filesystem.OpenRead()` doesn't support `io.Seeker`, which is a required feature for processing Parquet files. I am trying to implement `OpenRead()` to return `io.ReadSeekCloser`
   
   Otherwise, just to be clear, are you working on making this an SDF in this PR? Is it the openread on the filesystem that's the issue? Do you need specific advice or assistance?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@beam.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org