You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/05/20 12:35:53 UTC

[GitHub] [arrow-rs] alamb commented on issue #1715: Why `Parquet` is a part of `Arrow`?

alamb commented on issue #1715:
URL: https://github.com/apache/arrow-rs/issues/1715#issuecomment-1132851458

   > Why parquet is a part of Arrow? They should be independent, aren't they?
   
   My understanding is that the parquet project is a separate top level ASF project. 
   
   https://projects.apache.org/committee.html?arrow
   
   https://projects.apache.org/committee.html?parquet
   
   > Arrow2 and Parquet2 are independent crates. Could we move parquet to a top level repo, for example Apache/parquet-rs?
   
   Yes that would be fine -- right now they are in the same repo as the same people maintain them and it lowers the maintenance burden to have them in the same repo. I would personally not be opposed to separating them
   
   I think it is a similar setup to the C++ implementation https://github.com/apache/arrow/tree/master/cpp which has arrow and parquet in the same foramt
   
   > Parquet should be maintained by Apache Parquet committee. Is it a little weird to let Apache Arrow contributors to maintain this crate ?
   
   I don't disagree -- the reason I am helping with both is that we need both in our project. 
   
   Also, I think fast conversion between `arrow` <--> `parquet` is important and having them in the same repo may help with that. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org