You are viewing a plain text version of this content. The canonical link for it is here.

Posted to github@arrow.apache.org by "dadepo (via GitHub)" <gi...@apache.org> on 2023/02/26 12:33:56 UTC

[GitHub] [arrow-datafusion] dadepo commented on issue #5383: The output of write_csv and write_json methods is confusing.

dadepo commented on issue #5383:
URL: https://github.com/apache/arrow-datafusion/issues/5383#issuecomment-1445350173

   As a end user who does not know much about the internal details/limitation of the underlying data formats I'll like to ask:
   
   1. Is it required for the right usage of the library to be exposed to the facts that data could exist in "parts"?
   
   I ask, because if not, then for a user who wants to take a dataframe and produce a csv file out of that, an implementation that produces just one file will be the most user friendly approach that should be default.
   
   There should still be the option of writing the data out in parts, and an advance user, who knows more about the underlying data format can decided to go for this approach either by passing the appropriate flag or calling the appropriate method


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org