You are viewing a plain text version of this content. The canonical link for it is here.
Posted to github@arrow.apache.org by GitBox <gi...@apache.org> on 2022/03/01 21:21:50 UTC

[GitHub] [arrow-cookbook] thisisnic commented on issue #92: [R] Add content on Tables vs. Datasets

thisisnic commented on issue #92:
URL: https://github.com/apache/arrow-cookbook/issues/92#issuecomment-1055869736


   Topics:
   
   - different formats (csv/feather/parquet)
   - partitioning (via group_by or just supplying column names)
   - customising filenames via basename_template param
   - hive_style vs bare values
   - overwrite existing data via existing_data_behavior param
   - fine control over file structure via max_partitions/max_open_files/max_rows_per_file/min_rows_per_group/max_rows_per_group
   - CSV datasets - how to read ones in with or without headers
   - CSV datasets - similarities and differences compared to read_csv_arrow
   - CSV datasets - working with schemas
   - converting datasets from one format to another without loading it all into memory


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscribe@arrow.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org