You are viewing a plain text version of this content. The canonical link for it is here.
Posted to jira@arrow.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2022/03/01 14:01:00 UTC
[jira] [Updated] (ARROW-14612) [C++] Support for filename-based partitioning
[ https://issues.apache.org/jira/browse/ARROW-14612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated ARROW-14612:
-----------------------------------
Labels: dataset good-second-issue pull-request-available (was: dataset good-second-issue)
> [C++] Support for filename-based partitioning
> ---------------------------------------------
>
> Key: ARROW-14612
> URL: https://issues.apache.org/jira/browse/ARROW-14612
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Nicola Crane
> Assignee: Sanjiban Sengupta
> Priority: Major
> Labels: dataset, good-second-issue, pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Directory-based partitioning is a feature of Arrow, but could we support filename-based partitioning?
> e.g. I have a series of CSV files here all called something like {{foo_month_year.csv}} and it'd be nice to be able to read them in and then the month/year bits of the filenames then appear as fields I can filter on etc.
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)