You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Jacques Nadeau (JIRA)" <ji...@apache.org> on 2014/04/23 05:21:18 UTC
[jira] [Resolved] (DRILL-468) Support for FileSystem partitions
[ https://issues.apache.org/jira/browse/DRILL-468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jacques Nadeau resolved DRILL-468.
----------------------------------
Resolution: Fixed
resolved in 69c571c
> Support for FileSystem partitions
> ---------------------------------
>
> Key: DRILL-468
> URL: https://issues.apache.org/jira/browse/DRILL-468
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Steven Phillips
> Assignee: Steven Phillips
> Attachments: DRILL-468.patch
>
>
> For filesystem partitioning, we want to use the existing directory structure of the data. So, if a selection is a directory that contains subdirectories, the name of the directory a given record was stored in can be included as a field in that record. For example, given this structure:
> /data
> /a
> file.csv
> /b
> file.csv
> select * from dfs.`/data`
> will include a column named dir0, with possible values a and b. This can be extended to a hierarchy of partitions. For example,
> /data
> /a
> /1
> file.csv
> /2
> file.csv
> /b
> file.csv
> would have columns dir0 (with possible values a and b) and dir1 (with possible values 1, 2 and null).
> The data type will always be VARCHAR for the partition columns.
--
This message was sent by Atlassian JIRA
(v6.2#6252)