You are viewing a plain text version of this content. The canonical link for it is here.

Posted to dev@drill.apache.org by "Jacques Nadeau (JIRA)" <ji...@apache.org> on 2014/04/23 05:21:18 UTC

[jira] [Resolved] (DRILL-468) Support for FileSystem partitions

     [ https://issues.apache.org/jira/browse/DRILL-468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jacques Nadeau resolved DRILL-468.
----------------------------------

    Resolution: Fixed

resolved in 69c571c

> Support for FileSystem partitions
> ---------------------------------
>
>                 Key: DRILL-468
>                 URL: https://issues.apache.org/jira/browse/DRILL-468
>             Project: Apache Drill
>          Issue Type: Bug
>            Reporter: Steven Phillips
>            Assignee: Steven Phillips
>         Attachments: DRILL-468.patch
>
>
> For filesystem partitioning, we want to use the existing directory structure of the data. So, if a selection is a directory that contains subdirectories, the name of the directory a given record was stored in can be included as a field in that record. For example, given this structure:
> /data
>   /a
>     file.csv
>   /b
>     file.csv
> select * from dfs.`/data`
> will include a column named dir0, with possible values a and b. This can be extended to a hierarchy of partitions. For example,
> /data
>   /a
>     /1
>       file.csv
>     /2
>       file.csv
>   /b
>     file.csv
> would have columns dir0 (with possible values a and b) and dir1 (with possible values 1, 2 and null).
> The data type will always be VARCHAR for the partition columns.



--
This message was sent by Atlassian JIRA
(v6.2#6252)