You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tajo.apache.org by "Min Zhou (JIRA)" <ji...@apache.org> on 2013/12/16 02:38:10 UTC

[jira] [Commented] (TAJO-283) Add Table Partitioning

    [ https://issues.apache.org/jira/browse/TAJO-283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13848744#comment-13848744 ] 

Min Zhou commented on TAJO-283:
-------------------------------

How do you implements partition tables?   One hdfs file per partition? or one hdfs directory per partition? Is there any shuffle when generating partition table?

> Add Table Partitioning
> ----------------------
>
>                 Key: TAJO-283
>                 URL: https://issues.apache.org/jira/browse/TAJO-283
>             Project: Tajo
>          Issue Type: New Feature
>          Components: catalog, physical operator, planner/optimizer
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 0.8-incubating
>
>
> Table partitioning gives many facilities to maintain large tables. First of all, it enables the data management system to prune many input data which are actually not necessary. In addition, it gives the system more optimization  opportunities  that exploit the physical layouts.
> Basically, Tajo should follow the RDBMS-style partitioning system, including range, list, hash, and so on. In order to keep Hive compatibility, we need to add Hive partition type that does not exists in existing DBMS systems.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)