You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tajo.apache.org by "Henry Saputra (JIRA)" <ji...@apache.org> on 2013/11/29 07:43:35 UTC

[jira] [Commented] (TAJO-337) Generic StorageHandler to provide common storage methods

    [ https://issues.apache.org/jira/browse/TAJO-337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13835207#comment-13835207 ] 

Henry Saputra commented on TAJO-337:
------------------------------------

For this one, were you thinking about want to build Tajo specific storage API or leverage existing library like Apache Gora?

> Generic StorageHandler to provide common storage methods
> --------------------------------------------------------
>
>                 Key: TAJO-337
>                 URL: https://issues.apache.org/jira/browse/TAJO-337
>             Project: Tajo
>          Issue Type: Improvement
>          Components: catalog, storage
>            Reporter: Hyunsik Choi
>            Assignee: Hyunsik Choi
>             Fix For: 1.0-incubating
>
>
> Currently, Tajo uses HDFS as a primary storage. But, as a data warehouse system, Tajo should easily support various data sources.
> For this, I propose a generic storage handler interface that provides common storage methods:
> * splitting input data
> * finding a cluster node which is nearest neighbor to data
> * accessing catalog
> * creating a table
> * removing a table
> The above methods are derived from query proecssing mechanism on data sets stored in HDFS.
> Later, we can add easily storage handlers for HBase or other data sources.



--
This message was sent by Atlassian JIRA
(v6.1#6144)