You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@pig.apache.org by "Harsh J (JIRA)" <ji...@apache.org> on 2014/01/16 02:10:22 UTC

[jira] [Assigned] (PIG-2921) Provide a bulkloadable option in HBaseStorage

     [ https://issues.apache.org/jira/browse/PIG-2921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Harsh J reassigned PIG-2921:
----------------------------

    Assignee:     (was: Harsh J)

> Provide a bulkloadable option in HBaseStorage
> ---------------------------------------------
>
>                 Key: PIG-2921
>                 URL: https://issues.apache.org/jira/browse/PIG-2921
>             Project: Pig
>          Issue Type: New Feature
>          Components: data
>    Affects Versions: 0.9.2
>            Reporter: Harsh J
>
> Right now, the Pig HBaseStorage writes Puts directly into HBase. This is slow for bulk operations (such as the ones Pig exactly does). The Puts/Deletes are more meant for realtime operations, so it would be nice if Pig had an automatic mechanism to prepare bulkloadable HFiles for the target table, and bulkload it in right at the end of the job.
> For compatibility reasons, this can be optional and turned off by default until it is agreed that this must be default (but can continue to provide a turn-off option).



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)