You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hbase.apache.org by "Vasu Mariyala (JIRA)" <ji...@apache.org> on 2014/01/24 23:07:38 UTC

[jira] [Created] (HBASE-10416) Improvements to the import flow

Vasu Mariyala created HBASE-10416:
-------------------------------------

             Summary: Improvements to the import flow
                 Key: HBASE-10416
                 URL: https://issues.apache.org/jira/browse/HBASE-10416
             Project: HBase
          Issue Type: New Feature
          Components: mapreduce
            Reporter: Vasu Mariyala


Following improvements can be made to the Import logic

a) Make the import extensible (i.e., remove the filter from being a static member of Import and make it an instance variable of the mapper, make the mappers or variables of interest protected. )

b) Make sure that the Import calls filterRowKey method of the filter (Useful if we want to filter the data of an organization based on the row key or using filters like PrefixFilter). The existing test case in TestImportExport#testWithFilter works with this assumption but is so far successful because there is only one row inserted into the table.

c) Provide an option to specify the durability during the import (Specifying the Durability as SKIP_WAL would improve the performance of restore considerably.) [~lhofhansl] suggested that this should be a parameter to the import.

d) Some minor refactoring to avoid building a comma separated string for the filter args.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)