You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hbase.apache.org by "Doug Meil (JIRA)" <ji...@apache.org> on 2011/07/07 21:54:16 UTC

[jira] [Created] (HBASE-4079) HTableUtil - helper class for loading data

HTableUtil - helper class for loading data 
-------------------------------------------

                 Key: HBASE-4079
                 URL: https://issues.apache.org/jira/browse/HBASE-4079
             Project: HBase
          Issue Type: Improvement
            Reporter: Doug Meil
            Assignee: Doug Meil
            Priority: Minor
         Attachments: HTableUtil.java

A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.

I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.



--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13061566#comment-13061566 ] 

Ted Yu commented on HBASE-4079:
-------------------------------

Not changing HTableInterface is a good decision.

Please reformat the source code with 2 spaces for tab.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Meil updated HBASE-4079:
-----------------------------

    Attachment:     (was: HTableUtil.java)

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ted Yu updated HBASE-4079:
--------------------------

    Attachment: 4079.txt

This is the patch I am going to commit.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13061575#comment-13061575 ] 

Doug Meil commented on HBASE-4079:
----------------------------------

Thanks Ted!  Will do on the reformatting.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Meil updated HBASE-4079:
-----------------------------

    Attachment:     (was: HTableUtil.java)

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062303#comment-13062303 ] 

Ted Yu commented on HBASE-4079:
-------------------------------

Here are the commands used to generate patch:
{code}
svn add src/main/java/org/apache/hadoop/hbase/client/HTableUtil.java
svn add src/test//java/org/apache/hadoop/hbase/client/TestHTableUtil.java
svn diff src/main/java/org/apache/hadoop/hbase/client/HTableUtil.java > 4079.txt
svn diff src/test//java/org/apache/hadoop/hbase/client/TestHTableUtil.java >> 4079.txt
{code}

Along the way, I am reformatting line 65:
{code}
    Map<String, List<Put>> putMap = createRsPutMap(htable, puts );
{code}
I also removed unused imports in the test.
Also added short description for TestHTableUtil.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062305#comment-13062305 ] 

Ted Yu commented on HBASE-4079:
-------------------------------

Committed to TRUNK.

Thanks for the patch Doug.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Hudson (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062316#comment-13062316 ] 

Hudson commented on HBASE-4079:
-------------------------------

Integrated in HBase-TRUNK #2015 (See [https://builds.apache.org/job/HBase-TRUNK/2015/])
    HBASE-4079 HTableUtil - helper class for loading data (Doug Meil via Ted Yu)

tedyu : 
Files : 
* /hbase/trunk/src/main/java/org/apache/hadoop/hbase/client/HTableUtil.java
* /hbase/trunk/CHANGES.txt
* /hbase/trunk/src/test/java/org/apache/hadoop/hbase/client/TestHTableUtil.java


> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>             Fix For: 0.92.0
>
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ted Yu updated HBASE-4079:
--------------------------

    Attachment:     (was: 4079.txt)

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ted Yu updated HBASE-4079:
--------------------------

      Resolution: Fixed
    Hadoop Flags: [Reviewed]
          Status: Resolved  (was: Patch Available)

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13061585#comment-13061585 ] 

Doug Meil commented on HBASE-4079:
----------------------------------

I think should pass the sniff test on the reformatting.  Please advise if I honked it up.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ted Yu updated HBASE-4079:
--------------------------

    Fix Version/s: 0.92.0
     Release Note: 
HTableUtil provides the following helper methods:
{code}
public static void bucketRsPut(HTable htable, List<Put> puts) throws IOException
{code}
It processes a List of Puts and writes them to an HTable instance in RegionServer buckets via the htable.put method.
{code}
public static void bucketRsBatch(HTable htable, List<Row> rows) throws IOException
{code}
Processes a List of Rows (Put, Delete) and writes them to an HTable instance in RegionServer buckets via the htable.batch method.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>             Fix For: 0.92.0
>
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Meil updated HBASE-4079:
-----------------------------

    Attachment: HTableUtil.java

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Meil updated HBASE-4079:
-----------------------------

    Attachment: HTableUtil.java

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Meil updated HBASE-4079:
-----------------------------

    Status: Patch Available  (was: Open)

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13061936#comment-13061936 ] 

Doug Meil commented on HBASE-4079:
----------------------------------

Hi Ted-

I'll add a test. 

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Commented] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13061722#comment-13061722 ] 

Ted Yu commented on HBASE-4079:
-------------------------------

@Doug:
It would be nice if you can come up with some unit test. You can refer to HTable unit test(s).
Since this code has been verified in production, the above is desirable but not required.

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Ted Yu (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ted Yu updated HBASE-4079:
--------------------------

    Attachment: 4079.txt

Some minor formatting.
Changing copyright to 2011.

To generate patch, do the following:
svn add `find . -name HTableUtil.java`
svn diff src/main/java/org/apache/hadoop/hbase/client/HTableUtil.java > 4079.txt

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

[jira] [Updated] (HBASE-4079) HTableUtil - helper class for loading data

Posted by "Doug Meil (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/HBASE-4079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Doug Meil updated HBASE-4079:
-----------------------------

    Attachment: TestHTableUtil.java
                HTableUtil.java

Replacing HTableUtil - cleaned up some javadoc warnings (adding @param, etc.)  also cleaned up a warning on a deprecated method.  changed copyright to 2011 and fixed an indent formatting issue.

Also adding TestHTableUtil unit test. 

> HTableUtil - helper class for loading data 
> -------------------------------------------
>
>                 Key: HBASE-4079
>                 URL: https://issues.apache.org/jira/browse/HBASE-4079
>             Project: HBase
>          Issue Type: Improvement
>            Reporter: Doug Meil
>            Assignee: Doug Meil
>            Priority: Minor
>         Attachments: 4079.txt, HTableUtil.java, TestHTableUtil.java
>
>
> A pattern that we use at Explorys is to chunk up Puts, and then bucket Puts by RegionServer.  This reduces the number of RPC calls per writeBuffer flush, because the flushes will typically be going to one region with this approach.
> I didn't think adding such utility methods to HTable was the right approach, so I created an HTableUtil (in the .client package) that contained such functionality.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira