You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@pig.apache.org by "leiwangouc@gmail.com" <le...@gmail.com> on 2014/07/29 11:46:47 UTC

How to batched gets Hbase records in pig UDF

Hi, 

    I have an input which has  about  10M records.  Each record contains a rowkey of  an Hbase table.
    I can do a batched get as described http://stackoverflow.com/questions/13310434/hbase-api-get-data-rows-information-by-list-of-row-ids, but it is slowly because of the big input size. 
    I want to do it with pig script. 
    But how can i use batched gets in the pig UDF?
    Any insight about this?

Thanks,
Lei




leiwangouc@gmail.com