You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@pig.apache.org by "leiwangouc@gmail.com" <le...@gmail.com> on 2014/07/29 11:46:47 UTC
How to batched gets Hbase records in pig UDF
Hi,
I have an input which has about 10M records. Each record contains a rowkey of an Hbase table.
I can do a batched get as described http://stackoverflow.com/questions/13310434/hbase-api-get-data-rows-information-by-list-of-row-ids, but it is slowly because of the big input size.
I want to do it with pig script.
But how can i use batched gets in the pig UDF?
Any insight about this?
Thanks,
Lei
leiwangouc@gmail.com