You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@hadoop.apache.org by "laozhao0@sina.cn" <la...@sina.cn> on 2014/08/17 05:05:41 UTC

回复: Re:How to sort in a WordCount






Hi,?周杰Thank you for your reply.partition can get the top-k of the wordcount result ?In this problem,I have a lot of search keywords , and i want to know the top-k words .So for those words which only occures one or two times, i can ignore them.


laozhao0@sina.cn
?发件人:?周杰发送时间:?2014-08-17?10:54收件人:?user主题:?Re:How to sort in a WordCountHi,laozhao
???? are you gonna sort all the keys?Maybe you can sort all the keys by implentmenting partition function, something like range-partition.for example, there are 100 digits. you can design 5 partitions [1-20],[20,40],[40,60],[60,80],[80,100], deviding the different digits into different partitions.? 

At 2014-08-17 10:35:34, "laozhao0@sina.cn" <la...@sina.cn> wrote:
 
Hello,I am using mapreduce to get the frequency of words in a corpus .?And I want to get a descnding sorted result.Now ?I can use Hive to sort them ,but If i use mapreduce,how can i do this ?
Thakns.


laozhao0@sina.cn