You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@kylin.apache.org by "hongbin ma (JIRA)" <ji...@apache.org> on 2015/11/16 09:28:10 UTC

[jira] [Resolved] (KYLIN-943) Approximate TopN supported by Cube

     [ https://issues.apache.org/jira/browse/KYLIN-943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

hongbin ma resolved KYLIN-943.
------------------------------
    Resolution: Fixed

> Approximate TopN supported by Cube
> ----------------------------------
>
>                 Key: KYLIN-943
>                 URL: https://issues.apache.org/jira/browse/KYLIN-943
>             Project: Kylin
>          Issue Type: New Feature
>    Affects Versions: v2.0
>            Reporter: hongbin ma
>            Assignee: Shaofeng SHI
>             Fix For: v2.0
>
>
> SpaceSaving (TopN algorithm) code could copy from https://github.com/addthis/stream-lib/blob/master/src/main/java/com/clearspring/analytics/stream/StreamSummary.java
> We don’t need the whole stream-lib, but just one (or two) classes is enough. Make sure you give credit to stream-lib in class comment.
>  
> In order to run SpaceSaving in parallel, the TopN has to be merged using http://arxiv.org/pdf/1401.0702.pdf.  No existing impl as I searched, we have to implement ourselves.
>  
> Cheers
> Yang
>  
> From: Li, Yang 
> Sent: 2015年8月7日 12:43
> To: DL-eBay-Kylin
> Subject: Distributed TopN papers
>  
> The basic algorithm
> [1] https://icmi.cs.ucsb.edu/research/tech_reports/reports/2005-23.pdf
>  
> Its application in distributed system
> [2] http://www.cs.utah.edu/~jeffp/papers/merge-summ-TODS.pdf
> [3] http://www.crm.umontreal.ca/pub/Rapports/3300-3399/3322.pdf
>  
>  
> Cheers
> Yang



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)