You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@datasketches.apache.org by Will Lauer <wl...@verizonmedia.com> on 2020/12/03 15:30:28 UTC

Tuple sketch question

I'm using tuple sketches (specifically the ArrayOfDoublesTupleSketch) to do
some computations, and as part of that, I need to do some set operations. I
need to intersect one tuple sketch with a filtered version (filtered by
tuple value) of another tuple sketch. The intersect operation support is
obvious, and I can use a SketchIterator to step through the entries in a
sketch to find the entries that match my filter condition. My question is,
is there a filter operation already existing in the tuple sketch code that
I'm missing, or do I need to build this "filtered" intersection myself
(probably by modifying the existing sketch intersection operation to pass
in a separate "filter" function similar to the current combiner function).

Will


<http://www.verizonmedia.com>

Will Lauer

Senior Principal Architect, Audience & Advertising Reporting
Data Platforms & Systems Engineering

M 508 561 6427
1908 S. First St
Champaign, IL 61822

<http://www.facebook.com/verizonmedia>   <http://twitter.com/verizonmedia>
<https://www.linkedin.com/company/verizon-media/>
<http://www.instagram.com/verizonmedia>

Re: Tuple sketch question

Posted by Will Lauer <wl...@verizonmedia.com>.
That looks like just what I need. And it doesn't look too hard to port to
ArrayOfDouble or to my custom sketch.

Will

<http://www.verizonmedia.com>

Will Lauer

Senior Principal Architect, Audience & Advertising Reporting
Data Platforms & Systems Engineering

M 508 561 6427
1908 S. First St
Champaign, IL 61822

<http://www.facebook.com/verizonmedia>   <http://twitter.com/verizonmedia>
<https://www.linkedin.com/company/verizon-media/>
<http://www.instagram.com/verizonmedia>



On Thu, Dec 3, 2020 at 2:34 PM Alexander Saydakov <sa...@verizonmedia.com>
wrote:

> Someone contributed a class called Filter for generic tuple sketches, but
> I don't think there is an equivalent for ArrayOfDoubles yet.
>
> On Thu, Dec 3, 2020 at 7:30 AM Will Lauer <wl...@verizonmedia.com> wrote:
>
>> I'm using tuple sketches (specifically the ArrayOfDoublesTupleSketch) to
>> do some computations, and as part of that, I need to do some set
>> operations. I need to intersect one tuple sketch with a filtered version
>> (filtered by tuple value) of another tuple sketch. The intersect operation
>> support is obvious, and I can use a SketchIterator to step through the
>> entries in a sketch to find the entries that match my filter condition. My
>> question is, is there a filter operation already existing in the tuple
>> sketch code that I'm missing, or do I need to build this "filtered"
>> intersection myself (probably by modifying the existing sketch intersection
>> operation to pass in a separate "filter" function similar to the current
>> combiner function).
>>
>> Will
>>
>>
>> <http://www.verizonmedia.com>
>>
>> Will Lauer
>>
>> Senior Principal Architect, Audience & Advertising Reporting
>> Data Platforms & Systems Engineering
>>
>> M 508 561 6427
>> 1908 S. First St
>> Champaign, IL 61822
>>
>>
>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.facebook.com_verizonmedia&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=4NFgoEl-tw33ksaHGiRzxwg2ILVvQYkbeAaz9I41KRU&e=>
>>
>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__twitter.com_verizonmedia&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=cu3Ah2vvYt3DIDx-Ix0tUXUiaqtE44GOtgYUX4qqPpA&e=>
>>
>> <https://urldefense.proofpoint.com/v2/url?u=https-3A__www.linkedin.com_company_verizon-2Dmedia_&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=-KoIrOPm5hf6sjIjIEVYRBPaj2VY7_pa8vyKVupHX4o&e=>
>>
>> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.instagram.com_verizonmedia&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=ACNoum6-RGhc1yTvCihOMz1zlA30VtciMHRrODnw5wk&e=>
>>
>>

Re: Tuple sketch question

Posted by Alexander Saydakov <sa...@verizonmedia.com>.
Someone contributed a class called Filter for generic tuple sketches, but I
don't think there is an equivalent for ArrayOfDoubles yet.

On Thu, Dec 3, 2020 at 7:30 AM Will Lauer <wl...@verizonmedia.com> wrote:

> I'm using tuple sketches (specifically the ArrayOfDoublesTupleSketch) to
> do some computations, and as part of that, I need to do some set
> operations. I need to intersect one tuple sketch with a filtered version
> (filtered by tuple value) of another tuple sketch. The intersect operation
> support is obvious, and I can use a SketchIterator to step through the
> entries in a sketch to find the entries that match my filter condition. My
> question is, is there a filter operation already existing in the tuple
> sketch code that I'm missing, or do I need to build this "filtered"
> intersection myself (probably by modifying the existing sketch intersection
> operation to pass in a separate "filter" function similar to the current
> combiner function).
>
> Will
>
>
> <http://www.verizonmedia.com>
>
> Will Lauer
>
> Senior Principal Architect, Audience & Advertising Reporting
> Data Platforms & Systems Engineering
>
> M 508 561 6427
> 1908 S. First St
> Champaign, IL 61822
>
>
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.facebook.com_verizonmedia&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=4NFgoEl-tw33ksaHGiRzxwg2ILVvQYkbeAaz9I41KRU&e=>
>
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__twitter.com_verizonmedia&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=cu3Ah2vvYt3DIDx-Ix0tUXUiaqtE44GOtgYUX4qqPpA&e=>
>
> <https://urldefense.proofpoint.com/v2/url?u=https-3A__www.linkedin.com_company_verizon-2Dmedia_&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=-KoIrOPm5hf6sjIjIEVYRBPaj2VY7_pa8vyKVupHX4o&e=>
>
> <https://urldefense.proofpoint.com/v2/url?u=http-3A__www.instagram.com_verizonmedia&d=DwMFaQ&c=sWW_bEwW_mLyN3Kx2v57Q8e-CRbmiT9yOhqES_g_wVY&r=0TpvE_u2hS1ubQhK3gLhy94YgZm2k_r8JHJnqgjOXx4&m=BgDcuZpHuIZSssBarSDowqVfw1zW5v0BvpbTpmcUbsU&s=ACNoum6-RGhc1yTvCihOMz1zlA30VtciMHRrODnw5wk&e=>
>
>