You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@flink.apache.org by Flavio Pompermaier <po...@okkam.it> on 2019/02/08 09:35:53 UTC
Dataset statistics
Hi to all,
is there any effort to standardize descriptive statistics in Apache Flink?
Is there any suggested way to achieve this?
Best,
Flavio
Re: Dataset statistics
Posted by Flavio Pompermaier <po...@okkam.it>.
We've just published a first attempt (on Flink 1.6.2) that extract some
descriptive statistics from a batch dataset[1].
Any feedback is welcome.
Best,
Flavio
[1] https://github.com/okkam-it/flink-descriptive-stats
On Thu, Feb 14, 2019 at 11:19 AM Flavio Pompermaier <po...@okkam.it>
wrote:
> No effort in this direction, then?
> I had a try using SQL on Table API but I fear that the generated plan is
> not the optimal one..I'm looking for an efficient way to implement
> describe() method on a table or dataset/datasource
>
> On Fri, Feb 8, 2019 at 10:35 AM Flavio Pompermaier <po...@okkam.it>
> wrote:
>
>> Hi to all,
>> is there any effort to standardize descriptive statistics in Apache Flink?
>> Is there any suggested way to achieve this?
>>
>> Best,
>> Flavio
>>
>
>
--
Flavio Pompermaier
Development Department
OKKAM S.r.l.
Tel. +(39) 0461 041809
Re: Dataset statistics
Posted by Flavio Pompermaier <po...@okkam.it>.
No effort in this direction, then?
I had a try using SQL on Table API but I fear that the generated plan is
not the optimal one..I'm looking for an efficient way to implement
describe() method on a table or dataset/datasource
On Fri, Feb 8, 2019 at 10:35 AM Flavio Pompermaier <po...@okkam.it>
wrote:
> Hi to all,
> is there any effort to standardize descriptive statistics in Apache Flink?
> Is there any suggested way to achieve this?
>
> Best,
> Flavio
>