You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@flink.apache.org by Yuta Morisawa <yu...@kddi-research.jp> on 2018/07/11 07:24:12 UTC

Some question about document

Hi all

Now, I'm reading Flink document and I have some points to feel difficult 
to get an idea.
I'd appreciate if you tell it me.

1,TypeInformation
  I understand TypeInformation is used for selecting relevant serializer 
and comparator.
  But, the ducument doesn't specify if it has another way to be used.

  So, what I want to know is that what kinds of process gets benefit 
from TypeInformation other than serializer and comparator.

2, Managed Memory
  The word "Managed memory" is appeared several time in the document 
but I can't find any detail description.
  This is the only document I found 
(https://www.slideshare.net/sbaltagi/overview-of-apacheflinkbyslimbaltagi)

  If anyone has document that explains managed memory, please let me know.

3, Serializer
  What do the words in the document  "serializers we ship with Flink" 
mean? I know Flink uses avro for POJOs, is it the same thing?
https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html


Regards,
Yuta

-- 

--------------------------------------------
  Business Vision :"Challenge for the future"
---------------------------------------------

   〒356-8502
 埼玉県ふじみ野市大原2丁目1番15号
 株式会社 KDDI総合研究所(KDDI Research, Inc.)
 コネクティッドカー1G
 森澤 雄太
 mail yu-morisawa@kddi-research.jp
 tel  070-3871-8883
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 この電子メールおよび添付書類は、名宛人のための
 特別な秘密情報を含んでおります。
 そのため、名宛人以外の方による利用は認められて
 おりません。
 名宛人以外の方による通信内容公表、複写、転用等
 は厳禁であり、違法となることがあります。
 万が一、何らかの誤りによりこの電子メールを名宛
 人以外の方が受信された場合は、お手数でも、直ち
 に発信人にお知らせ頂くと同時に、当メールを削除
 下さいますようお願い申し上げます。


Re: Some question about document

Posted by Yuta Morisawa <yu...@kddi-research.jp>.
Hi yang

Thank you for your comment.
I read document and got an idea.

Flink uses custom serializers on known types, and the fallback is kryo.
The other arbitary objects is serialized by kryo.



On 2018/07/12 12:14, vino yang wrote:
> Hi Yuta,
> 
> It seems Chesnay is right. The "fallback" in flink's documentation is in 
> terms of the types flink supported. But for all the other arbitrary 
> types kryo is the first choice.
> 
> 2018-07-12 9:55 GMT+08:00 Yuta Morisawa <yu-morisawa@kddi-research.jp 
> <ma...@kddi-research.jp>>:
> 
>     Thank you for your answer.
> 
>     > For POJOs Flink has a custom serializer. For arbitrary objects we use
>     > kryo, and can use Avro as a fallback.
>     https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html#serialization-of-pojo-types
>     <https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html#serialization-of-pojo-types>
> 
>     It may be the reverse.
>     Kryo is for fallback, right?
> 
> 
>     On 2018/07/11 19:00, Chesnay Schepler wrote:
> 
>         1) TypeInformation are used to create serializers, comparators
>         and to verify correctness of certain operations (like
>         projections on tuple datasets).
> 
>         2) see
>         https://flink.apache.org/news/2015/05/11/Juggling-with-Bits-and-Bytes.html
>         <https://flink.apache.org/news/2015/05/11/Juggling-with-Bits-and-Bytes.html>
> 
>         3) Flink comes with a number of serializers for varying types as
>         outlined here
>         <https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html#flinks-typeinformation-class
>         <https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html#flinks-typeinformation-class>>.
>         For POJOs Flink has a custom serializer. For arbitrary objects
>         we use kryo, and can use Avro as a fallback.
> 
>         On 11.07.2018 09:24, Yuta Morisawa wrote:
> 
>             Hi all
> 
>             Now, I'm reading Flink document and I have some points to
>             feel difficult to get an idea.
>             I'd appreciate if you tell it me.
> 
>             1,TypeInformation
>               I understand TypeInformation is used for selecting
>             relevant serializer and comparator.
>               But, the ducument doesn't specify if it has another way to
>             be used.
> 
>               So, what I want to know is that what kinds of process gets
>             benefit from TypeInformation other than serializer and
>             comparator.
> 
>             2, Managed Memory
>               The word "Managed memory" is appeared several time in the
>             document but I can't find any detail description.
>               This is the only document I found
>             (https://www.slideshare.net/sbaltagi/overview-of-apacheflinkbyslimbaltagi
>             <https://www.slideshare.net/sbaltagi/overview-of-apacheflinkbyslimbaltagi>)
> 
>               If anyone has document that explains managed memory,
>             please let me know.
> 
>             3, Serializer
>               What do the words in the document  "serializers we ship
>             with Flink" mean? I know Flink uses avro for POJOs, is it
>             the same thing?
>             https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html
>             <https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html>
> 
> 
> 
>             Regards,
>             Yuta
> 
> 
> 
> 


Re: Some question about document

Posted by vino yang <ya...@gmail.com>.
Hi Yuta,

It seems Chesnay is right. The "fallback" in flink's documentation is in
terms of the types flink supported. But for all the other arbitrary types
kryo is the first choice.

2018-07-12 9:55 GMT+08:00 Yuta Morisawa <yu...@kddi-research.jp>:

> Thank you for your answer.
>
> > For POJOs Flink has a custom serializer. For arbitrary objects we use
> > kryo, and can use Avro as a fallback.
> https://ci.apache.org/projects/flink/flink-docs-release-1.5/
> dev/types_serialization.html#serialization-of-pojo-types
>
> It may be the reverse.
> Kryo is for fallback, right?
>
>
> On 2018/07/11 19:00, Chesnay Schepler wrote:
>
>> 1) TypeInformation are used to create serializers, comparators and to
>> verify correctness of certain operations (like projections on tuple
>> datasets).
>>
>> 2) see https://flink.apache.org/news/2015/05/11/Juggling-with-Bits-
>> and-Bytes.html
>>
>> 3) Flink comes with a number of serializers for varying types as outlined
>> here <https://ci.apache.org/projects/flink/flink-docs-release-1.
>> 5/dev/types_serialization.html#flinks-typeinformation-class>.
>> For POJOs Flink has a custom serializer. For arbitrary objects we use
>> kryo, and can use Avro as a fallback.
>>
>> On 11.07.2018 09:24, Yuta Morisawa wrote:
>>
>>> Hi all
>>>
>>> Now, I'm reading Flink document and I have some points to feel difficult
>>> to get an idea.
>>> I'd appreciate if you tell it me.
>>>
>>> 1,TypeInformation
>>>  I understand TypeInformation is used for selecting relevant serializer
>>> and comparator.
>>>  But, the ducument doesn't specify if it has another way to be used.
>>>
>>>  So, what I want to know is that what kinds of process gets benefit from
>>> TypeInformation other than serializer and comparator.
>>>
>>> 2, Managed Memory
>>>  The word "Managed memory" is appeared several time in the document but
>>> I can't find any detail description.
>>>  This is the only document I found (https://www.slideshare.net/sb
>>> altagi/overview-of-apacheflinkbyslimbaltagi)
>>>
>>>  If anyone has document that explains managed memory, please let me know.
>>>
>>> 3, Serializer
>>>  What do the words in the document  "serializers we ship with Flink"
>>> mean? I know Flink uses avro for POJOs, is it the same thing?
>>> https://ci.apache.org/projects/flink/flink-docs-release-1.5/
>>> dev/types_serialization.html
>>>
>>>
>>> Regards,
>>> Yuta
>>>
>>>
>>
>

Re: Some question about document

Posted by Yuta Morisawa <yu...@kddi-research.jp>.
Thank you for your answer.

 > For POJOs Flink has a custom serializer. For arbitrary objects we use
 > kryo, and can use Avro as a fallback.
https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html#serialization-of-pojo-types

It may be the reverse.
Kryo is for fallback, right?


On 2018/07/11 19:00, Chesnay Schepler wrote:
> 1) TypeInformation are used to create serializers, comparators and to 
> verify correctness of certain operations (like projections on tuple 
> datasets).
> 
> 2) see 
> https://flink.apache.org/news/2015/05/11/Juggling-with-Bits-and-Bytes.html
> 
> 3) Flink comes with a number of serializers for varying types as 
> outlined here 
> <https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html#flinks-typeinformation-class>.
> For POJOs Flink has a custom serializer. For arbitrary objects we use 
> kryo, and can use Avro as a fallback.
> 
> On 11.07.2018 09:24, Yuta Morisawa wrote:
>> Hi all
>>
>> Now, I'm reading Flink document and I have some points to feel 
>> difficult to get an idea.
>> I'd appreciate if you tell it me.
>>
>> 1,TypeInformation
>>  I understand TypeInformation is used for selecting relevant 
>> serializer and comparator.
>>  But, the ducument doesn't specify if it has another way to be used.
>>
>>  So, what I want to know is that what kinds of process gets benefit 
>> from TypeInformation other than serializer and comparator.
>>
>> 2, Managed Memory
>>  The word "Managed memory" is appeared several time in the document 
>> but I can't find any detail description.
>>  This is the only document I found 
>> (https://www.slideshare.net/sbaltagi/overview-of-apacheflinkbyslimbaltagi)
>>
>>  If anyone has document that explains managed memory, please let me know.
>>
>> 3, Serializer
>>  What do the words in the document  "serializers we ship with Flink" 
>> mean? I know Flink uses avro for POJOs, is it the same thing?
>> https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html 
>>
>>
>>
>> Regards,
>> Yuta
>>
> 


Re: Some question about document

Posted by Chesnay Schepler <ch...@apache.org>.
1) TypeInformation are used to create serializers, comparators and to 
verify correctness of certain operations (like projections on tuple 
datasets).

2) see 
https://flink.apache.org/news/2015/05/11/Juggling-with-Bits-and-Bytes.html

3) Flink comes with a number of serializers for varying types as 
outlined here 
<https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html#flinks-typeinformation-class>.
For POJOs Flink has a custom serializer. For arbitrary objects we use 
kryo, and can use Avro as a fallback.

On 11.07.2018 09:24, Yuta Morisawa wrote:
> Hi all
>
> Now, I'm reading Flink document and I have some points to feel 
> difficult to get an idea.
> I'd appreciate if you tell it me.
>
> 1,TypeInformation
>  I understand TypeInformation is used for selecting relevant 
> serializer and comparator.
>  But, the ducument doesn't specify if it has another way to be used.
>
>  So, what I want to know is that what kinds of process gets benefit 
> from TypeInformation other than serializer and comparator.
>
> 2, Managed Memory
>  The word "Managed memory" is appeared several time in the document 
> but I can't find any detail description.
>  This is the only document I found 
> (https://www.slideshare.net/sbaltagi/overview-of-apacheflinkbyslimbaltagi)
>
>  If anyone has document that explains managed memory, please let me know.
>
> 3, Serializer
>  What do the words in the document  "serializers we ship with Flink" 
> mean? I know Flink uses avro for POJOs, is it the same thing?
> https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/types_serialization.html 
>
>
>
> Regards,
> Yuta
>