You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@flink.apache.org by Jan Lukavský <je...@seznam.cz> on 2019/08/14 14:35:47 UTC

Watermarks not propagated to WebUI?

Hi,

is it possible, that watermarks are sometimes not propagated to WebUI, 
although they are internally moving as normal? I see in WebUI every 
operator showing "No Watermark", but outputs seem to be propagated to 
sink (and there are watermark sensitive operations involved - e.g. 
reductions on fixed windows without early emitting). More strangely, 
this happens when I increase parallelism above some threshold. If I use 
parallelism of N, watermarks are shown, when I increase it above some 
number (seems not to be exactly deterministic), watermarks seems to 
disappear.

I'm using Flink 1.8.1.

Did anyone experience something like this before?

Jan


Re: Watermarks not propagated to WebUI?

Posted by Thomas Weise <th...@apache.org>.
The issue persists with 1.9.1:

https://issues.apache.org/jira/browse/FLINK-14470


On Mon, Aug 26, 2019 at 1:47 AM Jan Lukavský <je...@seznam.cz> wrote:

> Hi Robert,
>
> I'd very much love to, but because I run my pipeline with Beam, I'm
> afraid I will have to wait a little longer, before Beam has runner for
> 1.9 [1]. I'm pretty sure that the watermarks disappeared with overall
> parallelism (over all operators) something above 2000. There was quite a
> lot of operators (shuffling), so the individual parallelism of each
> operator was about 200. The pipeline was spread over 50 taskmanager
> (each having 4 slots).
>
> Jan
>
> [1] https://github.com/apache/beam/pull/9296/
>
> On 8/26/19 10:23 AM, Robert Metzger wrote:
> > Jan, will you be able to test this issue on the now-released Flink 1.9
> > with the new UI?
> >
> > What parallelism is needed to reproduce the issue?
> >
> >
> > On Thu, Aug 15, 2019 at 1:59 PM Chesnay Schepler <chesnay@apache.org
> > <ma...@apache.org>> wrote:
> >
> >     I remember an issue regarding the watermark fetch request from the
> >     WebUI
> >     exceeding some HTTP size limit, since it tries to fetch all
> >     watermarks
> >     at once, and the format of this request isn't exactly efficient.
> >
> >     Querying metrics for individual operators still works since the
> >     request
> >     is small enough.
> >
> >     Not sure whether we ever fixed that.
> >
> >     On 15/08/2019 12:01, Jan Lukavský wrote:
> >     > Hi,
> >     >
> >     > Thomas, thanks for confirming this. I have noticed, that in 1.9 the
> >     > WebUI has been reworked a lot, does anyone know if this is still an
> >     > issue? I currently cannot easily try 1.9, so I cannot confirm or
> >     > disprove that.
> >     >
> >     > Jan
> >     >
> >     > On 8/14/19 6:25 PM, Thomas Weise wrote:
> >     >> I have also noticed this issue (Flink 1.5, Flink 1.8), and it
> >     appears
> >     >> with
> >     >> higher parallelism.
> >     >>
> >     >> This can be confusing to the user when watermarks actually work
> >     and
> >     >> can be
> >     >> observed using the metrics.
> >     >>
> >     >> On Wed, Aug 14, 2019 at 7:36 AM Jan Lukavský <je.ik@seznam.cz
> >     <ma...@seznam.cz>> wrote:
> >     >>
> >     >>> Hi,
> >     >>>
> >     >>> is it possible, that watermarks are sometimes not propagated
> >     to WebUI,
> >     >>> although they are internally moving as normal? I see in WebUI
> >     every
> >     >>> operator showing "No Watermark", but outputs seem to be
> >     propagated to
> >     >>> sink (and there are watermark sensitive operations involved -
> e.g.
> >     >>> reductions on fixed windows without early emitting). More
> >     strangely,
> >     >>> this happens when I increase parallelism above some threshold.
> >     If I use
> >     >>> parallelism of N, watermarks are shown, when I increase it
> >     above some
> >     >>> number (seems not to be exactly deterministic), watermarks
> >     seems to
> >     >>> disappear.
> >     >>>
> >     >>> I'm using Flink 1.8.1.
> >     >>>
> >     >>> Did anyone experience something like this before?
> >     >>>
> >     >>> Jan
> >     >>>
> >     >>>
> >     >
> >
>

Re: Watermarks not propagated to WebUI?

Posted by Jan Lukavský <je...@seznam.cz>.
Hi Robert,

I'd very much love to, but because I run my pipeline with Beam, I'm 
afraid I will have to wait a little longer, before Beam has runner for 
1.9 [1]. I'm pretty sure that the watermarks disappeared with overall 
parallelism (over all operators) something above 2000. There was quite a 
lot of operators (shuffling), so the individual parallelism of each 
operator was about 200. The pipeline was spread over 50 taskmanager 
(each having 4 slots).

Jan

[1] https://github.com/apache/beam/pull/9296/

On 8/26/19 10:23 AM, Robert Metzger wrote:
> Jan, will you be able to test this issue on the now-released Flink 1.9 
> with the new UI?
>
> What parallelism is needed to reproduce the issue?
>
>
> On Thu, Aug 15, 2019 at 1:59 PM Chesnay Schepler <chesnay@apache.org 
> <ma...@apache.org>> wrote:
>
>     I remember an issue regarding the watermark fetch request from the
>     WebUI
>     exceeding some HTTP size limit, since it tries to fetch all
>     watermarks
>     at once, and the format of this request isn't exactly efficient.
>
>     Querying metrics for individual operators still works since the
>     request
>     is small enough.
>
>     Not sure whether we ever fixed that.
>
>     On 15/08/2019 12:01, Jan Lukavský wrote:
>     > Hi,
>     >
>     > Thomas, thanks for confirming this. I have noticed, that in 1.9 the
>     > WebUI has been reworked a lot, does anyone know if this is still an
>     > issue? I currently cannot easily try 1.9, so I cannot confirm or
>     > disprove that.
>     >
>     > Jan
>     >
>     > On 8/14/19 6:25 PM, Thomas Weise wrote:
>     >> I have also noticed this issue (Flink 1.5, Flink 1.8), and it
>     appears
>     >> with
>     >> higher parallelism.
>     >>
>     >> This can be confusing to the user when watermarks actually work
>     and
>     >> can be
>     >> observed using the metrics.
>     >>
>     >> On Wed, Aug 14, 2019 at 7:36 AM Jan Lukavský <je.ik@seznam.cz
>     <ma...@seznam.cz>> wrote:
>     >>
>     >>> Hi,
>     >>>
>     >>> is it possible, that watermarks are sometimes not propagated
>     to WebUI,
>     >>> although they are internally moving as normal? I see in WebUI
>     every
>     >>> operator showing "No Watermark", but outputs seem to be
>     propagated to
>     >>> sink (and there are watermark sensitive operations involved - e.g.
>     >>> reductions on fixed windows without early emitting). More
>     strangely,
>     >>> this happens when I increase parallelism above some threshold.
>     If I use
>     >>> parallelism of N, watermarks are shown, when I increase it
>     above some
>     >>> number (seems not to be exactly deterministic), watermarks
>     seems to
>     >>> disappear.
>     >>>
>     >>> I'm using Flink 1.8.1.
>     >>>
>     >>> Did anyone experience something like this before?
>     >>>
>     >>> Jan
>     >>>
>     >>>
>     >
>

Re: Watermarks not propagated to WebUI?

Posted by Robert Metzger <rm...@apache.org>.
Jan, will you be able to test this issue on the now-released Flink 1.9 with
the new UI?

What parallelism is needed to reproduce the issue?


On Thu, Aug 15, 2019 at 1:59 PM Chesnay Schepler <ch...@apache.org> wrote:

> I remember an issue regarding the watermark fetch request from the WebUI
> exceeding some HTTP size limit, since it tries to fetch all watermarks
> at once, and the format of this request isn't exactly efficient.
>
> Querying metrics for individual operators still works since the request
> is small enough.
>
> Not sure whether we ever fixed that.
>
> On 15/08/2019 12:01, Jan Lukavský wrote:
> > Hi,
> >
> > Thomas, thanks for confirming this. I have noticed, that in 1.9 the
> > WebUI has been reworked a lot, does anyone know if this is still an
> > issue? I currently cannot easily try 1.9, so I cannot confirm or
> > disprove that.
> >
> > Jan
> >
> > On 8/14/19 6:25 PM, Thomas Weise wrote:
> >> I have also noticed this issue (Flink 1.5, Flink 1.8), and it appears
> >> with
> >> higher parallelism.
> >>
> >> This can be confusing to the user when watermarks actually work and
> >> can be
> >> observed using the metrics.
> >>
> >> On Wed, Aug 14, 2019 at 7:36 AM Jan Lukavský <je...@seznam.cz> wrote:
> >>
> >>> Hi,
> >>>
> >>> is it possible, that watermarks are sometimes not propagated to WebUI,
> >>> although they are internally moving as normal? I see in WebUI every
> >>> operator showing "No Watermark", but outputs seem to be propagated to
> >>> sink (and there are watermark sensitive operations involved - e.g.
> >>> reductions on fixed windows without early emitting). More strangely,
> >>> this happens when I increase parallelism above some threshold. If I use
> >>> parallelism of N, watermarks are shown, when I increase it above some
> >>> number (seems not to be exactly deterministic), watermarks seems to
> >>> disappear.
> >>>
> >>> I'm using Flink 1.8.1.
> >>>
> >>> Did anyone experience something like this before?
> >>>
> >>> Jan
> >>>
> >>>
> >
>
>

Re: Watermarks not propagated to WebUI?

Posted by Chesnay Schepler <ch...@apache.org>.
I remember an issue regarding the watermark fetch request from the WebUI 
exceeding some HTTP size limit, since it tries to fetch all watermarks 
at once, and the format of this request isn't exactly efficient.

Querying metrics for individual operators still works since the request 
is small enough.

Not sure whether we ever fixed that.

On 15/08/2019 12:01, Jan Lukavský wrote:
> Hi,
>
> Thomas, thanks for confirming this. I have noticed, that in 1.9 the 
> WebUI has been reworked a lot, does anyone know if this is still an 
> issue? I currently cannot easily try 1.9, so I cannot confirm or 
> disprove that.
>
> Jan
>
> On 8/14/19 6:25 PM, Thomas Weise wrote:
>> I have also noticed this issue (Flink 1.5, Flink 1.8), and it appears 
>> with
>> higher parallelism.
>>
>> This can be confusing to the user when watermarks actually work and 
>> can be
>> observed using the metrics.
>>
>> On Wed, Aug 14, 2019 at 7:36 AM Jan Lukavský <je...@seznam.cz> wrote:
>>
>>> Hi,
>>>
>>> is it possible, that watermarks are sometimes not propagated to WebUI,
>>> although they are internally moving as normal? I see in WebUI every
>>> operator showing "No Watermark", but outputs seem to be propagated to
>>> sink (and there are watermark sensitive operations involved - e.g.
>>> reductions on fixed windows without early emitting). More strangely,
>>> this happens when I increase parallelism above some threshold. If I use
>>> parallelism of N, watermarks are shown, when I increase it above some
>>> number (seems not to be exactly deterministic), watermarks seems to
>>> disappear.
>>>
>>> I'm using Flink 1.8.1.
>>>
>>> Did anyone experience something like this before?
>>>
>>> Jan
>>>
>>>
>


Re: Watermarks not propagated to WebUI?

Posted by Jan Lukavský <je...@seznam.cz>.
Hi,

Thomas, thanks for confirming this. I have noticed, that in 1.9 the 
WebUI has been reworked a lot, does anyone know if this is still an 
issue? I currently cannot easily try 1.9, so I cannot confirm or 
disprove that.

Jan

On 8/14/19 6:25 PM, Thomas Weise wrote:
> I have also noticed this issue (Flink 1.5, Flink 1.8), and it appears with
> higher parallelism.
>
> This can be confusing to the user when watermarks actually work and can be
> observed using the metrics.
>
> On Wed, Aug 14, 2019 at 7:36 AM Jan Lukavský <je...@seznam.cz> wrote:
>
>> Hi,
>>
>> is it possible, that watermarks are sometimes not propagated to WebUI,
>> although they are internally moving as normal? I see in WebUI every
>> operator showing "No Watermark", but outputs seem to be propagated to
>> sink (and there are watermark sensitive operations involved - e.g.
>> reductions on fixed windows without early emitting). More strangely,
>> this happens when I increase parallelism above some threshold. If I use
>> parallelism of N, watermarks are shown, when I increase it above some
>> number (seems not to be exactly deterministic), watermarks seems to
>> disappear.
>>
>> I'm using Flink 1.8.1.
>>
>> Did anyone experience something like this before?
>>
>> Jan
>>
>>

Re: Watermarks not propagated to WebUI?

Posted by Thomas Weise <th...@apache.org>.
I have also noticed this issue (Flink 1.5, Flink 1.8), and it appears with
higher parallelism.

This can be confusing to the user when watermarks actually work and can be
observed using the metrics.

On Wed, Aug 14, 2019 at 7:36 AM Jan Lukavský <je...@seznam.cz> wrote:

> Hi,
>
> is it possible, that watermarks are sometimes not propagated to WebUI,
> although they are internally moving as normal? I see in WebUI every
> operator showing "No Watermark", but outputs seem to be propagated to
> sink (and there are watermark sensitive operations involved - e.g.
> reductions on fixed windows without early emitting). More strangely,
> this happens when I increase parallelism above some threshold. If I use
> parallelism of N, watermarks are shown, when I increase it above some
> number (seems not to be exactly deterministic), watermarks seems to
> disappear.
>
> I'm using Flink 1.8.1.
>
> Did anyone experience something like this before?
>
> Jan
>
>