You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@flink.apache.org by "João Boto (Jira)" <ji...@apache.org> on 2022/05/09 06:51:00 UTC

[jira] [Updated] (FLINK-27552) Prometheus metrics

     [ https://issues.apache.org/jira/browse/FLINK-27552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

João Boto updated FLINK-27552:
------------------------------
    Description: 
I have a Standalone cluster (with jobmanager and taskmanager on same machine) on 1.14.4 and I'm testing the migration to 1.15.0

But I keep losing the taskmanager metrics when I start a job on the 1.15 cluster

I use the same configuration as in the previous cluster

{{  }}
{code:java}
metrics.reporters: prom 
metrics.reporter.prom.factory.class: org.apache.flink.metrics.prometheus.PrometheusReporterFactory 
metrics.reporter.prom.port: 9250-9251{code}
{{ }}
If the cluster is running without jobs I can see the metrics on port 9250 for jobmanager and on port 9251 for taskmanager

If I start a job, the metrics from taskmanager disappear and if I stop the job the metrics come live again

What am I missing?
 

  was:
I have a Standalone cluster (with jobmanager and taskmanager on same machine) on 1.14.4 and I'm testing the migration to 1.15.0

But I keep losing the taskmanager metrics when I start a job on the 1.15 cluster

I use the same configuration as in the previous cluster

{{  }}
{code:java}
metrics.reporters: prom metrics.reporter.prom.factory.class: org.apache.flink.metrics.prometheus.PrometheusReporterFactory metrics.reporter.prom.port: 9250-9251{code}
{{ }}
If the cluster is running without jobs I can see the metrics on port 9250 for jobmanager and on port 9251 for taskmanager

If I start a job, the metrics from taskmanager disappear and if I stop the job the metrics come live again

What am I missing?
 


> Prometheus metrics
> ------------------
>
>                 Key: FLINK-27552
>                 URL: https://issues.apache.org/jira/browse/FLINK-27552
>             Project: Flink
>          Issue Type: Bug
>          Components: Runtime / Metrics
>    Affects Versions: 1.15.0
>            Reporter: João Boto
>            Priority: Major
>
> I have a Standalone cluster (with jobmanager and taskmanager on same machine) on 1.14.4 and I'm testing the migration to 1.15.0
> But I keep losing the taskmanager metrics when I start a job on the 1.15 cluster
> I use the same configuration as in the previous cluster
> {{  }}
> {code:java}
> metrics.reporters: prom 
> metrics.reporter.prom.factory.class: org.apache.flink.metrics.prometheus.PrometheusReporterFactory 
> metrics.reporter.prom.port: 9250-9251{code}
> {{ }}
> If the cluster is running without jobs I can see the metrics on port 9250 for jobmanager and on port 9251 for taskmanager
> If I start a job, the metrics from taskmanager disappear and if I stop the job the metrics come live again
> What am I missing?
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)