You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@skywalking.apache.org by GitBox <gi...@apache.org> on 2021/03/12 01:41:40 UTC

[GitHub] [skywalking] ppliuppliu opened a new issue #6541: A little confused about the alarm rules

ppliuppliu opened a new issue #6541:
URL: https://github.com/apache/skywalking/issues/6541


   Please answer these questions before submitting your issue.
   
   - Why do you submit this issue?
   - [X] Question or discussion
   - [ ] Bug
   - [ ] Requirement
   - [ ] Feature or performance improvement
   
   ___
   ### Question
   - What do you want to know?
   I am a little confused about the conception of the alarm rule, even after I read the document  many times.
   
   - How often will the data be evaluated? If I set the _Period_ to be 10, will it evaluate the data of the past 10 minutes every 1 minute? Or I can set the interval in somewhere?
   - What _Count_ exactly means? If I set the _Count_ to be 2, and the _Period_ to be 10,  does it means the metric reach the alarm condition for 20 minutes? Or twice of the evaluated interval?
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [skywalking] ppliuppliu removed a comment on issue #6541: A little confused about the alarm rules

Posted by GitBox <gi...@apache.org>.
ppliuppliu removed a comment on issue #6541:
URL: https://github.com/apache/skywalking/issues/6541#issuecomment-797316658


   查了下代码,每10s检测一下告警,但我更没有办法理解这里count的含义了。
   以默认告警配置为例:
   `  service_resp_time_rule:
       metrics-name: service_resp_time
       op: ">"
       threshold: 1000
       period: 10
       count: 3
       silence-period: 5
       message: Response time of service {name} is more than 1000ms in 3 minutes of last 10 minutes.`
   
   这里的count:3 为什么是指 响应时间在最近10分钟内有3分钟超过了1000ms,服务的吞吐比较高的话,每秒都有几十次请求,每个请求都有响应时间,这个3是如何对应到3分钟的呢?如果是指检测间隔的话,应该是30s才对吧


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [skywalking] wu-sheng commented on issue #6541: A little confused about the alarm rules

Posted by GitBox <gi...@apache.org>.
wu-sheng commented on issue #6541:
URL: https://github.com/apache/skywalking/issues/6541#issuecomment-797504838


   In the storage, all metrics have been downsampling into minutes. Such as CPM in 21:54 is one slot, the next is 21:55. What is your traffic load, doesn't matter.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [skywalking] wu-sheng closed issue #6541: A little confused about the alarm rules

Posted by GitBox <gi...@apache.org>.
wu-sheng closed issue #6541:
URL: https://github.com/apache/skywalking/issues/6541


   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [skywalking] ppliuppliu commented on issue #6541: A little confused about the alarm rules

Posted by GitBox <gi...@apache.org>.
ppliuppliu commented on issue #6541:
URL: https://github.com/apache/skywalking/issues/6541#issuecomment-797326845


   How to understand _how many slots_? Take response time as an example. When the application throughput is relatively high, there are hundreds of requests per second, and each request has a response time. Does each request count as a count? 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [skywalking] wu-sheng commented on issue #6541: A little confused about the alarm rules

Posted by GitBox <gi...@apache.org>.
wu-sheng commented on issue #6541:
URL: https://github.com/apache/skywalking/issues/6541#issuecomment-797313985


   Think in this way, period is windows of metric slot. Count means how many slots over the threshold in the given window.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [skywalking] ppliuppliu commented on issue #6541: A little confused about the alarm rules

Posted by GitBox <gi...@apache.org>.
ppliuppliu commented on issue #6541:
URL: https://github.com/apache/skywalking/issues/6541#issuecomment-797316658


   查了下代码,每10s检测一下告警,但我更没有办法理解这里count的含义了。
   以默认告警配置为例:
   `  service_resp_time_rule:
       metrics-name: service_resp_time
       op: ">"
       threshold: 1000
       period: 10
       count: 3
       silence-period: 5
       message: Response time of service {name} is more than 1000ms in 3 minutes of last 10 minutes.`
   
   这里的count:3 为什么是指 响应时间在最近10分钟内有3分钟超过了1000ms,服务的吞吐比较高的话,每秒都有几十次请求,每个请求都有响应时间,这个3是如何对应到3分钟的呢?如果是指检测间隔的话,应该是30s才对吧


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org