You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@trafficcontrol.apache.org by GitBox <gi...@apache.org> on 2021/08/19 23:10:27 UTC

[GitHub] [trafficcontrol] ocket8888 commented on a change in pull request #6126: Distributed Traffic Monitor blueprint

ocket8888 commented on a change in pull request #6126:
URL: https://github.com/apache/trafficcontrol/pull/6126#discussion_r692543391



##########
File path: blueprints/distributed-traffic-monitor.md
##########
@@ -0,0 +1,180 @@
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+    http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+# Distributed Traffic Monitor
+
+## Problem Description
+Currently, TM polls all caches in a CDN. As CDNs grow, this becomes a major
+pain point as TM is limited by the amount of bandwidth and CPU it requires to
+receive and process data from every cache on the CDN, and scaling vertically by
+running it on better hardware is only feasible up to a certain point. Also, the
+performance of a cache observed by a TM which is very far away from it does not
+always reflect the performance observed by clients that are actually using the
+cache (because the clients are typically much closer to it).
+
+## Proposed Change
+TM should have the ability to poll only a subset of caches in a CDN and peer
+with other TMs which are monitoring other subsets in order to get a full view
+of the CDN's health. This would allow us to run TM in a more distributed manner
+across the CDN, giving us a view of cache health that is closer to what clients
+actually observe and enabling us to scale TM horizontally. Additionally, we
+would like to have the option to disable _stat polling_ in order for these
+distributed TMs to focus on _health polling_.
+
+### Traffic Portal Impact
+This proposal does not require any TP changes.
+
+### Traffic Ops Impact
+This proposal should have no impact on TO. The existing TO API endpoints
+already provide the data that TM will need to run in a distributed manner.

Review comment:
       will it affect the endpoints that call into TM, though? Like `cdns/{{name}}/health` just picks any random ONLINE TM and gets its `/publish/CrStates` but that won't actually be the whole CDN now, maybe.




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@trafficcontrol.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org