You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@atlas.apache.org by "Jasper Knulst (Jira)" <ji...@apache.org> on 2022/05/11 15:47:00 UTC

[jira] [Created] (ATLAS-4602) Report Lag for consuming from ATLAS_HOOK

Jasper Knulst created ATLAS-4602:
------------------------------------

             Summary: Report Lag for consuming from ATLAS_HOOK
                 Key: ATLAS-4602
                 URL: https://issues.apache.org/jira/browse/ATLAS-4602
             Project: Atlas
          Issue Type: Improvement
          Components:  atlas-core
    Affects Versions: 2.2.0
            Reporter: Jasper Knulst
             Fix For: trunk
         Attachments: image-2022-05-11-17-42-12-250.png

Currently the 'Stats' webUI function shows some details about the consumption from the ATLAS_HOOK Kafka topic where changes from Hive Metastore arrive.

 !image-2022-05-11-17-42-12-250.png! 

By far the most important metric is not available though; the lag the atlas server consumer-group has in consuming Hive updates.

Monitoring the lag is very important as trust in Atlas is greatly undermined when changes are not reflected in Atlas within seconds. I have had numerous occasions where ATLAS_HOOK consumption was slowing down silently and atlas was behind tens of thousands (or 2 days) worth of messages.

There should be a new metric for the lag on the stats page to quickly identify a possible reason for slow Atlas updates




--
This message was sent by Atlassian Jira
(v8.20.7#820007)