You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "KulykDmytro (via GitHub)" <gi...@apache.org> on 2023/02/24 09:22:41 UTC
[GitHub] [airflow] KulykDmytro opened a new issue, #29737: Metric dag_processing.total_parse_time is exposed incorrectly
KulykDmytro opened a new issue, #29737:
URL: https://github.com/apache/airflow/issues/29737
### Apache Airflow version
2.5.1
### What happened
Following metric `dag_processing.total_parse_time` exposing smth not relevant to overall processing time
![image](https://user-images.githubusercontent.com/34435869/221139873-03f9623f-76eb-40bb-98f8-10e5950c1cd5.png)
```
DAG File Processing Stats
File Path PID Runtime # DAGs # Errors Last Runtime Last Run
------------------------------------------------- ----- --------- -------- ---------- -------------- -------------------
/opt/airflow/dags/player_time_features.py 1 0 0.89s 2023-02-24T09:18:59
/opt/airflow/dags/maxmind_uploader.py 1 0 0.81s 2023-02-24T09:19:29
/opt/airflow/dags/player_team_similarities.py 1 0 1.34s 2023-02-24T09:18:31
/opt/airflow/dags/aws_sanitize.py 1 0 3.58s 2023-02-24T09:18:58
/opt/airflow/dags/ad_backoffice.py 1 0 1.05s 2023-02-24T09:19:28
/opt/airflow/dags/gdpr.py 1 0 0.46s 2023-02-24T09:18:39
/opt/airflow/dags/aws_regional_split_backfill.py 1 0 2.55s 2023-02-24T09:19:12
/opt/airflow/dags/reconciliation_daily.py 1 0 1.04s 2023-02-24T09:19:16
/opt/airflow/dags/maintenance.py 1 0 0.24s 2023-02-24T09:17:51
/opt/airflow/dags/athena_repair_partitions.py 1 0 0.81s 2023-02-24T09:19:28
/opt/airflow/dags/trending_events.py 1 0 1.74s 2023-02-24T09:19:07
/opt/airflow/dags/casino_games_similarities.py 1 0 1.17s 2023-02-24T09:18:18
/opt/airflow/dags/player_segmentation.py 1 0 1.44s 2023-02-24T09:17:41
/opt/airflow/dags/test_copy.py 1 0 0.71s 2023-02-24T09:19:01
/opt/airflow/dags/datahub_ingest.py 1 0 0.90s 2023-02-24T09:19:05
/opt/airflow/dags/gs_exporter.py 1 0 2.65s 2023-02-24T09:17:37
/opt/airflow/dags/canary.py 1 0 1.75s 2023-02-24T09:19:04
/opt/airflow/dags/sfsc_load.py 1 0 1.06s 2023-02-24T09:19:15
/opt/airflow/dags/led_recognition_processed.py 1 0 2.28s 2023-02-24T09:19:09
/opt/airflow/dags/casino_games_recommendations.py 1 0 1.77s 2023-02-24T09:18:59
/opt/airflow/dags/vip_export.py 1 0 1.29s 2023-02-24T09:18:29
/opt/airflow/dags/autotest.py 2 0 0.63s 2023-02-24T09:19:30
/opt/airflow/dags/anomaly_detection.py 2 0 3.67s 2023-02-24T09:17:37
/opt/airflow/dags/native_rls.py 3 0 1.14s 2023-02-24T09:19:26
/opt/airflow/dags/dbt.py 4 0 1.23s 2023-02-24T09:18:14
/opt/airflow/dags/front_data.py 4 0 8.55s 2023-02-24T09:18:10
/opt/airflow/dags/gs_loader.py 4 0 1.37s 2023-02-24T09:19:25
/opt/airflow/dags/backfill_lambda.py 6 0 1.91s 2023-02-24T09:19:14
/opt/airflow/dags/tableau_refresher.py 7 0 1.85s 2023-02-24T09:17:39
/opt/airflow/dags/extract_load.py 8 0 1.30s 2023-02-24T09:18:49
/opt/airflow/dags/validator.py 30 0 1.91s 2023-02-24T09:19:24
/opt/airflow/dags/aws_transforms.py 1768 34.15s 56 0 151.91s 2023-02-24T09:17:01
```
### What you think should happen instead
_No response_
### How to reproduce
process dags by dag-processor
observe exposed metric: dag_processing.total_parse_time
### Operating System
n/a
### Versions of Apache Airflow Providers
_No response_
### Deployment
Official Apache Airflow Helm Chart
### Deployment details
AWS EKS
### Anything else
_No response_
### Are you willing to submit PR?
- [ ] Yes I am willing to submit a PR!
### Code of Conduct
- [X] I agree to follow this project's [Code of Conduct](https://github.com/apache/airflow/blob/main/CODE_OF_CONDUCT.md)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [airflow] potiuk commented on issue #29737: Metric dag_processing.total_parse_time is exposed incorrectly
Posted by "potiuk (via GitHub)" <gi...@apache.org>.
potiuk commented on issue #29737:
URL: https://github.com/apache/airflow/issues/29737#issuecomment-1513963179
I think this is something to be addressed with the Open-Telemetry implementation @feruzzi @howardyoo - possibly dropping or replacing this metric with some aggregated one.
I think we should not keep it as an issue, now when it is documented, I doubt we will do anything with it but it should be looked at with the OTEL work.
converting to a discussion.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [airflow] potiuk closed issue #29737: Metric dag_processing.total_parse_time is exposed incorrectly
Posted by "potiuk (via GitHub)" <gi...@apache.org>.
potiuk closed issue #29737: Metric dag_processing.total_parse_time is exposed incorrectly
URL: https://github.com/apache/airflow/issues/29737
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
[GitHub] [airflow] vandonr-amz commented on issue #29737: Metric dag_processing.total_parse_time is exposed incorrectly
Posted by "vandonr-amz (via GitHub)" <gi...@apache.org>.
vandonr-amz commented on issue #29737:
URL: https://github.com/apache/airflow/issues/29737#issuecomment-1499650247
I made a change recently (#30290) to edit the doc about this metric. The number of dag (files) parsed can vary from one run to the next, explaining (some of) the volatility.
You could try dividing one metric by the other, but since they are not emitted at the same time, it can still yield to bad results.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: commits-unsubscribe@airflow.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org