You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "ASF GitHub Bot (Jira)" <ji...@apache.org> on 2020/02/03 21:55:01 UTC

[jira] [Commented] (AIRFLOW-6685) Add Data Quality Operators

    [ https://issues.apache.org/jira/browse/AIRFLOW-6685?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17029324#comment-17029324 ] 

ASF GitHub Bot commented on AIRFLOW-6685:
-----------------------------------------

alexzlue commented on pull request #7353: [AIRFLOW-6685] Data Quality Check operators
URL: https://github.com/apache/airflow/pull/7353
 
 
   This PR includes 3 operators:
   `BaseDataQualityOperator`
   - contains shared attributes and methods that data quality check operators utilize
   a base class that can be used to create other dq operators
   
   `DataQualityThresholdCheckOperator`
   - will check a single value, sql result against a threshold range, and will fail a task if it is outside this range.
   
   `DataQulaityThresholdSQLCheckOperator`
   - Similar to DataQualityThresholdCheckOperator, but thresholds are sql-evaluated values, for dynamic threshold ranging.
   ---
   Issue link: WILL BE INSERTED BY [boring-cyborg](https://github.com/kaxil/boring-cyborg)
   
   Make sure to mark the boxes below before creating PR: [x]
   
   - [ ] Description above provides context of the change
   - [ ] Commit message/PR title starts with `[AIRFLOW-NNNN]`. AIRFLOW-NNNN = JIRA ID<sup>*</sup>
   - [ ] Unit tests coverage for changes (not needed for documentation changes)
   - [ ] Commits follow "[How to write a good git commit message](http://chris.beams.io/posts/git-commit/)"
   - [ ] Relevant documentation is updated including usage instructions.
   - [ ] I will engage committers as explained in [Contribution Workflow Example](https://github.com/apache/airflow/blob/master/CONTRIBUTING.rst#contribution-workflow-example).
   
   <sup>*</sup> For document-only changes commit message can start with `[AIRFLOW-XXXX]`.
   
   ---
   In case of fundamental code change, Airflow Improvement Proposal ([AIP](https://cwiki.apache.org/confluence/display/AIRFLOW/Airflow+Improvements+Proposals)) is needed.
   In case of a new dependency, check compliance with the [ASF 3rd Party License Policy](https://www.apache.org/legal/resolved.html#category-x).
   In case of backwards incompatible changes please leave a note in [UPDATING.md](https://github.com/apache/airflow/blob/master/UPDATING.md).
   Read the [Pull Request Guidelines](https://github.com/apache/airflow/blob/master/CONTRIBUTING.rst#pull-request-guidelines) for more information.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
users@infra.apache.org


> Add Data Quality Operators 
> ---------------------------
>
>                 Key: AIRFLOW-6685
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-6685
>             Project: Apache Airflow
>          Issue Type: New Feature
>          Components: operators
>    Affects Versions: 2.0.0
>            Reporter: alex l
>            Assignee: alex l
>            Priority: Major
>
> Add Data Quality Operators to improve data quality testing on data workflows/pipelines. This includes 3 operators:
>  * BaseDataQualityOperator
>  ** contains shared attributes and methods that data quality check operators utilize
>  ** a base class that can be used to create other dq operators
>  * DataQualityThresholdCheckOperator
>  ** will check a single value, sql result against a threshold range, and will fail a task if it is outside this range.
>  * DataQulaityThresholdSQLCheckOperator
>  ** Similar to DataQualityThresholdCheckOperator, but thresholds are sql-evaluated values, for dynamic threshold ranging.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)