You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Diogo Franco (JIRA)" <ji...@apache.org> on 2018/03/16 10:48:00 UTC

[jira] [Created] (AIRFLOW-2221) Fill up DagBad from remote locations

Diogo Franco created AIRFLOW-2221:
-------------------------------------

             Summary: Fill up DagBad from remote locations
                 Key: AIRFLOW-2221
                 URL: https://issues.apache.org/jira/browse/AIRFLOW-2221
             Project: Apache Airflow
          Issue Type: New Feature
          Components: configuration, core
    Affects Versions: Airflow 2.0
            Reporter: Diogo Franco
            Assignee: Diogo Franco
             Fix For: Airflow 2.0


The ability to fill up the DagBag from remote locations (HDFS, S3...) seems to be deemed useful, e.g. facilitating deployment processes.

This JIRA is to propose an implementation of a *DagFetcher* abstraction on the DagBag, where the collect_dags method can delegate the walking to a *FileSystemDagFetcher*, *GitRepoDagFetcher*, *S3DagFetcher*, *HDFSDagFetcher*, *GCSDagFetcher*, *ArtifactoryDagFetcher* or even *TarballInS3DagFetcher*.

This was briefly discussed in [this mailing list thread|https://lists.apache.org/thread.html/03ddcd3a42b7fd6e3dad9711e8adea37fc00391f6053762f73af5b6a@%3Cdev.airflow.apache.org%3E]

I'm happy to start work on this and provide an initial implementation for review.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)