You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Diogo Franco (JIRA)" <ji...@apache.org> on 2018/03/16 10:48:00 UTC
[jira] [Updated] (AIRFLOW-2221) Fill up DagBag from remote
locations
[ https://issues.apache.org/jira/browse/AIRFLOW-2221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Diogo Franco updated AIRFLOW-2221:
----------------------------------
Summary: Fill up DagBag from remote locations (was: Fill up DagBad from remote locations)
> Fill up DagBag from remote locations
> ------------------------------------
>
> Key: AIRFLOW-2221
> URL: https://issues.apache.org/jira/browse/AIRFLOW-2221
> Project: Apache Airflow
> Issue Type: New Feature
> Components: configuration, core
> Affects Versions: Airflow 2.0
> Reporter: Diogo Franco
> Assignee: Diogo Franco
> Priority: Major
> Fix For: Airflow 2.0
>
>
> The ability to fill up the DagBag from remote locations (HDFS, S3...) seems to be deemed useful, e.g. facilitating deployment processes.
> This JIRA is to propose an implementation of a *DagFetcher* abstraction on the DagBag, where the collect_dags method can delegate the walking to a *FileSystemDagFetcher*, *GitRepoDagFetcher*, *S3DagFetcher*, *HDFSDagFetcher*, *GCSDagFetcher*, *ArtifactoryDagFetcher* or even *TarballInS3DagFetcher*.
> This was briefly discussed in [this mailing list thread|https://lists.apache.org/thread.html/03ddcd3a42b7fd6e3dad9711e8adea37fc00391f6053762f73af5b6a@%3Cdev.airflow.apache.org%3E]
> I'm happy to start work on this and provide an initial implementation for review.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)