You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Bob De Schutter (JIRA)" <ji...@apache.org> on 2017/02/02 12:33:51 UTC

[jira] [Work started] (AIRFLOW-827) Add scrapyd operator

     [ https://issues.apache.org/jira/browse/AIRFLOW-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on AIRFLOW-827 started by Bob De Schutter.
-----------------------------------------------
> Add scrapyd operator
> --------------------
>
>                 Key: AIRFLOW-827
>                 URL: https://issues.apache.org/jira/browse/AIRFLOW-827
>             Project: Apache Airflow
>          Issue Type: New Feature
>          Components: contrib
>            Reporter: Bob De Schutter
>            Assignee: Bob De Schutter
>              Labels: operator
>
> This operator allows to schedule a spider run on a scrapyd server.
> Optionally, the operator can wait for the crawl process to finish
> which allows for downstream tasks to use the scraped data.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)