You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@airflow.apache.org by "Bob De Schutter (JIRA)" <ji...@apache.org> on 2017/02/02 12:33:51 UTC
[jira] [Work started] (AIRFLOW-827) Add scrapyd operator
[ https://issues.apache.org/jira/browse/AIRFLOW-827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Work on AIRFLOW-827 started by Bob De Schutter.
-----------------------------------------------
> Add scrapyd operator
> --------------------
>
> Key: AIRFLOW-827
> URL: https://issues.apache.org/jira/browse/AIRFLOW-827
> Project: Apache Airflow
> Issue Type: New Feature
> Components: contrib
> Reporter: Bob De Schutter
> Assignee: Bob De Schutter
> Labels: operator
>
> This operator allows to schedule a spider run on a scrapyd server.
> Optionally, the operator can wait for the crawl process to finish
> which allows for downstream tasks to use the scraped data.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)