You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Sujen Shah (JIRA)" <ji...@apache.org> on 2016/10/20 22:24:58 UTC

[jira] [Work started] (NUTCH-2327) Seeds injected in REST workflow must be ingested into HDFS

     [ https://issues.apache.org/jira/browse/NUTCH-2327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Work on NUTCH-2327 started by Sujen Shah.
-----------------------------------------
> Seeds injected in REST workflow must be ingested into HDFS
> ----------------------------------------------------------
>
>                 Key: NUTCH-2327
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2327
>             Project: Nutch
>          Issue Type: Improvement
>          Components: injector, REST_api
>    Affects Versions: 1.12
>            Reporter: Lewis John McGibbney
>            Assignee: Sujen Shah
>             Fix For: 1.13
>
>
> Right now when one uses the REST POST /seed/create API, a directory is created within /var/some/path/here which is create if you are working locally with the Nutch server e.g. on one machine. It is however not suitable for using the REST API in distributed deployments where seeds needs to be present within HDFS. More documentation on this topic is available at 
> https://wiki.apache.org/nutch/Nutch_1.X_RESTAPI#Seed_List_creation
> There are also various mailing list threads regarding use of the REST and this injector url issue described above needs to be addressed.
> [~sujenshah] CC for context.
> http://www.mail-archive.com/user%40nutch.apache.org/msg14922.html
> http://www.mail-archive.com/user%40nutch.apache.org/msg14921.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)