You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by "Gajanan Watkar (JIRA)" <ji...@apache.org> on 2018/10/20 12:28:00 UTC

[jira] [Updated] (NUTCH-2664) WebApp for Nutch running in deploy Mode Creates Seed Directory in local FileSystem

     [ https://issues.apache.org/jira/browse/NUTCH-2664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Gajanan Watkar updated NUTCH-2664:
----------------------------------
    Environment: 
Nutch-2.3.1

Hbase-1.2.3

Hadoo- 2.5.2

 

> WebApp for Nutch running in deploy Mode Creates Seed Directory in local FileSystem
> ----------------------------------------------------------------------------------
>
>                 Key: NUTCH-2664
>                 URL: https://issues.apache.org/jira/browse/NUTCH-2664
>             Project: Nutch
>          Issue Type: Bug
>          Components: REST_api, web gui
>    Affects Versions: 2.3.1
>         Environment: Nutch-2.3.1
> Hbase-1.2.3
> Hadoo- 2.5.2
>  
>            Reporter: Gajanan Watkar
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 2.3.1
>
>
> When creating crawl jobs using nutch webapp, seed directory gets created in temp (/tmp on linux) directory in local filesystem. This prevents crawl job to inject urls. As injection of url fails, no further phases of crawl can be executed. Seed Directory needs to be created on HDFS in case of Nutch running in deploy mode.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)