You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@mahout.apache.org by "Hudson (JIRA)" <ji...@apache.org> on 2016/04/11 10:42:28 UTC

[jira] [Commented] (MAHOUT-1775) FileNotFoundException caused by aborting the process of downloading Wikipedia dataset

    [ https://issues.apache.org/jira/browse/MAHOUT-1775?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15234739#comment-15234739 ] 

Hudson commented on MAHOUT-1775:
--------------------------------

FAILURE: Integrated in Mahout-Quality #3324 (See [https://builds.apache.org/job/Mahout-Quality/3324/])
MAHOUT-1775 FileNotFoundException caused by aborting the process of (smarthi: rev d53f0a5d78000045bb12e90e3a6808cc2c369450)
* examples/bin/classify-wikipedia.sh
MAHOUT-1775 FileNotFoundException caused by aborting the process of (smarthi: rev 0c6351fa5459920ccfd752f54f8fd41176b0afe8)
* CHANGELOG


> FileNotFoundException caused by aborting the process of downloading Wikipedia dataset
> -------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-1775
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1775
>             Project: Mahout
>          Issue Type: Bug
>          Components: Examples
>            Reporter: Bowei Zhang
>            Assignee: Suneel Marthi
>            Priority: Trivial
>             Fix For: 0.11.1
>
>
> When running the script examples/bin/classify-wikipedia.sh for the first time, it will create a wikixml folder and starts fetching data via curl. If this downloading process is aborted, then in the future when the script is run, it won't extract the .bz2 file (since extracion is guarded by the condition where wikixml doesn't exist) and starts to run Mahout, which will definately end up with throwing up a FileNotFoundException.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)