You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@lucene.apache.org by "Grant Ingersoll (JIRA)" <ji...@apache.org> on 2011/05/20 17:44:47 UTC

[jira] [Reopened] (LUCENE-929) contrib/benchmark build doesn't handle checking if content is properly extracted

     [ https://issues.apache.org/jira/browse/LUCENE-929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Grant Ingersoll reopened LUCENE-929:
------------------------------------

         Assignee: Grant Ingersoll
    Lucene Fields:   (was: [New])

Note, this fix this doesn't work if the output dir has a trailing slash.  See MAHOUT-694.

> contrib/benchmark build doesn't handle checking if content is properly extracted
> --------------------------------------------------------------------------------
>
>                 Key: LUCENE-929
>                 URL: https://issues.apache.org/jira/browse/LUCENE-929
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: modules/benchmark
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>
> The contrib/benchmark build does not properly handle checking to see if the content (such as Reuters coll.) is properly extracted.  It only checks to see if the directory exists.  Thus, it is possible that the directory gets created and the extraction fails.  Then, the next time it is run, it skips the extraction part and tries to continue on running the benchmark.
> The workaround is to manually delete the extraction directory.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@lucene.apache.org
For additional commands, e-mail: dev-help@lucene.apache.org