You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2015/09/30 18:01:35 UTC

[jira] [Resolved] (TIKA-1754) tika-batch's FileListCrawler truncates the first character of the fileList if the root is e.g. X:

     [ https://issues.apache.org/jira/browse/TIKA-1754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Allison resolved TIKA-1754.
-------------------------------
    Resolution: Fixed

Fixed with TIKA-1747.

> tika-batch's FileListCrawler truncates the first character of the fileList if the root is e.g. X:
> -------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-1754
>                 URL: https://issues.apache.org/jira/browse/TIKA-1754
>             Project: Tika
>          Issue Type: Bug
>          Components: batch
>    Affects Versions: 1.10
>            Reporter: Tim Allison
>            Priority: Trivial
>              Labels: java7
>
> The FileListCrawler takes a root directory and a list of relative file paths and "crawls" that list as if it were a directory crawler.  If the root is specified as, e.g. "X:" on a Windows system, the call to substring on root's absolute path and the subtraction of one character is incorrect.
> With a root of X: and a relative file of "dir1/dir2/file.doc", the output file is: "X:/ir/dir2/file.doc.txt"
> Let's get rid of the substring calculations and move to Java 7! :)
> See TIKA-1747.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)