You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Hudson (Jira)" <ji...@apache.org> on 2022/04/25 23:10:00 UTC

[jira] [Commented] (TIKA-3730) New ExternalParser doesn't work on Windows

    [ https://issues.apache.org/jira/browse/TIKA-3730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17527798#comment-17527798 ] 

Hudson commented on TIKA-3730:
------------------------------

SUCCESS: Integrated in Jenkins build Tika » tika-main-jdk8 #524 (See [https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/524/])
TIKA-3730 (tallison: [https://github.com/apache/tika/commit/4639e8d3712fa015bcecdb1e6b89e8bd9e5e67fa])
* (edit) tika-core/src/main/java/org/apache/tika/parser/external2/ExternalParser.java
* (edit) tika-core/src/test/java/org/apache/tika/parser/external2/ExternalParserTest.java
TIKA-3730 -- fix checkstyle; hang head in shame. (tallison: [https://github.com/apache/tika/commit/90c7e4c2d0f1ae1b5a8e559b2955820a5d743046])
* (edit) tika-core/src/test/java/org/apache/tika/parser/external2/ExternalParserTest.java


> New ExternalParser doesn't work on Windows
> ------------------------------------------
>
>                 Key: TIKA-3730
>                 URL: https://issues.apache.org/jira/browse/TIKA-3730
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Trivial
>             Fix For: 2.4.0
>
>
> [~tilman] noted that the external2.ExternalParser uses "replaceAll" on a regex where the replacement is a file path does not work on Windows.  The replaceAll strips the file separators.  I admit that I cannot figure out why this is is happening.  I've tried a couple of combinations of backslashing etc, but nothing is working.  I even tried Pattern.quote() and that doesn't work on Windows. 
> If we back off to use "replace" with a string, everything seems to work.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)