You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2020/11/24 15:57:00 UTC
[jira] [Comment Edited] (TIKA-3235) Build failure caused by
timeouts in XMLReaderUtils
[ https://issues.apache.org/jira/browse/TIKA-3235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17238231#comment-17238231 ]
Tim Allison edited comment on TIKA-3235 at 11/24/20, 3:56 PM:
--------------------------------------------------------------
The impetus was Nutch/Sebastian Nagel (TIKA-2645/NUTCH-2578) who diagnosed a substantial thread lock issue when creating new SAXParsers (outside of Tika's code). I'd be happy to get rid of the complexity we have in Tika, but I don't want users to have add their own providers to avoid this surprising thread lock issue.
If we can simplify our code, I'm all for it.
was (Author: tallison@mitre.org):
The impetus was Nutch/Sebastian Nagel (TIKA-2645/NUTCH-2578) who diagnosed a substantial thread lock issue when creating new SAXParsers (outside of Tika's code). I'd be happy to get rid of the complexity we have in Tika, but I don't want users to have add there own providers to avoid this surprising threading issue.
If we can simplify our code, I'm all for it.
> Build failure caused by timeouts in XMLReaderUtils
> --------------------------------------------------
>
> Key: TIKA-3235
> URL: https://issues.apache.org/jira/browse/TIKA-3235
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Priority: Major
>
> [~kkrugler] was not able to build 1.25-rc1 because of timeouts from XMLReaderUtils. Let's use this issue to figure out what's going wrong.
> {noformat}
> >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> >> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> >> SAXParser; This could indicate that a parser has not correctly released its
> >> SAXParser. Please report this to the Tika team: dev@tika.apache.org
> >> <ma...@tika.apache.org>
> >> at
> >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> >> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> >> minutes for a SAXParser; This could indicate that a parser has not
> >> correctly released its SAXParser. Please report this to the Tika team:
> >> dev@tika.apache.org <ma...@tika.apache.org>
> >> at
> {noformat}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)