You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2020/11/24 15:57:00 UTC

[jira] [Comment Edited] (TIKA-3235) Build failure caused by timeouts in XMLReaderUtils

    [ https://issues.apache.org/jira/browse/TIKA-3235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17238231#comment-17238231 ] 

Tim Allison edited comment on TIKA-3235 at 11/24/20, 3:56 PM:
--------------------------------------------------------------

The impetus was Nutch/Sebastian Nagel (TIKA-2645/NUTCH-2578) who diagnosed a substantial thread lock issue when creating new SAXParsers (outside of Tika's code).  I'd be happy to get rid of the complexity we have in Tika, but I don't want users to have add their own providers to avoid this surprising thread lock issue.

If we can simplify our code, I'm all for it.


was (Author: tallison@mitre.org):
The impetus was Nutch/Sebastian Nagel (TIKA-2645/NUTCH-2578) who diagnosed a substantial thread lock issue when creating new SAXParsers (outside of Tika's code).  I'd be happy to get rid of the complexity we have in Tika, but I don't want users to have add there own providers to avoid this surprising threading issue.

If we can simplify our code, I'm all for it.

> Build failure caused by timeouts in XMLReaderUtils
> --------------------------------------------------
>
>                 Key: TIKA-3235
>                 URL: https://issues.apache.org/jira/browse/TIKA-3235
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>
> [~kkrugler] was not able to build 1.25-rc1 because of timeouts from XMLReaderUtils.  Let's use this issue to figure out what's going wrong.
> {noformat}
> >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> >> Caused by: org.xml.sax.SAXException: Waited more than 5 minutes for a
> >> SAXParser; This could indicate that a parser has not correctly released its
> >> SAXParser. Please report this to the Tika team: dev@tika.apache.org
> >> <ma...@tika.apache.org>
> >>        at
> >> org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testUnsupportedPowerPoint(OOXMLParserTest.java:341)
> >> Caused by: org.apache.tika.exception.TikaException: Waited more than 5
> >> minutes for a SAXParser; This could indicate that a parser has not
> >> correctly released its SAXParser. Please report this to the Tika team:
> >> dev@tika.apache.org <ma...@tika.apache.org>
> >>        at
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)