You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Jawahar (Jira)" <ji...@apache.org> on 2020/09/07 14:03:00 UTC

[jira] [Commented] (TIKA-2789) Apache tika - java.lang.NoClassDefFoundError

    [ https://issues.apache.org/jira/browse/TIKA-2789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17191723#comment-17191723 ] 

Jawahar commented on TIKA-2789:
-------------------------------

This issue fixed by loading proper TypeSystemHolder class file into current thread class loader. (class file loaded before calling newTika().parseToString(input stream) method)

> Apache tika - java.lang.NoClassDefFoundError
> --------------------------------------------
>
>                 Key: TIKA-2789
>                 URL: https://issues.apache.org/jira/browse/TIKA-2789
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.19.1
>            Reporter: Jawahar
>            Priority: Minor
>
> I am running tomcat based server for attachment extraction using apache tika-app-jar-1.19.1.(https://www.apache.org/dyn/closer.cgi/tika/tika-app-1.19.1.jar) . while trying to extract docx files(not for all docx files,only some files), tika throws following exception. JDK version is 1.8.0_162.
> java.lang.NoClassDefFoundError: Could not initialize class org.openxmlformats.schemas.wordprocessingml.x2006.main.DocumentDocument at org.openxmlformats.schemas.wordprocessingml.x2006.main.DocumentDocument$Factory.parse(Unknown Source) at org.apache.poi.xwpf.usermodel.XWPFDocument.onDocumentRead(XWPFDocument.java:178) at org.apache.poi.ooxml.POIXMLDocument.load(POIXMLDocument.java:184) at org.apache.poi.xwpf.usermodel.XWPFDocument.(XWPFDocument.java:138) at org.apache.poi.xwpf.extractor.XWPFWordExtractor.(XWPFWordExtractor.java:60) at org.apache.poi.ooxml.extractor.ExtractorFactory.createExtractor(ExtractorFactory.java:228) at org.apache.tika.parser.microsoft.ooxml.OOXMLExtractorFactory.parse(OOXMLExtractorFactory.java:116) at org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:110) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143) at org.apache.tika.Tika.parseToString(Tika.java:527) 
> at org.apache.tika.Tika.parseToString(Tika.java:602)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)