You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@lucene.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2019/12/12 18:08:00 UTC

[jira] [Comment Edited] (SOLR-14054) Upgrade Tika to 1.23

    [ https://issues.apache.org/jira/browse/SOLR-14054?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16994927#comment-16994927 ] 

Tim Allison edited comment on SOLR-14054 at 12/12/19 6:07 PM:
--------------------------------------------------------------

The tests all pass, and I can get a successful build locally, however when I try a full  integration test (package, unzip, deploy), I'm getting a NoClassDefFoundError:

{noformat}
Exception in thread "Thread-15" java.lang.NoClassDefFoundError: org/tukaani/xz/FilterOptions
	at org.apache.commons.compress.archivers.sevenz.Coders.<clinit>(Coders.java:47)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.buildDecoderStack(SevenZFile.java:1153)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.buildDecodingStream(SevenZFile.java:1106)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.getNextEntry(SevenZFile.java:405)
	at org.apache.tika.parser.pkg.PackageParser$SevenZWrapper.getNextEntry(PackageParser.java:424)
	at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:285)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
	at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:165)
	at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:267)
	at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:476)
	at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:517)
	at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:415)
	at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:330)
	at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:233)
	at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:424)
	at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:483)
	at org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:466)
	at java.base/java.lang.Thread.run(Thread.java:834)
Caused by: java.lang.ClassNotFoundException: org.tukaani.xz.FilterOptions
{noformat}

I can tell from the logs that the jars in contrib/extraction/lib are loading:
{noformat}
Added 44 libs to classloader, from paths: [/home/tim/work/solr-9.0.0-SNAPSHOT/contrib/extraction/lib, /home/tim/work/solr-9.0.0-SNAPSHOT/dist]
{noformat}

The xz.jar is where it belongs and it is the right version, and when I unzip that jar, the class is there.

Any idea what might be going on?



was (Author: tallison@mitre.org):
The tests all pass, and I can get a successful build locally, however when I try a full  integration test, I'm getting a NoClassDeffFoundError:

{noformat}
Exception in thread "Thread-15" java.lang.NoClassDefFoundError: org/tukaani/xz/FilterOptions
	at org.apache.commons.compress.archivers.sevenz.Coders.<clinit>(Coders.java:47)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.buildDecoderStack(SevenZFile.java:1153)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.buildDecodingStream(SevenZFile.java:1106)
	at org.apache.commons.compress.archivers.sevenz.SevenZFile.getNextEntry(SevenZFile.java:405)
	at org.apache.tika.parser.pkg.PackageParser$SevenZWrapper.getNextEntry(PackageParser.java:424)
	at org.apache.tika.parser.pkg.PackageParser.parse(PackageParser.java:285)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
	at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:280)
	at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:143)
	at org.apache.solr.handler.dataimport.TikaEntityProcessor.nextRow(TikaEntityProcessor.java:165)
	at org.apache.solr.handler.dataimport.EntityProcessorWrapper.nextRow(EntityProcessorWrapper.java:267)
	at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:476)
	at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:517)
	at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:415)
	at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:330)
	at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:233)
	at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:424)
	at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:483)
	at org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:466)
	at java.base/java.lang.Thread.run(Thread.java:834)
Caused by: java.lang.ClassNotFoundException: org.tukaani.xz.FilterOptions
{noformat}

I can tell from the logs that the jars in contrib/extraction/lib are loading:
{noformat}
Added 44 libs to classloader, from paths: [/home/tim/work/solr-9.0.0-SNAPSHOT/contrib/extraction/lib, /home/tim/work/solr-9.0.0-SNAPSHOT/dist]
{noformat}

Any idea what might be going on?


> Upgrade Tika to 1.23
> --------------------
>
>                 Key: SOLR-14054
>                 URL: https://issues.apache.org/jira/browse/SOLR-14054
>             Project: Solr
>          Issue Type: Task
>      Security Level: Public(Default Security Level. Issues are Public) 
>          Components: contrib - DataImportHandler
>            Reporter: Tim Allison
>            Assignee: Tim Allison
>            Priority: Minor
>
> We just released 1.23.  Let's upgrade Tika.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@lucene.apache.org
For additional commands, e-mail: issues-help@lucene.apache.org