You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Derrick Johnson (JIRA)" <ji...@apache.org> on 2013/09/12 17:06:54 UTC

[jira] [Commented] (TIKA-1163) NPE thrown by TikaConfig.getDefaultConfig()

    [ https://issues.apache.org/jira/browse/TIKA-1163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13765508#comment-13765508 ] 

Derrick Johnson commented on TIKA-1163:
---------------------------------------

I just tried out your test case, and it works. So then, what is different for my project? Why does new Tika() work for 1.0, but not for 1.4? I'd greatly appreciate any assistance. Here is the output of `mvn dependency:tree -verbose` :


{code}
[INFO] |  +- org.apache.tika:tika-core:jar:1.4:compile
[INFO] |  +- org.apache.tika:tika-parsers:jar:1.4:compile
[INFO] |  |  +- (org.apache.tika:tika-core:jar:1.4:compile - omitted for duplicate)
[INFO] |  |  +- org.gagravarr:vorbis-java-tika:jar:0.1:compile
[INFO] |  |  |  +- org.gagravarr:vorbis-java-core:jar:tests:0.1:runtime
[INFO] |  |  |  \- (org.apache.tika:tika-core:jar:1.4:compile - version managed from 1.0; omitted for duplicate)
[INFO] |  |  +- edu.ucar:netcdf:jar:4.2-min:compile
[INFO] |  |  |  \- (org.slf4j:slf4j-api:jar:1.6.4:compile - version managed from 1.5.6; omitted for duplicate)
[INFO] |  |  +- (org.apache.james:apache-mime4j-core:jar:0.7:compile - version managed from 0.7.2; omitted for duplicate)
[INFO] |  |  +- org.apache.james:apache-mime4j-dom:jar:0.7:compile (version managed from 0.7.2)
[INFO] |  |  |  \- (org.apache.james:apache-mime4j-core:jar:0.7:compile - version managed from 0.7.2; omitted for duplicate)
[INFO] |  |  +- (org.apache.commons:commons-compress:jar:1.4.1:compile - version managed from 1.5; omitted for duplicate)
[INFO] |  |  +- (commons-codec:commons-codec:jar:1.5:compile - omitted for conflict with 1.4)
[INFO] |  |  +- org.apache.pdfbox:pdfbox:jar:1.8.1:compile
[INFO] |  |  |  +- org.apache.pdfbox:fontbox:jar:1.8.1:compile
[INFO] |  |  |  |  \- (commons-logging:commons-logging:jar:1.1.1:compile - omitted for duplicate)
[INFO] |  |  |  +- org.apache.pdfbox:jempbox:jar:1.8.1:compile
[INFO] |  |  |  \- (commons-logging:commons-logging:jar:1.1.1:compile - omitted for duplicate)
[INFO] |  |  +- org.bouncycastle:bcmail-jdk15:jar:1.45:compile
[INFO] |  |  |  \- (org.bouncycastle:bcprov-jdk15:jar:1.45:compile - omitted for duplicate)
[INFO] |  |  +- org.bouncycastle:bcprov-jdk15:jar:1.45:compile
[INFO] |  |  +- org.apache.poi:poi:jar:3.9:compile
[INFO] |  |  |  \- (commons-codec:commons-codec:jar:1.5:compile - omitted for conflict with 1.4)
[INFO] |  |  +- org.apache.poi:poi-scratchpad:jar:3.9:compile
[INFO] |  |  |  \- (org.apache.poi:poi:jar:3.9:compile - omitted for duplicate)
[INFO] |  |  +- org.apache.poi:poi-ooxml:jar:3.9:compile
[INFO] |  |  |  +- (org.apache.poi:poi:jar:3.9:compile - omitted for duplicate)
[INFO] |  |  |  +- org.apache.poi:poi-ooxml-schemas:jar:3.9:compile
[INFO] |  |  |  |  \- org.apache.xmlbeans:xmlbeans:jar:2.3.0:compile
[INFO] |  |  |  \- (dom4j:dom4j:jar:1.6.1:compile - omitted for duplicate)
[INFO] |  |  +- org.apache.geronimo.specs:geronimo-stax-api_1.0_spec:jar:1.0.1:compile
[INFO] |  |  +- org.ccil.cowan.tagsoup:tagsoup:jar:1.2.1:compile
[INFO] |  |  +- org.ow2.asm:asm-debug-all:jar:4.1:compile
[INFO] |  |  +- com.googlecode.mp4parser:isoparser:jar:1.0-RC-1:compile
[INFO] |  |  |  \- (org.aspectj:aspectjrt:jar:1.6.11:compile - omitted for conflict with 1.6.12)
[INFO] |  |  +- com.drewnoakes:metadata-extractor:jar:2.6.2:compile
[INFO] |  |  |  +- com.adobe.xmp:xmpcore:jar:5.1.2:compile
[INFO] |  |  |  \- xerces:xercesImpl:jar:2.8.1:compile
[INFO] |  |  |     \- xml-apis:xml-apis:jar:1.3.03:compile
[INFO] |  |  +- de.l3s.boilerpipe:boilerpipe:jar:1.1.0:compile
[INFO] |  |  +- rome:rome:jar:0.9:compile
[INFO] |  |  |  \- jdom:jdom:jar:1.0:compile
[INFO] |  |  +- org.gagravarr:vorbis-java-core:jar:0.1:compile
[INFO] |  |  \- com.googlecode.juniversalchardet:juniversalchardet:jar:1.0.3:compile
{code}
                
> NPE thrown by TikaConfig.getDefaultConfig() 
> --------------------------------------------
>
>                 Key: TIKA-1163
>                 URL: https://issues.apache.org/jira/browse/TIKA-1163
>             Project: Tika
>          Issue Type: Bug
>    Affects Versions: 1.4
>         Environment: OS-X, JDK 1.7
>            Reporter: Derrick Johnson
>         Attachments: TCT.java
>
>
> The below exception gets thrown every time I execute TikaConfig.getDefaultConfig. Similary, invoking `Tika t = new Tika()` throws the same exception, since code inside this constructor invokes TikaConfig.getDefaultConfig().
> This problem is non-existent when I use Tika-core and Tika-parsers 1.0. But when bump the version numbers to 1.2 (in order to get around a bug in PDFBOX), the problem shows up. I'm using maven. I've carefully ensured that I'm not pulling in the wrong version of Tika, using things like `mvn dependency:tree` and making there were no sneaky problematic transitive dependencies. 
> java.lang.NullPointerException
> 	at org.apache.tika.mime.MimeTypesReader$ClauseRecord.stop(MimeTypesReader.java:245)
> 	at org.apache.tika.mime.MimeTypesReader.endElement(MimeTypesReader.java:203)
> 	at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.endElement(AbstractSAXParser.java:606)
> 	at com.sun.org.apache.xerces.internal.parsers.AbstractXMLDocumentParser.emptyElement(AbstractXMLDocumentParser.java:183)
> 	at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanStartElement(XMLDocumentFragmentScannerImpl.java:1303)
> 	at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl$FragmentContentDriver.next(XMLDocumentFragmentScannerImpl.java:2717)
> 	at com.sun.org.apache.xerces.internal.impl.XMLDocumentScannerImpl.next(XMLDocumentScannerImpl.java:607)
> 	at com.sun.org.apache.xerces.internal.impl.XMLDocumentFragmentScannerImpl.scanDocument(XMLDocumentFragmentScannerImpl.java:489)
> 	at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:835)
> 	at com.sun.org.apache.xerces.internal.parsers.XML11Configuration.parse(XML11Configuration.java:764)
> 	at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParser.java:123)
> 	at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1210)
> 	at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:568)
> 	at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl.parse(SAXParserImpl.java:302)
> 	at javax.xml.parsers.SAXParser.parse(SAXParser.java:195)
> 	at org.apache.tika.mime.MimeTypesReader.read(MimeTypesReader.java:115)
> 	at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:64)
> 	at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:93)
> 	at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:149)
> 	at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:479)
> 	at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:60)
> 	at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:169)
> 	at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:268)
>         at 
> my.method.which.invokes `new Tika()`

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira