You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2021/05/11 15:04:00 UTC

[jira] [Commented] (TIKA-3392) Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.

    [ https://issues.apache.org/jira/browse/TIKA-3392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17342625#comment-17342625 ] 

Tim Allison commented on TIKA-3392:
-----------------------------------

Is that the only xml parser available on Android?  If you can show us how to prevent xxe and xml bombs with harmony, we can add that...maybe?

> Apache Tika V1.26 doen't work on Android anymore. Issue with org.xml dependencies.
> ----------------------------------------------------------------------------------
>
>                 Key: TIKA-3392
>                 URL: https://issues.apache.org/jira/browse/TIKA-3392
>             Project: Tika
>          Issue Type: Bug
>          Components: core
>    Affects Versions: 1.26
>         Environment: Android 11
>            Reporter: Andrei Dobrescu
>            Priority: Major
>              Labels: android
>         Attachments: image-2021-05-11-17-53-58-291.png
>
>
> I use Apache Tika on Android in order to detect mime type of varios files:
> Apache Tika V1.10 works fine on Android:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {code:java}
> val mimeType = file.inputStream().buffered().use { inputStream ->
>     AutoDetectParser().detector .detect(inputStream, Metadata()).toString()
> }
> {code}
> However, Tika V1.26 will crash when trying to detect the mime type:
> {code:java}
> implementation 'org.apache.tika:tika-core:1.10'
> {code}
> {noformat}
> java.lang.ExceptionInInitializerError
>     at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>     at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>     at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>     at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>     at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>     at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:257)
>     at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>     at org.apache.tika.parser.AutoDetectParser.<init>(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE:
> java.lang.RuntimeException: problem initializing SAXParser pool
>         at org.apache.tika.mime.MimeTypesReader.<clinit>(MimeTypesReader.java:119)
>         at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>         at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>         at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>         at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>         at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>         at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:257)
>         at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>         at org.apache.tika.parser.AutoDetectParser.<init>(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE:
>  org.apache.tika.exception.TikaException: problem creating SAX parser factory
>      at org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:433)
>      at org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>      at org.apache.tika.mime.MimeTypesReader.<clinit>(MimeTypesReader.java:117)
>      at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>      at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>      at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>      at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>      at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>      at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:257)
>      at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>      at org.apache.tika.parser.AutoDetectParser.<init>(AutoDetectParser.java:55){noformat}
> {noformat}
> CAUSE OF CAUSE OF CAUSE:
> org.xml.sax.SAXNotRecognizedException: http://javax.xml.XMLConstants/feature/secure-processing
>      at org.apache.harmony.xml.parsers.SAXParserFactoryImpl.setFeature(SAXParserFactoryImpl.java:93)
>      at org.apache.tika.mime.MimeTypesReader.newSAXParser(MimeTypesReader.java:429)
>      at org.apache.tika.mime.MimeTypesReader.setPoolSize(MimeTypesReader.java:417)
>      at org.apache.tika.mime.MimeTypesReader.<clinit>(MimeTypesReader.java:117)
>      at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:69)
>      at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:100)
>      at org.apache.tika.mime.MimeTypesFactory.create(MimeTypesFactory.java:189)
>      at org.apache.tika.mime.MimeTypes.getDefaultMimeTypes(MimeTypes.java:604)
>      at org.apache.tika.config.TikaConfig.getDefaultMimeTypes(TikaConfig.java:83)
>      at org.apache.tika.config.TikaConfig.<init>(TikaConfig.java:257)
>      at org.apache.tika.config.TikaConfig.getDefaultConfig(TikaConfig.java:422)
>      at org.apache.tika.parser.AutoDetectParser.<init>(AutoDetectParser.java:55)
> {noformat}
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)