You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Chris Carman (Jira)" <ji...@apache.org> on 2020/09/28 04:33:00 UTC

[jira] [Commented] (TIKA-2518) tika app outputs warnings by default

    [ https://issues.apache.org/jira/browse/TIKA-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17202994#comment-17202994 ] 

Chris Carman commented on TIKA-2518:
------------------------------------

And 3 years later, still broken.

 

> tika app outputs warnings by default
> ------------------------------------
>
>                 Key: TIKA-2518
>                 URL: https://issues.apache.org/jira/browse/TIKA-2518
>             Project: Tika
>          Issue Type: Bug
>          Components: app
>    Affects Versions: 1.16
>            Reporter: Ryan Brueske
>            Priority: Major
>
> upon downloading the latest tika and trying basic commands it spews unwanted warnings, which makes parsing output necessary.
> Example 1:
> {code}
> java -jar tika-app-1.16.jar --list-detectors
> Dec 05, 2017 3:16:13 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: JBIG2ImageReader not loaded. jbig2 files will be ignored
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> TIFFImageWriter not loaded. tiff files will not be processed
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> J2KImageReader not loaded. JPEG2000 files will not be processed.
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> Dec 05, 2017 3:16:13 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: org.xerial's sqlite-jdbc is not loaded.
> Please provide the jar on your classpath to parse sqlite files.
> See tika-parsers/pom.xml for the correct version.
> org.apache.tika.detect.DefaultDetector (Composite Detector):
>   org.apache.tika.parser.microsoft.POIFSContainerDetector
>   org.apache.tika.parser.pkg.ZipContainerDetector
>   org.gagravarr.tika.OggDetector
>   org.apache.tika.mime.MimeTypes
> {code}
> Example 2:
> {code}
> java -jar tika-app-1.16.jar --text my.xlsx
> Dec 05, 2017 3:00:22 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: JBIG2ImageReader not loaded. jbig2 files will be ignored
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> TIFFImageWriter not loaded. tiff files will not be processed
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> J2KImageReader not loaded. JPEG2000 files will not be processed.
> See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io
> for optional dependencies.
> Dec 05, 2017 3:00:22 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem
> WARNING: org.xerial's sqlite-jdbc is not loaded.
> Please provide the jar on your classpath to parse sqlite files.
> See tika-parsers/pom.xml for the correct version.
> INFO  As a convenience, TikaCLI has turned on extraction of
> inline images for the PDFParser (TIKA-2374).
> This is not the default option in Tika generally or in tika-server.
> As a convenience, TikaCLI has turned on extraction of
> inline images for the PDFParser (TIKA-2374).
> This is not the default option in Tika generally or in tika-server.
> {code}
> The expected behavior is to return only the requested information. I do not see a switch to turn off or control unrequested warnings. 
> I can't imagine this is the correct behavior. It is not documented, nor could I find why such output exists.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)