You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/02/22 19:31:00 UTC
[jira] [Commented] (TIKA-3683) Documentation of native dependencies per module
[ https://issues.apache.org/jira/browse/TIKA-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17496285#comment-17496285 ]
Tilman Hausherr commented on TIKA-3683:
---------------------------------------
I don't know about the first and the last, but the other three are font packages so PDFs will look better when rendering when fonts are not embedded in a PDF.
> Documentation of native dependencies per module
> -----------------------------------------------
>
> Key: TIKA-3683
> URL: https://issues.apache.org/jira/browse/TIKA-3683
> Project: Tika
> Issue Type: Wish
> Components: tika-docker, tika-server
> Reporter: dataminer.accolade
> Priority: Minor
>
> I created a custom Docker image using the latest Tesseract release. I came across the tika [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] file which installs the following dependencies:
> xfonts-utils
> fonts-freefont-ttf
> fonts-liberation
> ttf-mscorefonts-installer
> cabextract
> I have not found any documetation yet about those dependencies in [https://cwiki.apache.org/confluence/display/tika] and [https://github.com/apache/tika]. I can only guess that those dependencies might impact PDF content handling.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)