You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tilman Hausherr (Jira)" <ji...@apache.org> on 2022/02/22 19:31:00 UTC

[jira] [Commented] (TIKA-3683) Documentation of native dependencies per module

    [ https://issues.apache.org/jira/browse/TIKA-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17496285#comment-17496285 ] 

Tilman Hausherr commented on TIKA-3683:
---------------------------------------

I don't know about the first and the last, but the other three are font packages so PDFs will look better when rendering when fonts are not embedded in a PDF.

> Documentation of native dependencies per module
> -----------------------------------------------
>
>                 Key: TIKA-3683
>                 URL: https://issues.apache.org/jira/browse/TIKA-3683
>             Project: Tika
>          Issue Type: Wish
>          Components: tika-docker, tika-server
>            Reporter: dataminer.accolade
>            Priority: Minor
>
> I created a custom Docker image using the latest Tesseract release. I came across the tika [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] file which installs the following dependencies:
> xfonts-utils
> fonts-freefont-ttf
> fonts-liberation
> ttf-mscorefonts-installer
> cabextract
> I have not found any documetation yet about those dependencies in [https://cwiki.apache.org/confluence/display/tika] and [https://github.com/apache/tika]. I can only guess that those dependencies might impact PDF content handling.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)