You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "dataminer.accolade (Jira)" <ji...@apache.org> on 2022/02/22 14:55:00 UTC

[jira] [Updated] (TIKA-3683) Documentation of native dependencies per module

     [ https://issues.apache.org/jira/browse/TIKA-3683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

dataminer.accolade updated TIKA-3683:
-------------------------------------
    Description: 
I created a custom Docker image using the latest Tesseract release. I came across the tika [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] file which installs the following dependencies:

xfonts-utils
fonts-freefont-ttf
fonts-liberation
ttf-mscorefonts-installer
cabextract

I have not found any documetation yet about those dependencies in [https://cwiki.apache.org/confluence/display/tika] and [https://github.com/apache/tika]. I can only guess that those dependencies might impact PDF content handling.

  was:
I created a custom Docker image using the latest Tesseract version. I came across the tika [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] file which installs the following dependencies:

xfonts-utils
fonts-freefont-ttf
fonts-liberation
ttf-mscorefonts-installer
cabextract

I have not found any documetation yet about those dependencies in [https://cwiki.apache.org/confluence/display/tika] and [https://github.com/apache/tika]. I can only guess that those dependencies might impact PDF content handling.


> Documentation of native dependencies per module
> -----------------------------------------------
>
>                 Key: TIKA-3683
>                 URL: https://issues.apache.org/jira/browse/TIKA-3683
>             Project: Tika
>          Issue Type: Wish
>          Components: tika-docker, tika-server
>            Reporter: dataminer.accolade
>            Priority: Minor
>
> I created a custom Docker image using the latest Tesseract release. I came across the tika [Dockerfile|https://github.com/apache/tika-docker/blob/master/full/Dockerfile] file which installs the following dependencies:
> xfonts-utils
> fonts-freefont-ttf
> fonts-liberation
> ttf-mscorefonts-installer
> cabextract
> I have not found any documetation yet about those dependencies in [https://cwiki.apache.org/confluence/display/tika] and [https://github.com/apache/tika]. I can only guess that those dependencies might impact PDF content handling.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)