You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@tika.apache.org by dm...@apache.org on 2022/09/30 13:28:18 UTC

[tika-docker] branch master updated: TIKA-3860 -- pull in tesseract 5

This is an automated email from the ASF dual-hosted git repository.

dmeikle pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/tika-docker.git


The following commit(s) were added to refs/heads/master by this push:
     new c9d6cfc  TIKA-3860 -- pull in tesseract 5
     new 11a4f49  Merge pull request #7 from tballison/TIKA-3860
c9d6cfc is described below

commit c9d6cfced144d37bf08379bd8bbf854236e5ce2a
Author: tballison <ta...@apache.org>
AuthorDate: Tue Sep 27 09:59:54 2022 -0400

    TIKA-3860 -- pull in tesseract 5
---
 full/Dockerfile | 4 ++++
 1 file changed, 4 insertions(+)

diff --git a/full/Dockerfile b/full/Dockerfile
index cb2290b..13d0c56 100644
--- a/full/Dockerfile
+++ b/full/Dockerfile
@@ -11,6 +11,10 @@
 # the License.
 FROM ubuntu:focal as base
 RUN apt-get update
+# tesseract 5 is not yet in the base repo
+RUN apt-get -y install software-properties-common
+RUN add-apt-repository -y ppa:alex-p/tesseract-ocr5
+
 
 FROM base as dependencies