You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by aravinth thangasami <ar...@gmail.com> on 2020/05/15 15:04:50 UTC

TesseractOCRParser - As separate process - Clarification

Hi all,

Is there any specific reason to call tesseract via ProcessBuilder as an
external process?
Why tika didn't use JNA libraries like tess4j for OCR?

Kind regards
Aravinth

Re: TesseractOCRParser - As separate process - Clarification

Posted by Tim Allison <ta...@apache.org>.
We tried.

tl;dr native libraries are complicated.  IIRC.

https://issues.apache.org/jira/browse/TIKA-2293

On Fri, May 15, 2020 at 11:05 AM aravinth thangasami <
aravinththangasami@gmail.com> wrote:

> Hi all,
>
> Is there any specific reason to call tesseract via ProcessBuilder as an
> external process?
> Why tika didn't use JNA libraries like tess4j for OCR?
>
> Kind regards
> Aravinth
>
>