You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by Michael Wechner <mi...@wyona.com> on 2022/11/02 22:03:18 UTC

factory.newDocumentBuilder() takes much longer since my project is using Tika

Hi

I recently added Tika 2.5.0 (tika-core, tika-parsers-standard-package) 
to my project and updated commons-io to 2.11.0 and Tika itself works 
very fine, but since I included Tika I have trouble with

DocumentBuilder builder = factory.newDocumentBuilder();

creating the DocumentBuilder takes much longer than before, but only the 
first instance per thread.

And only when I deploy it inside Docker container with Java 11

On my Mac laptop with Java 11 it works fine otherwise

Any idea what might be causing this?

Thanks

Michael

Re: factory.newDocumentBuilder() takes much longer since my project is using Tika

Posted by Michael Wechner <mi...@wyona.com>.
Thanks for this pointer!

The behaviour feels like there is a lock somehow, but unfortunately I do 
not see anything within the logs.

Is it possible to turn on "log debug" for Tika?

Thanks

Michael



Am 03.11.22 um 00:26 schrieb Tim Allison:
> This is the only thing that I can think of... 
> https://lists.apache.org/thread/6ndctttvn44v1zpmnqfmwkv4sf72kngb
>
> On Wed, Nov 2, 2022 at 6:03 PM Michael Wechner 
> <mi...@wyona.com> wrote:
>
>     Hi
>
>     I recently added Tika 2.5.0 (tika-core,
>     tika-parsers-standard-package) to my project and updated
>     commons-io to 2.11.0 and Tika itself works very fine, but since I
>     included Tika I have trouble with
>
>     DocumentBuilder builder = factory.newDocumentBuilder();
>
>     creating the DocumentBuilder takes much longer than before, but
>     only the first instance per thread.
>
>     And only when I deploy it inside Docker container with Java 11
>
>     On my Mac laptop with Java 11 it works fine otherwise
>
>     Any idea what might be causing this?
>
>     Thanks
>
>     Michael
>

Re: factory.newDocumentBuilder() takes much longer since my project is using Tika

Posted by Tim Allison <ta...@apache.org>.
This is the only thing that I can think of...
https://lists.apache.org/thread/6ndctttvn44v1zpmnqfmwkv4sf72kngb

On Wed, Nov 2, 2022 at 6:03 PM Michael Wechner <mi...@wyona.com>
wrote:

> Hi
>
> I recently added Tika 2.5.0 (tika-core, tika-parsers-standard-package) to
> my project and updated commons-io to 2.11.0 and Tika itself works very
> fine, but since I included Tika I have trouble with
>
> DocumentBuilder builder = factory.newDocumentBuilder();
>
> creating the DocumentBuilder takes much longer than before, but only the
> first instance per thread.
>
> And only when I deploy it inside Docker container with Java 11
>
> On my Mac laptop with Java 11 it works fine otherwise
>
> Any idea what might be causing this?
>
> Thanks
>
> Michael
>