You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by Michael Wechner <mi...@wyona.com> on 2022/11/02 22:03:18 UTC
factory.newDocumentBuilder() takes much longer since my project is using Tika
Hi
I recently added Tika 2.5.0 (tika-core, tika-parsers-standard-package)
to my project and updated commons-io to 2.11.0 and Tika itself works
very fine, but since I included Tika I have trouble with
DocumentBuilder builder = factory.newDocumentBuilder();
creating the DocumentBuilder takes much longer than before, but only the
first instance per thread.
And only when I deploy it inside Docker container with Java 11
On my Mac laptop with Java 11 it works fine otherwise
Any idea what might be causing this?
Thanks
Michael
Re: factory.newDocumentBuilder() takes much longer since my project is using Tika
Posted by Michael Wechner <mi...@wyona.com>.
Thanks for this pointer!
The behaviour feels like there is a lock somehow, but unfortunately I do
not see anything within the logs.
Is it possible to turn on "log debug" for Tika?
Thanks
Michael
Am 03.11.22 um 00:26 schrieb Tim Allison:
> This is the only thing that I can think of...
> https://lists.apache.org/thread/6ndctttvn44v1zpmnqfmwkv4sf72kngb
>
> On Wed, Nov 2, 2022 at 6:03 PM Michael Wechner
> <mi...@wyona.com> wrote:
>
> Hi
>
> I recently added Tika 2.5.0 (tika-core,
> tika-parsers-standard-package) to my project and updated
> commons-io to 2.11.0 and Tika itself works very fine, but since I
> included Tika I have trouble with
>
> DocumentBuilder builder = factory.newDocumentBuilder();
>
> creating the DocumentBuilder takes much longer than before, but
> only the first instance per thread.
>
> And only when I deploy it inside Docker container with Java 11
>
> On my Mac laptop with Java 11 it works fine otherwise
>
> Any idea what might be causing this?
>
> Thanks
>
> Michael
>
Re: factory.newDocumentBuilder() takes much longer since my project is using Tika
Posted by Tim Allison <ta...@apache.org>.
This is the only thing that I can think of...
https://lists.apache.org/thread/6ndctttvn44v1zpmnqfmwkv4sf72kngb
On Wed, Nov 2, 2022 at 6:03 PM Michael Wechner <mi...@wyona.com>
wrote:
> Hi
>
> I recently added Tika 2.5.0 (tika-core, tika-parsers-standard-package) to
> my project and updated commons-io to 2.11.0 and Tika itself works very
> fine, but since I included Tika I have trouble with
>
> DocumentBuilder builder = factory.newDocumentBuilder();
>
> creating the DocumentBuilder takes much longer than before, but only the
> first instance per thread.
>
> And only when I deploy it inside Docker container with Java 11
>
> On my Mac laptop with Java 11 it works fine otherwise
>
> Any idea what might be causing this?
>
> Thanks
>
> Michael
>