You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (Jira)" <ji...@apache.org> on 2022/01/24 19:36:00 UTC

[jira] [Resolved] (TIKA-3662) Solr emitter when used by tika-app is not successfully emitting all files

     [ https://issues.apache.org/jira/browse/TIKA-3662?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Tim Allison resolved TIKA-3662.
-------------------------------
    Fix Version/s: 2.3.0
       Resolution: Not A Problem

This turned out to be "not a problem".  The tika-config path in the config file pointed to a different config file.  The issue was that the emitter that had been loaded by the AsyncProcessor was configured by the primary tika-config, and the emitter in the pipes-server was configured by a different tika-config.  This mean that the docs were being indexed, but in a different index. :( :P

I improved logging and added a check for the return status.

> Solr emitter when used by tika-app is not successfully emitting all files
> -------------------------------------------------------------------------
>
>                 Key: TIKA-3662
>                 URL: https://issues.apache.org/jira/browse/TIKA-3662
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Blocker
>             Fix For: 2.3.0
>
>
> In the demo today, I noticed that some files were reported to be emitted successfully, but then they weren't available in Solr.
>  
> I was using tika-app in async.  This may be a race condition during shutdown.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)