You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Subasini Rath (JIRA)" <ji...@apache.org> on 2019/01/15 09:13:00 UTC

[jira] [Comment Edited] (CONNECTORS-1563) SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes

    [ https://issues.apache.org/jira/browse/CONNECTORS-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16742883#comment-16742883 ] 

Subasini Rath edited comment on CONNECTORS-1563 at 1/15/19 9:12 AM:
--------------------------------------------------------------------

Hi Karl,

  Tried your suggestions in the below email but no luck.

Please find attached the screenshots for my manifold settings.

Could you please revisit once and let me know if I am missing something.

 

Also as per your suggestion - In the Solr output connection :

 tab [Paths] — > I changed [update handler to /update instead of /update/extract .

 In [Schema] tab ---> deselect [Use the Extract Update Handler:].

What I observe is no indexing happened in Solr.


was (Author: subasinir):
Hi Karl,

  Tried your suggestions in the below email but no luck.

Please find attached the screenshots for my manifold settings.

Could you please revisit once and let me know if I am missing something.

 

Also as per your suggestion - In the Solr output connection :

 tab [Paths] --- > I changed [update handler to /update instead of /update/extract .

 In [Schema] tab ---> deselect [Use the Extract Update Handler:].

What I observe is no indexing happened in Solr.

> SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes
> -----------------------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-1563
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1563
>             Project: ManifoldCF
>          Issue Type: Task
>          Components: Lucene/SOLR connector
>            Reporter: Sneha
>            Assignee: Karl Wright
>            Priority: Major
>         Attachments: managed-schema, solrconfig.xml
>
>
> I am encountering this problem:
> I have checked "Use the Extract Update Handler:" param then I am getting an error on Solr i.e. null:org.apache.solr.common.SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes
> If I ignore tika exception, my documents get indexed but dont have content field on Solr.
> I am using Solr 7.3.1 and manifoldCF 2.8.1
> I am using solr cell and hence not configured external tika extractor in manifoldCF pipeline
> Please help me with this problem
> Thanks in advance



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)