You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@manifoldcf.apache.org by "Subasini Rath (JIRA)" <ji...@apache.org> on 2019/02/19 08:22:00 UTC

[jira] [Updated] (CONNECTORS-1563) SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes

     [ https://issues.apache.org/jira/browse/CONNECTORS-1563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Subasini Rath updated CONNECTORS-1563:
--------------------------------------
    Attachment: Manifold and Solr settings_CustomField.docx

Hi Karl,
     Please find below the configuration details which I am using in manifold :

1. Output Connection Type - Solr
2. Repository Connection - Web
3. Job connection Sequence : Repository --> Metadata Adjuster Transformation --> output connection

I tried to write the content in a custom field by unchecking the checkbox in schema tab of output connection.
But after unchecking the checkbox, It is not writing to Solr anything. Please find the attached document for my settings.
Could you please check and let me know if I am missing something.



Thanks & Regards,
Subasini Rath
O: +91-33 6636-8889 
M: +91 983-1234-341
Email: Subasini.Rath@endeavourenergy.com.au



> SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes
> -----------------------------------------------------------------------------------------------
>
>                 Key: CONNECTORS-1563
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-1563
>             Project: ManifoldCF
>          Issue Type: Task
>          Components: Lucene/SOLR connector
>            Reporter: Sneha
>            Assignee: Karl Wright
>            Priority: Major
>         Attachments: Document simple history.docx, Manifold and Solr settings_CustomField.docx, managed-schema, manifold settings.docx, manifoldcf.log, solr.log, solrconfig.xml
>
>
> I am encountering this problem:
> I have checked "Use the Extract Update Handler:" param then I am getting an error on Solr i.e. null:org.apache.solr.common.SolrException: org.apache.tika.exception.ZeroByteFileException: InputStream must have > 0 bytes
> If I ignore tika exception, my documents get indexed but dont have content field on Solr.
> I am using Solr 7.3.1 and manifoldCF 2.8.1
> I am using solr cell and hence not configured external tika extractor in manifoldCF pipeline
> Please help me with this problem
> Thanks in advance



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)