You are viewing a plain text version of this content. The canonical link for it is here.
Posted to users@solr.apache.org by Andrew Beverley <an...@andybev.com> on 2024/01/12 13:35:14 UTC

Re: Skipping indexing of encrypted documents

On 20/12/2023 11:07, Andrew Beverley wrote:
> Hello all,
> 
> I am looking for a way to silently skip the indexing of documents that 
> are encrypted.
> 
> Currently I see an exception:
> 
> 500 Exception => org.apache.solr.common.SolrException: 
> org.apache.tika.exception.EncryptedDocumentException: Unable to pr
> ocess: document is encrypted
> 
> I am wondering whether it is possible to use an Update Request Processor 
> to skip such documents. Or whether there is a way to configure Tika to 
> skip such documents? Let me know if I should be asking this question on 
> the Tika mailing list.

Hello all - I wondered if anyone might be able to help with the above on 
a professional basis?

Many thanks,

Andy

Re: Skipping indexing of encrypted documents

Posted by Andrew Beverley <an...@andybev.com>.
On 12/01/2024 22:31, Mikhail Khludnev wrote:
> Hello Andy,
> May you use ignoreTikaException param described
> https://solr.apache.org/guide/solr/latest/indexing-guide/indexing-with-tika.html#solr-cell-parameters

Thanks Mikhail, that will be useful, at least in the short-term!

Andy

> On Fri, Jan 12, 2024 at 7:02 PM Andrew Beverley <an...@andybev.com> wrote:
> 
>> On 20/12/2023 11:07, Andrew Beverley wrote:
>>> Hello all,
>>>
>>> I am looking for a way to silently skip the indexing of documents that
>>> are encrypted.
>>>
>>> Currently I see an exception:
>>>
>>> 500 Exception => org.apache.solr.common.SolrException:
>>> org.apache.tika.exception.EncryptedDocumentException: Unable to pr
>>> ocess: document is encrypted
>>>
>>> I am wondering whether it is possible to use an Update Request Processor
>>> to skip such documents. Or whether there is a way to configure Tika to
>>> skip such documents? Let me know if I should be asking this question on
>>> the Tika mailing list.
>>
>> Hello all - I wondered if anyone might be able to help with the above on
>> a professional basis?
>>
>> Many thanks,
>>
>> Andy
>>
> 
> 

Re: Skipping indexing of encrypted documents

Posted by Mikhail Khludnev <mk...@apache.org>.
Hello Andy,
May you use ignoreTikaException param described
https://solr.apache.org/guide/solr/latest/indexing-guide/indexing-with-tika.html#solr-cell-parameters
?

On Fri, Jan 12, 2024 at 7:02 PM Andrew Beverley <an...@andybev.com> wrote:

> On 20/12/2023 11:07, Andrew Beverley wrote:
> > Hello all,
> >
> > I am looking for a way to silently skip the indexing of documents that
> > are encrypted.
> >
> > Currently I see an exception:
> >
> > 500 Exception => org.apache.solr.common.SolrException:
> > org.apache.tika.exception.EncryptedDocumentException: Unable to pr
> > ocess: document is encrypted
> >
> > I am wondering whether it is possible to use an Update Request Processor
> > to skip such documents. Or whether there is a way to configure Tika to
> > skip such documents? Let me know if I should be asking this question on
> > the Tika mailing list.
>
> Hello all - I wondered if anyone might be able to help with the above on
> a professional basis?
>
> Many thanks,
>
> Andy
>


-- 
Sincerely yours
Mikhail Khludnev