You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@tika.apache.org by Zheng Lin Edwin Yeo <ed...@gmail.com> on 2019/08/02 07:56:43 UTC

Indexing information on number of attachments and their names in EML file

Hi,

Would like to check, Is there anyway which we can detect the number of
attachments and their names during indexing of EML files in Solr, and index
those information into Solr?

Currently, Solr is able to use Tika and Tesseract OCR to extract the
contents of the attachments. However, I could not find the information
about the number of attachments in the EML file and what are their filename.

I am using Solr 7.6.0 in production, and also trying out on the new Solr
8.2.0. Both uses Tika 1.19.1.

Regards,
Edwin