You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@manifoldcf.apache.org by Julien Massiera <ju...@francelabs.com> on 2016/05/19 16:44:22 UTC

Multiple output documents from one input document in transformation connector

Hi ManifoldCF community,

here is my problem : I would like to crawl '.pst' documents with 
ManifoldCF and index each email within them into a Solr instance.
I'm thinking to crawl the '.pst' files with a FileSystem repository 
connection and then use my custom Transformation connection to extract 
the emails and send them for Solr ingestion through the activities object.

Is my approach correct ? or do I need to consider another solution ?

Thanks for your help.

Julien Massiera