You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@manifoldcf.apache.org by Julien Massiera <ju...@francelabs.com> on 2016/05/19 16:44:22 UTC
Multiple output documents from one input document in transformation
connector
Hi ManifoldCF community,
here is my problem : I would like to crawl '.pst' documents with
ManifoldCF and index each email within them into a Solr instance.
I'm thinking to crawl the '.pst' files with a FileSystem repository
connection and then use my custom Transformation connection to extract
the emails and send them for Solr ingestion through the activities object.
Is my approach correct ? or do I need to consider another solution ?
Thanks for your help.
Julien Massiera