You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@tika.apache.org by "Tim Allison (JIRA)" <ji...@apache.org> on 2018/06/07 18:26:00 UTC

[jira] [Created] (TIKA-2662) Add a streaming out option for the Json serialization

Tim Allison created TIKA-2662:
---------------------------------

             Summary: Add a streaming out option for the Json serialization
                 Key: TIKA-2662
                 URL: https://issues.apache.org/jira/browse/TIKA-2662
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison


Depending on the configuration of the ForkParser, it might be useful for that and also for tika-batch to write out each embedded file once the parse for that embedded file has completed, rather than caching the entire output in memory.

The downside to this is that the main document will now show up at the bottom of the list of metadata objects.  We can re-arrange when we deserialize, but anyone not using our deserialization will see this change in order.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)