You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Andy LoPresto (Jira)" <ji...@apache.org> on 2020/01/09 18:02:00 UTC
[jira] [Created] (NIFI-7002) Handle large files internally in EC
toolkit
Andy LoPresto created NIFI-7002:
-----------------------------------
Summary: Handle large files internally in EC toolkit
Key: NIFI-7002
URL: https://issues.apache.org/jira/browse/NIFI-7002
Project: Apache NiFi
Issue Type: Sub-task
Components: Tools and Build
Affects Versions: 1.10.0
Reporter: Andy LoPresto
The EC toolkit cannot handle large flow definition files (>100 MB uncompressed) due to its use of {{String}} to hold the XML content internally through frequent manipulations. As Strings are immutable in Java, this causes a lot of heap usage and duplication. The read/write to the file system also operates on the String directly rather than perform a streaming read/write.
We should change the internal representation of the file content (for all files) to be a {{BufferedInputStream}} or other similar stream representation rather than a String. We should test with very large files (1 GB) and clearly provide instructions and warnings if the file size exceeds that.
The logic of value encryption/replacement will have to change to handle this approach.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)