You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@nifi.apache.org by "Andy LoPresto (Jira)" <ji...@apache.org> on 2020/01/09 18:02:00 UTC

[jira] [Created] (NIFI-7002) Handle large files internally in EC toolkit

Andy LoPresto created NIFI-7002:
-----------------------------------

             Summary: Handle large files internally in EC toolkit
                 Key: NIFI-7002
                 URL: https://issues.apache.org/jira/browse/NIFI-7002
             Project: Apache NiFi
          Issue Type: Sub-task
          Components: Tools and Build
    Affects Versions: 1.10.0
            Reporter: Andy LoPresto


The EC toolkit cannot handle large flow definition files (>100 MB uncompressed) due to its use of {{String}} to hold the XML content internally through frequent manipulations. As Strings are immutable in Java, this causes a lot of heap usage and duplication. The read/write to the file system also operates on the String directly rather than perform a streaming read/write. 

We should change the internal representation of the file content (for all files) to be a {{BufferedInputStream}} or other similar stream representation rather than a String. We should test with very large files (1 GB) and clearly provide instructions and warnings if the file size exceeds that. 

The logic of value encryption/replacement will have to change to handle this approach. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)