You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@atlas.apache.org by "Ashutosh Mestry (Jira)" <ji...@apache.org> on 2020/02/28 21:43:00 UTC

[jira] [Created] (ATLAS-3641) Import Service: Support New 'zipDirect' Format for Import

Ashutosh Mestry created ATLAS-3641:
--------------------------------------

             Summary: Import Service: Support New 'zipDirect' Format for Import
                 Key: ATLAS-3641
                 URL: https://issues.apache.org/jira/browse/ATLAS-3641
             Project: Atlas
          Issue Type: Improvement
          Components:  atlas-core
    Affects Versions: trunk
            Reporter: Ashutosh Mestry
            Assignee: Ashutosh Mestry
             Fix For: trunk


*Background*

Existing exported ZIP formats stores the entities and the order of import in 2 separate files. During import it is thus necessary for both these (entities and order) to be in memory to be able to correctly import.

This causes large import files not function well. Also it puts a lot of pressure on memory.

Existing format also uses 1 zip entry per entity. This does not effectively compress the files.

 The new format addresses these short comings.

*Solution*

Create a format that has:
 * Entities are stored in the order such that they are correctly imported.
 * Lends itself to be able to read as a stream. This will reduce memory constraints.
 * Store entities as 1 zip entry. This will improve compression rate.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)