You are viewing a plain text version of this content. The canonical link for it is here.
Posted to notifications@iotdb.apache.org by "Jialin Qiao (Jira)" <ji...@apache.org> on 2022/06/14 01:09:00 UTC

[jira] [Assigned] (IOTDB-3100) Semantics change of TsFile filename

     [ https://issues.apache.org/jira/browse/IOTDB-3100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Jialin Qiao reassigned IOTDB-3100:
----------------------------------

    Assignee: yusicheng  (was: Haiming Zhu)

> Semantics change of TsFile filename
> -----------------------------------
>
>                 Key: IOTDB-3100
>                 URL: https://issues.apache.org/jira/browse/IOTDB-3100
>             Project: Apache IoTDB
>          Issue Type: Improvement
>          Components: Core/Engine
>            Reporter: Haiming Zhu
>            Assignee: yusicheng
>            Priority: Major
>              Labels: pull-request-available
>
> Currently, the filename format of each tsfile is \{file_created_time}-\{version_id}-\{inner_space_merge_num}-\{cross_space_merge_num}.tsfile. In one time partition, the order of tsfiles is guaranteed by the version_id, for example, 1651825804093-2-0-0.tsfile is after 1651825804092-1-0-0.tsfile
>  
> The problem is that filename conflict may occur in the cross space compaction and load scenes. In the cross space compaction, assuming there exists 3-2-0-0.tsfile, 4-3-0-0.tsfile and 5-5-0-0.tsfile in the sequence folder, if file 4-3-0-0.tsfile is selected, compaction cannot generate 3 or more target files because only 2 version_id are left between 2 and 5, so some big target files may be generated. In the load, assuming there exists 3-2-0-0.tsfile, 3-3-0-0.tsfile and 3-3-0-0.tsfile in the sequence folder, no more sequence files cannot be loaded between 3-2-0-0.tsfile and 3-3-0-0.tsfile, they can only be loaded into the unsequence folder.
>  
> In response to these problems, the format won't be changed, but the meaning of file_created_time and version_id will be different. Instead of version_id, we use file_created_time to guarantee the order of tsfiles, and if two tsfiles have the same file_created_time, then we use version_id to guarantee the order. This semantics change may afftect query, compaction and load module.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)