You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Hemant Kumar (Jira)" <ji...@apache.org> on 2023/01/27 05:26:00 UTC
[jira] [Updated] (HDDS-7845) Wait for checkpoint directory to be created
[ https://issues.apache.org/jira/browse/HDDS-7845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hemant Kumar updated HDDS-7845:
-------------------------------
Description: There is an issue while running a UT that included a snapdiff call, The problem was that it included 2 consecutive createSnapshot calls , The first one created a checkpoint however the second didn’t and gave an error during snapshot read (as checkpoint hadn't been created). although it didn’t give an error during snapshot create and returned a response, It looks like the checkpoint creation is happening async and not blocking call. One more observation is that this occurs only when ratisEnabled set to true . (was: Currently, `RocksDBCheckpointDiffer` does lots of things (e.g. maintaining the DAG, implements RocksDB listener, diff two snapshots and probably remove/update of snapshot from DAG too) which makes it complicated and harder to test because of tight coupling with RocksDB.
`RocksDbCheckpointDiffer` can be simplified by extracting out the DAG to new class `RocksDbCompactionDag` which maintains SST DAG in which you can add nodes & arcs, load the DAG from disk/DB, remove nodes and arcs, etc. DAG is independent of what we use to store compaction log (disk or RocksDB). We should be able to test all the functionality of `RocksDbCompactionDag` independently.)
> Wait for checkpoint directory to be created
> -------------------------------------------
>
> Key: HDDS-7845
> URL: https://issues.apache.org/jira/browse/HDDS-7845
> Project: Apache Ozone
> Issue Type: Sub-task
> Reporter: Hemant Kumar
> Assignee: Hemant Kumar
> Priority: Major
>
> There is an issue while running a UT that included a snapdiff call, The problem was that it included 2 consecutive createSnapshot calls , The first one created a checkpoint however the second didn’t and gave an error during snapshot read (as checkpoint hadn't been created). although it didn’t give an error during snapshot create and returned a response, It looks like the checkpoint creation is happening async and not blocking call. One more observation is that this occurs only when ratisEnabled set to true .
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org