You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@ozone.apache.org by Wei-Chiu Chuang <we...@cloudera.com.INVALID> on 2019/11/07 18:37:30 UTC

Notes from Hadoop storage community online sync

Thanks @Xiaoyu Yao <xy...@cloudera.com> for giving us a great status update
on Ozone!

We had a pretty large group yesterday. Here's my notes for your reference:
<goog_1177019630>
https://docs.google.com/document/d/1jXM5Ujvf-zhcyw_5kiQVx6g-HeKe-YGnFS_1-qFXomI/edit?usp=sharing
11/6/2019
~20 contributors joined the discussion.Weichiu, Xiaoyu, Chen, Haihua,
haiyang, hexiaoqiao, Hui, Jinglun, Li, Lisheng, Oliver, sibyl.lv, Sammi,
Yisheng, aiphago, Dazhuang, haicai and many others.
Xiaoyu led the discussion of Ozone: object store for big data workloads.What
and why, feature set, current development: 0.4 features (security) and 0.5
features (HA), future roadmap: scale and stability improvement.

Decommissioning support in progress

Questions:

   1.

   Python client implementation — S3 or RPC
   1.

      Sammi: Tencent is preparing to introduce Ozone at Tencent. Use case
      1: Hive. Use case 2: Data science use cases, small files. Requires Python
      client.
      2.

   Ozone GA timeline
   3.

   How does client read: is OM involved in reading data? Ans: No. client
   access DataNode directly.
   4.

   What metadata does OM and SCM maintain?
   5.

   When can Ozone be used in production environment? Ans: wait for GA, and
   benchmarks running workloads like TPC-DS.
   6.

   Performance comparison between HDFS and Ozone. Ans: Ozone use RocksDB as
   the persistent store for metadata, and optimization and tuning is required
   for RocksDB.
   7.

   Ozone uses Raft replication protocol. What if it replicates more than 3
   copies? Would the leader become the bottleneck? Ans: multi Raft project is
   undergoing which addresses this problem.
   8.

   Rename? Ozone is flat hierarchy. Does it mean rename is a O(n)
   operation? Ans: Ozone plans to support hierarchy.