You are viewing a plain text version of this content. The canonical link for it is here.
Posted to hdfs-dev@hadoop.apache.org by "Lin,Yiqun(vip.com)" <yi...@vipshop.com> on 2018/06/15 02:25:50 UTC

答复: Regarding Hadoop Erasure Coding architecture

Hi Chaitanya,

I suppose you can get more details that you wanted to know from EC design doc attached in JIRA HDFS-7285(https://issues.apache.org/jira/browse/HDFS-7285). Hope this makes sense to you.

Thanks
Yiqun

-----邮件原件-----
发件人: Chaitanya M V S [mailto:chaitanya.mvs2007@gmail.com]
发送时间: 2018年6月14日 21:02
收件人: hdfs-dev@hadoop.apache.org
抄送: Shreya Gupta
主题: Regarding Hadoop Erasure Coding architecture

Hi!

We a group of people trying to understand the architecture of erasure coding in Hadoop 3.0. We have been facing difficulties to understand few terms and concepts regarding the same.

1. What do the terms Block, Block Group, Stripe, Cell and Chunk mean in the context of erasure coding (these terms have taken different meanings and have been used interchangeably over various documentation and blogs)? How has this been incorporated in reading and writing of EC data?

2. How has been the idea/concept of the block from previous versions carried over to EC?

3. ‎The higher level APIs, that of ErasureCoders and ErasureCodec still hasn't been plugged into Hadoop. Also, I haven't found any new Jira regarding the same. Can I know if there are any updates or pointers regarding the incorporation of these APIs into Hadoop?

4. How is the datanode for reconstruction work chosen?  Also, how are the buffer sizes for the reconstruction work determined?


Thanks in advance for your time and considerations.

Regards,
M.V.S.Chaitanya
本电子邮件可能为保密文件。如果阁下非电子邮件所指定之收件人,谨请立即通知本人。敬请阁下不要使用、保存、复印、打印、散布本电子邮件及其内容,或将其用于其他任何目的或向任何人披露。谢谢您的合作! This communication is intended only for the addressee(s) and may contain information that is privileged and confidential. You are hereby notified that, if you are not an intended recipient listed above, or an authorized employee or agent of an addressee of this communication responsible for delivering e-mail messages to an intended recipient, any dissemination, distribution or reproduction of this communication (including any attachments hereto) is strictly prohibited. If you have received this communication in error, please notify us immediately by a reply e-mail addressed to the sender and permanently delete the original e-mail communication and any attachments from all storage devices without making or otherwise retaining a copy.

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-dev-unsubscribe@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-help@hadoop.apache.org