You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ozone.apache.org by "Hongbing Wang (Jira)" <ji...@apache.org> on 2023/06/01 09:45:00 UTC

[jira] [Commented] (HDDS-8716) ReplicationFactor will be ONE when distcp hdfs ec file to ozone

    [ https://issues.apache.org/jira/browse/HDDS-8716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17728287#comment-17728287 ] 

Hongbing Wang commented on HDDS-8716:
-------------------------------------

distcp -p or -pr will preserve replication number. HDFS ec rep is 1.
In this case, it may be better for ozone to use the default replicationFactor if ozone dir does not has ec policy. 

ONE replicationFactor maybe risks.

>  ReplicationFactor will be ONE when distcp hdfs ec file to ozone
> ----------------------------------------------------------------
>
>                 Key: HDDS-8716
>                 URL: https://issues.apache.org/jira/browse/HDDS-8716
>             Project: Apache Ozone
>          Issue Type: Bug
>    Affects Versions: 1.3.0
>            Reporter: guangbao zhao
>            Assignee: guangbao zhao
>            Priority: Major
>         Attachments: clipboard_image_1685349424008.png, clipboard_image_1685349999645.png
>
>
> When I used distcp to copy data from hdfs to ozone, I found that some data has three replication, and some data has one replication
> I submit with the following command:
> {code:java}
> hadoop distcp \
> -i -p \
> hdfs://xxx/user/hadoop/warehouse/bigdata_admin.db/hive_audit_res/year=2020 \
> ofs://xxx/vol-bigdata-offline/prod-bigdata-offline/hive_audit_res/year=2020 {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@ozone.apache.org
For additional commands, e-mail: issues-help@ozone.apache.org