You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@apex.apache.org by "Ananth (JIRA)" <ji...@apache.org> on 2017/11/11 11:26:10 UTC

[jira] [Closed] (APEXMALHAR-2484) BlockWriter for writing the part files into the specified directory

     [ https://issues.apache.org/jira/browse/APEXMALHAR-2484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ananth closed APEXMALHAR-2484.
------------------------------

> BlockWriter for writing the part files into the specified directory
> -------------------------------------------------------------------
>
>                 Key: APEXMALHAR-2484
>                 URL: https://issues.apache.org/jira/browse/APEXMALHAR-2484
>             Project: Apache Apex Malhar
>          Issue Type: Task
>            Reporter: Chaitanya
>            Assignee: Chaitanya
>             Fix For: 3.8.0
>
>
> Use case: Suppose, the size of source file (f1.txt) is 1 GB and the block size is 128 MB. I want to copy the file in destination as follows:
> f1.txt.part1
> f2.txt.part2
> ....
> By default, size of each part file is 128 MB except the last part.
> Design: Currently, the BlockWriter is restricted to write the part files into the HDFS on which the app is running. To achieve the above use case, operator needs the block index and relative path information. BlockMetadata which is the input port for the BlockWriter doesn't have these information.
> So, I am creating the new operator(PartFileWriter) which extends from BlockWriter with the input port of type FileMetadata.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)