You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@apex.apache.org by "Chandni Singh (JIRA)" <ji...@apache.org> on 2016/04/22 19:58:12 UTC
[jira] [Commented] (APEXMALHAR-2063) Integrate WAL to FS
WindowDataManager
[ https://issues.apache.org/jira/browse/APEXMALHAR-2063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15254325#comment-15254325 ]
Chandni Singh commented on APEXMALHAR-2063:
-------------------------------------------
Will also move WindowDataManager to the new package structure:
org.apache.apex.malhar.lib.wal
> Integrate WAL to FS WindowDataManager
> -------------------------------------
>
> Key: APEXMALHAR-2063
> URL: https://issues.apache.org/jira/browse/APEXMALHAR-2063
> Project: Apache Apex Malhar
> Issue Type: Improvement
> Reporter: Chandni Singh
> Assignee: Chandni Singh
>
> FS Window Data Manager is used to save meta-data that helps in replaying tuples every completed application window after failure. For this it saves meta-data in a file per window. Having multiple small size files on hdfs cause issues as highlighted here:
> http://blog.cloudera.com/blog/2009/02/the-small-files-problem/
> Instead FS Window Data Manager can utilize the WAL to write data and maintain a mapping of how much data was flushed to WAL each window.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)