You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@ambari.apache.org by "Hemanth Yamijala (JIRA)" <ji...@apache.org> on 2016/07/21 23:36:20 UTC

[jira] [Updated] (AMBARI-17834) HDFS copy thread in OutputHDFSFile is not removing files that are processed

     [ https://issues.apache.org/jira/browse/AMBARI-17834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Hemanth Yamijala updated AMBARI-17834:
--------------------------------------
    Assignee: Hayat Behlim

> HDFS copy thread in OutputHDFSFile is not removing files that are processed 
> ----------------------------------------------------------------------------
>
>                 Key: AMBARI-17834
>                 URL: https://issues.apache.org/jira/browse/AMBARI-17834
>             Project: Ambari
>          Issue Type: Bug
>          Components: ambari-logsearch
>            Reporter: Hemanth Yamijala
>            Assignee: Hayat Behlim
>
> Consider the HDFS copy thread here: https://github.com/apache/ambari/blob/trunk/ambari-logsearch/ambari-logsearch-logfeeder/src/main/java/org/apache/ambari/logfeeder/output/OutputHDFSFile.java#L177
> It uses a data structure localReadyFiles that has the list of files to copy to HDFS. But AFAICS, nowhere are processed files removed from this data structure. This could result in multiple incorrect behaviours:
> * Over a period of time, this structure could keep growing indefinitely. While the check for existence of files will allow progress, it will take longer and longer to run.
> *  The check to wait when there are no files to process will never pass because the size of the data structure will never be 0. This could lead to tight looped checks and some utilization issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)