You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@chukwa.apache.org by "Ari Rabkin (JIRA)" <ji...@apache.org> on 2009/07/13 05:37:14 UTC

[jira] Created: (CHUKWA-346) simplified sink archiver

simplified sink archiver
------------------------

                 Key: CHUKWA-346
                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
             Project: Hadoop Chukwa
          Issue Type: New Feature
          Components: Data Processors
    Affects Versions: 0.3.0
            Reporter: Ari Rabkin
            Assignee: Ari Rabkin
             Fix For: 0.3.0


The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CHUKWA-346) simplified sink archiver

Posted by "Ari Rabkin (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ari Rabkin updated CHUKWA-346:
------------------------------

    Status: Patch Available  (was: Open)

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: sinkArchiver.patch
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CHUKWA-346) simplified sink archiver

Posted by "Ari Rabkin (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ari Rabkin updated CHUKWA-346:
------------------------------

    Resolution: Fixed
        Status: Resolved  (was: Patch Available)

Taking silence for consent, I just committed this.

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: sinkArchiver.patch
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (CHUKWA-346) simplified sink archiver

Posted by "Ari Rabkin (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730215#action_12730215 ] 

Ari Rabkin commented on CHUKWA-346:
-----------------------------------

No. The archiver, by default in this patch, will group by cluster, day and datatype.   Which is well suited to our use case, which is mapreduce analytics of logs.

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: sinkArchiver.patch
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CHUKWA-346) simplified sink archiver

Posted by "Ari Rabkin (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ari Rabkin updated CHUKWA-346:
------------------------------

    Attachment:     (was: sinkArchiver.patch)

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (CHUKWA-346) simplified sink archiver

Posted by "Ari Rabkin (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730204#action_12730204 ] 

Ari Rabkin commented on CHUKWA-346:
-----------------------------------

A future enhancement, once we have appends, is to actually merge files during promotion, and not just rename to avoid collision.

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: sinkArchiver.patch
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CHUKWA-346) simplified sink archiver

Posted by "Ari Rabkin (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ari Rabkin updated CHUKWA-346:
------------------------------

    Attachment: sinkArchiver.patch

Revised, fixes a few unit test problems.

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: sinkArchiver.patch
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Updated: (CHUKWA-346) simplified sink archiver

Posted by "Ari Rabkin (JIRA)" <ji...@apache.org>.
     [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Ari Rabkin updated CHUKWA-346:
------------------------------

    Attachment: sinkArchiver.patch

Simple sink archiver.

Copies all the .done files out of the sink, runs an archiver MapReduce job, then merges output of that job into archive, renaming files to avoid collision.

Intended use is to run once every day or two, to empty out sink.

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: sinkArchiver.patch
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


[jira] Commented: (CHUKWA-346) simplified sink archiver

Posted by "Jiaqi Tan (JIRA)" <ji...@apache.org>.
    [ https://issues.apache.org/jira/browse/CHUKWA-346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12730210#action_12730210 ] 

Jiaqi Tan commented on CHUKWA-346:
----------------------------------

If there's no Demux, then the purpose of Chukwa will be just to collect logs, and store them in a single jumbled mix of all the log record types?

> simplified sink archiver
> ------------------------
>
>                 Key: CHUKWA-346
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-346
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: Data Processors
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: sinkArchiver.patch
>
>
> The current demux-archive plumbing is quite complicated. At Berkeley, we need something much simpler. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.