You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@hive.apache.org by "Chengxiang Li (JIRA)" <ji...@apache.org> on 2016/01/08 07:53:39 UTC
[jira] [Commented] (HIVE-12205) Spark: unify spark statististics
aggregation between local and remote spark client
[ https://issues.apache.org/jira/browse/HIVE-12205?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15088795#comment-15088795 ]
Chengxiang Li commented on HIVE-12205:
--------------------------------------
[~chinnalalam], thanks working on this.
In your patch, the statistic aggregation is still computed separately in different methods(although in same class now) for {{LocalSparkJobStatus}} and {{RemoteSparkJobStatus}}, i suggest you can add a initialize method in {{MetrisCollection}} with parameter {{String jobId, Map<String, List<TaskMetrics>> jobMetrics}}, so that {{LocalSparkJobStatus}} can reuse {{MetricsCollection}} to aggregate statistics as well. What do you think?
Besides, could you create a ticket on RB for this?
> Spark: unify spark statististics aggregation between local and remote spark client
> ----------------------------------------------------------------------------------
>
> Key: HIVE-12205
> URL: https://issues.apache.org/jira/browse/HIVE-12205
> Project: Hive
> Issue Type: Task
> Components: Spark
> Affects Versions: 1.1.0
> Reporter: Xuefu Zhang
> Assignee: Chinna Rao Lalam
> Attachments: HIVE-12205.1.patch
>
>
> In class {{LocalSparkJobStatus}} and {{RemoteSparkJobStatus}}, spark statistics aggregation are done similar but in different code paths. Ideally, we should have a unified approach to simply maintenance.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)