You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Ashish Thusoo (JIRA)" <ji...@apache.org> on 2008/11/18 23:59:44 UTC

[jira] Commented: (HIVE-69) genMapRedTasks does not use the tree walker and uses implicit state which makes it difficult to enhance

    [ https://issues.apache.org/jira/browse/HIVE-69?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12648792#action_12648792 ] 

Ashish Thusoo commented on HIVE-69:
-----------------------------------

General comment - need a lot of javadocs and documentation of how the state machine works. I think it works correctly but lets document as I have indicated below.

Also the state should not be maintained in the plan but should be in the treewalker. The state is never really used in the execution time and we should only try to put things that are used in the execution time in the plan.

Otherwise, this looks much cleaner than the pervious monolithic blob that we had. Thanks...
Inline Comments
ql/src/java/org/apache/hadoop/hive/ql/exec/TableScanOperator.java:32	Is this just a place holder right now?
ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java:112	Lets add proper javadocs!!
ql/src/java/org/apache/hadoop/hive/ql/exec/Task.java:132	javadocs. What is this used for?
ql/src/java/org/apache/hadoop/hive/ql/parse/GenMapRedTaskProcessor.java:57	Move the new into the constructor?
ql/src/java/org/apache/hadoop/hive/ql/parse/GenMapRedTaskProcessor.java:1	javadocs needed.
ql/src/java/org/apache/hadoop/hive/ql/parse/GenMapRedTaskProcessor.java:117	Please put a comment on how this state machine works. What are the transitions, how the transitions are made and what are the actions taken by each transition.
ql/src/java/org/apache/hadoop/hive/ql/plan/mapredWork.java:28	This is purely compile time information that is used to generate the plan (cut the plan into multiple map reduce plans), so we should not be storing this in the plan. This seems to be specific to the GenMapRedPlanWalker, so I think we should be maintaining this state there in a hashmap.
ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java:2986	Looks much cleaner with the walker stuff... This is cool...
ql/src/java/org/apache/hadoop/hive/ql/parse/GenMapRedWalker.java:44	I thought we were going to move this logic into the dispatcher? no?
ql/src/java/org/apache/hadoop/hive/ql/parse/GenMapRedTaskProcessor.java:116	Also a description of the call stack that leads to there transitions would be very helpful.


> genMapRedTasks does not use the tree walker and uses implicit state which makes it difficult to enhance
> -------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-69
>                 URL: https://issues.apache.org/jira/browse/HIVE-69
>             Project: Hadoop Hive
>          Issue Type: Bug
>            Reporter: Namit Jain
>            Assignee: Namit Jain
>
> In SemanticAnalyzer, genmapredtasks() does not use a tree walker. For map-side joins, the taskplan needs to be enhanced to be possibly
> broken at MapSink also. Basically, the code is very difficult to enhance since there are implicit assumptions that reduce sink is the only
> operator where the plan breaks.
> This should be enhanced so that the user can implement their own task generation logic which is independent of the tree walking.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.