You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "YoungYik (JIRA)" <ji...@apache.org> on 2011/03/23 08:48:05 UTC
[jira] [Created] (HIVE-2072) set hive.exec.reducers.max= is
not working to view
set hive.exec.reducers.max=<number> is not working to view
----------------------------------------------------------
Key: HIVE-2072
URL: https://issues.apache.org/jira/browse/HIVE-2072
Project: Hive
Issue Type: Bug
Components: CLI
Affects Versions: 0.6.0
Environment: Linux 2.6.18-128.el5 x86_64 GNU/Linux, JRE 1.6.0_14
Reporter: YoungYik
I create two views from one table of different partitions, and join these two views in my query, it just use 1 reducer and the tasks stay in 82% for a long time, and then failed.
So, I use set hive.exec.reducers.max=28 before the execution of query, when select on joined tables, it works, but still keep saying "Number of reduce tasks determined at compile time: 1" when on the two joined views:
1)hive -e "create view view_1(uname, login) as select uname,'this30d' from userlist where domain='$domain' and year=$YYYY and month=$MM and day=$DD and type='all' and period_days=30;"
2)hive -e "create view view_0(uname, login) as select uname,'last30d' from userlist where domain='$domain' and year=$YYYY and month=$MM and day=$DD and type='all' and period_days=30;"
3)hive -e "set mapred.reduce.tasks=28; set; select v0.login,v1.login,count(*) from view_0 v0 full outer join view_1 v1 group by v0.login,v1.login; set;"
then the output:
Total MapReduce jobs = 2
Launching Job 1 out of 2
Number of reduce tasks determined at compile time: 1
In order to change the average load for a reducer (in bytes):
set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
set mapred.reduce.tasks=<number>
and at last, it just still keep using 1 reducer in this query.
but if I use "set;" to print the environment, it tells me:
mapred.reduce.tasks=28
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Resolved] (HIVE-2072) test
Posted by "Carl Steinbach (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Carl Steinbach resolved HIVE-2072.
----------------------------------
Resolution: Incomplete
> test
> ----
>
> Key: HIVE-2072
> URL: https://issues.apache.org/jira/browse/HIVE-2072
> Project: Hive
> Issue Type: Test
> Reporter: YoungYik
> Priority: Trivial
>
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (HIVE-2072) set hive.exec.reducers.max=
is not working to view
Posted by "YoungYik (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13010538#comment-13010538 ]
YoungYik commented on HIVE-2072:
--------------------------------
I have made a mistake in my query SQL, just forget this issue, sorry.
> set hive.exec.reducers.max=<number> is not working to view
> ----------------------------------------------------------
>
> Key: HIVE-2072
> URL: https://issues.apache.org/jira/browse/HIVE-2072
> Project: Hive
> Issue Type: Bug
> Components: CLI
> Affects Versions: 0.6.0
> Environment: Linux 2.6.18-128.el5 x86_64 GNU/Linux, JRE 1.6.0_14
> Reporter: YoungYik
>
> I create two views from one table of different partitions, and join these two views in my query, it just use 1 reducer and the tasks stay in 82% for a long time, and then failed.
> So, I use set hive.exec.reducers.max=28 before the execution of query, when select on joined tables, it works, but still keep saying "Number of reduce tasks determined at compile time: 1" when on the two joined views:
> 1)hive -e "create view view_1(uname, login) as select uname,'this30d' from userlist where domain='$domain' and year=$YYYY and month=$MM and day=$DD and type='all' and period_days=30;"
> 2)hive -e "create view view_0(uname, login) as select uname,'last30d' from userlist where domain='$domain' and year=$YYYY and month=$MM and day=$DD and type='all' and period_days=30;"
> 3)hive -e "set mapred.reduce.tasks=28; set; select v0.login,v1.login,count(*) from view_0 v0 full outer join view_1 v1 group by v0.login,v1.login; set;"
> then the output:
> Total MapReduce jobs = 2
> Launching Job 1 out of 2
> Number of reduce tasks determined at compile time: 1
> In order to change the average load for a reducer (in bytes):
> set hive.exec.reducers.bytes.per.reducer=<number>
> In order to limit the maximum number of reducers:
> set hive.exec.reducers.max=<number>
> In order to set a constant number of reducers:
> set mapred.reduce.tasks=<number>
> and at last, it just still keep using 1 reducer in this query.
> but if I use "set;" to print the environment, it tells me:
> mapred.reduce.tasks=28
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (HIVE-2072) test
Posted by "YoungYik (JIRA)" <ji...@apache.org>.
[ https://issues.apache.org/jira/browse/HIVE-2072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
YoungYik updated HIVE-2072:
---------------------------
Component/s: (was: CLI)
Description: (was: I create two views from one table of different partitions, and join these two views in my query, it just use 1 reducer and the tasks stay in 82% for a long time, and then failed.
So, I use set hive.exec.reducers.max=28 before the execution of query, when select on joined tables, it works, but still keep saying "Number of reduce tasks determined at compile time: 1" when on the two joined views:
1)hive -e "create view view_1(uname, login) as select uname,'this30d' from userlist where domain='$domain' and year=$YYYY and month=$MM and day=$DD and type='all' and period_days=30;"
2)hive -e "create view view_0(uname, login) as select uname,'last30d' from userlist where domain='$domain' and year=$YYYY and month=$MM and day=$DD and type='all' and period_days=30;"
3)hive -e "set mapred.reduce.tasks=28; set; select v0.login,v1.login,count(*) from view_0 v0 full outer join view_1 v1 group by v0.login,v1.login; set;"
then the output:
Total MapReduce jobs = 2
Launching Job 1 out of 2
Number of reduce tasks determined at compile time: 1
In order to change the average load for a reducer (in bytes):
set hive.exec.reducers.bytes.per.reducer=<number>
In order to limit the maximum number of reducers:
set hive.exec.reducers.max=<number>
In order to set a constant number of reducers:
set mapred.reduce.tasks=<number>
and at last, it just still keep using 1 reducer in this query.
but if I use "set;" to print the environment, it tells me:
mapred.reduce.tasks=28)
Priority: Trivial (was: Major)
Environment: (was: Linux 2.6.18-128.el5 x86_64 GNU/Linux, JRE 1.6.0_14)
Affects Version/s: (was: 0.6.0)
Issue Type: Test (was: Bug)
Summary: test (was: set hive.exec.reducers.max=<number> is not working to view)
> test
> ----
>
> Key: HIVE-2072
> URL: https://issues.apache.org/jira/browse/HIVE-2072
> Project: Hive
> Issue Type: Test
> Reporter: YoungYik
> Priority: Trivial
>
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira