You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@hive.apache.org by "Carl Steinbach (JIRA)" <ji...@apache.org> on 2011/03/07 21:23:59 UTC

[jira] Resolved: (HIVE-797) mappers should report life in ways other than emitting data

     [ https://issues.apache.org/jira/browse/HIVE-797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Carl Steinbach resolved HIVE-797.
---------------------------------

    Resolution: Invalid

> mappers should report life in ways other than emitting data
> -----------------------------------------------------------
>
>                 Key: HIVE-797
>                 URL: https://issues.apache.org/jira/browse/HIVE-797
>             Project: Hive
>          Issue Type: Bug
>            Reporter: S. Alex Smith
>
> Mappers which are performing a great deal of aggregation can be killed by time out even if they are running successfully.  For example, in the following query the group by operator stops the mapper from returning any rows of data until the map is entirely finished.  If the data processing takes longer than the time-out limit, the job will fail.  The mapper should instead offer the tracker some indication that it is busy working.  Alternatively, the tracker could ping the mapper with an appropriate question / warning before it sends a kill signal.
> FROM (
>   FROM my_table
>   SELECT TRANSFORM(my_data)
>   USING 'my_boolean_function'
>   AS boolean_output) a
> SELECT boolean_output, COUNT(1)
> GROUP BY boolean_output

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira