You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@tajo.apache.org by "Hyoungjun Kim (JIRA)" <ji...@apache.org> on 2014/07/22 13:24:38 UTC

[jira] [Created] (TAJO-973) DISTINCT should support SortAggregation in first stage.

Hyoungjun Kim created TAJO-973:
----------------------------------

             Summary: DISTINCT should support SortAggregation in first stage.
                 Key: TAJO-973
                 URL: https://issues.apache.org/jira/browse/TAJO-973
             Project: Tajo
          Issue Type: Improvement
            Reporter: Hyoungjun Kim
            Assignee: Hyoungjun Kim
            Priority: Minor


Currently Tajo only support HashAggregation in first stage in the case of DISTINCT. If a task for first stage has many records, OutOfMemory occurs. Tajo should support SortAggregation.
Tajo does not know what the algorithm should choose in the first stage. And Tajo does not support query hint clause. So I will add "tajo-distinct-first-stage-algorithm" property in the TajoConf. After TAJO-928 that property can be set in session variable.
 



--
This message was sent by Atlassian JIRA
(v6.2#6252)