You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@drill.apache.org by "Jason Altekruse (JIRA)" <ji...@apache.org> on 2014/08/06 02:13:15 UTC

[jira] [Created] (DRILL-1264) Fix operators to work with embedded types

Jason Altekruse created DRILL-1264:
--------------------------------------

             Summary: Fix operators to work with embedded types
                 Key: DRILL-1264
                 URL: https://issues.apache.org/jira/browse/DRILL-1264
             Project: Apache Drill
          Issue Type: Sub-task
            Reporter: Jason Altekruse


Currently there are several operators such as sort, group by and join that assume a constant schema. One example of a bug introduced by this assumption is the current functionality of breaking a group by grouping any time we hit a schema change in the batch, even if it is not in the column being grouped by. This is a limitation of the constant schema in a batch, even if we have all integers in one column we are grouping by, we cannot hold all of the records matching a group by constraint in a single batch if the type changes in another column. This currently forces a breaking up of the records that should not happen. There are also cases where the column we are interested in for the group by, sort, etc. should be able to incorporate schema changes into the algorithm, such as correctly sorting different numeric types together or doing a similar aggregation with compatible types, such as the different numeric types.



--
This message was sent by Atlassian JIRA
(v6.2#6252)