You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "anubhav tarar (JIRA)" <ji...@apache.org> on 2017/12/20 10:11:00 UTC

[jira] [Created] (CARBONDATA-1920) Sparksql query result is not same as presto on same sql

anubhav tarar created CARBONDATA-1920:
-----------------------------------------

             Summary: Sparksql query result is not same as presto on same sql 
                 Key: CARBONDATA-1920
                 URL: https://issues.apache.org/jira/browse/CARBONDATA-1920
             Project: CarbonData
          Issue Type: Bug
          Components: presto-integration
    Affects Versions: 1.3.0
         Environment: spark2.1,presto 0.187
            Reporter: anubhav tarar
            Assignee: anubhav tarar


i  use carbondata version is 1.2.0 and spark version is 1.6.0.
     in my test case
   1.Creating a Table
  cc.sql("create table IF NOT EXISTS  test.table5(id string,name
String,city String,age int) stored by 'carbondata' *
tblproperties('DICTIONARY_INCLUDE' = 'age')* ")

  2.load csv data into table,data like this:
    id,name,city,age
    1,david,shenzhen,31
    88,eason,shenzhen,27
    3,jarry,wuhan,35

   3.select from sparksql,result is :
  +-----+--------+-----------+------+--+
| id  |  name  |   city    | age  |
+-----+--------+-----------+------+--+
| 1   | david  | shenzhen  | 31   |
| 3   | jarry  | wuhan     | 35   |
| 88  | eason  | shenzhen  | 27   |
+-----+--------+-----------+------+--+
   this result is correct

 4.select from presto,result is:
 id | name  |   city   | age
----+-------+----------+-----
 1  | david | shenzhen |   3
 3  | jarry | wuhan    |   4
 88 | eason | shenzhen |   2
(3 rows)
  look at the age filed,is wrong

I know why this happens because I used dictionary encoding in the age field。




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)