You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@carbondata.apache.org by "anubhav tarar (JIRA)" <ji...@apache.org> on 2017/03/02 07:10:45 UTC
[jira] [Commented] (CARBONDATA-704) data mismatch between hive and
carbondata after loading for bigint values
[ https://issues.apache.org/jira/browse/CARBONDATA-704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15891726#comment-15891726 ]
anubhav tarar commented on CARBONDATA-704:
------------------------------------------
reasonr for this issue is byte compression of data
> data mismatch between hive and carbondata after loading for bigint values
> -------------------------------------------------------------------------
>
> Key: CARBONDATA-704
> URL: https://issues.apache.org/jira/browse/CARBONDATA-704
> Project: CarbonData
> Issue Type: Bug
> Components: data-load
> Affects Versions: 1.0.0-incubating
> Reporter: SWATI RAO
> Assignee: anubhav tarar
> Attachments: Test_Data1 (4).csv
>
>
> carbondata
> 0: jdbc:hive2://localhost:10000> create table Test_Boundary (c1_int int,c2_Bigint Bigint,c3_Decimal Decimal(38,30),c4_double double,c5_string string,c6_Timestamp Timestamp,c7_Datatype_Desc string) STORED BY 'org.apache.carbondata.format' ;
> 0: jdbc:hive2://localhost:10000> LOAD DATA INPATH 'hdfs://localhost:54310/Test_Data1.csv' INTO table Test_Boundary OPTIONS ('DELIMITER'=',','QUOTECHAR'='','BAD_RECORDS_ACTION'='FORCE','FILEHEADER'='');
> 0: jdbc:hive2://localhost:10000> select c2_Bigint from Test_Boundary;
> +----------------------+--+
> | c2_Bigint |
> +----------------------+--+
> | NULL |
> | NULL |
> | NULL |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> +----------------------+--+
> but in hive
> create table Test_Boundary_hive (c1_int int,c2_Bigint Bigint,c3_Decimal Decimal(38,30),c4_double double,c5_string string,c6_Timestamp Timestamp,c7_Datatype_Desc string) ROW FORMAT DELIMITED FIELDS TERMINATED BY ",";
> LOAD DATA LOCAL INPATH 'Test_Data1.csv' into table Test_Boundary_hive;
> select c2_Bigint from Test_Boundary_hive;
> +-----------------------+--+
> | c2_Bigint |
> +-----------------------+--+
> | 1234 |
> | 2345 |
> | 3456 |
> | 4567 |
> | 9223372036854775807 |
> | -9223372036854775808 |
> | -9223372036854775807 |
> | -9223372036854775806 |
> | -9223372036854775805 |
> | 0 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | 9223372036854775807 |
> | NULL |
> | NULL |
> | NULL |
> +-----------------------+--+
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)