You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2021/06/22 08:32:25 UTC

[GitHub] [hudi] deep-teliacompany opened a new issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

deep-teliacompany opened a new issue #3132:
URL: https://github.com/apache/hudi/issues/3132


   HUDI table returns no records when queried using Hive with partition columns in where clause. While same query returns expected records perfectly in Impala
   
   Hudi Table created in hive- 
   CREATE EXTERNAL TABLE t_test_hudi (  
   `_hoodie_commit_time` string,
   `_hoodie_commit_seqno` string,
   `_hoodie_record_key` string,
   `_hoodie_partition_path` string,
   `_hoodie_file_name` string,
   `DagName` string,
   `sequence_number` int,
   `dt_timestamp` date,
   cdl_ingest_time string)
   PARTITIONED BY ( 
   `ing_year` int, 
   `ing_month` int, 
   `ing_day` int
     ) 
   ROW FORMAT SERDE 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe' 
   STORED AS INPUTFORMAT 'org.apache.hudi.hadoop.HoodieParquetInputFormat' 
   OUTPUTFORMAT 'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat' 
   LOCATION '/data/test/swe/base/cusin_gsmahs/mobile_customer/t_cusin_customer_d1/';
   
   While below query works fine -
   "Select * from t_test_hudi"
   
   Expected behavior
   Query  -"Select * from t_test_hudi where ing_year=2021 and ing_month=4 and ing_day=4" 
   should return records
   
   Environment Description
   Hudi version : 0.8.0
   Spark version : 2.4.3 
   Hive version : 3.1
   Hadoop version : Distribution CDH-7.1.4
   Storage (HDFS/S3/GCS..) : HDFS
   Running on Docker? (yes/no) : No
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] nsivabalan commented on issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
nsivabalan commented on issue #3132:
URL: https://github.com/apache/hudi/issues/3132#issuecomment-991914238


   @deep-teliacompany : would appreciate if you can respond so that we can investigate further. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] nsivabalan closed issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
nsivabalan closed issue #3132:
URL: https://github.com/apache/hudi/issues/3132


   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] nsivabalan commented on issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
nsivabalan commented on issue #3132:
URL: https://github.com/apache/hudi/issues/3132#issuecomment-918428229


   @deep-teliacompany : Can you please respond. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] nsivabalan commented on issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
nsivabalan commented on issue #3132:
URL: https://github.com/apache/hudi/issues/3132#issuecomment-997564895


   @deep-teliacompany : will go ahead and close out the issue if we don't hear back from you. Hive sync tool should work for you as proposed above. Or you can set appropriate hive sync configs and hudi writes will take care of syncing. 
   Feel free to re-open if you still have issues or open a new issue if you are having some other problems. would be happy to assist. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] codope commented on issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
codope commented on issue #3132:
URL: https://github.com/apache/hudi/issues/3132#issuecomment-905377794


   @deep-teliacompany Are you using [hive sync tool](https://hudi.apache.org/docs/writing_data/#syncing-to-hive) to create hudi tables in hive? If yes, can you share hive_sync properties? If not, can you try with hive sync?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] vinothchandar commented on issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
vinothchandar commented on issue #3132:
URL: https://github.com/apache/hudi/issues/3132#issuecomment-926238039


   @deep-teliacompany any updates for us?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] deep-teliacompany commented on issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
deep-teliacompany commented on issue #3132:
URL: https://github.com/apache/hudi/issues/3132#issuecomment-920748080


   Testing this and will revert once have my findings.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] deep-teliacompany edited a comment on issue #3132: [SUPPORT]- HUDI table returns no records when queried using Hive with partition columns in where clause

Posted by GitBox <gi...@apache.org>.
deep-teliacompany edited a comment on issue #3132:
URL: https://github.com/apache/hudi/issues/3132#issuecomment-920748080


   Hi, I am not using hive sync tool, will try and and will share the observations.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org