You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@hudi.apache.org by GitBox <gi...@apache.org> on 2022/03/08 05:01:53 UTC

[GitHub] [hudi] awpengfei opened a new issue #4978: [SUPPORT] Wrong table path when using Hive to query xxx_rt table before the first compaction

awpengfei opened a new issue #4978:
URL: https://github.com/apache/hudi/issues/4978


   **Describe the problem you faced**
   When using Hive to query xxx_rt table,if there is no parquet file but only log files, we get a wrong table path.
   But when the parquet files are generated, the table path is correct and we can get the data.
   Is this expected behavior?
   ```
   ERROR : Job failed with java.io.FileNotFoundException: File does not exist: hdfs://da-hdfs/tmp/hive/hadoop/90b7d231-0e0a-42e5-a72a-6faad6a9ac89/.hoodie
   org.apache.hudi.exception.TableNotFoundException: Hoodie table not found in path hdfs://da-hdfs/tmp/hive/hadoop/90b7d231-0e0a-42e5-a72a-6faad6a9ac89/.hoodie
   Caused by: java.io.FileNotFoundException: File does not exist: hdfs://da-hdfs/tmp/hive/hadoop/90b7d231-0e0a-42e5-a72a-6faad6a9ac89/.hoodie
   ```
   
   **Environment Description**
   
   * Hudi version : 0.10.1
   
   * Hive version : 3.1.2
   
   * Hadoop version : 3.3.1
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org



[GitHub] [hudi] xiarixiaoyao commented on issue #4978: [SUPPORT] Wrong table path when using Hive to query xxx_rt table before the first compaction

Posted by GitBox <gi...@apache.org>.
xiarixiaoyao commented on issue #4978:
URL: https://github.com/apache/hudi/issues/4978#issuecomment-1062470012


   @awpengfei  yes, for now it is a expected behavior,   before call any hudi function, hive will filter out all files which start with '.'  so all the log files  are filtered out.    
   you have two way to sovle this problem
   1: trigger compaction, after compaction parquet file will generate 
   2: modify hive souce code,  not filter out  .log files


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscribe@hudi.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org