You are viewing a plain text version of this content. The canonical link for it is here.

Posted to issues@iceberg.apache.org by "luxl6 (via GitHub)" <gi...@apache.org> on 2023/03/13 07:15:23 UTC

[GitHub] [iceberg] luxl6 opened a new issue, #7092: 数据合并

luxl6 opened a new issue, #7092:
URL: https://github.com/apache/iceberg/issues/7092

   ### Query engine
   
   Flink
   
   ### Question
   
   flink 结合iceberg写入minio，小文件怎么合并


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465792578

   > 参见：https://iceberg.apache.org/docs/latest/flink/#rewrite-files-action
   
   Exception in thread "main" org.apache.iceberg.exceptions.NoSuchTableException: Table does not exist at location: s3a://iceberg/flink_db/flink_hb
   	at org.apache.iceberg.hadoop.HadoopTables.load(HadoopTables.java:97)
   	at org.apache.iceberg.flink.TableLoader$HadoopTableLoader.loadTable(TableLoader.java:76)
   一直报表不存在，我的catalog用的是jdbc模式存在了pgsql,数据存在了minio里面，这个怎么做呢？？


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] xuzhiwen1255 commented on issue #7092: 数据合并

Posted by "xuzhiwen1255 (via GitHub)" <gi...@apache.org>.

xuzhiwen1255 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465869375

   You need aws dependency


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] xuzhiwen1255 commented on issue #7092: 数据合并

Posted by "xuzhiwen1255 (via GitHub)" <gi...@apache.org>.

xuzhiwen1255 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1467274274

   I think the documentation is very detailed.
   
   Change the address to the one that accesses minio, change the catalog to the corresponding catalog, and you may need some access-related configuration
   
   ```java
   import org.apache.iceberg.flink.actions.Actions;
    
   TableLoader tableLoader = TableLoader.fromHadoopTable("xxx");
   Table table = tableLoader.loadTable();
   RewriteDataFilesActionResult result = Actions.forTable(table)
           .rewriteDataFiles()
           .execute();
   
   ```
   
   Such as these, of course, not all of them are necessary, I am just an example
   ```java
         config.set("fs.s3a.impl", "org.apache.hadoop.fs.s3a.S3AFileSystem");
         config.set("fs.s3a.path.style.access", "true");
         config.set("fs.s3a.connection.ssl.enabled", "false");
         config.set("fs.s3a.access.key", access);
         config.set("fs.s3a.secret.key", secret);
         config.set("fs.s3a.endpoint", endpoint);
         .....
   ```
   
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465869105

   You can access minio implementation rewrite using the s3 protocol.
   
   How to do this? ? ask for advice


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] xuzhiwen1255 commented on issue #7092: 数据合并

Posted by "xuzhiwen1255 (via GitHub)" <gi...@apache.org>.

xuzhiwen1255 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465816702

   >官网这个说的是Hadoop的吧，大佬们，求指教
   You can rewrite using the s3 protocol link minio


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1471379328

   Teacher, the jdbc_catalog data merge that has troubled me for a week has finally been solved, thank you teacher for your previous help
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465864266

   
   
   
   
   > 参见：https://iceberg.apache.org/docs/latest/flink/#rewrite-files-action
   The official website is about Hadoop.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

Re: [I] 数据合并 [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.

github-actions[bot] closed issue #7092: 数据合并
URL: https://github.com/apache/iceberg/issues/7092


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] xuzhiwen1255 commented on issue #7092: 数据合并

Posted by "xuzhiwen1255 (via GitHub)" <gi...@apache.org>.

xuzhiwen1255 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465813401

   @luxl6 I suggest you use English. No one here uses Chinese


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

Re: [I] 数据合并 [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.

github-actions[bot] commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1861893339

   This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1467224585

    <!-- https://mvnrepository.com/artifact/org.apache.hadoop/hadoop-aws -->
           <dependency>
               <groupId>org.apache.hadoop</groupId>
               <artifactId>hadoop-aws</artifactId>
               <version>${hadoop.version}</version>
           </dependency>
           <!-- https://mvnrepository.com/artifact/com.amazonaws/aws-java-sdk-bundle -->
           <dependency>
               <groupId>com.amazonaws</groupId>
               <artifactId>aws-java-sdk-bundle</artifactId>
               <version>1.11.271</version>
           </dependency>
   
   exist   My Hadoop is 3.3.1
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465880663

   > You need aws dependency
   
   <properties>
           <maven.compiler.source>8</maven.compiler.source>
           <maven.compiler.target>8</maven.compiler.target>
           <scala.version>2.12</scala.version>
           <flink.version>1.15.3</flink.version>
           <hadoop.version>3.3.1</hadoop.version>
       </properties>
   
       <dependencies>
           <!-- Flink SQL & Table-->
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-table-runtime</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-table-common</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-table-api-java</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-table-planner_${scala.version}</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
           <dependency>
               <groupId>junit</groupId>
               <artifactId>junit</artifactId>
               <version>4.11</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
           <!--        flink 环境包-->
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-java</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-streaming-java</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-clients</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
           <!-- https://mvnrepository.com/artifact/org.apache.flink/flink-s3-fs-hadoop -->
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-s3-fs-hadoop</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
   
   
           <!--        flink连接jdbc-->
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-connector-jdbc</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
          
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-connector-hive_2.12</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-table-api-java-bridge</artifactId>
               <version>${flink.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
           <dependency>
               <groupId>org.apache.hadoop</groupId>
               <artifactId>hadoop-client</artifactId>
               <version>${hadoop.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>org.apache.hadoop</groupId>
               <artifactId>hadoop-common</artifactId>
               <version>${hadoop.version}</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
           <!-- https://mvnrepository.com/artifact/org.apache.hadoop/hadoop-hdfs-client -->
           <dependency>
               <groupId>org.apache.hadoop</groupId>
               <artifactId>hadoop-hdfs-client</artifactId>
               <version>${hadoop.version}</version>
           </dependency>
   
           <!-- https://mvnrepository.com/artifact/org.apache.hadoop/hadoop-mapreduce-client-core -->
           <dependency>
               <groupId>org.apache.hadoop</groupId>
               <artifactId>hadoop-mapreduce-client-core</artifactId>
               <version>3.3.1</version>
           </dependency>
   
   
           <!-- https://mvnrepository.com/artifact/org.apache.hadoop/hadoop-aws -->
    <dependency>
               <groupId>org.apache.hadoop</groupId>
               <artifactId>hadoop-aws</artifactId>
               <version>${hadoop.version}</version>
           </dependency>
           <!-- https://mvnrepository.com/artifact/com.amazonaws/aws-java-sdk-bundle -->
           <dependency>
               <groupId>com.amazonaws</groupId>
               <artifactId>aws-java-sdk-bundle</artifactId>
               <version>1.11.271</version>
           </dependency>
   
   
   
           <!-- Hive Dependency -->
           <dependency>
               <groupId>org.apache.hive</groupId>
               <artifactId>hive-exec</artifactId>
               <version>2.3.9</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
   
           <!-- mongodb -->
   
   
           <!-- HBase -->
           <dependency>
               <groupId>org.apache.flink</groupId>
               <artifactId>flink-sql-connector-hbase-2.2</artifactId>
               <version>${flink.version}</version>
               <scope>provided</scope>
           </dependency>
           <dependency>
               <groupId>org.apache.hbase</groupId>
               <artifactId>hbase-server</artifactId>
               <version>1.2.4</version>
               <scope>provided</scope>
           </dependency>
           <dependency>
               <groupId>org.apache.hbase</groupId>
               <artifactId>hbase-client</artifactId>
               <version>1.2.4</version>
               <scope>provided</scope>
           </dependency>
   
           <!-- Flink 操作Iceberg 需要的Iceberg依赖 minio连接包-->
           <dependency>
               <groupId>iceberg</groupId>
               <artifactId>iceberg-flink-runtime-1.15</artifactId>
               <version>1.1.0</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>iceberg</groupId>
               <artifactId>bundle</artifactId>
               <version>2.17.257</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <dependency>
               <groupId>software.amazon.awssdk</groupId>
               <artifactId>url-connection-client</artifactId>
               <version>2.17.257</version>
   <!--            <scope>provided</scope>-->
           </dependency>
   
           <!--      postgresql连接  -->
           <dependency>
               <groupId>org.postgresql</groupId>
               <artifactId>postgresql</artifactId>
               <version>42.5.1</version>
   <!--            <scope>provided</scope>-->
           </dependency>
           <!--mysql连接        -->
           <dependency>
               <groupId>org.apache</groupId>
               <artifactId>mysql-connector-java</artifactId>
               <version>5.1.48</version>
   <!--            <scope>provided</scope>-->
           </dependency>
       </dependencies>
   
   
   This is all my dependencies, can you see which one is missing


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

Re: [I] 数据合并 [iceberg]

Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.

github-actions[bot] commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1873537276

   This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1469235521

    Configuration conf = new Configuration();
           conf.set("fs.s3a.connection.ssl.enabled", "false");
           conf.set("fs.s3a.endpoint", "http://ip:9000");
           conf.set("fs.s3a.access.key", "minioadmin");
           conf.set("fs.s3a.secret.key", "minioadmin");
           conf.set("fs.s3a.path.style.access", "true");
           conf.set("fs.s3a.impl", "org.apache.hadoop.fs.s3a.S3AFileSystem");
           conf.set("fs.s3a.fast.upload", "true");
           conf.set("catalog-impl", "org.apache.iceberg.jdbc.JdbcCatalog");
           conf.set("uri", "jdbc:postgresql://ip:5432/icebergcat");
           conf.set("user", "PostgreSQL");
           conf.set("password", "PostgreSQL");
   
   
           TableLoader tableLoader = TableLoader.fromHadoopTable("s3a://iceberg/flink_db/flink_hb",conf);
           tableLoader.open();
           Table table = tableLoader.loadTable();
           RewriteDataFilesActionResult result = Actions.forTable(table)
                   .rewriteDataFiles()
                   .execute();
   
   Teacher, my main problem is that my catalog is not Hadoop catalog, but jdbc catalog. I merged Hadoop catalog successfully. I use jdbc to store catalog, and when I use Hadoop to merge files on minio, it cannot find it. Related table information, so the report does not exist all the time. Table does not exist at location: s3a://iceberg/flink_db/flink_hb


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] xuzhiwen1255 commented on issue #7092: 数据合并

Posted by "xuzhiwen1255 (via GitHub)" <gi...@apache.org>.

xuzhiwen1255 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1467195544

    hadoop-aws aws-java-sdk-bundle 1.11.271 Does it exist?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465793922

   > 参见：https://iceberg.apache.org/docs/latest/flink/#rewrite-files-action
   
   官网这个说的是Hadoop的吧，大佬们，求指教


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465862622

   > 参见：https://iceberg.apache.org/docs/latest/flink/#rewrite-files-action
   线程 “main” org.apache.iceberg.exceptions.NoSuchTableException： Table 在 location： s3a：//iceberg/flink_db/flink_hb at org.apache.iceberg.hadoop.HadoopTables.load（HadoopTables.java：97）
   at org.apache.iceberg.flink.TableLoader$HadoopTableLoader.loadTable（TableLoader.java：76）
   
   
   The report does not exist, my catalog uses jdbc mode, pgsql exists, and the data exists in minio, how do I merge data? ?
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465867360

    org.apache.hadoop.fs.UnsupportedFileSystemException: No FileSystem for scheme "s3"
   
   Does Hadoop not support the s3 protocol?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] xuzhiwen1255 commented on issue #7092: 数据合并

Posted by "xuzhiwen1255 (via GitHub)" <gi...@apache.org>.

xuzhiwen1255 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1465705026

   see:https://iceberg.apache.org/docs/latest/flink/#rewrite-files-action


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org

[GitHub] [iceberg] luxl6 commented on issue #7092: 数据合并

Posted by "luxl6 (via GitHub)" <gi...@apache.org>.

luxl6 commented on issue #7092:
URL: https://github.com/apache/iceberg/issues/7092#issuecomment-1467225621

   > hadoop-aws aws-java-sdk-bundle 1.11.271 Does it exist?
   
   I use flink + iceberg + minio, if I want to merge data, the teacher will teach me


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org

For queries about this service, please contact Infrastructure at:
users@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org