You are viewing a plain text version of this content. The canonical link for it is here.
Posted to issues@iceberg.apache.org by "ming12713 (via GitHub)" <gi...@apache.org> on 2023/05/26 07:52:10 UTC
[GitHub] [iceberg] ming12713 opened a new issue, #7709: org.apache.iceberg.exceptions.ServiceFailureException: Server error: SdkClientException: Received an UnknownHostException when attempting to interact with a service. See cause for the exact endpoint that is failing to resolve. If this is happening on an endpoint that previoorg.apache.iceberg.exceptions.ServiceFailureException: Server error: SdkClientException: Received an UnknownHostException when attempting to interact with a service. See cause for the exact endpoint that is failing to resolve. If this is happening on an endpoint that previously worked, there may be a network connectivity issue or your DNS cache could be storing endpoints for too long.usly worked, there may be a network connectivity issue or your DNS cache could be storing endpoints for too long.
ming12713 opened a new issue, #7709:
URL: https://github.com/apache/iceberg/issues/7709
### Apache Iceberg version
1.2.1 (latest release)
### Query engine
Spark
### Please describe the bug 🐞
Follow this [bkm](https://iceberg.apache.org/spark-quickstart/) test rest catalog is ok
Attempting to modify spark and restcata to with production MinIO address <los.xxxx.com>, there is an issue with can't resolve dns "bucket.los.xxxx.com"
![Screenshot from 2023-05-26 15-41-30](https://github.com/apache/iceberg/assets/5196113/7415a0d3-1e9d-46f2-8d88-fd351e4240a6)
rest cataserver log
```
org.apache.iceberg.exceptions.RESTException: Unhandled error: ErrorResponse(code=500, type=SdkClientException, message=Received an UnknownHostException when attempting to interact with a service. See cause for the exact endpoint that is failing to resolve. If this is happening on an endpoint that previously worked, there may be a network connectivity issue or your DNS cache could be storing endpoints for too long.)
software.amazon.awssdk.core.exception.SdkClientException: Received an UnknownHostException when attempting to interact with a service. See cause for the exact endpoint that is failing to resolve. If this is happening on an endpoint that previously worked, there may be a network connectivity issue or your DNS cache could be storing endpoints for too long.
at software.amazon.awssdk.core.exception.SdkClientException$BuilderImpl.build(SdkClientException.java:111)
at software.amazon.awssdk.awscore.interceptor.HelpfulUnknownHostExceptionInterceptor.modifyException(HelpfulUnknownHostExceptionInterceptor.java:59)
at software.amazon.awssdk.core.interceptor.ExecutionInterceptorChain.modifyException(ExecutionInterceptorChain.java:202)
at software.amazon.awssdk.core.internal.http.pipeline.stages.utils.ExceptionReportingUtils.runModifyException(ExceptionReportingUtils.java:54)
at software.amazon.awssdk.core.internal.http.pipeline.stages.utils.ExceptionReportingUtils.reportFailureToInterceptors(ExceptionReportingUtils.java:38)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ExecutionFailureExceptionReportingStage.execute(ExecutionFailureExceptionReportingStage.java:39)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ExecutionFailureExceptionReportingStage.execute(ExecutionFailureExceptionReportingStage.java:26)
at software.amazon.awssdk.core.internal.http.AmazonSyncHttpClient$RequestExecutionBuilderImpl.execute(AmazonSyncHttpClient.java:193)
at software.amazon.awssdk.core.internal.handler.BaseSyncClientHandler.invoke(BaseSyncClientHandler.java:103)
at software.amazon.awssdk.core.internal.handler.BaseSyncClientHandler.doExecute(BaseSyncClientHandler.java:171)
at software.amazon.awssdk.core.internal.handler.BaseSyncClientHandler.lambda$execute$1(BaseSyncClientHandler.java:82)
at software.amazon.awssdk.core.internal.handler.BaseSyncClientHandler.measureApiCallSuccess(BaseSyncClientHandler.java:179)
at software.amazon.awssdk.core.internal.handler.BaseSyncClientHandler.execute(BaseSyncClientHandler.java:76)
at software.amazon.awssdk.core.client.handler.SdkSyncClientHandler.execute(SdkSyncClientHandler.java:45)
at software.amazon.awssdk.awscore.client.handler.AwsSyncClientHandler.execute(AwsSyncClientHandler.java:56)
at software.amazon.awssdk.services.s3.DefaultS3Client.putObject(DefaultS3Client.java:9321)
at org.apache.iceberg.aws.s3.S3OutputStream.completeUploads(S3OutputStream.java:435)
at org.apache.iceberg.aws.s3.S3OutputStream.close(S3OutputStream.java:269)
at java.base/sun.nio.cs.StreamEncoder.implClose(StreamEncoder.java:347)
at java.base/sun.nio.cs.StreamEncoder.close(StreamEncoder.java:169)
at java.base/java.io.OutputStreamWriter.close(OutputStreamWriter.java:252)
at org.apache.iceberg.TableMetadataParser.$closeResource(TableMetadataParser.java:131)
at org.apache.iceberg.TableMetadataParser.internalWrite(TableMetadataParser.java:131)
at org.apache.iceberg.TableMetadataParser.overwrite(TableMetadataParser.java:114)
at org.apache.iceberg.BaseMetastoreTableOperations.writeNewMetadata(BaseMetastoreTableOperations.java:170)
at org.apache.iceberg.BaseMetastoreTableOperations.writeNewMetadataIfRequired(BaseMetastoreTableOperations.java:160)
at org.apache.iceberg.jdbc.JdbcTableOperations.doCommit(JdbcTableOperations.java:106)
at org.apache.iceberg.BaseMetastoreTableOperations.commit(BaseMetastoreTableOperations.java:135)
at org.apache.iceberg.BaseMetastoreCatalog$BaseMetastoreCatalogTableBuilder.create(BaseMetastoreCatalog.java:199)
at org.apache.iceberg.rest.CatalogHandlers.createTable(CatalogHandlers.java:214)
at org.apache.iceberg.rest.RESTCatalogAdapter.handleRequest(RESTCatalogAdapter.java:319)
at org.apache.iceberg.rest.RESTCatalogAdapter.execute(RESTCatalogAdapter.java:384)
at org.apache.iceberg.rest.RESTCatalogServlet.execute(RESTCatalogServlet.java:100)
at org.apache.iceberg.rest.RESTCatalogServlet.doPost(RESTCatalogServlet.java:78)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:707)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:790)
at org.eclipse.jetty.servlet.ServletHolder.handle(ServletHolder.java:799)
at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:550)
at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143)
at org.eclipse.jetty.server.handler.gzip.GzipHandler.handle(GzipHandler.java:713)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandler.java:235)
at org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandler.java:1434)
at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:188)
at org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:501)
at org.eclipse.jetty.server.handler.ScopedHandler.nextScope(ScopedHandler.java:186)
at org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandler.java:1349)
at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:141)
at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127)
at org.eclipse.jetty.server.Server.handle(Server.java:516)
at org.eclipse.jetty.server.HttpChannel.lambda$handle$1(HttpChannel.java:388)
at org.eclipse.jetty.server.HttpChannel.dispatch(HttpChannel.java:633)
at org.eclipse.jetty.server.HttpChannel.handle(HttpChannel.java:380)
at org.eclipse.jetty.server.HttpConnection.onFillable(HttpConnection.java:277)
at org.eclipse.jetty.io.AbstractConnection$ReadCallback.succeeded(AbstractConnection.java:311)
at org.eclipse.jetty.io.FillInterest.fillable(FillInterest.java:105)
at org.eclipse.jetty.io.ChannelEndPoint$1.run(ChannelEndPoint.java:104)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.runTask(EatWhatYouKill.java:338)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.doProduce(EatWhatYouKill.java:315)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.tryProduce(EatWhatYouKill.java:173)
at org.eclipse.jetty.util.thread.strategy.EatWhatYouKill.run(EatWhatYouKill.java:131)
at org.eclipse.jetty.util.thread.ReservedThreadExecutor$ReservedThread.run(ReservedThreadExecutor.java:386)
at org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPool.java:883)
at org.eclipse.jetty.util.thread.QueuedThreadPool$Runner.run(QueuedThreadPool.java:1034)
at java.base/java.lang.Thread.run(Thread.java:833)
Caused by: software.amazon.awssdk.core.exception.SdkClientException: Unable to execute HTTP request: xx.los.xxxx.com
at software.amazon.awssdk.core.exception.SdkClientException$BuilderImpl.build(SdkClientException.java:111)
at software.amazon.awssdk.core.exception.SdkClientException.create(SdkClientException.java:47)
at software.amazon.awssdk.core.internal.http.pipeline.stages.utils.RetryableStageHelper.setLastException(RetryableStageHelper.java:223)
at software.amazon.awssdk.core.internal.http.pipeline.stages.RetryableStage.execute(RetryableStage.java:83)
at software.amazon.awssdk.core.internal.http.pipeline.stages.RetryableStage.execute(RetryableStage.java:36)
at software.amazon.awssdk.core.internal.http.pipeline.RequestPipelineBuilder$ComposingRequestPipelineStage.execute(RequestPipelineBuilder.java:206)
at software.amazon.awssdk.core.internal.http.StreamManagingStage.execute(StreamManagingStage.java:56)
at software.amazon.awssdk.core.internal.http.StreamManagingStage.execute(StreamManagingStage.java:36)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ApiCallTimeoutTrackingStage.executeWithTimer(ApiCallTimeoutTrackingStage.java:80)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ApiCallTimeoutTrackingStage.execute(ApiCallTimeoutTrackingStage.java:60)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ApiCallTimeoutTrackingStage.execute(ApiCallTimeoutTrackingStage.java:42)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ApiCallMetricCollectionStage.execute(ApiCallMetricCollectionStage.java:48)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ApiCallMetricCollectionStage.execute(ApiCallMetricCollectionStage.java:31)
at software.amazon.awssdk.core.internal.http.pipeline.RequestPipelineBuilder$ComposingRequestPipelineStage.execute(RequestPipelineBuilder.java:206)
at software.amazon.awssdk.core.internal.http.pipeline.RequestPipelineBuilder$ComposingRequestPipelineStage.execute(RequestPipelineBuilder.java:206)
at software.amazon.awssdk.core.internal.http.pipeline.stages.ExecutionFailureExceptionReportingStage.execute(ExecutionFailureExceptionReportingStage.java:37)
... 59 more
Suppressed: software.amazon.awssdk.core.exception.SdkClientException: Request attempt 1 failure: Unable to execute HTTP request: xx.los.xxxx.com
Suppressed: software.amazon.awssdk.core.exception.SdkClientException: Request attempt 2 failure: Unable to execute HTTP request: xx.los.xxxx.com
Suppressed: software.amazon.awssdk.core.exception.SdkClientException: Request attempt 3 failure: Unable to execute HTTP request: xx.los.xxxx.com
Caused by: java.net.UnknownHostException: xx.los.xxxx.com
```
**However, using IP resolution instead of the domain name does not encounter any problems.**
spark conf
```
spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
spark.sql.catalog.demo org.apache.iceberg.spark.SparkCatalog
spark.sql.catalog.demo.catalog-impl org.apache.iceberg.rest.RESTCatalog
spark.sql.catalog.demo.uri http://rest:8181
spark.sql.catalog.demo.io-impl org.apache.iceberg.aws.s3.S3FileIO
spark.sql.catalog.demo.warehouse s3a://xx/
spark.sql.catalog.demo.s3.endpoint http://los.xxxx.com:9000
spark.sql.defaultCatalog demo
spark.eventLog.enabled true
spark.eventLog.dir /home/iceberg/spark-events
spark.history.fs.logDirectory /home/iceberg/spark-events
spark.sql.catalogImplementation in-memory
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] ming12713 commented on issue #7709: reset catalog can't reslove s3 domain name
Posted by "ming12713 (via GitHub)" <gi...@apache.org>.
ming12713 commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1569403840
> Same here! It seems that within the Spark context, a different DNS resolver is used. We currently did an ugly workaround by giving the container a fixed IP and reference this in the config.
>
> Any clean solve is much appreciated!
i also use ip as workaroud solution
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "github-actions[bot] (via GitHub)" <gi...@apache.org>.
github-actions[bot] commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1837652801
This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] Fokko commented on issue #7709: reset catalog can't reslove s3 domain name
Posted by "Fokko (via GitHub)" <gi...@apache.org>.
Fokko commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1570413628
Can you try setting:
```
s3.path-style-access=true
```
This can be done on a catalog level:
```sh
spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
spark.sql.catalog.demo org.apache.iceberg.spark.SparkCatalog
spark.sql.catalog.demo.catalog-impl org.apache.iceberg.rest.RESTCatalog
spark.sql.catalog.demo.uri http://rest:8181
spark.sql.catalog.demo.io-impl org.apache.iceberg.aws.s3.S3FileIO
spark.sql.catalog.demo.warehouse s3a://xx/
spark.sql.catalog.demo.s3.endpoint http://los.xxxx.com:9000
spark.sql.catalog.demo.s3.path-style-access. true
spark.sql.defaultCatalog demo
spark.eventLog.enabled true
spark.eventLog.dir /home/iceberg/spark-events
spark.history.fs.logDirectory /home/iceberg/spark-events
spark.sql.catalogImplementation in-memory
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "sg-c0de (via GitHub)" <gi...@apache.org>.
sg-c0de commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1993790181
> Is there anyone who solved this issue?
Try to add env variable to the rest container: CATALOG_S3_PATH__STYLE__ACCESS: true
It'll be [converted](https://github.com/tabular-io/iceberg-rest-image/blob/2e4d04184e6db38f23a98498151aa18bb6c148ab/src/main/java/org/apache/iceberg/rest/RESTCatalogServer.java#L54) to s3.path-style-access=true on catalog creation and passed to [S3FileIO ](https://github.com/apache/iceberg/blob/43c3397528101859250160f123a0749bae79fb4d/aws/src/main/java/org/apache/iceberg/aws/s3/S3FileIOProperties.java#L178)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "springbearpro (via GitHub)" <gi...@apache.org>.
springbearpro commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-2028825632
@sg-c0de Thanks. Saved my day. I can't find it in the iceberg document though.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] SvenThies commented on issue #7709: reset catalog can't reslove s3 domain name
Posted by "SvenThies (via GitHub)" <gi...@apache.org>.
SvenThies commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1568783459
Same here! It seems that within the Spark context, a different DNS resolver is used. We currently did an ugly workaround by giving the container a fixed IP and reference this in the config.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "sg-c0de (via GitHub)" <gi...@apache.org>.
sg-c0de commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-2028827829
> > > Is there anyone who solved this issue?
> >
> >
> > Try to add env variable to the rest container: `CATALOG_S3_PATH__STYLE__ACCESS: true`
> > It'll be [converted](https://github.com/tabular-io/iceberg-rest-image/blob/2e4d04184e6db38f23a98498151aa18bb6c148ab/src/main/java/org/apache/iceberg/rest/RESTCatalogServer.java#L54) to `s3.path-style-access=true` when the catalog is created and passed to [S3FileIO ](https://github.com/apache/iceberg/blob/43c3397528101859250160f123a0749bae79fb4d/aws/src/main/java/org/apache/iceberg/aws/s3/S3FileIOProperties.java#L178)
> > @sg-c0de
> > Can the jdbc catalog be configured like this?
@springbearpro Sure, you can do it by passing this parameter in the config of ETL tool you use. For spark in spark-defaults.conf it will be like:
spark.sql.catalog.catalof_name.s3.path-style-access true
Full listing
```
spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
spark.sql.catalog.demo org.apache.iceberg.spark.SparkCatalog
spark.sql.catalog.demo.type jdbc
spark.sql.catalog.demo.uri jdbc:postgresql://192.168.0.16:5432/jdbc_catalog_test
spark.sql.catalog.demo.jdbc.verifyServerCertificate false
spark.sql.catalog.demo.jdbc.useSSL false
spark.sql.catalog.demo.jdbc.user jdbc
spark.sql.catalog.demo.jdbc.password pass
spark.sql.catalog.demo.io-impl org.apache.iceberg.aws.s3.S3FileIO
spark.sql.catalog.demo.warehouse s3://iceberg-test-jdbc/
spark.sql.catalog.demo.s3.endpoint https://minio.example.com
spark.sql.catalog.demo.s3.path-style-access true
spark.sql.defaultCatalog demo
spark.eventLog.enabled true
spark.eventLog.dir /home/iceberg/spark-events
spark.history.fs.logDirectory /home/iceberg/spark-events
spark.sql.catalogImplementation in-memory
```
For trino it will be **s3.path-style-access** in the catalog properties
https://trino.io/docs/current/object-storage/file-system-s3.html
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "dongsupkim-onepredict (via GitHub)" <gi...@apache.org>.
dongsupkim-onepredict commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1965763850
Is there anyone who solved this issue?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "ming12713 (via GitHub)" <gi...@apache.org>.
ming12713 commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-2008705570
> > Is there anyone who solved this issue?
>
> Try to add env variable to the rest container: `CATALOG_S3_PATH__STYLE__ACCESS: true`
>
> It'll be [converted](https://github.com/tabular-io/iceberg-rest-image/blob/2e4d04184e6db38f23a98498151aa18bb6c148ab/src/main/java/org/apache/iceberg/rest/RESTCatalogServer.java#L54) to `s3.path-style-access=true` when the catalog is created and passed to [S3FileIO ](https://github.com/apache/iceberg/blob/43c3397528101859250160f123a0749bae79fb4d/aws/src/main/java/org/apache/iceberg/aws/s3/S3FileIOProperties.java#L178)
@sg-c0de
Can the JDBC type catalog be configured like this?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "sg-c0de (via GitHub)" <gi...@apache.org>.
sg-c0de commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-2029161591
> > > Is there anyone who solved this issue?
> >
> >
> > Try to add env variable to the rest container: `CATALOG_S3_PATH__STYLE__ACCESS: true`
> > It'll be [converted](https://github.com/tabular-io/iceberg-rest-image/blob/2e4d04184e6db38f23a98498151aa18bb6c148ab/src/main/java/org/apache/iceberg/rest/RESTCatalogServer.java#L54) to `s3.path-style-access=true` when the catalog is created and passed to [S3FileIO ](https://github.com/apache/iceberg/blob/43c3397528101859250160f123a0749bae79fb4d/aws/src/main/java/org/apache/iceberg/aws/s3/S3FileIOProperties.java#L178)
> > @sg-c0de
> > Can the jdbc catalog be configured like this?
@ming12713 Sure, you can do it by passing this parameter in the config of ETL tool you use. For spark in spark-defaults.conf it will be like:
spark.sql.catalog.catalof_name.s3.path-style-access true
Full listing
```
spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
spark.sql.catalog.demo org.apache.iceberg.spark.SparkCatalog
spark.sql.catalog.demo.type jdbc
spark.sql.catalog.demo.uri jdbc:postgresql://192.168.0.16:5432/jdbc_catalog_test
spark.sql.catalog.demo.jdbc.verifyServerCertificate false
spark.sql.catalog.demo.jdbc.useSSL false
spark.sql.catalog.demo.jdbc.user jdbc
spark.sql.catalog.demo.jdbc.password pass
spark.sql.catalog.demo.io-impl org.apache.iceberg.aws.s3.S3FileIO
spark.sql.catalog.demo.warehouse s3://iceberg-test-jdbc/
spark.sql.catalog.demo.s3.endpoint https://minio.example.com
spark.sql.catalog.demo.s3.path-style-access true
spark.sql.defaultCatalog demo
spark.eventLog.enabled true
spark.eventLog.dir /home/iceberg/spark-events
spark.history.fs.logDirectory /home/iceberg/spark-events
spark.sql.catalogImplementation in-memory
```
For trino it will be **s3.path-style-access** in the catalog properties
https://trino.io/docs/current/object-storage/file-system-s3.html
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "dongsupkim-onepredict (via GitHub)" <gi...@apache.org>.
dongsupkim-onepredict commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-2008658279
It works Thanks! @sg-c0de
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] ming12713 commented on issue #7709: reset catalog can't reslove s3 domain name
Posted by "ming12713 (via GitHub)" <gi...@apache.org>.
ming12713 commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1577843182
> Can you try setting:
>
> ```
> s3.path-style-access=true
> ```
>
> This can be done on a catalog level:
>
> ```shell
> spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
> spark.sql.catalog.demo org.apache.iceberg.spark.SparkCatalog
> spark.sql.catalog.demo.catalog-impl org.apache.iceberg.rest.RESTCatalog
> spark.sql.catalog.demo.uri http://rest:8181
> spark.sql.catalog.demo.io-impl org.apache.iceberg.aws.s3.S3FileIO
> spark.sql.catalog.demo.warehouse s3a://xx/
> spark.sql.catalog.demo.s3.endpoint http://los.xxxx.com:9000
> spark.sql.catalog.demo.s3.path-style-access true
> spark.sql.defaultCatalog demo
> spark.eventLog.enabled true
> spark.eventLog.dir /home/iceberg/spark-events
> spark.history.fs.logDirectory /home/iceberg/spark-events
> spark.sql.catalogImplementation in-memory
>
Thanks @Fokko
But, Failed to modify sql.catalog
```
spark.hadoop.fs.s3a.aws.credentials.provider org.apache.hadoop.fs.s3a.SimpleAWSCredentialsProvider
spark.hadoop.fs.s3a.impl org.apache.hadoop.fs.s3a.S3AFileSystem
spark.hadoop.fs.s3a.endpoint.region cn-north-1
spark.hadoop.fs.s3a.access.key xm2U3ww
spark.hadoop.fs.s3a.secret.key eDMZQAulRQJm
spark.hadoop.fs.s3a.endpoint http://los.xxx.com
spark.hadoop.fs.s3a.bucket.probe 0
spark.hadoop.fs.s3a.change.detection.version.required false
spark.hadoop.fs.s3a.change.detection.mode none
spark.hadoop.fs.s3a.path.style.access true
# spark jdbc
spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
spark.sql.catalog.jdbc org.apache.iceberg.spark.SparkCatalog
spark.sql.catalog.jdbc.catalog-impl org.apache.iceberg.jdbc.JdbcCatalog
spark.sql.catalog.jdbc.uri jdbc:postgresql://10.0.162.5:31884/jdbc_gm
spark.sql.catalog.jdbc.jdbc.user postgres
spark.sql.catalog.jdbc.jdbc.password QzBodddd
spark.sql.catalog.jdbc.warehouse s3://minio-jdbc
spark.sql.catalog.jdbc.io-impl org.apache.iceberg.aws.s3.S3FileIO
spark.sql.catalog.jdbc.s3.endpoint http://los.xxx.com
spark.sql.catalog.jdbc.s3.path-style-access true
spark.eventLog.enabled true
```
errors output
![Screenshot from 2023-06-06 11-25-34](https://github.com/apache/iceberg/assets/5196113/d565c88a-3ba7-44da-af4c-236717017ef0)
AWS profile is configured in os profile
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
[GitHub] [iceberg] SvenThies commented on issue #7709: reset catalog can't reslove s3 domain name
Posted by "SvenThies (via GitHub)" <gi...@apache.org>.
SvenThies commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-1571479183
Hey @Fokko,
thanks for the suggestions. Unfortunately, it doesn't work. We tried the following
```
spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
spark.sql.catalog.default org.apache.iceberg.spark.SparkCatalog
spark.sql.catalog.default.catalog-impl org.apache.iceberg.rest.RESTCatalog
spark.sql.catalog.default.uri http://rest:8181
spark.sql.catalog.default.io-impl org.apache.iceberg.aws.s3.S3FileIO
spark.sql.catalog.default.warehouse s3://dev-customer-analytics/
spark.sql.catalog.default.s3.endpoint http://minio:9000
spark.sql.catalog.default.s3.path-style-access true
spark.sql.defaultCatalog default
spark.sql.sources.partitionOverwriteMode dynamic
spark.eventLog.enabled true
spark.eventLog.dir /home/iceberg/spark-events
spark.history.fs.logDirectory /home/iceberg/spark-events
spark.sql.catalogImplementation in-memory
spark.jars.packages org.apache.spark:spark-sql-kafka-0-10_2.12:3.3.2,org.apache.kafka:kafka-clients:2.8.1,org.apache.commons:commons-pool2:2.11.1,org.apache.spark:spark-token-provider-kafka-0-10_2.12:3.3.2,com.google.code.findbugs:jsr305:3.0.0,org.apache.spark:spark-tags_2.12:3.3.2
```
Additionally, we tried setting `spark.hadoop.fs.s3a.path.style.access true` in the `spark-defaults.conf`, but didn't work.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "ming12713 (via GitHub)" <gi...@apache.org>.
ming12713 commented on issue #7709:
URL: https://github.com/apache/iceberg/issues/7709#issuecomment-2029217628
> > > > Is there anyone who solved this issue?
> > >
> > >
> > > Try to add env variable to the rest container: `CATALOG_S3_PATH__STYLE__ACCESS: true`
> > > It'll be [converted](https://github.com/tabular-io/iceberg-rest-image/blob/2e4d04184e6db38f23a98498151aa18bb6c148ab/src/main/java/org/apache/iceberg/rest/RESTCatalogServer.java#L54) to `s3.path-style-access=true` when the catalog is created and passed to [S3FileIO ](https://github.com/apache/iceberg/blob/43c3397528101859250160f123a0749bae79fb4d/aws/src/main/java/org/apache/iceberg/aws/s3/S3FileIOProperties.java#L178)
> > > @sg-c0de
> > > Can the jdbc catalog be configured like this?
>
> @ming12713 Sure, you can do it by passing this parameter in the config of ETL tool you use. For spark in spark-defaults.conf it will be like: spark.sql.catalog.catalof_name.s3.path-style-access true
>
> Full listing
>
> ```
> spark.sql.extensions org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions
> spark.sql.catalog.demo org.apache.iceberg.spark.SparkCatalog
> spark.sql.catalog.demo.type jdbc
> spark.sql.catalog.demo.uri jdbc:postgresql://192.168.0.16:5432/jdbc_catalog_test
> spark.sql.catalog.demo.jdbc.verifyServerCertificate false
> spark.sql.catalog.demo.jdbc.useSSL false
> spark.sql.catalog.demo.jdbc.user jdbc
> spark.sql.catalog.demo.jdbc.password pass
> spark.sql.catalog.demo.io-impl org.apache.iceberg.aws.s3.S3FileIO
> spark.sql.catalog.demo.warehouse s3://iceberg-test-jdbc/
> spark.sql.catalog.demo.s3.endpoint https://minio.example.com
> spark.sql.catalog.demo.s3.path-style-access true
> spark.sql.defaultCatalog demo
> spark.eventLog.enabled true
> spark.eventLog.dir /home/iceberg/spark-events
> spark.history.fs.logDirectory /home/iceberg/spark-events
> spark.sql.catalogImplementation in-memory
> ```
>
> For trino it will be **s3.path-style-access** in the catalog properties https://trino.io/docs/current/object-storage/file-system-s3.html
@sg-c0de it's work thanks
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org
Re: [I] reset catalog can't reslove s3 domain name [iceberg]
Posted by "ming12713 (via GitHub)" <gi...@apache.org>.
ming12713 closed issue #7709: reset catalog can't reslove s3 domain name
URL: https://github.com/apache/iceberg/issues/7709
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For queries about this service, please contact Infrastructure at:
users@infra.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscribe@iceberg.apache.org
For additional commands, e-mail: issues-help@iceberg.apache.org